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5 

NOVEL COMPOSITIONS AND METHODS FOR CANCER 

10 

The present application is a continuing application of U.S.S.N.s 09/747,377, filed December 
22, 2000 and 09/798,586, filed March 2, 2001, and applications entitled Novel Compositions 
and Methods for Cancer filed October 23, 2001, November 8, 2001 , November 30, 2001, and 
December 20, 2001, all of which are expressly incorporated herein by reference. 

15 

FIELD OF THE INVENTION 

The present invention relates to novel sequences for use in diagnosis and treatment of 
cancer, especially carcinomas, as well as the use of the novel compositions in screening 
20 methods. 

BACKGROUND OF THE INVENTION 

Oncogenes are genes that can cause cancer. Carcinogenesis can occur by a wide variety of 
25 mechanisms, including infection of cells by viruses containing oncogenes, activation of 

protooncogenes in the host genome, and mutations of protooncogenes and tumor suppressor 
genes. 

There are a number of viruses known to be involved in human cancer as well as in animal 
30 cancer. Of particular interest here are viruses that do not contain oncogenes themselves; 
these are slow-transforming retroviruses. They induce tumors by integrating into the host 
genome.and affecting neighboring protooncogenes in a variety of ways, including promoter 
insertion, enhancer insertion, and/or. truncation of a protooncogene or tumor suppressor gene. 
The analysis of sequences at or near the insertion sites led to the identification of a number of 
3 5 new protooncogenes. 

With respect to lymphoma and leukemia, murine leukemia retrovirus (MuLV), such as SL3-3 
or Akv, is a potent inducer of tumors when inoculated into susceptible newborn mice, or when 
carried in the germline. A number of sequences have been identified as relevant in the 
40 induction of lymphoma and leukemia by analyzing the insertion sites; see Sorensen et al., J. 
of Virology 74:2161 (2000); Hansen et al., Genome Res. 10(2):237-43 (2000); Sorensen et 
al., J. Virology 70:4063 (1996); Sorensen et al., J. Virology 67:7118 (-1993); Joosten et al., 
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Virology 268:308 (2000); and Li et al., Nature Genetics 23:348 (1999); all of which are 
expressly incorporated by reference herein. 

Lymphomas are a collection of cancers involving the lymphatic system and are generally 
5 categorized as Hodgkin's disease and Non-Hodgkin lymphoma. Hodgkin's lymphomas are of 
B lymphocyte origin. Non-Hodgkin lymphomas are a collection of over 30 different types of 
cancers including T and B lymphomas. Leukemia is a disease of the blood forming tissues 
and includes B and T cell lymphocytic leukemias. It is characterized by an abnormal and 
persistent increase in the number of leukocytes and the amount of bone marrow, with 
10 enlargement of the spleen and lymph nodes. 

Breast cancer is one of the most significant diseases that affects women. At the current rate, 
American women have a 1 in 8 risk of developing breast cancer by age 95 (American Cancer 
Society, 1992). Treatment of breast cancer at later stages is often futile and disfiguring, 
15 making early detection a high priority in medical management of the disease. 

Accordingly, it is an object of the invention to provide sequences involved in cancer and in 
particular in oncogenesis. 

20 

SUMMARY OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides methods for 
screening for compositions which modulate carcinomas, especially lymphoma and leukemia. 
25 Also provided herein are methods of inhibiting proliferation of a cell, preferably a lymphoma 
cell. Methods of treatment of carcinomas, including diagnosis, are also provided herein. 

In one aspect, a method of screening drug candidates comprises providing a cell that 
expresses a carcinoma associated (CA) gene or fragments thereof. Preferred embodiments 

30 of CA genes are genes which are differentially expressed in cancer cells, preferably 
lymphatic, breast, prostate or epithelial cells, compared to other cells. Preferred 
embodiments of CA genes used in the methods herein include, but are not limited to the 
nucleic acids selected from Tables 1-112. The method further includes adding a drug 
candidate to the cell and determining the effect of the drug candidate on the expression of the 

35 CA gene. 

In one embodiment, the method of screening drug candidates includes comparing the level of 
expression in the absence of the drug candidate to the level of expression in the presence of 
the drug candidate. 
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Also provided herein is a method of screening for a bioactive agent capable of binding to a 
CA protein (CAP), the method comprising combining the CAP and a candidate bioactive 
agent and determining the binding of the candidate agent to the CAP. 

5 

Further provided herein is a method for screening for a bioactive agent capable of modulating 
the activity of a CAP. In one embodiment, the method comprises combining the CAP and a 
candidate bioactive agent, and determining the effect of the candidate agent on the bioactivity 
of the CAP. 

10 

Also provided is a method of evaluating the effect of a candidate carcinoma drug comprising 
administering the drug to a patient and removing a cell sample from the patient. The 
expression profile of the cell is then determined. This method may further comprise 
comparing the expression profile of the patient to an expression profile of a heathy individual. 

15 

In a further aspect, a method for inhibiting the activity of an CA protein is provided. In one 
embodiment, the method comprises administering to a patient an inhibitor of a CA protein 
preferably selected from the group consisting of the sequences outlined in Tables 1-112 or 
their complements. 

20 

A method of neutralizing the effect of a CA protein, preferably a protein encoded by a nucleic 
acid selected from the group of sequences outlined in Tables 1-112, is also provided. 
Preferably, the method comprises contacting an agent specific for said protein with said 
protein in an amount sufficient to effect neutralization. 

25 

Moreover, provided herein is a biochip comprising a nucleic acid segment which encodes a 
CA protein, preferably selected from the sequences outlined in Tables 1-112. 

Also provided herein is a method for diagnosing or determining the propensity to carcinomas, 
30 especially lymphoma or leukemia by sequencing at least one carcinoma or lymphoma gene 
of an individual. In yet another aspect of the invention, a method is provided for determining 
carcinoma including lymphoma and leukemia gene copy number in an individual. 

Novel sequences are also provided herein. Other aspects of the invention will become 
35 apparent to the skilled artisan by the following description of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to a number of sequences associated with carcinomas, 
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especially lymphoma, breast cancer or prostate cancer. The relatively tight linkage between 
clonally-integrated proviruses and protooncogenes forms "provirus tagging", in which slow- 
transforming retroviruses that act by an insertion mutation mechanism are used to isolate 
protooncogenes. In some models, uninfected animals have low cancer rates, and infected 
5 animals have high cancer rates. It is known that many of the retroviruses involved do not 
carry transduced host protooncogenes or pathogenic frans-acting viral genes, and thus the 
cancer incidence must therefor be a direct consequence of proviral integration effects into 
host protooncogenes. Since proviral integration is random, rare integrants will "activate" host 
protooncogenes that provide a selective growth advantage, and these rare events result in 
10 new proviruses at clonal stoichiometries in tumors. 

The use of oncogenic retroviruses, whose sequences insert into the genome of the host 
organism resulting in carcinoma, allows the identification of host sequences involved in 
carcinoma. These sequences may then be used in a number of different ways, including 

15 diagnosis, prognosis, screening for modulators (including both agonists and antagonists), 

antibody generation (for immunotherapy and imaging), etc. However, as will be appreciated 
by those in the art, oncogenes that are identified in one type of cancer such as lymphoma or 
leukemia have a strong likelihood of being involved in other types of cancers as well. Thus, 
while the sequences outlined herein are initially identified as correlated with lymphoma, they 

20 can also be found in other types of cancers as well, outlined below. 

Accordingly, the present invention provides nucleic acid and protein sequences that are 
associated with carcinoma, herein termed "carcinoma associated" or "CA" sequences. In a 
preferred embodiment, the present invention provides nucleic acid and protein sequences 
25 that are associated with carcinomas which originate in lymphatic tissue, herein termed 
"lymphoma associated" , "leukemia associated" or "LA" sequences. 

Suitable cancers which can be diagnosed or screened for using the methods of the present 
invention include cancers classified by site or by histological type. Cancers classified by site 

30 include cancer of the oral cavity and pharynx (lip, tongue, salivary gland, floor of mouth, gum 
and other mouth, nasopharynx, tonsil, oropharynx, hypopharynx, other oral/pharynx); cancers 
of the digestive system (esophagus; stomach; small intestine; colon and rectum; anus, anal 
canal, and anorectum; liver, intrahepatic bile duct; gallbladder; other biliary; pancreas; 
retroperitoneum; peritoneum, omentum, and mesentery; other digestive); cancers of the 

35 respiratory system (nasal cavity, middle ear, and sinuses; larynx; lung and bronchus; pleura; 
trachea, mediastinum, and other respiratory); cancers of the mesothelioma; bones and joints; 
and soft tissue, including heart; skin cancers, including melanomas and other non-epithelial 
skin cancers; Kaposi's sarcoma and breast cancer; cancer of the female genital system 
(cervix uteri; corpus uteri; uterus, nos; ovary; vagina; vulva; and other female genital); cancers 
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of the male genital system (prostate gland; testis; penis; and other male genital); cancers of 
the urinary system (urinary bladder, kidney and renal pelvis; ureter; and other urinary); 
cancers of the eye and orbit; cancers of the brain and nervous system (brain; and other 
nervous system); cancers of the endocrine system (thyroid gland and other endocrine, 
5 including thymus); cancers of the lymphomas (hodgkin's disease and non-hodgkin's 

lymphoma), multiple myeloma, and leukemias (lymphocytic leukemia; myeloid leukemia; 
monocytic leukemia; and other leukemias). 

Other cancers, classified by histological type, that may be associated with the sequences of 
10 the invention include, but are not limited to, Neoplasm, malignant; Carcinoma, NOS; 

Carcinoma, undifferentiated, NOS; Giant and spindle cell carcinoma; Small cell carcinoma, 

NOS; Papillary carcinoma, NOS; Squamous cell carcinoma, NOS; Lymphoepithelial 

carcinoma; Basal cell carcinoma, NOS; Pilomatrix carcinoma; Transitional cell carcinoma, 

NOS; Papillary transitional cell carcinoma; Adenocarcinoma, NOS; Gastrinoma, malignant; 
is Cholangiocardnoma; Hepatocellular carcinoma, NOS; Combined hepatocellular carcinoma 

and cholangiocardnoma; Trabecular adenocarcinoma; Adenoid cystic carcinoma; 

Adenocarcinoma in adenomatous polyp; Adenocarcinoma, familial polyposis coli; Solid 

carcinoma, NOS; Carcinoid tumor, malignant; Branchiolo-alveolar adenocarcinoma; Papillary 

adenocarcinoma, NOS; Chromophobe carcinoma; Acidophil carcinoma; Oxyphilic 
20 adenocarcinoma; Basophil carcinoma; Clear cell adenocarcinoma, NOS; Granular cell 

carcinoma; Follicular adenocarcinoma, NOS; Papillary and follicular adenocarcinoma; 

Nonencapsulating sclerosing carcinoma; Adrenal cortical carcinoma; Endometroid carcinoma; 

Skin appendage carcinoma; Apocrine adenocarcinoma; Sebaceous adenocarcinoma; 

Ceruminous adenocarcinoma; Mucoepidermoid carcinoma; Cystadenocarcinoma, NOS; 
25 Papillary cystadenocardnoma, NOS; Papillary serous cystadenocarcinoma; Mucinous 

cystadenocarcinoma, NOS; Mucinous adenocarcinoma; Signet ring cell carcinoma; Infiltrating 

duct carcinoma; Medullary carcinoma, NOS; Lobular carcinoma; Inflammatory carcinoma; 

Pagef's disease, mammary; Adnar cell carcinoma; Adenosquamous carcinoma; 

Adenocarcinoma w/ squamous metaplasia; Thymoma, malignant; Ovarian stromal tumor, 
30 malignant; Thecoma, malignant; Granulosa cell tumor, malignant; Androblastoma, malignant; 

Sertoli cell carcinoma; Leydig cell tumor, malignant; Lipid cell tumor, malignant; 

Paraganglioma, malignant; Extra-mammary paraganglioma, malignant; Pheochromocytoma; 

Glomangiosarcoma; Malignant melanoma, NOS; Amelanotic melanoma; Superficial 

spreading melanoma; Malig melanoma in giant pigmented nevus; Epithelioid cell melanoma; 
35 Blue nevus, malignant; Sarcoma, NOS; Fibrosarcoma, NOS; Fibrous histiocytoma, malignant; 

Myxosarcoma; Liposarcoma, NOS; Leiomyosarcoma, NOS; Rhabdomyosarcoma, NOS; 

Embryonal rhabdomyosarcoma; Alveolar rhabdomyosarcoma; Stromal sarcoma, NOS; Mixed 

tumor, malignant, NOS; Mullerian mixed tumor, Nephroblastoma; Hepatoblastoma; 

Cardnosarcoma, NOS; Mesenchymoma, malignant; Brenner tumor, malignant; Phyllodes 
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tumor, malignant; Synovial sarcoma, NOS; Mesothelioma, malignant; Dysgerminoma; 

Embryonal carcinoma, NOS; Teratoma, malignant, NOS; Struma ovarii, malignant; 

Choriocarcinoma; Mesonephroma, malignant; Hemangiosarcoma; Hemangioendothelioma, 

malignant; Kaposi's sarcoma; Hemangiopericytoma, malignant; Lymphangiosarcoma; 
5 Osteosarcoma, NOS; Juxtacortical osteosarcoma; Chondrosarcoma, NOS; 

Chondroblastoma, malignant; Mesenchymal chondrosarcoma; Giant cell tumor of bone; 

Ewing's sarcoma; Odontogenic tumor, malignant; Ameloblastic odontosarcoma; 

Ameloblastoma, malignant; Ameloblastic fibrosarcoma; Pinealoma, malignant; Chordoma; 

Glioma, malignant; Ependymoma, NOS; Astrocytoma, NOS; Protoplasmic astrocytoma; 
10 Fibrillary astrocytoma; Astroblastoma; Glioblastoma, NOS; Oligodendroglioma, NOS; 

Oligodendroblastoma; Primitive neuroectodermal; Cerebellar sarcoma, NOS; 

Ganglioneuroblastoma; Neuroblastoma, NOS; Retinoblastoma, NOS; Olfactory neurogenic 

tumor; Meningioma, malignant; Neurofibrosarcoma; Neurilemmoma, malignant; Granular cell 

tumor, malignant; Malignant lymphoma, NOS; Hodgkin's disease, NOS; Hodgkin's; 
15 paragranuloma, NOS; Malignant lymphoma, small lymphocytic; Malignant lymphoma, large 

cell, diffuse; Malignant lymphoma, follicular, NOS; Mycosis fungoides; Other specified non- 

Hodgkin's lymphomas; Malignant histiocytosis; Multiple myeloma; Mast cell sarcoma; 

Immunoproliferative small intestinal disease; Leukemia, NOS; Lymphoid leukemia, NOS; 

Plasma cell leukemia; Erythroleukemia; Lymphosarcoma cell leukemia; Myeloid leukemia, 
20 NOS; Basophilic leukemia; Eosinophilic leukemia; Monocytic leukemia, NOS; Mast ceil 

leukemia; Megakaryoblastic leukemia; Myeloid sarcoma; and Hairy cell leukemia. 

in addition, the genes may be involved in other diseases, such as but not limited to diseases 
associated with aging or neurodegenerative diseases. 

25 

Association in this context means that the nucleotide or protein sequences are either 
differentially expressed, activated, inactivated or altered in carcinomas as compared to 
normal tissue. As outlined below, CA sequences include those that are up-regulated (i.e. 
expressed at a higher level), as well as those that are down-regulated (i.e. expressed at a 

30 lower level), in carcinomas. CA sequences also include sequences which have been altered 
(i.e., truncated sequences or sequences with substitutions, deletions or insertions, including 
point mutations) and show either the same expression profile or an altered profile. In a 
preferred embodiment, the CA sequences are from humans; however, as will be appreciated 
by those in the art, CA sequences from other organisms may be useful in animal models of 

35 disease and drug evaluation; thus, other CA sequences are provided, from vertebrates, 

including mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm 
animals (including sheep, goats, pigs, cows, horses, etc). In some cases, prokaryotic CA 
sequences may be useful. CA sequences from other organisms may be obtained using the 
techniques outlined below. 
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CA sequences can include both nucleic acid and amino acid sequences. In a preferred 
embodiment, the CA sequences are recombinant nucleic acids. By the term "recombinant 
nucleic acid" herein is meant nucleic acid, originally formed in vitro, in general, by the 
5 manipulation of nucleic acid by polymerases and endonucleases, in a form not normally found 
in nature. Thus an isolated nucleic acid, in a linear form, or an expression vector formed in 
vitro by ligating DNA molecules that are not normally joined, are both considered recombinant 
for the purposes of this invention. It is understood that once a recombinant nucleic acid is 
made and reintroduced into a host cell or organism, it will replicate non-recombinantly, i.e. 
10 using the in vivo cellular machinery of the host cell rather than in vitro manipulations; 

however, such nucleic acids, once produced recombinant^, although subsequently replicated 
non-recombinantly, are still considered recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, i.e. 

15 through the expression of a recombinant nucleic acid as depicted above. A recombinant 
protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all of 
the proteins and compounds with which it is normally associated in its wild type host, and thus 
may be substantially pure. For example, an isolated protein is unaccompanied by at least 

20 some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 
weight of the total protein, with at least about 80% being preferred, and at least about 90% 
being particularly preferred. The definition includes the production of an CA protein from one 

25 organism in a different organism or host cell. Alternatively, the protein may be made at a 
significantly higher concentration than is normally seen, through the use of an inducible 
promoter or high expression promoter, such that the protein is made at increased 
concentration levels. Alternatively, the protein may be in a form not normally found in nature, 
as in the addition of an epitope tag or amino acid substitutions, insertions and deletions, as 

30 discussed below. 

In a preferred embodiment, the CA sequences are nucleic acids. As will be appreciated by 
those in the art and is more fully outlined below, CA sequences are useful in a variety of 
applications, including diagnostic applications, which will detect naturally occurring nucleic 
35 acids, as well as screening applications; for example,, biochips comprising nucleic acid probes 
to the CA sequences can be generated. In the broadest sense, then, by "nucleic acid" or 
"oligonucleotide" or grammatical equivalents herein means at least two nucleotides 
covalently linked together. A nucleic acid of the present invention will generally contain 
phosphodiester bonds, although in some cases, as outlined below (for example in antisense 
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applications or when a candidate agent is a nucleic acid), nucleic acid analogs may be used 
that have alternate backbones, comprising, for example, phosphoramidate (Beaucage et al., 
Tetrahedron 49(10):1925 (1993) and references therein, Letsinger, J. Org. Chem. 35:3800 
(1970); Sprinzl etal., Eur. J. Biochem. 81:579 (1977); Letsinger et al., Nucl. Acids Res. 
5 14:3487 (1986); Sawai et al ( Chem. Lett. 805 (1984), Letsinger et al.. J. Am. Chem. Soc. 
110:4470 (1988); and Pauwels et al., Chemica Scripta 26:141 91986)), phosphorothioate 
(Mag et al., Nucleic Acids Res. 19:1437 (1991); and U.S. Patent No. 5,644,048), 
phosphorodithioate (Briu et al., J. Am. Chem. Soc. 111:2321 (1989), O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical 

10 Approach, Oxford University Press), and peptide nucleic acid backbones and linkages (see 
Egholm, J. Am. Chem. Soc. 114:1895 (1992); Meier etal., Chem. Int. Ed. Engl. 31:1008 
(1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all of which 
are incorporated by reference). Other analog nucleic acids include those with positive 
backbones (Denpcy et al., Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic backbones 

15 (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; Kiedrowshi et 
al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. Chem. Soc. 
110:4470 (1988); Letsinger et al., Nucleoside & Nucleotide 13:1597 (1994); Chapters 2 and 3, 
ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", Ed. Y.S. 
Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal Chem. Lett. 4:395 

20 (1994); Jeffs et al, J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 37:743 (1996)) and 
non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 
5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications 
in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
more carbocyclic sugars are also included within one definition of nucleic acids (see Jenkins 

25 etal, Chem. Soc. Rev. (1995) pp1 69-1 76). Several nucleic acid analogs are described in 
Rawls, C & E News June 2, 1997 page 35. All of these references are hereby expressly 
incorporated by reference. These modifications of the ribose-phosphate backbone may be 
done for a variety of reasons, for example to increase the stability and half-life of such 
molecules in physiological environments for use in anti-sense applications or as probes on a 

30 biochip. 

As will be appreciated by those in the art, all of these nucleic acid analogs may find use in the 
present invention. In addition, mixtures of naturally occurring nucleic acids and analogs can 
be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally 
35 occurring nucleic acids and analogs may be made. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by 
those in the art, the depiction of a single strand "Watson" also defines the sequence of the 
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other strand "Crick"; thus the sequences described herein also includes the complement of 
the sequence. The nucleic acid may be DNA; both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid contains any combination of deoxy ribo- and ribonucleotides, and any 
combination of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 
5 hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term "nucleoside" includes 
nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such as amino 
modified nucleosides. In addition, "nucleoside" includes non-naturally occurring analog 
structures. Thus for example the individual units of a peptide nucleic acid, each containing a 
base, are referred to herein as a nucleoside. 

10 

An CA sequence can be initially identified by substantial nucleic acid and/or amino acid 
sequence homology to the CA sequences outlined herein. Such homology can be based 
upon the overall nucleic acid or amino acid sequence, and is generally determined as outlined 
below, using either homology programs or hybridization conditions. 

15 

The CA sequences of the invention were initially identified as described herein; basically, 
infection of mice with murine leukemia viruses (MLV) resulted in lymphoma, although many 
of these sequences will also be involved in other cancers as is generally outlined herein. 

20 The CA sequences outlined herein comprise the insertion sites for the virus. In general, the 
retrovirus can cause carcinomas in three basic ways: first of all, by inserting upstream of a 
normally silent host gene and activating it (e.g. promoter insertion); secondly, by truncating a 
host gene that leads to oncogenesis; or by enhancing the transcription of a neighboring gene. 
For example, retrovirus enhancers, including SL3-3, are known to act on genes up to 

25 approximately 200 kilobases of the insertion site. 

In a preferred embodiment, CA sequences are those that are up-regulated in carcinomas; that 
is, the expression of these genes is higher in carcinoma tissue as compared to normal tissue 
of the same differentiation stage. "Up-regulation" as used herein means at least about 50%. 
30 more preferably at least about 100%, more preferably at least about 150%, more preferably, 
at least about 200%, with from 300 to at least 1000% being especially preferred. 

In a preferred embodiment, CA sequences are those that are down-regulated in carcinomas; 
that is, the expression of these genes is lower in carcinoma tissue as compared to normal I 
35 tissue of the same differentiation stage. "Down-regulation" as used herein means at least 

about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably, at least about 200%, with from 300 to at least 1000% being especially preferred. 

In a preferred embodiment, CA sequences are those that are altered but show either the 
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same 

expression profile or an altered profile as compared to normal lymphoid tissue of the same 
differentiation stage. "Altered CA sequences" as used herein refers to sequences which are 
truncated, contain insertions or contain point mutations. 

5 

CA proteins of the present invention may be classified as secreted proteins, transmembrane 
proteins or intracellular proteins. 

In a preferred embodiment the CA protein is an intracellular protein. Intracellular proteins 
10 may be found in the cytoplasm and/or in the nucleus. Intracellular proteins are involved in all 

aspects of cellular function and replication (including, for example, signaling pathways); 

aberrant expression of such proteins results in unregulated or disregulated cellular processes. 

For example, many intracellular proteins have enzymatic activity such as protein kinase 

activity, protein phosphatase activity, protease activity, nucleotide cyclase activity, 
15 polymerase activity and the like. Intracellular proteins also serve as docking proteins that are 

involved in organizing complexes of proteins, or targeting proteins to various subcellular 

localizations, and are involved in maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterizing intracellular proteins is the presence in 
20 the proteins of one or more motifs for which defined functions have been attributed. In 

addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
25 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich targets. 
In addition, PH domains, tetratricopeptide repeats and WD domains to name only a few, have 
been shown to mediate protein-protein interactions. Some of these may also be involved in 
binding to phospholipids or other second messengers. As will be appreciated by one of 
ordinary skill in the art, these motifs can be identified on the basis of primary sequence; thus, 
30 an analysis of the sequence of proteins may provide insight into both the enzymatic potential 
of the molecule and/or molecules with which the protein may associate. 

In a preferred embodiment, the CA sequences are transmembrane proteins. Transmembrane 
proteins are molecules that span the phospholipid bilayer of a cell. They may have an 
35 intracellular domain, an extracellular domain, or both.. The intracellular domains of such 

proteins may have a number of functions including those already described for intracellular 
proteins. For example, the intracellular domain may have enzymatic activity and/or may 
serve as a binding site for additional proteins. Frequently the intracellular domain of 
transmembrane proteins serves both roles. For example certain receptor tyrosine kinases 
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have both protein kinase activity and SH2 domains. In addition, autophosphorylation of 
tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

5 Transmembrane proteins may contain from one to many transmembrane domains. For 
example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors are classified as "seven 
10 transmembrane domain" proteins, as they contain 7 membrane spanning regions. Important 
transmembrane protein receptors include, but are not limited to insulin receptor, insulin Jike 
growth factor receptor, human growth hormone receptor, glucose transporters, transferrin 
receptor, epidermal growth factor receptor, low density lipoprotein receptor, epidermal growth 
factor receptor, leptin receptor, interleukin receptors, e.g. IL_1 receptor, IL_2 receptor, etc. 

15 

Characteristics of transmembrane domains include approximately 20 consecutive 
hydrophobic amino acids that may be followed by charged amino acids. Therefore, upon 
analysis of the amino acid sequence of a particular protein, the localization and number of 
transmembrane domains within the protein may be predicted. 

20 

The extracellular domains of transmembrane proteins are diverse; however, conserved motifs 
are found repeatedly among various extracellular domains. Conserved structure and/or 
functions have been ascribed to different extracellular motifs. For example, cytokine 
receptors are characterized by a cluster of cysteines and a WSXWS (W= tryptophan, S= 
25 serine, X=any amino acid) motif. Immunoglobulin-like domains are highly conserved. Mucin- 
like domains may be involved in cell adhesion and leucine-rich repeats participate in protein- 
protein interactions. 

Many extracellular domains are involved in binding to other molecules. In one aspect, 
30 extracellular domains are receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
35 bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell for example via a glycosylphosphatidylinositol 
(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 
associate with the extracellular matrix and contribute to the maintenance of the cell structure. 
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CA proteins that are transmembrane are particularly preferred in the present invention as they 
are good targets for immunotherapeutics, as are described herein. In addition, as outlined 
below, transmembrane proteins can be also useful in imaging modalities. 

5 It will also be appreciated by those in the art that a transmembrane protein can be made 

soluble by removing transmembrane sequences, for example through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

10 In a preferred embodiment, the CA proteins are secreted proteins; the secretion of which can 
be either constitutive or regulated. These proteins have a signal peptide or signal sequence 
that targets the molecule to the secretory pathway. Secreted proteins are involved in 
numerous physiological events; by virtue of their circulating nature, they serve to transmit 
signals to various other cell types. The secreted protein may function in an autocrine manner 

is (acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 
proximity to the cell that secreted the factor) or an endocrine manner (acting on cells at a 
distance). Thus secreted molecules find use in modulating or altering numerous aspects of 
physiology. CA proteins that are secreted proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, for example for blood tests. 

20 

An CA sequence is initially identified by substantial nucleic acid and/or amino acid sequence 
homology to the CA sequences outlined herein. Such homology can be based upon the 
overall nucleic acid or amino acid sequence, and is generally determined as outlined below, 
using either homology programs or hybridization conditions. 

25 

As used herein, a nucleic acid is a "CA nucleic acid" if the overall homology of the nucleic 
acid sequence to one of the nucleic acids of Tables 1-112 is preferably greater than about 
75%, more preferably greater than about 80%, even more preferably greater than about 85% 
and most preferably greater than 90%. In some embodiments the homology will be as high 
30 as about 93 to 95 or 98%. In a preferred embodiment, the sequences which are used to 
determine sequence identity or similarity are selected from those of the nucleic acids of 
Tables 1-112. In another embodiment, the sequences are naturally occurring allelic variants 
of the sequences of the nucleic acids of Tables 1-112. In another embodiment, the 
sequences are sequence variants as further described herein. 

35 

Homology in this context means sequence similarity or identity, with identity being preferred. 
A preferred comparison for homology purposes is to compare the sequence containing 
sequencing errors to the correct sequence. This homology will be determined using standard 
techniques known in the art, including, but not limited to, the local homology algorithm of 
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Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of 
Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of 
Pearson & Lipman, PNAS USA 85:2444 (1988), by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software 
5 Package, Genetics Computer Group, 575 Science Drive, Madison, WI), the Best Fit sequence 
program described by Devereux et al., Nucl. Acid Res. 12:387-395 (1984), preferably using 
the default settings, or by inspection. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence 

10 alignment from a group of related sequences using progressive, pairwise alignments. It can 
also plot a tree showing the clustering relationships used to create the alignment. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 
35:351-360 (1987); the method is similar to that described by Higgins & Sharp CABIOS 
5:151-153 (1989). Useful PILEUP parameters including a default gap weight of 3.00, a 

15 default gap length weight of 0.10, and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. 
Mol. Biol. 215, 403-410, (1990) and Karlin etal., PNAS USA 90:5873-5787 (1993). A 
particularly useful BLAST program is the WU-BLAST-2 program which was obtained from 
Altschul et aL. Methods in Enzymology, 266: 460-480 (1996); http://blast.wustlj. WU-BLAST- 

20 2 uses several search parameters, most of which are set to the default values. The 

adjustable parameters are set with the following values: overlap span =1, overlap fraction = 
0.125, word threshold (T) = 11. The HSP S and HSP S2 parameters are dynamic values and 
are established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of interest 

25 is being searched; however, the values may be adjusted to increase sensitivity. A % amino 
acid sequence identity value is determined by the number of matching identical residues 
divided by the total number of residues of the "longer" sequence in the aligned region. The 
"longer" sequence is the one having the most actual residues in the aligned region (gaps 
introduced by WU-Blast-2 to maximize the alignment score are ignored). 

30 

Thus, "percent (%) nucleic acid sequence identity" is defined as the percentage of nucleotide 
residues in a candidate sequence that are identical with the nucleotide residues of the nucleic 
acids of Tables 1-1 12. A preferred method utilizes the BLASTN module of WU-BLAST-2 set 
to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, 
35 respectively. 

The alignment may include the introduction of gaps in the sequences to be aligned. In 
addition, for sequences which contain either more or fewer nucleotides than those of the 
nucleic acids of Tables 1-112, it is understood that the percentage of homology will be 
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determined based on the number of homologous nucleosides in relation to the total number of 
nucleosides. Thus, for example, homology of sequences shorter than those of the sequences 
identified herein and as discussed below, will be determined using the number of nucleosides 
in the shorter sequence. 

5 

In one embodiment, the nucleic acid homology is determined through hybridization studies. 
Thus, for example, nucleic acids which hybridize under high stringency to the nucleic acids 
identified in the figures, or their complements, are considered CA sequences. High 
stringency conditions are known in the art; see for example Maniatis et at., Molecular 

10 Cloning: A Laboratory Manual, 2d Edition, 1989, and Short Protocols in Molecular Biology, ed. 
Ausubel, et al., both of which are hereby incorporated by reference. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer sequences 
hybridize specifically at higher temperatures. An extensive guide to the hybridization of 
nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology- 

15 Hybridization with Nucleic Acid Probes, "Overview of principles of hybridization and the 

strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to be 
about 5-10°C lower than the thermal melting point (Tm) for the specific sequence at a defined 
ionic strength pH. The Tm is the temperature (under defined ionic strength, pH and nucleic 
acid concentration) at which 50% of the probes complementary to the target hybridize to the 

20 target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of 
the probes are occupied at equilibrium). Stringent conditions will be those in which the salt 
concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion 
concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for 
short probes (e.g. 10 to 50 nucleotides) and at least about 60°C for long probes (e.g. greater 

25 than 50 nucleotides). Stringent conditions may also be achieved with the addition of 
destabilizing agents such as formamide. 

In another embodiment, less stringent hybridization conditions are used; for example, 
moderate or low stringency conditions may be used, as are known in the art; see Maniatis 
and Ausubel, supra, and Tijssen, supra. 

30 

In addition, the CA nucleic acid sequences of the invention are fragments of larger genes, i.e. 
they are nucleic acid segments. Alternatively, the CA nucleic acid sequences can serve as 
indicators of oncogene position, for example, the CA sequence may be an enhancer that 
activates a protooncogene. "Genes" in this context includes coding regions, non-coding 
35 regions, and mixtures of coding and non-coding regions. Accordingly, as will be appreciated 
by those in the art, using the sequences provided herein, additional sequences of the CA 
genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Maniatis et al., and Ausubel, et al., supra, hereby 
expressly incorporated by reference. In general, this is done using PCR, for example, kinetic 
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PCR. 

Once the CA nucleic acid is identified, it can be cloned and, if necessary, its constituent parts 
recombined to form the entire CA nucleic acid. Once isolated from its natural source, e.g., 
5 contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant CA nucleic acid can be further used as a probe to identify and 
isolate other CA nucleic acids, for example additional coding regions. It can also be used as 
a "precursor" nucleic acid to make modified or variant CA nucleic acids and proteins. 

10 The CA nucleic acids of the present invention are used in several ways. In a first 

embodiment, nucleic acid probes to the CA nucleic acids are made and attached to biochips 
to be used in screening and diagnostic methods, as outlined below, or for administration, for 
example for gene therapy and/or antisense applications. Alternatively, the CA nucleic acids 
that include coding regions of CA proteins can be put into expression vectors for the 

15 expression of CA proteins, again either for screening purposes or for administration to a 
patient. 

In a preferred embodiment, nucleic acid probes to CA nucleic acids (both the nucleic acid 
sequences outlined in the figures and/or the complements thereof) are made. The nucleic 

20 acid probes attached to the biochip are designed to be substantially complementary to the CA 
nucleic acids, i.e. the target sequence (either the target sequence of the sample or to other 
probe sequences, for example in sandwich assays), such that hybridization of the target 
sequence and the probes of the present invention occurs. As outlined below, this 
complementarity need not be perfect; there may be any number of base pair mismatches 

25 which will interfere with hybridization between the target sequence and the single stranded 

nucleic acids of the present invention. However, if the number of mutations is so great that no 
hybridization can occur under even the least stringent of hybridization conditions, the 
sequence is not a complementary target sequence. Thus, by "substantially complementary" 
herein is meant that the probes are sufficiently complementary to the target sequences to 

30 hybridize under normal reaction conditions, particularly high stringency conditions, as outlined 
herein. 

A nucleic acid probe is generally single stranded but can be partially single and partially 
double stranded. The strandedness of the probe is dictated by the structure, composition, 
35 and properties of the target sequence. In general, the nucleic acid probes range from about 8 
to about 100 bases long, with from about 10 to about 80 bases being preferred, and from 
about 30 to about 50 bases being particularly preferred. That is, generally whole genes are 
not used. In some embodiments, much longer nucleic acids can be used, up to hundreds of 
bases. 
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In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 
5 particular target. The probes can be overlapping (i.e. have some sequence in common), or 
separate. 

As will be appreciated by those in the art nucleic acids can be attached or immobilized to a 
solid support in a wide variety of ways. By "immobilized" and grammatical equivalents herein 

10 is meant the association or binding between the nucleic acid probe and the solid support is 
sufficient to be stable under the conditions of binding, washing, analysis, and removal as 
outlined below. The binding can be covalent or non-covalent By u non-covalent binding" and 
grammatical equivalents herein is meant one or more of either electrostatic, hydrophilic, and 
hydrophobic interactions. Included in non-covalent binding is the covalent attachment of a 

15 molecule, such as, streptavidin to the support and the non-covalent binding of the biotinylated 
probe to the streptavidin. By "covalent binding" and grammatical equivalents herein is meant 
that the two moieties, the solid support and the probe, are attached by at least one bond, 
including sigma bonds, pi bonds and coordination bonds. Covalent bonds can be formed 
directly between the probe and the solid support or can be formed by a cross linker or by 

20 inclusion of a specific reactive group on either the solid support or the probe or both 

molecules. Immobilization may also involve a combination of covalent and non-covalent 
interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
25 appreciated by those in the art. As described herein, the nucleic acids can either be 

synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or other 
30 grammatical equivalents herein is meant any material that can be modified to contain discrete 
individual sites appropriate for the attachment or association of the nucleic acid probes and is 
amenable to at least one detection method. As will be appreciated by those in the art, the 
number of possible substrates are very large, and include, but are not limited to, glass and 
modified or functionalized glass, plastics (including acrylics, polystyrene and copolymers of 
35 styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, 
Teflon™, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica__based 
materials including silicon and modified silicon, carbon, metals, inorganic glasses, etc. In 
general, the substrates allow optical detection and do not appreciably fluoresce. 
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In a preferred embodiment, the surface of the biochip and the probe may be derivatized with 
chemical functional groups for subsequent attachment of the two. Thus, for example, the 
biochip is derivatized with a chemical functional group including, but not limited to, amino 
groups, carboxy groups, oxo groups and thiol groups, with amino groups being particularly 
5 preferred. Using these functional groups, the probes can be attached using functional groups 
on the probes. For example, nucleic acids containing amino groups can be attached to 
surfaces comprising amino groups, for example using linkers as are known in the art; for 
example, homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical 
Company catalog, technical section on crossjinkers, pages 155_200, incorporated herein by 
10 reference). In addition, in some cases, additional linkers, such as alky I groups (including 
substituted and heteroalkyl groups) may be used. 

In this embodiment, the oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
15 either the 5' or 3* terminus may be attached to the solid support, or attachment may be via an 
internal nucleoside. 

In an additional embodiment, the immobilization to the solid support may be very strong, yet 
non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 
20 surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is known in the art. 
For example, photoactivation techniques utilizing photopolymerization compounds and 
techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
25 situ, using well known photolithographic techniques, such as those described in WO 

95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 
within, all of which are expressly incorporated by reference; these methods of attachment 
form the basis of the Affymetrix GeneChip technology. 

30 In addition to the solid-phase technology represented by biochip arrays, gene expression can 
also be quantified using liquid-phase arrays. One such system is kinetic polymerase chain 
reaction (PCR). Kinetic PCR allows for the simultaneous amplification and quantification of 
specific nucleic acid sequences. The specificity is derived from synthetic oligonucleotide 
primers designed to preferentially adhere to single-stranded nucleic acid sequences 

35 bracketing the target site. This pair of oligonucleotide primers form specific, non-covalently 
bound complexes on each strand of the target sequence. These complexes facilitate in vitro 
transcription of double-stranded DNA in opposite orientations. Temperature cycling of the 
reaction mixture creates a continuous cycle of primer binding, transcription, and re-melting of 
the nucleic acid to individual strands. The result is an exponential increase of the target 
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dsDNA product This product can be quantified in real time either through the use of an 
intercalating dye or a sequence specific probe. SYBR® Greene I, is an example of an 
intercalating dye, that preferentially binds to dsDNA resulting in a concomitant increase in the 
fluorescent signal Sequence specific probes, such as used with TaqMan® technology, 
5 consist of a fluorochrome and a quenching molecule covalently bound to opposite ends of an 
oligonucleotide. The probe is designed to selectively bind the target DNA sequence between 
the two primers. When the DNA strands are synthesized during the PCR reaction, the 
fluorochrome is cleaved from the probe by the exonuclease activity of the polymerase 
resulting in signal dequenching. The probe signaling method can be more specific than the 

10 intercalating dye method, but in each case, signal strength is proportional to the dsDNA 

product produced. Each type of quantification method can be used in multi-well liquid phase 
arrays with each well representing primers and/or probes specific to nucleic acid sequences 
of interest. When used with messenger RNA preparations of tissues or celt lines, and an 
array of probe/primer reactions can simultaneously quantify the expression of multiple gene 

is products of interest. See Germer, S., et a!., Genome Res, 10:258-266 (2000); Heid, C. A., et 
al., Genome Res. 6, 986-994 (1996). 

In a preferred embodiment, CA nucleic acids encoding CA proteins are used to make a 
variety of expression vectors to express CA proteins which can then be used in screening 

20 assays, as described below. The expression vectors may be either self-replicating 

extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translations regulatory nucleic acid operably 
linked to the nucleic acid encoding the CA protein. The term "control sequences" refers to 
DNA sequences necessary for the expression of an operably linked coding sequence in a 

25 particular host organism. The control sequences that are suitable for prokaryotes, for 

example, include a promoter, optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
30 nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably 
linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the 
secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
35 means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, 
synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice. The transcriptional and translational regulatory nucleic acid will generally be 
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appropriate to the host cell used to express the CA protein; for example, transcriptional and 
translational regulatory nucleic acid sequences from Bacillus are preferably used to express 
the CA protein in Bacillus. Numerous types of appropriate expression vectors, and suitable 
regulatory sequences are known in the art for a variety of host cells. 

5 

In general, the transcriptional and translational regulatory sequences may include, but are not 
limited to, promoter sequences, ribosomat binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In 
a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
10 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters may 
be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
15 present invention. 

In addition, the expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
organisms, for example in mammalian or insect cells for expression and in a procaryotic host 

20 for cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector contains at least one sequence homologous to the host cell genome, and preferably 
two homologous sequences which flank the expression construct. The integrating vector may 
be directed to a specific locus in the host cell by selecting the appropriate homologous 
sequence for inclusion in the vector. Constructs for integrating vectors are well known in the 

25 art. 

In addition, in a preferred embodiment, the expression vector contains a selectable marker 
gene to allow the selection of transformed host cells. Selection genes are well known in the 
art and will vary with the host cell used. 

30 The CA proteins of the present invention are produced by culturing a host cell transformed 
with an expression vector containing nucleic acid encoding an CA protein, under the 
appropriate conditions to induce or cause expression of the CA protein. The conditions 
appropriate for CA protein expression will vary with the choice of the expression vector and 
the host cell, and will be easily ascertained by one skilled in the art through routine 

35 experimentation. For example, the use of constitutive promoters in the expression vector will 
require optimizing the growth and proliferation of the host cell, while the use of an inducible 
promoter requires the appropriate growth conditions for induction. In addition, in some 
embodiments, the timing of the harvest is important For example, the baculoviral systems 
used in insect cell expression are lytic viruses, and thus harvest time selection can be crucial 
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for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect plant and 
animal cells, including mammalian cells. Of particular interest are Drosophila melanogaster 
5 cells, Saccharomyces cerevisiae and other yeasts, E. co//, Bacillus subtilis, Sf9 cells, C129 

cells, 293 cells, Neurospora, BHK, CHO, COS, HeLa cells, THP1 cell line (a macrophage cell 
line) and human cells and cell lines. 

In a preferred embodiment, the CA proteins are expressed in mammalian cells. Mammalian 
10 expression systems are also known in the art, and include retroviral systems. A preferred 
expression vector system is a retroviral vector system such as is generally described in 
PCT/US97/01019 and PCT/US97/01048, both of which are hereby expressly incorporated by 
reference. Of particular use as mammalian promoters are the promoters from mammalian 
viral genes, since the viral genes are often highly expressed and have a broad host range, 
is Examples include the SV40 early promoter, mouse mammary tumor virus LTR promoter, 
adenovirus major late promoter, herpes simplex virus promoter, and the CMV promoter. 
Typically, transcription termination and polyadenylation sequences recognized by mammalian 
cells are regulatory regions located 3' to the translation stop codon and thus, together with the 
promoter elements, flank the coding sequence. Examples of transcription terminator and 
20 polyadenlytion signals include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as well as other 
hosts, is well known in the art, and will vary with the host cell used. Techniques include 
dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
25 transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred. embodiment, CA proteins are expressed in bacterial systems. Bacterial 
expression systems are well known in the art. Promoters from bacteriophage may also be 

30 used and are known in the art. In addition, synthetic promoters and hybrid promoters are also 
useful; for example, the tac promoter is a hybrid of the tip and lac promoter sequences. 
Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial 
origin that have the ability to bind bacterial RNA polymerase and initiate transcription. In 
addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. 

35 The expression vector may also include a signal peptide sequence that provides for secretion 
of the CA protein in bacteria. The protein is either secreted into the growth media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria). The bacterial expression vector may also 
include a selectable marker gene to allow for the selection of bacterial strains that have been 
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transformed. Suitable selection genes include genes which render the bacteria resistant to 
drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin and 
tetracycline. Selectable markers also include biosynthetic genes, such as those in the 
histtdine, tryptophan and leucine biosynthetic pathways. These components are assembled 
5 into expression vectors. Expression vectors for bacteria are well known in the art, and include 
vectors for Bacillus subtilis, E coli, Streptococcus cremohs, and Streptococcus lividans, 
among others. The bacterial expression vectors are transformed into bacterial host cells 
using techniques well known in the art, such as calcium chloride treatment, electroporation, 
and others. 

10 

In one embodiment, CA proteins are produced in insect cells. Expression vectors for the 
transformation of insect cells, and in particular, baculovirus-based expression vectors, are 
well known in the art. 

15 In a preferred embodiment, CA protein is produced in yeast cells. Yeast expression systems 
are well known in the art, and include expression vectors for Saccharomyces cerevisiae. 
Candida albicans and C. maltosa, Hansenula polymorpha, Kluyveromyces fragilis and K. 
lactis, Pichia guillerimondii and P. pastohs, Schizosaccharomyces pombe t and Yarrowia 
lipolytica. 

20 

The CA protein may also be made as a fusion protein, using techniques well known in the art. 
Thus, for example, for the creation of monoclonal antibodies. If the desired epitope is small, 
the CA protein may be fused to a carrier protein to form an immunogen. Alternatively, the CA 
protein may be made as a fusion protein to increase expression, or for other reasons. For 
25 example, when the CA protein is an CA peptide, the nucleic acid encoding the peptide may 
be linked to other nucleic acid for expression purposes. 

In one embodiment, the CA nucleic acids, proteins and antibodies of the invention are 
labeled. By "labeled" herein is meant that a compound has at least one element, isotope or 

30 chemical compound attached to enable the detection of the compound. In general, labels fall 
into three classes: a) isotopic labels, which may be radioactive or heavy isotopes; b) immune 
labels, which may be antibodies or antigens; and c) colored or fluorescent dyes. The labels 
may be incorporated into the CA nucleic acids, proteins and antibodies at any position. For 
example, the label should be capable of-producing, either directly or indirectly, a detectable 

35 signal. The detectable moiety may be a radioisotope, such as 3 H, 14 C, 32 P, ^S, or 125 l, a 

fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, 
or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or horseradish 
peroxidase. Any method known in the art for conjugating the antibody to the label may be 
employed, including those methods described by Hunter et al., Nature, 144:945 (1962); 
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David etal., Biochemistry, 13:1014 (1974); Pain et al., J. Immunol. Meth., 40:219 (1981); and 
Nygren, J. Histochem. and Cytochem., 30:407 (1982). 

Accordingly, the present invention also provides CA protein sequences. An CA protein of the 
5 present invention may be identified in several ways. -Protein" in this sense includes proteins, 
polypeptides, and peptides. As will be appreciated by those in the art, the nucleic acid 
sequences of the invention can be used to generate protein sequences. There are a variety 
of ways to do this, including cloning the entire gene and verifying its frame and amino acid 
sequence, or by comparing it to known sequences to search for homology to provide a frame, 

10 assuming the CA protein has homology to some protein in the database being used. 

Generally, the nucleic acid sequences are input into a program that will search all three 
frames for homology. This is done in a preferred embodiment using the following NCBI 
Advanced BLAST parameters. The program is blastx or blastn. The database is nr. The 
input data is as "Sequence in FASTA format". The organism list is "none". The "expect* is 

15 10; the filter is default The "descriptions" is 500, the "alignments" is 500, and the "alignment 
view" is pairwise. The "query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 
existence cost is 1 1, per residue gap cost is 1; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

20 Also included within one embodiment of CA proteins are amino acid variants of the naturally 
occurring sequences, as determined herein. Preferably, the variants are preferably greater 
than about 75% homologous to the wild-type sequence, more preferably greater than about 
80%, even more preferably greater than about 85% and most preferably greater than 90%. 
In some embodiments the homology will be as high as about 93 to 95 or 98%. As for nucleic 

25 acids, homology in this context means sequence similarity or identity, with identity being 

preferred. This homology will be determined using standard techniques known in the art as 
are outlined above for the nucleic acid homologies. 

CA proteins of the present invention may be shorter or longer than the wild type amino acid 
30 sequences. Thus, in a preferred embodiment, included within the definition of CA proteins 
are portions or fragments of the wild type sequences herein. In addition, as outlined above, 
the CA nucleic acids of the invention may be used to obtain additional coding regions, and 
thus additional protein sequence, using techniques known in the art. 

35 In a preferred embodiment, the CA proteins are derivative or variant CA proteins as compared 
to the wild-type sequence. That is, as outlined more fully below, the derivative CA peptide will 
contain at least one amino acid substitution, deletion or insertion, with amino acid 
substitutions being particularly preferred. The amino acid substitution, insertion or deletion 
may occur at any residue within the CA peptide. 
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Also included in an embodiment of CA proteins of the present invention are amino acid 
sequence variants. These variants fall into one or more of three classes: substitutional, * 
insertional or deletional variants. These variants ordinarily are prepared by site specific 
5 mutagenesis of nucleotides in the DNA encoding the CA protein, using cassette or PCR 

mutagenesis or other techniques well known in the art to produce DNA encoding the variant, 
and thereafter expressing the DNA in recombinant cell culture as outlined above. However, 
variant CA protein fragments having up to about 100-150 residues may be prepared by in 
vitro synthesis using established techniques. Amino acid sequence variants are 
10 characterized by the predetermined nature of the variation, a feature that sets them apart 
from naturally occurring allelic or interspecies variation of the CA protein amino acid 
sequence. The variants typically exhibit the same qualitative biological activity as the 
naturally occurring analogue, although variants can also be selected which have modified 
characteristics as will be more fully outlined below. 

15 

While the site or region for introducing an amino acid sequence variation is predetermined, 
the mutation per se need not be predetermined. For example, in order to optimize the 
performance of a mutation at a given site, random mutagenesis may be conducted at the 
target codon or region and the expressed CA variants screened for the optimal combination of 
20 desired activity. Techniques for making substitution mutations at predetermined sites in DNA 
having a known sequence are well known, for example, M13 primer mutagenesis and LAR 
mutagenesis. Screening of the mutants is done using assays of CA protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on the order 
25 of from about 1 to 20 amino acids, although considerably larger insertions may be tolerated. 

Deletions range from about 1 to about 20 residues, although in some cases deletions may be 
much larger. 

*> 

Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final 
30 derivative. Generally these changes are done on a few amino acids to minimize the alteration 
of the molecule. However, larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of the CA protein are desired, substitutions are 
generally made in accordance with the following chart: 

35 Chart I . 

Original Residue Exemplary Substitutions 
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Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn, Gin 


He 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



Substantial changes in function or immunological identity are made by selecting substitutions 
that are less conservative than those shown in Chart I. For example, substitutions may be 
made which more significantly affect: the structure of the polypeptide backbone in the area of 
5 the alteration, for example the alpha-helical or beta-sheet structure; the charge or 
hydrophobicity of the molecule at the target site; or the bulk of the side chain. The 
substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 
10 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 

having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side chain, 
e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

15 The variants typically exhibit the same qualitative biological activity and will elicit the same 

immune response as the naturally-occurring analogue, although variants also are selected to 
modify the characteristics of the CA proteins as needed. Alternatively, the variant may be 
designed such that the biological activity of the CA protein is altered. For example, 
glycosylate sites may be altered or removed, dominant negative mutations created, etc. 

20 
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Covalent modifications of CA polypeptides are included within the scope of this invention, for 
example for use in screening. One type of covalent modification includes reacting targeted 
amino acid residues of an CA polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N-or C-terminal residues of an CA polypeptide. 
5 Derivatization with Afunctional agents is useful, for instance, for crosslinking CA polypeptides 
to a water-insoluble support matrix or surface for use in the method for purifying anti-CA 
antibodies or screening assays, as is more fully described below. Commonly used 
crosslinking agents include, e.g., 1,1-bis(diazoacety!)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional 
10 imidoesters, including disuccinimidyl esters such as 3,3*-dithiobis(succinimidylpropionate), 
bifunctional maleimides such as bis-N-maleimido-1,8-octane and agents such as methyl-3- 
[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
15 corresponding glutamyl and asparty! residues, respectively, hydroxylation of proline and 

lysine, phosphorylation of hydroxyl groups of seryl, threonyl ortyrosyl residues, methylation of 
the a-amino groups of lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: 
Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], 
acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

20 

Another type of covalent modification of the CA polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the 
native glycosylation pattern" is intended for purposes herein to mean deleting one or more 
carbohydrate moieties found in native sequence CA polypeptide, and/or adding one or more 
25 glycosylation sites that are not present in the native sequence CA polypeptide. 

Addition of glycosylation sites to CA polypeptides may be accomplished by altering the amino 
acid sequence thereof. The alteration may be made, for example, by the addition of, or 
substitution by, one or more serine or threonine residues to the native sequence CA 
30 polypeptide (for O-linked glycosylation sites). The CA amino acid sequence may optionally 
be altered through changes at the DNA level, particularly by mutating the DNA encoding the 
CA polypeptide at preselected bases such that codons are generated that will translate into 
the desired amino acids. 

35 Another means of increasing the number of carbohydrate moieties on the CA polypeptide is 
by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are 
described in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and 
Wriston, LA Crit. Rev. Biochem., pp. 259-306 (1981). 
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Removal of carbohydrate moieties present on the CA polypeptide may be accomplished 
chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
residues that serve as targets for glycosylate. Chemical deglycosylation techniques are 
known in the art and described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophys., 
5 259:52 (1987) and by Edge et al., Anal. Biochem., 118:131 (1981). Enzymatic cleavage of 
carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo-and 
exo-glycosidases as described by Thotakura et al., Meth. Enzymol., 138:350 (1987). 

Another type of covalent modification of CA comprises linking the CA polypeptide to one of a 
10 variety of nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene glycol, or 
polyoxyalkylenes. in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

CA polypeptides of the present invention may also be modified in a way to form chimeric 
15 molecules comprising an CA polypeptide fused to another, heterologous polypeptide or amino 
acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of an CA 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody 
can selectively bind. The epitope tag is generally placed at the amino-or carboxyl-terminus of 
the CA polypeptide, although internal fusions may also be tolerated in some instances. The 
20 presence of such epitope-tagged forms of an CA polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the CA 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or another 
type of affinity matrix that binds to the epitope tag. In an alternative embodiment, the chimeric 
molecule may comprise a fusion of an CA polypeptide with an immunoglobulin or a particular 
25 region of an immunoglobulin. For a bivalent form of the chimeric molecule, such a fusion 
could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag 

30 polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol., 8:2159-2165 (1988)]; the c- 
myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular 
and Cellular Biology, 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) 
tag and its antibody [Paborsky et al., Protein Engineering, 3(6):547-553 (1990)]. Other tag 
polypeptides include the Flag-peptide [Hopp et al., BioTechnology, 6:1204-1210 (1988)]; the 

35 KT3 epitope peptide [Martin et al., Science, 255:192-194 (1992)]; tubulin epitope peptide 

[Skinner et al., J. Biol. Chem., 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide 
tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. 

Also included with the definition of CA protein in one embodiment are other CA proteins of the 
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CA family, and CA proteins from other organisms, which are cloned and expressed as 
outlined below. Thus, probe or degenerate polymerase chain reaction (PCR) primer 
sequences may be used to find other related CA proteins from humans or other organisms. 
As will be appreciated by those in the art, particularly useful probe and/or PCR primer 
5 sequences include the unique areas of the CA nucleic acid sequence. As is generally known 
in the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with 
from about 20 to about 30 being preferred, and may contain inosine as needed. The 
conditions for the PCR reaction are well known in the art 

10 In addition, as is outlined herein, CA proteins can be made that are longer than those 
encoded by the nucleic acids of the figures, for example, by the elucidation of additional 
sequences, the addition of epitope or purification tags, the addition of other fusion sequences, 
etc. 

15 CA proteins may also be identified as being encoded by CA nucleic acids. Thus, CA proteins 
are encoded by nucleic acids that will hybridize to the sequences of the sequence listings, or 
their complements, as outlined herein. 

In a preferred embodiment, the invention provides CA antibodies. In a preferred embodiment, 
20 when the CA protein is to be used to generate antibodies, for example for immunotherapy, the 
CA protein should share at least one epitope or determinant with the full length protein. By 
"epitope" or "determinant" herein is meant a portion of a protein which will generate and/or 
bind an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller CA protein will be able to bind to the full length protein. In a preferred 
25 embodiment, the epitope is unique; that is, antibodies generated to a unique epitope show 
little or no cross-reactivity. 

In one embodiment, the term "antibody" includes antibody fragments, as are known in the art, 
including Fab, Fab 2 , single chain antibodies (Fv for example), chimeric antibodies, etc., either 
30 produced by the modification of whole antibodies or those synthesized de novo using 
recombinant DNA technologies. 

Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclonal 
antibodies can be raised in a mammal, for example, by one or more injections of an 
35 immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent and/or 

adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent may include a protein encoded by a nucleic acid of the figures or 
fragment thereof or a fusion protein thereof. It may be useful to conjugate the immunizing 
agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
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such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which 
may be employed include Freund's complete adjuvant and M PL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The immunization protocol 
5 may be selected by one skilled in the art without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 
256:495 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, 
is typically immunized with an immunizing agent to elicit lymphocytes that produce or are 
capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will typically 
include a polypeptide encoded by a nucleic acid of Tables 1-112, or fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if 
cells of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Goding, Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells may be cultured in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or survival of the unfused, 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
monoclonal, preferably human or humanized, antibodies that have binding specificities for at 
30 least two different antigens. In the present case, one of the binding specificities is for a 

protein encoded by a nucleic acid of Tables 1-1 12, or a fragment thereof, the other one is for 
any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. 

35 In a preferred embodiment, the antibodies to CA are capable of reducing or eliminating the 
biological function of CA, as is described below. That is, the addition of anti-CA antibodies 
(either polyclonal or preferably monoclonal) to CA (or cells containing CA) may reduce or 
eliminate the CA activity. Generally, at least a 25% decrease in activity is preferred, with at 
least about 50% being particularly preferred and about a 95-100% decrease being especially 
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preferred. 

In a preferred embodiment the antibodies to the CA proteins are humanized antibodies. 
Humanized forms of non_human (e.g., murine) antibodies are chimeric molecules of 
5 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab\ F(ab') 2 
or other antigen binding subsequences of antibodies) which contain minimal sequence 
derived from non_human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues form a complementary determining 
region (CDR) of the recipient are replaced by residues from a CDR of a non_human species 

10 (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding nonjiuman residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. In general, the humanized antibody will comprise substantially all of at 

15 least one, and typically two, variable domains, in which all or substantially all of the CDR 

regions correspond to those of a non_human immunoglobulin and all or substantially all of the 
framework residues (FR) regions are those of a human immunoglobulin consensus sequence. 
The humanized antibody optimally also will comprise at least a portion of an immunoglobulin 
constant region (Fc), typically that of a human immunoglobulin [Jones et al., Nature, 

20 321:522^525 (1986); Riechmann et al., Nature, 332:323_329 (1988); and Presta, Curr. Op. 
Struct. Biol., 2:593.596 (1992)]. 

Methods for humanizing non_human antibodies are well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a source 

25 which is non_human. These nonjiuman amino acid residues are often referred to as import 
residues, which are typically taken from an import variable domain. Humanization can be 
essentially performed following the method of Winter and co_workers [Jones et al., Nature, 
321:522^525 (1986); Riechmann et al., Nature, 332:323.327 (1988); Verhoeyen etal., 
Science, 239:1534_1536 (1988)], by substituting rodent CDRs or CDR sequences for the 

30 corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a 
nonjiuman species. In practice, humanized antibodies are typically human antibodies in 
which some CDR residues and possibly some FR residues are substituted by residues from 

35 analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including 
phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); Marks etal., 
J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and Boerner et al. are also 
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available for the preparation of human monoclonal antibodies [Cole et aL, Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boerner et a!., J. Immunol., 
147(1 ):86_95 (1991)]. Similarly, human antibodies can be made by introducing human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
5 immunoglobulin genes have been partially or completely inactivated. Upon challenge, human 
antibody production is observed, which closely resembles that seen in humans in all respects, 
including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks et al., 
10 Bio/Technology 10, 779_783 (1992); Lonberg et al., Nature 368 856_859 (1994); Morrison, 
Nature 368, 812JI3 (1994); Fishwild et aL. Nature Biotechnology 14, 845_51 (1996); 
Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 
13 65.93 (1995). 

is By immunotherapy is meant treatment of a carcinoma with an antibody raised against an CA 
protein. As used herein, immunotherapy can be passive or active. Passive immunotherapy 
as defined herein is the passive transfer of antibody to a recipient (patient). Active 
immunization is the induction of antibody and/or T-cell responses in a recipient (patient). 
Induction of an immune response is the result of providing the recipient with an antigen to 

20 which antibodies are raised. As appreciated by one of ordinary skill in the art, the antigen 

may be provided by injecting a polypeptide against which antibodies are desired to be raised 
into a recipient, or contacting the recipient with a nucleic acid capable of expressing the 
antigen and under conditions for expression of the antigen. 

25 In a preferred embodiment, oncogenes which encode secreted growth factors may be 

inhibited by raising antibodies against CA proteins that are secreted proteins as described 
above. Without being bound by theory, antibodies used for treatment, bind and prevent the 
secreted protein from binding to its receptor, thereby inactivating the secreted CA protein. 

30 In another preferred embodiment, the CA protein to which antibodies are raised is a 

transmembrane protein. Without being bound by theory, antibodies used for treatment, bind 
the extracellular domain of the CA protein and prevent it from binding to other proteins, such 
as circulating ligands or cell-associated molecules. The antibody may cause down-regulation 
of the transmembrane CA protein. As will be appreciated by one of ordinary skill in the art, 

35 the antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein 

binding to the extracellular domain of the CA protein. The antibody is also an antagonist of 
the CA protein. Further, the antibody prevents activation of the transmembrane CA protein. 
In one aspect, when the antibody prevents the binding of other molecules to the CA protein, 
the antibody prevents growth of the cell. The antibody may also sensitize the cell to cytotoxic 
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agents, including, but not limited to TNF-a, TNF-p, IL-1, INF-y and IL-2, or chemotherapeutic 
agents including 5FU, vinblastine, actinomycin D ? cisplatin, methotrexate, and the like. In 
some instances the antibody belongs to a sub-type that activates serum complement when 
complexed with the transmembrane protein thereby mediating cytotoxicity. Thus, carcinomas 
5 may be treated by administering to a patient antibodies directed against the transmembrane 
CA protein. 

In another preferred embodiment, the antibody is conjugated to a therapeutic moiety. In one 
aspect the therapeutic moiety is a small molecule that modulates the activity of the CA 
10 protein. In another aspect the therapeutic moiety modulates the activity of molecules 

associated with or in close proximity to the CA protein. The therapeutic moiety may inhibit 
enzymatic activity such as protease or protein kinase activity associated with carcinoma. 

In a preferred embodiment, the therapeutic moiety may also be a cytotoxic agent. In this 
15 method, targeting the cytotoxic agent to tumor tissue or ceils, results in a reduction in the 
number of afflicted cells, thereby reducing symptoms associated with carcinomas, including 
lymphoma. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
20 chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 

radiochemicals made by conjugating radioisotopes to antibodies raised against CA proteins, 
or binding of a radionuclide to a chelating agent that has been covalently attached to the 
antibody. Targeting the therapeutic moiety to transmembrane CA proteins not only serves to 
increase the local concentration of therapeutic moiety in the carcinoma of interest, i.e., 
25 lymphoma, but also serves to reduce deleterious side effects that may be associated with the 
therapeutic moiety. 

In another preferred embodiment, the CA protein against which the antibodies are raised is an 
intracellular protein. In this case, the antibody may be conjugated to a protein which 
30 facilitates entry into the cell. In one case, the antibody enters the cell by endocytosis. In 

another embodiment, a nucleic acid encoding the antibody is administered to the individual or 
cell. Moreover, wherein the CA protein can be targeted within a cell, i.e., the nucleus, an 
antibody thereto contains a signal for that target localization, i.e., a nuclear localization signal. 

35 The CA antibodies of the invention specifically bind to CA proteins. By "specifically bind" 

herein is meant that the antibodies bind to the protein with a binding constant in the range of 
at least 10* 4 - 10" 6 M" 1 , with a preferred range being 10" 7 - 10* NT 1 . 

In a preferred embodiment, the CA protein is purified or isolated after expression. CA 
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proteins may be isolated or purified in a variety of ways known to those skilled in the art 
depending on what other components are present in the sample. Standard purification 
methods include electrophoretic, molecular, immunological and chromatographic techniques, 
including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and 
5 chromatofbcusing. For example, the CA protein may be purified using a standard anti-CA 
antibody column. Ultrafiltration and diafiltration techniques, in conjunction with protein 
concentration, are also useful. For general guidance in suitable purification techniques, see 
Scopes, R., Protein Purification, Springer-Vertag, NY (1982). The degree of purification 
necessary will vary depending on the use of the CA protein. In some instances no purification 
io will be necessary. 

Once expressed and purified if necessary, the CA proteins and nucleic acids are useful in a 
number of applications. 

15 In one aspect, the expression levels of genes are determined for different cellular states in the 
carcinoma phenotype; that is, the expression levels of genes in normal tissue and in 
carcinoma tissue (and in some cases, for varying severities of lymphoma that relate to 
prognosis, as outlined below) are evaluated to provide expression profiles. An expression 
profile of a particular cell state or point of development is essentially a "fingerprint" of the 

20 state; while two states may have any particular gene similarly expressed, the evaluation of a 
number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. By comparing expression profiles of cells in different states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. Then, diagnosis may be done or confirmed: does 

25 tissue from a particular patient have the gene expression profile of normal or carcinoma 
tissue. 

"Differential expression," or grammatical equivalents as used herein, refers to both qualitative 
as well as quantitative differences in the genes temporal and/or cellular expression patterns 

30 within and among the cells. Thus, a differentially expressed gene can qualitatively have its 
expression altered, including an activation or inactivation, in, for example, normal versus 
carcinoma tissue. That is, genes may be turned on or turned off in a particular state, relative 
to another state. As is apparent to the skilled artisan, any comparison of two or more states 
can be made. Such a qualitatively regulated gene will exhibit an expression pattern within a 

35 state or cell type which is detectable by standard techniques in one such state or cell type, but 
is not detectable in both. Alternatively, the determination is quantitative in that expression is 
increased or decreased; that is, the expression of the gene is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
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standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip® expression arrays, Lockhart, Nature Biotechnology, 14:1675-1680(1996), hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, Northern analysis and RNase protection. As outlined 
5 above, preferably the change in expression (i.e. upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably, at least about 200%, with from 300 to at least 1000% being especially preferred. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene 
10 transcript, or the protein level; that is, the amount of gene expression may be monitored using 
nucleic acid probes to the DNA or RNA equivalent of the gene transcript, and the 
quantification of gene expression levels, or, alternatively, the final gene product itself (protein) 
can be monitored, for example through the use of antibodies to the CA protein and standard 
immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy assays, 2D 
15 gel electrophoresis assays, etc. Thus, the proteins corresponding to CA genes, i.e. those 
identified as being important in a particular carcinoma phenotype, i.e., lymphoma, can be 
evaluated in a diagnostic test specific for that carcinoma. 

In a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. 
20 an expression profile, is monitored simultaneously, although multiple protein expression 

monitoring can be done as well. Similarly, these assays may be done on an individual basts 
as well. 

In this embodiment, the CA nucleic acid probes may be attached to biochips as outlined 
25 herein for the detection and quantification of CA sequences in a particular cell. The assays 
are done as is known in the art. As will be appreciated by those in the art, any number of 
different CA sequences may be used as probes, with single sequence assays being used in 
some cases, and a plurality of the sequences described herein being used in other 
embodiments. In addition, while solid-phase assays are described, any number of solution 
30 based assays may be done as well. 

In a preferred embodiment, both solid and solution based assays may be used to detect CA 
sequences that are up-regulated or down-regulated in carcinomas as compared to normal 
tissue. In instances where the CA sequence has been altered but shows the same 
35 expression profile or an altered expression profile, the protein will be detected as outlined 
herein. 

In a preferred embodiment nucleic acids encoding the CA protein are detected. Although 
DNA or RNA encoding the CA protein may be detected, of particular interest are methods 
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wherein the mRNA encoding a CA protein is detected. The presence of mRNA in a sample is 
an indication that the CA gene has been transcribed to form the mRNA, and suggests that the 
protein is expressed. Probes to detect the mRNA can be any nucleotide/deoxynucleotide 
probe that is complementary to and base pairs with the mRNA and includes but is not limited 
to oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is detected. 
In another method detection of the mRNA is performed in situ. In this method permeabilized 
cells or tissue samples are contacted with a detectably labeled nucleic acid probe for 
sufficient time to allow the probe to hybridize with the target mRNA. Following washing to 
remove the non-specifically bound probe, the label is detected. For example a digoxygenin 
labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a CA protein is 
detected by binding the digoxygenin with an anti-digoxygenin secondary antibody and 
developed with nitro blue tetrazolium and 5_bromo_4_chloro_J3Jndoyl phosphate. 

In a preferred embodiment, any of the three classes of proteins as described herein 
(secreted, transmembrane or intracellular proteins) are used in diagnostic assays. The CA 
proteins, antibodies, nucleic acids, modified proteins and cells containing CA sequences are 
20 used in diagnostic assays. This can be done on an individual gene or corresponding 
polypeptide level, or as sets of assays. 

As described and defined herein, CA proteins find use as markers of carcinomas, including 
lymphomas such as, but not limited to, Hodgkin's and non-Hodgkin lymphoma. Detection of 

25 these proteins in putative carcinoma tissue or patients allows for a determination or diagnosis 
of the type of carcinoma. Numerous methods known to those of ordinary skill in the art find 
use in detecting carcinomas. In one embodiment, antibodies are used to detect CA proteins. 
A preferred method separates proteins from a sample or patient by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be any other type of gel including 

30 isoelectric focusing gels and the like). Following separation of proteins, the CA protein is 
detected by immunoblotting with antibodies raised against the CA protein. Methods of 
immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the CA protein find use in in situ imaging 
35 techniques. In this method cells are contacted with from one to many antibodies to the CA 
protein(s). Following washing to remove non-specific antibody binding, the presence of the 
antibody or antibodies is detected. In one embodiment the antibody is detected by incubating 
with a secondary antibody that contains a detectable label. In another method the primary 
antibody to the CA protein(s) contains a detectable label. In another preferred embodiment 
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each one of multiple primary antibodies contains a distinct and detectable label. This method 
finds particular use in simultaneous screening for a plurality of CA proteins. As will be 
appreciated by one of ordinary skill in the art, numerous other histological imaging techniques 
are useful in the invention. 

5 

In a preferred embodiment the label is detected in a fluorometer which has the ability to detect 
and distinguish emissions of different wavelengths. In addition, a fluorescence activated cell 
sorter (FACS) can be used in the method. 

10 In another preferred embodiment, antibodies find use in diagnosing carcinomas from blood 
samples. As previously described, certain CA proteins are secreted/circulating molecules. 
Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted CA proteins. Antibodies can be used to detect the CA proteins by any of the 
previously described immunoassay techniques including ELISA, immunoblotting (Western 

15 blotting), immunoprecipitation, BIACORE technology and the like, as will be appreciated by 
one of ordinary skill in the art. 

In a preferred embodiment, in situ hybridization of labeled CA nucleic acid probes to tissue 
arrays is done. For example, arrays of tissue samples, including CA tissue and/or normal 
20 tissue, are made. In situ hybridization as is known in the art can then be done. 

It is understood that when comparing the expression fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis as well as a prognosis. It is further 
understood that the genes which indicate the diagnosis may differ from those which indicate 
25 the prognosis. 

In a preferred embodiment, the CA proteins, antibodies, nucleic acids, modified proteins and 
cells containing CA sequences are used in prognosis assays. As above, gene expression 
profiles can be generated that correlate to carcinoma, especially lymphoma, severity, in terms 
30 of long term prognosis. Again, this may be done on either a protein or gene level, with the 
use of genes being preferred. As above, the CA probes are attached to biochips for the 
detection and quantification of CA sequences in a tissue or patient. The assays proceed as 
outlined for diagnosis. 

35 In a preferred embodiment, any of the CA sequences as described herein are used in drug 
screening assays. The CA proteins, antibodies, nucleic acids, modified proteins and cells 
containing CA sequences are used in drug screening assays or by evaluating the effect of 
drug candidates on a "gene expression profile" or expression profile of polypeptides. In one 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 



35 



WO 03/008583 



PCT/US01/51291 



screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, Zlokarnik, et al. f Science 279, 84-8 (1998), Heid, et al., Genome Res., 
6:986-994 (1996). 

5 In a preferred embodiment, the CA proteins, antibodies, nucleic acids, modified proteins and 
cells containing the native or modified CA proteins are used in screening assays. That is, the 
present invention provides novel methods for screening for compositions which modulate the 
carcinoma phenotype. As above, this can be done by screening for modulators of gene 
expression or for modulators of protein activity. Similarly, this may be done on an individual 
10 gene or protein level or by evaluating the effect of drug candidates on a "gene expression 
profile". In a preferred embodiment, the expression profiles are used, preferably in 
conjunction with high throughput screening techniques to allow monitoring for expression 
profile genes after treatment with a candidate agent, see Zlokarnik, supra. 

15 Having identified the CA genes herein, a variety of assays to evaluate the effects of agents on 
gene expression may be executed. In a preferred embodiment, assays may be run on an 
individual gene or protein level. That is, having identified a particular gene as aberrantly 
regulated in carcinoma, candidate bioactive agents may be screened to modulate the genes 
response. "Modulation" thus includes both an increase and a decrease in gene expression or 

20 activity. The preferred amount of modulation will depend on the original change of the gene 
expression in normal versus tumor tissue, with changes of at least 10%, preferably 50%, 
more preferably 100-300%, and in some embodiments 300-1000% or greater. Thus, if a 
gene exhibits a 4 fold increase in tumor compared to normal tissue, a decrease of about four 
fold is desired; a 10 fold decrease in tumor compared to normal tissue gives a 10 fold 

25 increase in expression for a candidate agent is desired, etc. Alternatively, where the CA 

sequence has been altered but shows the same expression profile or an altered expression 
profile, the protein will be detected as outlined herein. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene or 
30 the protein level; that is, the amount of gene expression may be monitored using nucleic acid 
probes and the quantification of gene expression levels, or, alternatively, the level of the gene 
product itself can be monitored, for example through the use of antibodies to the CA protein 
and standard immunoassays. Alternatively, binding and bioactivity assays with the protein 
may be done as outlined below. 

35 

In a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. 
an expression profile, is monitored simultaneously, although multiple protein expression 
monitoring can be done as well. 
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In this embodiment, the CA nucleic acid probes are attached to biochips as outlined herein for 
the detection and quantification of CA sequences in a particular cell. The assays are further 
described below. 

5 Generally, in a preferred embodiment, a candidate bioactive agent is added to the cells prior 
to analysis. Moreover, screens are provided to identify a candidate bioactive agent which 
modulates a particular type of carcinoma, modulates CA proteins, binds to a CA protein, or 
interferes between the binding of a CA protein and an antibody. 

10 The term "candidate bioactive agent" or "drug candidate" or grammatical equivalents as used 
herein describes any molecule, e.g., protein, oligopeptide, small organic or inorganic 
molecule, polysaccharide, polynucleotide, etc., to be tested for bioactive agents that are 
capable of directly or indirectly altering either the carcinoma phenotype, binding to and/or 
modulating the bioactivity of an CA protein, or the expression of a CA sequence, including 

is both nucleic acid sequences and protein sequences. In a particularly preferred embodiment, 
the candidate agent suppresses a CA phenotype, for example to a normal tissue fingerprint. 
Similarly, the candidate agent preferably suppresses a severe CA phenotype. Generally a 
plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
differential response to the various concentrations. Typically, one of these concentrations 

20 serves as a negative control, i.e., at zero concentration or below the level of detection. 

In one aspect, a candidate agent will neutralize the effect of an CA protein. By "neutralize" is 
meant that activity of a protein is either inhibited or counter acted against so as to have 
substantially no effect on a cell. 

25 

Candidate agents encompass numerous chemical classes, though typically they are organic 
or inorganic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 

30 functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Candidate agents are also found 

35 among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 

pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred are 
peptides. 

Candidate agents are obtained from a wide variety of sources including libraries of synthetic 
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or natural compounds. For example, numerous means are available for random and directed 
synthesis of a wide variety of organic compounds and biomolecules T including expression of 
randomized oligonucleotides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily produced. Additionally, 
5 natural or synthetically produced libraries and compounds are readily modified through 

conventional chemical, physical and biochemical means. Known pharmacological agents 
may be subjected to directed or random chemical modifications, such as acylation, alkylation, 
esterification, amidification to produce structural analogs. 

In a preferred embodiment, the candidate bioactive agents are proteins. By "protein" herein is 
meant at least two covalently attached amino acids, which includes proteins, polypeptides, 
oligopeptides and peptides. The protein may be made up of naturally occurring amino acids 
and peptide bonds, or synthetic peptidomimetic structures. Thus "amino acid", or "peptide 
residue", as used herein means both naturally occurring and synthetic amino acids. For 
example, homo-phenylalanine, citrulline and noreleucine are considered amino acids for the 
purposes of the invention. "Amino acid" also includes imino acid residues such as proline and 
hydroxyproline. The side chains may be in either the (R) or the (S) configuration. In the 
preferred embodiment, the amino acids are in the (S) or L-configuration. If non-naturally 
occurring side chains are used, non-amino acid substituents may be used, for example to 
prevent or retard in vivo degradations. 

In a preferred embodiment, the candidate bioactive agents are naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, for example, cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of procaryotic and eucaryotic proteins may be made for screening in the 
methods of the invention. Particularly preferred in this embodiment are libraries of bacterial, 
fungal, viral, and mammalian proteins, with the latter being preferred, and human proteins 
being especially preferred. 

in a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to 
30 about 30 amino acids, with from about 5 to about 20 amino acids being preferred, and from 
about 7 to about 15 being particularly preferred. The peptides may be digests of naturally 
occurring proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
35 these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 
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In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited number 
5 of possibilities. For example, in a preferred embodiment, the nucleotides or amino acid 
residues are randomized within a defined class, for example, of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 
nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
10 purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids, as defined 
above. 

15 As described above generally for proteins, nucleic acid candidate bioactive agents may be 

naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
example, digests of procaryotic or eucaryotic genomes may be used as is outlined above for 
proteins. 

20 In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a 
wide variety of which are available in the literature. 

In assays for altering the expression profile of one or more CA genes, after the candidate 
agent has been added and the cells allowed to incubate for some period of time, the sample 

25 containing the target sequences to be analyzed is added to the biochip. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR occurring as needed, as will be appreciated by those in the art. 
For example, an in vitro transcription with labels covalently attached to the nucleosides is 

30 done. Generally, the nucleic acids are labeled with a label as defined herein, with biotin-FITC 
or PE, cy3 and cy5 being particularly preferred. 

In a preferred embodiment, the target sequence is labeled with, for example, a fluorescent, 
chemiluminescent, chemical, or radioactive signal, to provide a means of detecting the target 
35 sequence's specific binding to a probe. The label also can be an enzyme, such as, alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate 
produces a product that can be detected. Alternatively, the label can be a labeled compound 
or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or altered by 
the enzyme. The label also can be a moiety or compound, such as, an epitope tag or biotin 
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which specifically binds to streptavidin. For the example of biotin, the streptavidin is labeled 
as described above, thereby, providing a detectable signal for the bound target sequence. As 
known in the art, unbound labeled streptavidin is removed prior to analysis. 

5 As will be appreciated by those in the art, these assays can be direct hybridization assays or 
can comprise "sandwich assays 0 , which include the use of multiple probes, as is generally 
outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 
5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 
5,124,246 and 5,681,697, ail of which are hereby incorporated by reference. In this 
10 embodiment, in general, the target nucleic acid is prepared as outlined above, and then 

added to the biochip comprising a plurality of nucleic acid probes, under conditions that allow 
the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including high, 
15 moderate and low stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization complex 
only in the presence of target. Stringency can be controlled by altering a step parameter that 
is a thermodynamic variable, including, but not limited to, temperature, formamide 
concentration, salt concentration, chaotropic salt concentration pH, organic solvent 
20 concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally outlined in 
U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at higher 
stringency conditions to reduce non-specific binding. 

25 

The reactions outlined herein may be accomplished in a variety of ways, as will be 
appreciated by those in the art. Components of the reaction may be added simultaneously, or 
sequentially, in any order, with preferred embodiments outlined below. In addition, the 
reaction may include a variety of other reagents may be included in the assays. These 

30 include reagents like salts, buffers, neutral proteins, e.g. albumin, detergents, etc which may 
be used to facilitate optimal hybridization and detection, and/or reduce non-specific or 
background interactions. Also reagents that otherwise improve the efficiency of the assay, 
such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used, 
depending on the sample preparation methods and purity of the target. In addition, either 

35 solid phase or solution based (i.e., kinetic PCR) assays may be used. 

Once the assay is run, the data is analyzed to determine the expression levels, and changes 
in expression levels as between states, of individual genes, forming a gene expression profile. 
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In a preferred embodiment, as for the diagnosis and prognosis applications, having identified 
the differentially expressed gene(s) or mutated gene(s) important in any one state, screens 
can be run to alter the expression of the genes individually. That is, screening for modulation 
of regulation of expression of a single gene can be done. Thus, for example, particularly in 
5 the case of target genes whose presence or absence is unique between two states, screening 
is done for modulators of the target gene expression. 

In addition, screens can be done for novel genes that are induced in response to a candidate 
agent. After identifying a candidate agent based upon its ability to suppress a CA expression 

10 pattern leading to a normal expression pattern, or modulate a single CA gene expression 
profile so as to mimic the expression of the gene from normal tissue, a screen as described 
above can be performed to identify genes that are specifically modulated in response to the 
agent Comparing expression profiles between norma! tissue and agent treated CA tissue 
reveals genes that are not expressed in normal tissue or CA tissue, but are expressed in 

is agent treated tissue. These agent specific sequences can be identified and used by any of 
the methods described herein for CA genes or proteins. In particular these sequences and 
the proteins they encode find use in marking or identifying agent treated cells. In addition, 
antibodies can be raised against the agent induced proteins and used to target novel 
therapeutics to the treated CA tissue sample. 

20 

Thus, in one embodiment, a candidate agent is administered to a population of CA cells, that 
thus has an associated CA expression profile. By "administration" or "contacting" herein is 
meant that the candidate agent is added to the cells in such a manner as to allow the agent to 
act upon the cell, whether by uptake and intracellular action, or by action at the cell surface. 
25 In some embodiments, nucleic acid encoding a proteinaceous candidate agent (i.e. a peptide) 
may be put into a viral construct such as a retroviral construct and added to the cell, such that 
expression of the peptide agent is accomplished; see PCT US97/01019, hereby expressly 
incorporated by reference. 

30 Once the candidate agent has been administered to the cells, the cells can be washed if 

desired and are allowed to incubate under preferably physiological conditions for some period 
of time. The cells are then harvested and a new gene expression profile is generated, as 
outlined herein. 

35 Thus, for example, CA tissue may be screened for agents that reduce or suppress the CA 
phenotype. A change in at least one gene of the expression profile indicates that the agent 
has an effect on CA activity. By defining such a signature for the CA phenotype, screens for 
new drugs that alter the phenotype can be devised. With this approach, the drug target need 
not be known and need not be represented in the original expression screening platform, nor 
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does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual genes and 
gene products (proteins). That is, having identified a particular differentially expressed gene 
5 as important in a particular state, screening of modulators of either the expression of the gene 
or the gene product itself can be done. The gene products of differentially expressed genes 
are sometimes referred to herein as B CA proteins" or an "CAP". The CAP may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
acids of Tables 1-1 12. Preferably, the CAP is a fragment. In another embodiment, the 
10 sequences are sequence variants as further described herein. 

Preferably, the CAP is a fragment of approximately 14 to 24 amino acids long. More 
preferably the fragment is a soluble fragment. Preferably, the fragment includes a non- 
transmembrane region. In a preferred embodiment, the fragment has an N-terminal Cys to 
15 aid in solubility. In one embodiment, the c-terminus of the fragment is kept as a free acid and 
the n-terminus is a free amine to aid in coupling, i.e., to cysteine. 

In one embodiment the CA proteins are conjugated to an immunogenic agent as discussed 
herein. In one embodiment the CA protein is conjugated to BSA. 

20 

In a preferred embodiment, screening is done to alter the biological function of the expression 
product of the CA gene. Again, having identified the importance of a gene in a particular 
state, screening for agents that bind and/or modulate the biological activity of the gene 
product can be run as is more fully outlined below. 

25 

In a preferred embodiment, screens are designed to first find candidate agents that can bind 
to CA proteins, and then these agents may be used in assays that evaluate the ability of the 
candidate agent to modulate the CAP activity and the carcinoma phenotype. Thus, as will be 
appreciated by those in the art, there are a number of different assays which may be run; 
30 binding assays and activity assays. 

In a preferred embodiment, binding assays are done. In general, purified or isolated gene 
product is used; that is, the gene products of one or more CA nucleic acids are made. In 
general, this is done as is known in the art. For example, antibodies are generated to the 
35 protein gene products, and standard immunoassays are run to determine the amount of 
protein present. Alternatively, cells comprising the CA proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a CA protein and a 
candidate bioactive agent, and determining the binding of the candidate agent to the CA 
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protein. Preferred embodiments utilize the human or mouse CA protein, although other 
mammalian proteins may also be used, for example for the development of animal models of 
human disease. In some embodiments, as outlined herein, variant or derivative CA proteins 
may be used. 

5 

Generally, in a preferred embodiment of the methods herein, the CA protein or the candidate 
agent is non-diffusabfy bound to an insoluble support having isolated sample receiving areas 
(e.g. a microtiter plate, an array, etc.). The insoluble supports may be made of any 
composition to which the compositions can be bound, is readily separated from soluble 

10 material, and is otherwise compatible with the overall method of screening. The surface of 
such supports may be solid or porous and of any convenient shape. Examples of suitable 
insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
Teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 

15 of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not stericaliy block either the ligand binding site or activation sequence when the 

20 protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 

crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of the 
protein or agent, excess unbound material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

25 

In a preferred embodiment, the CA protein is bound to the support, and a candidate bioactive 
agent is added to the assay. Alternatively, the candidate agent is bound to the support and 
the CA protein is added. Novel binding agents include specific antibodies, non_natural 
binding agents identified in screens of chemical libraries, peptide analogs, etc. Of particular 
30 interest are screening assays for agents that have a low toxicity for human cells. A wide 
variety of assays may be used for this purpose, including labeled in vitro protein_protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

35 The determination of the binding of the candidate bioactive agent to the CA protein may be 
done in a number of ways. In a preferred embodiment, the candidate bioactive agent is 
labeled, and binding determined directly. For example, this may be done by attaching all or a 
portion of the CA protein to a solid support, adding a labeled candidate agent (for example a 
fluorescent label), washing off excess reagent, and determining whether the label is present 
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on the solid support. Various blocking and washing steps may be utilized as is known in the 
art 

By "labeled" herein is meant that the compound is either directly or indirectly labeled with a 
5 label which provides a detectable signal, e.g. radioisotope, fluoresces, enzyme, antibodies, 
particles such as magnetic particles, chemiluminescers, or specific binding molecules, etc. 
Specific binding molecules include pairs, such as biotin and streptavidin, digoxin and 
antidigoxin etc. For the specific binding members, the complementary member would 
normally be labeled with a molecule which provides for detection, in accordance with known 
10 procedures, as outlined above. The label can directly or indirectly provide a detectable signal. 

In some embodiments, only one of the components is labeled. For example, the proteins (or 
proteinaceous candidate agents) may be labeled at tyrosine positions using 125 l, or with 
fluorophores. Alternatively, more than one component may be labeled with different labels; 
15 using 125 l for the proteins, for example, and a fluorophor for the candidate agents. 

In a preferred embodiment, the binding of the candidate bioactive agent is determined 
through the use of competitive binding assays. In this embodiment, the competitor is a 
binding moiety known to bind to the target molecule (i.e. CA protein), such as an antibody, 
20 peptide, binding partner, ligand, etc. Under certain circumstances, there may be competitive 
binding as between the bioactive agent and the binding moiety, with the binding moiety 
displacing the bioactive agent. 

In one embodiment, the candidate bioactive agent is labeled. Either the candidate bioactive 
25 agent, or the competitor, or both, is added first to the protein for a time sufficient to allow 
binding, if present. Incubations may be performed at any temperature which facilitates 
optimal activity, typically between 4 and 40°C. Incubation periods are selected for optimum 
activity, but may also be optimized to facilitate rapid high through put screening. Typically 
between 0.1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
30 away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the candidate bioactive 
agent. Displacement of the competitor is an indication that the candidate bioactive agent is 
35 binding to the CA protein and thus is capable of binding to, and potentially modulating, the 
activity of the CA protein. In this embodiment, either component can be labeled. Thus, for 
example, if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent Alternatively, if the candidate bioactive agent is labeled, the 
presence of the label on the support indicates displacement 
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In an alternative embodiment, the candidate bioactive agent is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the bioactive agent is bound to the CA protein with a higher affinity. Thus, if the 
5 candidate bioactive agent is labeled, the presence of the label on the support, coupled with a 
lack of competitor binding, may indicate that the candidate agent is capable of binding to the 
CA protein. 

In a preferred embodiment, the methods comprise differential screening to identity bioactive 
10 agents that are capable of modulating the activity of the CA proteins. In this embodiment, the 
methods comprise combining a CA protein and a competitor in a first sample. A second 
sample comprises a candidate bioactive agent, a CA protein and a competitor. The binding of 
the competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the CA 
15 protein and potentially modulating its activity. That is, if the binding of the competitor is 

different in the second sample relative to the first sample, the agent is capable of binding to 
the CA protein. 

Alternatively, a preferred embodiment utilizes differential screening to identify drug candidates 
2 o that bind to the native CA protein, but cannot bind to modified CA proteins. The structure of 
the CA protein may be modeled, and used in rational drug design to synthesize agents that 
interact with that site. Drug candidates that affect CA bioactivity are also identified by 
screening drugs for the ability to either enhance or reduce the activity of the protein. 

25 Positive controls and negative controls may be used in the assays. Preferably all control and 
test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, all samples are washed free of non_specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 

30 employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include reagents 
like salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate 
35 optimal protein_protein binding and/or reduce non-specific or background interactions. Also 
reagents that otherwise improve the efficiency of the assay, such as protease inhibitors, 
nuclease inhibitors, anti_microbial agents, etc., may be used. The mixture of components 
may be added in any order that provides for the requisite binding. 
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Screening for agents that modulate the activity of CA proteins may also be done. In a 
preferred embodiment, methods for screening for a bioactive agent capable of modulating the 
activity of CA proteins comprise the steps of adding a candidate bioactive agent to a sample 
of CA proteins, as above, and determining an alteration in the biological activity of CA 
5 proteins. "Modulating the activity of an CA protein" includes an increase in activity, a 
decrease in activity, or a change in the type or kind of activity present. Thus, in this 
embodiment, the candidate agent should both bind to CA proteins (although this may not be 
necessary), and alter its biological or biochemical activity as defined herein. The methods 
include both in vitro screening methods, as are generally outlined above, and in vivo 
10 screening of cells for alterations in the presence, distribution, activity or amount of CA 
proteins. 

Thus, in this embodiment, the methods comprise combining a CA sample and a candidate 
bioactive agent, and evaluating the effect on CA activity. By "CA activity" or grammatical 
15 equivalents herein is meant one of the CA protein's biological activities, including, but not 

limited to, its role in tumorigenesis, including cell division, preferably in lymphatic tissue, cell 
proliferation, tumor growth and transformation of cells. In one embodiment, CA activity 
includes activation of or by a protein encoded by a nucleic acid of Tables 1-112. An inhibitor 
of CA activity is the inhibition of any one or more CA activities. 

20 

In a preferred embodiment, the activity of the CA protein is increased; in another preferred 
embodiment, the activity of the CA protein is decreased. Thus, bioactive agents that are 
antagonists are preferred in some embodiments, and bioactive agents that are agonists may 
be preferred in other embodiments. 

25 

In a preferred embodiment, the invention provides methods for screening for bioactive agents 
capable of modulating the activity of a CA protein. The methods comprise adding a candidate 
bioactive agent, as defined above, to a cell comprising CA proteins. Preferred cell types 
include almost any cell. The cells contain a recombinant nucleic acid that encodes a CA 
30 protein. In a preferred embodiment, a library of candidate agents are tested on a plurality of 
cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
35 antigens, cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, bioactive agents are identified. Compounds with pharmacological activity are 
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able to enhance or interfere with the activity of the CA protein. 

In one embodiment, a method of inhibiting carcinoma cancer cell division, is provided. The 
method comprises administration of a carcinoma cancer inhibitor. 

5 

In a preferred embodiment, a method of inhibiting lymphoma carcinoma cell division is 
provided comprising administration of a lymphoma carcinoma inhibitor. 

In another embodiment, a method of inhibiting tumor growth is provided. The method 
10 comprises administration of a carcinoma cancer inhibitor. In a particularly preferred 

embodiment, a method of inhibiting tumor growth in lymphatic tissue is provided comprising 
administration of a lymphoma inhibitor. 

In a further embodiment, methods of treating cells or individuals with cancer are provided. 
The method comprises administration of a carcinoma cancer inhibitor. Preferably, the 
is carcinoma is a lymphoma carcinoma. 

In one embodiment a carcinoma cancer inhibitor is an antibody as discussed above. In 
another embodiment, the carcinoma cancer inhibitor is an antisense molecule. Antisense 
molecules as used herein include antisense or sense oligonucleotides comprising a singe- 

20 stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 

(sense) or DNA (antisense) sequences for carcinoma cancer molecules. Antisense or sense 
oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 

25 is described in, for example, Stein and Cohen, Cancer Res. 48:2659, (1988) and van der Krol 
etal., BioTechniques 6:958, (1988). 

Antisense molecules may be introduced into a cell containing the target nucleotide sequence 
by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. 

30 Suitable ligand binding molecules include, but are not limited to, cell surface receptors, growth 
factors, other cytokines, or other ligands that bind to cell surface receptors. Preferably, 
conjugation of the ligand binding molecule does not substantially interfere with the ability of 
the ligand binding molecule to bind to its corresponding molecule or receptor, or block entry of 
the sense or antisense oligonucleotide or its conjugated version into the cell. Alternatively, a 

35 sense or an antisense oligonucleotide may be introduced into a cell containing the target 

nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 
90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to methods of 
treatment 
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The compounds having the desired pharmacological activity may be administered in a 
physiologically acceptable carrier to a host, as previously described. The agents may be 
administered in a variety of ways, orally, parenterally e.g., subcutaneously, intraperitoneally, 
5 intravascularly, etc. Depending upon the manner of introduction, the compounds may be 

formulated in a variety of ways. The concentration of therapeutically active compound in the 
formulation may vary from about 0.1_100% wgt/vol. The agents may be administered alone 
or in combination with other treatments, i.e., radiation. 

10 The pharmaceutical compositions can be prepared in various forms, such as granules, 
tablets, pills, suppositories, capsules, suspensions, salves, lotions and the like. 
Pharmaceutical grade organic or inorganic carriers and/or diluents suitable for oral and topical 
use can be used to make up compositions containing the therapeutically_active compounds. 
Diluents known to the art include aqueous media, vegetable and anima) oils and fats. 

is Stabilizing agents, wetting and emulsifying agents, salts for varying the osmotic pressure or 
buffers for securing an adequate pH value, and skin penetration enhancers can be used as 
auxiliary agents. 

Without being bound by theory, it appears that the various CA sequences are important in 
20 carcinomas. Accordingly, disorders based on mutant or variant CA genes may be 
determined. In one embodiment, the invention provides methods for identifying cells 
containing variant CA genes comprising determining all or part of the sequence of at least one 
endogenous CA genes in a cell. As will be appreciated by those in the art, this may be done 
using any number of sequencing techniques. In a preferred embodiment, the invention 
25 provides methods of identifying the CA genotype of an individual comprising determining all or 
part of the sequence of at least one CA gene of the individual. This is generally done in at 
least one tissue of the individual, and may include the evaluation of a number of tissues or 
different samples of the same tissue. The method may include comparing the sequence of 
the sequenced CA gene to a known CA gene, i.e., a wild-type gene. As will be appreciated 
30 by those in the art, alterations in the sequence of some oncogenes can be an indication of 
either the presence of the disease, or propensity to develop the disease, or prognosis 
evaluations. 

The sequence of all or part of the CA gene can then be compared to the sequence of a 
35 known CA gene to determine if any differences exist. This can be done using any number of 
known homology programs, such as Bestfit, etc. In a preferred embodiment, the presence of 
a difference in the sequence between the CA gene of the patient and the known CA gene is 
indicative of a disease state or a propensity for a disease state, as outlined herein. 
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In a preferred embodiment the CA genes are used as probes to determine the number of 
copies of the CA gene in the genome. For example, some cancers exhibit chromosomal 
deletions or insertions, resulting in an alteration in the copy number of a gene. 

5 In another preferred embodiment CA genes are used as probes to determine the 

chromosomal location of the CA genes. Information such as chromosomal location finds use 
in providing a diagnosis or prognosis in particular when chromosomal abnormalities such as 
translocations, and the like are identified in CA gene loci. 

10 Thus, in one embodiment, methods of modulating CA in cells or organisms are provided. In 
one embodiment, the methods comprise administering to a cell an anti-CA antibody that 
reduces or eliminates the biological activity of an endogenous CA protein. Alternatively, the 
methods comprise administering to a cell or organism a recombinant nucleic acid encoding a 
CA protein. As will be appreciated by those in the art, this may be accomplished in any 

15 number of ways. In a preferred embodiment, for example when the CA sequence is down- 
regulated in carcinoma, the activity of the CA gene is increased by increasing the amount of 
CA in the cell, for example by overexpressing the endogenous CA or by administering a gene 
encoding the CA sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 

20 exogenous gene using enhanced homologous recombination (EHR), for example as 

described in PCT/US93/03868, hereby incorporated by reference in its entirety. Alternatively, 
for example when the CA sequence is up-regulated in carcinoma, the activity of the 
endogenous CA gene is decreased, for example by the administration of a CA antisense 
nucleic acid. 

25 

In one embodiment, the CA proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to CA proteins, which are useful as described herein. 
Similarly, the CA proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify CA antibodies. In a 

30 preferred embodiment, the antibodies are generated to epitopes unique to a CA protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. These antibodies find use 
in a number of applications. For example, the CA antibodies may be coupled to standard 
affinity chromatography columns and used to purify CA proteins. The antibodies may also be 
used as blocking polypeptides, as outlined above, since they will specifically bind to the CA 

35 protein. 

In one embodiment, a therapeutically effective dose of a CA or modulator thereof is 
administered to a patient. By "therapeutically effective dose" herein is meant a dose that 
produces the effects for which it is administered. The exact dose will depend on the purpose 
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of the treatment, and will be ascertainable by one skilled in the art using known techniques. 
As is known in the art adjustments for CA degradation, systemic versus localized delivery, 
and rate of new protease synthesis, as well as the age, body weight, general health, sex, diet, 
time of administration, drug interaction and the severity of the condition may be necessary, 
5 and will be ascertainable with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other animals, 
particularly mammals, and organisms. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
10 and in the most preferred embodiment the patient is human. 

The administration of the CA proteins and modulators of the present invention can be done in 
a variety of ways as discussed above, including, but not limited to, orally, subcutaneously, 
intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, intrapulmonary, 
15 vaginally, rectally, or intraocularly. In some instances, for example, in the treatment of 
wounds and inflammation, the CA proteins and modulators may be directly applied as a 
solution or spray. 

The pharmaceutical compositions of the present invention comprise a CA protein in a form 

20 suitable for administration to a patient. In the preferred embodiment, the pharmaceutical 
compositions are in a water soluble form, such as being present as pharmaceutical^ 
acceptable salts, which is meant to include both acid and base addition salts. 
"Pharmaceutically acceptable acid addition salt" refers to those salts that retain the biological 
effectiveness of the free bases and that are not biologically or otherwise undesirable, formed 

25 with inorganic acids such as hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, 
phosphoric acid and the like, and organic acids such as acetic acid, propionic acid, glycolic 
acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic acid, fumaric acid, tartaric 
acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, 
ethanesulfonic acid, pjoluenesulfonic acid, salicylic acid and the like. "Pharmaceutically 

30 acceptable base addition salts" include those derived from inorganic bases such as sodium, 
potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, manganese, 
aluminum salts and the like. Particularly preferred are the ammonium, potassium, sodium, 
calcium, and magnesium salts. Salts derived from pharmaceutically acceptable organic 
nonjoxic bases include salts of primary, secondary, and tertiary amines, substituted amines 

35 including naturally occurring substituted amines, cyclic amines and basic ion exchange resins, 
such as isopropylamine, trimethylamine, diethylamine, triethylamine, tripropylamine, and 
ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: carrier 
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proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, lactose, 
com and other starches; binding agents; sweeteners and other flavoring agents; coloring 
agents; and polyethylene glycol. Additives are well known in the art, and are used in a variety 
of formulations. 

5 In a preferred embodiment, CA proteins and modulators are administered as therapeutic 

agents, and can be formulated as outlined above. Similarly, CA genes (including both the full- 
length sequence, partial sequences, or regulatory sequences of the CA coding regions) can 
be administered in gene therapy applications, as is known in the art. These CA genes can 
include antisense applications, either as gene therapy (i.e. for incorporation into the genome) 
10 or as antisense compositions, as will be appreciated by those in the art. 

In a preferred embodiment, CA genes are administered as DNA vaccines, either single genes 
or combinations of CA genes. Naked DNA vaccines are generally known in the art. Brower, 
Nature Biotechnology, 16:1304-1305 (1998). 

15 

In one embodiment, CA genes of the present invention are used as DNA vaccines. Methods 
for the use of genes as DNA vaccines are well known to one of ordinary skill in the art, and 
include placing a CA gene or portion of a CA gene under the control of a promoter for 
expression in a patient with carcinoma. The CA gene used for DNA vaccines can encode full- 

20 length CA proteins, but more preferably encodes portions of the CA proteins including 

peptides derived from the CA protein. In a preferred embodiment a patient is immunized with 
a DNA vaccine comprising a plurality of nucleotide sequences derived from a CA gene. 
Similarly, it is possible to immunize a patient with a plurality of CA genes or portions thereof 
as defined herein. Without being bound by theory, expression of the polypeptide encoded by 

25 the DNA vaccine, cytotoxic T-cells, helper T-cells and antibodies are induced which recognize 
and destroy or eliminate cells expressing CA proteins. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant molecule 
with the DNA vaccine. Such adjuvant molecules include cytokines that increase the 
30 immunogenic response to the CA polypeptide encoded by the DNA vaccine. Additional or 
alternative adjuvants are known to those of ordinary skill in the art and find use in the 
invention. 

In another preferred embodiment CA genes find use in generating animal models of 
35 carcinomas, particularly lymphoma carcinomas. As is appreciated by one of ordinary skill in 
the art, when the CA gene identified is repressed or diminished in CA tissue, gene therapy 
technology wherein antisense RNA directed to the CA gene will also diminish or repress 
expression of the gene. An animal generated as such serves as an animal model of CA that 
finds use in screening bioactive drug candidates. Similarly, gene knockout technology, for 
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example as a result of homologous recombination with an appropriate gene targeting vector, 
will result in the absence of the CA protein. When desired, tissue-specific expression or 
knockout of the CA protein may be necessary. 

5 It is also possible that the CA protein is overexpressed in carcinoma. As such, transgenic 
animals can be generated that overexpress the CA protein. Depending on the desired 
expression level, promoters of various strengths can be employed to express the transgene. 
Also, the number of copies of the integrated transgene can be determined and compared for 
a determination of the expression level of the transgene. Animals generated by such 
10 methods find use as animal models of CA and are additionally useful in screening for 
bioactive molecules to treat carcinoma. 

The CA nucleic acid sequences of the invention are depicted in Tables 1-112. The 
sequences in Tables 1 and 2 depict mouse tags, i.e. the genomic insertion sites. The 
is sequences in Tables 3-102 include genomic sequence, mRNA and coding sequences for 
both mouse and human. N/A indicates a gene that has been identified, but for which there 
has not been a name ascribed. The different sequences are assigned the following SEQ ID 
Nos: 

20 Table 3 (mouse gene: Fscnl; human gene SNL) 

Mouse genomic sequence (SEQ ID NO: 1) 

Mouse mRNA sequence (SEQ ID NO: 2) 

Mouse coding sequence (SEQ ID NO: 3) 

Human genomic sequence (SEQ ID NO: 4) 
25 Human mRNA sequence (SEQ ID NO: 5) 

Human coding sequence (SEQ ID NO: 6) 

Table 4 (mouse gene Map3k6; human gene MAP3K6) 
Mouse genomic sequence (SEQ ID NO: 7) 
30 Mouse mRNA sequence (SEQ ID NO: 8) 
Mouse coding sequence (SEQ ID NO: 9) 
Human genomic sequence (SEQ ID NO: 10) 
Human mRNA sequence (SEQ ID NO: 11) 
Human coding sequence (SEQ ID NO: 12) 

35 

Table 5 (mouse gene Fosb; human gene FOSB) 
Mouse genomic sequence (SEQ ID NO: 13) 
Mouse mRNA sequence (SEQ ID NO: 14) 
Mouse coding sequence (SEQ ID NO: 15) 
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Human genomic sequence (SEQ ID NO: 16) 
Human mRNA sequence (SEQ ID NO: 17) 
Human coding sequence (SEQ ID NO: 1 8) 

5 Table 6 (mouse gene cmkbr7; human gene: CCR7) 
Mouse genomic sequence (SEQ ID NO: 1 9) 
Mouse mRNA sequence (SEQ ID NO: 20) 
Mouse coding sequence (SEQ ID NO: 21) 
Human genomic sequence (SEQ ID NO: 22) 
10 Human mRNA sequence (SEQ ID NO: 23) 
Human coding sequence (SEQ ID NO: 24) 

Table 7 (mouse gene: Ccndl; human gene: CCND1) 
Mouse genomic sequence (SEQ ID NO: 25) 
15 Mouse mRNA sequence (SEQ ID NO: 26) 
Mouse coding sequence (SEQ ID NO: 27) 
Human genomic sequence (SEQ ID NO: 28) 
Human mRNA sequence (SEQ ID NO:29) 
Human coding sequence (SEQ ID NO: 30) 

20 

Table 8 (mouse gene: Ccnd3; human gene: CCND3) 
Mouse genomic sequence (SEQ ID NO: 31) 
Mouse mRNA' sequence (SEQ ID NO: 32) 
Mouse coding sequence (SEQ ID NO: 33) 
25 Human genomic sequence (SEQ ID NO: 34) 
Human mRNA sequence (SEQ ID NO: 35) 
Human coding sequence (SEQ ID NO: 36) 

Table 9 (mouse gene: Wnt3; human gene: WNT3) 
30 Mouse genomic sequence (SEQ ID NO: 37) 
Mouse mRNA sequence (SEQ ID NO: 38) 
Mouse coding sequence (SEQ ID NO: 39) 
Human genomic sequence (SEQ ID NO: 40) 
Human mRNA sequence (SEQ ID NO: 41) 
35 Human coding sequence (SEQ ID NO: 42) 

Table 10 (mouse gene: Batf; human gene: BATF) 
Mouse genomic sequence (SEQ ID NO: 43) 
Mouse mRNA sequence (SEQ ID NO: 44) 
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Mouse coding sequence (SEQ ID NO: 45) 
Human genomic sequence (SEQ ID NO: 46) 
Human mRNA sequence (SEQ ID NO: 47) 
Human coding sequence (SEQ ID NO: 48) 

5 

Table 1 1 (mouse gene: Irf4; human gene: IRF4) 
Mouse genomic sequence (SEQ ID NO: 49) 
Mouse mRNA sequence (SEQ ID NO: 50) 
Mouse coding sequence (SEQ ID NO: 51) 
10 Human genomic sequence (SEQ ID NO: 52) 
Human mRNA sequence (SEQ ID NO: 53) 
Human coding sequence (SEQ ID NO: 54) 

Table 12 (mouse gene: Notchl; human gene: NOTCH1) 
15 Mouse genomic sequence (SEQ ID NO: 55) 
Mouse mRNA sequence (SEQ ID NO: 56) 
Mouse coding sequence (SEQ ID NO: 57) 
Human genomic sequence (SEQ ID NO: 58) 
Human mRNA sequence (SEQ ID NO: 59) 
20 Human coding sequence (SEQ ID NO: 60) 

Table 13 (mouse gene: Myc; human gene MYC) 
Mouse genomic sequence (SEQ ID NO: 61) 
Mouse mRNA sequence (SEQ ID NO: 62) 
25 Mouse coding sequence (SEQ ID NO: 63) 
Human genomic sequence (SEQ ID NO: 64) 
Human mRNA sequence (SEQ ID NO: 65) 
Human coding sequence (SEQ ID NO: 66) 

30 Table 14 (mouse gene Bach2; human gene BACH2) 
Mouse genomic sequence (SEQ ID NO: 67) 
Mouse mRNA sequence (SEQ ID NO: 68) 
Mouse coding sequence (SEQ ID NO: 69) 
Human genomic sequence (SEQ ID NO: 70) 

35 Human mRNA sequence (SEQ ID NO: 71) 
Human coding sequence (SEQ ID NO: 72) 

Table 15 (mouse gene Wnt1; human gene WNT1) 
Mouse genomic sequence (SEQ ID NO: 73) 
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Mouse mRNA sequence (SEQ ID NO: 74) 
Mouse coding sequence (SEQ ID NO: 75) 
Human genomic sequence (SEQ ID NO: 76) 
Human mRNA sequence (SEQ ID NO: 77) 
5 Human coding sequence (SEQ ID NO: 78) 

Table 16 (mouse gene Rasgrpl; human gene: RASGRP1) 
Mouse genomic sequence (SEQ ID NO: 79) 
Mouse mRNA sequence (SEQ ID NO: 80) 
10 Mouse coding sequence (SEQ ID NO: 81 ) 
Human genomic sequence (SEQ ID NO: 82) 
Human mRNA sequence (SEQ ID NO: 83) 
Human coding sequence (SEQ ID NO: 84) 

is Table 17 (mouse gene: Nmyd ; human gene: MYCN) 
Mouse genomic sequence (SEQ ID NO: 85) 
Mouse mRNA sequence (SEQ ID NO: 86) 
Mouse coding sequence (SEQ ID NO: 87) 
Human genomic* sequence (SEQ ID NO: 88) 

20 Human mRNA sequence (SEQ ID NO:89) 
Human coding sequence (SEQ ID NO: 90) 

Table 18 (mouse gene: Myb; human gene: MYB) 
Mouse genomic sequence (SEQ ID NO: 91) 
25 Mouse mRNA sequence (SEQ ID NO: 92) 
Mouse coding sequence (SEQ ID NO: 93) 
Human genomic sequence (SEQ ID NO: 94) 
Human mRNA sequence (SEQ ID NO: 95) 
Human coding sequence (SEQ ID NO: 96) 

30 

Table 19 (mouse gene: Sox4; human gene: SOX4) 
Mouse genomic sequence (SEQ ID NO: 97) 
Mouse mRNA sequence (SEQ ID NO: 98) 
Mouse coding sequence (SEQ ID NO: 99) 
35 Human genomic sequence (SEQ ID NO: 100) 
Human mRNA sequence (SEQ ID NO: 101) 
Human coding sequence (SEQ ID NO: 102) 

Table 20 (mouse gene: Tcofl; human gene: TCOF1) 
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Mouse genomic sequence (SEQ ID NO: 103) 
Mouse mRNA sequence (SEQ ID NO: 104) 
Mouse coding sequence (SEQ ID NO: 105) 
Human genomic sequence (SEQ ID NO: 106) 
5 Human mRNA sequence (SEQ ID NO: 107) 
Human coding sequence (SEQ ID NO: 108) 

Table 21 (mouse gene: Pim1; human gene: PIM1) 
Mouse genomic sequence (SEQ ID NO: 109) 
10 Mouse mRNA sequence (SEQ ID NO: 1 10) 
Mouse coding sequence (SEQ ID NO: 111) 
Human genomic sequence (SEQ ID NO: 112) 
Human mRNA sequence (SEQ ID NO: 113) 
Human coding sequence (SEQ ID NO: 114) 

15 

Table 22 (mouse gene: Wnt3a; human gene: WNT3A) 
Mouse genomic sequence (SEQ ID NO: 115) 
Mouse mRNA sequence (SEQ ID NO: 116) 
Mouse coding sequence (SEQ ID NO: 1 17) 
20 Human genomic sequence (SEQ ID NO: 118) 
Human mRNA sequence (SEQ ID NO: 119) 
Human coding sequence (SEQ ID NO: 120) 

Table 23 (mouse gene: Ly6e; human gene LY6E) 
25 Mouse genomic sequence (SEQ ID NO: 121) 
Mouse mRNA sequence (SEQ ID NO: 122) 
Mouse coding sequence (SEQ ID NO: 123) 
Human genomic sequence (SEQ ID NO: 124) 
Human mRNA sequence (SEQ ID NO: 125) 
30 Human coding sequence (SEQ ID NO: 126) 

Table 24 (mouse gene: Rasa2; human gene RASA2) 
Mouse genomic sequence (SEQ ID NO: 127) 
Mouse mRNA sequence (SEQ ID NO: 128) 
35 Mouse coding sequence (SEQ ID NO: 129) 
Human genomic sequence (SEQ ID NO: 130) 
Human mRNA sequence (SEQ ID NO: 131) 
Human coding sequence (SEQ ID NO: 132) 
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Table 25 (mouse gene: Gatal; human gene GATA1) 
Mouse genomic sequence (SEQ ID NO: 133) 
Mouse mRNA sequence (SEQ ID NO: 134) 
Mouse coding sequence (SEQ ID NO: 135) 
5 Human genomic sequence (SEQ ID NO: 136) 
Human mRNA sequence (SEQ ID NO: 137) 
Human coding sequence (SEQ ID NO: 138) 

Table 26 (mouse gene: Fkbp5; human gene FKBP5) 
Mouse genomic sequence (SEQ ID NO: 139) 
Mouse mRNA sequence (SEQ ID NO: 140) 
Mouse coding sequence (SEQ ID NO: 141) 
Human genomic sequence (SEQ ID NO: 142) 
Human mRNA sequence (SEQ ID NO: 143) 
Human coding sequence (SEQ ID NO: 144) 

Table 27 (mouse gene: Rel; human gene REL) 
Mouse genomic sequence (SEQ ID NO: 145) 
Mouse mRNA sequence (SEQ ID NO: 146) 
20 Mouse coding sequence (SEQ ID NO: 147) 
Human genomic sequence (SEQ ID NO: 148) 
Human mRNA sequence (SEQ ID NO: 149) 
Human coding sequence (SEQ ID NO: 150) 

Table 28 (mouse gene: Icsbp; human gene ICSBP1) 
Mouse genomic sequence (SEQ ID NO: 1 51 ) 
Mouse mRNA sequence (SEQ ID NO: 152) 
Mouse coding sequence (SEQ ID NO: 153) 
Human genomic sequence (SEQ ID NO: 154) 
Human mRNA sequence (SEQ ID NO: 155) 
Human coding sequence (SEQ ID NO: 156) 

Table 29 (mouse gene: Bmi1; human gene BMI1) 
Mouse genomic sequence (SEQ ID NO: 157) 
35 Mouse mRNA sequence (SEQ ID NO: 158) 
Mouse coding sequence (SEQ ID NO: 159) 
Human genomic sequence (SEQ ID NO: 160) 
Human mRNA sequence (SEQ ID NO: 161) 
Human coding sequence (SEQ ID NO: 162) 
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Table 30 (mouse gene: Rurort; human gene RUNX1) 
Mouse genomic sequence (SEQ ID NO: 163) 
Mouse mRNA sequence (SEQ ID NO: 164) 
5 Mouse coding sequence (SEQ ID NO: 165) 
Human genomic sequence (SEQ ID NO: 166) 
Human mRNA sequence (SEQ ID NO: 167) 
Human coding sequence (SEQ ID NO: 168) 



10 Table 31 (mouse gene: Il2ra; human gene IL2RA) 
Mouse genomic sequence (SEQ ID NO: 169) 
Mouse mRNA sequence (SEQ ID NO: 170) 
Mouse coding sequence (SEQ ID NO: 171) 
Human genomic sequence (SEQ ID NO: 172) 

15 Human mRNA sequence (SEQ ID NO: 173) 
Human coding sequence (SEQ ID NO: 174) 



Table 32 (mouse gene: Nfkbl ; human gene NFKB1 ) 
Mouse genomic sequence (SEQ ID NO: 175) 
2 o Mouse mRNA sequence (SEQ ID NO: 1 76) 
Mouse coding sequence (SEQ ID NO: 177) 
Human genomic sequence (SEQ ID NO: 178) 
Human mRNA sequence (SEQ ID NO: 179) 
Human coding sequence (SEQ ID NO: 180) 

25 

Table 33 (mouse gene: Fyn; human gene FYN) 
Mouse genomic sequence (SEQ ID NO: 181) 
Mouse mRNA sequence (SEQ ID NO: 182) 
Mouse coding sequence (SEQ ID NO: 183) 
30 Human genomic sequence (SEQ ID NO: 184) 
Human mRNA sequence (SEQ ID NO: 185) 
Human coding sequence (SEQ ID NO: 186) 



Table 34 (mouse gene: NfkbiH; human gene NFKBIL1) 
3 5 Mouse genomic sequence (SEQ ID NO: 1 87) 
Mouse mRNA sequence (SEQ ID NO: 188) 
Mouse coding sequence (SEQ ID NO: 189) 
Human genomic sequence (SEQ ID NO: 190) 
Human mRNA sequence (SEQ ID NO: 191) 
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Human coding sequence (SEQ ID NO: 192) 

Table 35 (mouse gene: Flt3; human gene FLT3) 
Mouse genomic sequence (SEQ ID NO: 193) 
5 Mouse mRNA sequence (SEQ ID NO: 194) 
Mouse coding sequence (SEQ ID NO: 195) 
Human genomic sequence (SEQ ID NO: 196) 
Human mRNA sequence (SEQ ID NO: 197) 
Human coding sequence (SEQ ID NO: 198) 

10 

Table 36 (mouse gene: Dntt; human gene DNTT) 
Mouse genomic sequence (SEQ ID NO: 199) 
Mouse mRNA sequence (SEQ ID NO: 200) 
Mouse coding sequence (SEQ ID NO: 201) 
15 Human genomic sequence (SEQ ID NO: 202) 
Human mRNA sequence (SEQ ID NO: 203) 
Human coding sequence (SEQ ID NO: 204) 

Table 37 (mouse gene: Znfn1a1; human gene ZNFN1A1) 
20 Mouse genomic sequence (SEQ ID NO: 205) 

Mouse mRNA sequence (SEQ ID NO: 206) 

Mouse coding sequence (SEQ ID NO: 207) 

Human genomic sequence (SEQ ID NO: 208) 

Human mRNA sequence (SEQ ID NO: 209) 
25 Human coding sequence (SEQ ID NO: 210) 

Table 38 (mouse gene: Tbx21 ; human gene TBX21) 
Mouse genomic sequence (SEQ ID NO: 211) 
Mouse mRNA sequence (SEQ ID NO: 212) 
30 Mouse coding sequence (SEQ ID NO: 213) 
Human genomic sequence (SEQ ID NO: 214) 
Human mRNA sequence (SEQ ID NO: 215) 
Human coding sequence (SEQ ID NO: 216) 

35 Table 39 (mouse gene: Stat5b; human gene STAT5B) 
Mouse genomic sequence (SEQ ID NO: 217) 
Mouse mRNA sequence (SEQ ID NO: 218) 
Mouse coding sequence (SEQ ID NO: 219) 
Human genomic sequence (SEQ ID NO: 220) 
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Human mRNA sequence (SEQ ID NO: 221) 
Human coding sequence (SEQ ID NO: 222) 

Table 40 (mouse gene: Sema4d; human gene SEMA4D) 
5 Mouse genomic sequence (SEQ ID NO: 223) 
Mouse mRNA sequence (SEQ ID NO: 224) 
Mouse coding sequence (SEQ ID NO: 225) 
Human genomic sequence (SEQ ID NO: 226) 
Human mRNA sequence (SEQ ID NO: 227) 
io Human coding sequence (SEQ ID NO: 228) 

Table 41 (mouse gene: Mdm2; human gene MDM2) 
Mouse genomic sequence (SEQ ID NO: 229) 
Mouse mRNA sequence (SEQ ID NO: 230) 
15 Mouse coding sequence (SEQ ID NO: 231) 
Human genomic sequence (SEQ ID NO: 232) 
Human mRNA sequence (SEQ ID NO: 233) 
Human coding sequence (SEQ ID NO: 234) 

20 Table 42 (mouse gene: Prlr; human gene PRLR) 
Mouse genomic sequence (SEQ ID NO: 235) 
Mouse mRNA sequence (SEQ ID NO: 236) 
Mouse coding sequence (SEQ ID NO: 237) 
Human genomic sequence (SEQ ID NO: 238) 

25 Human mRNA sequence (SEQ ID NO: 239) 
Human coding sequence (SEQ ID NO: 240) 

Table 43 (mouse gene: Topi; human gene TOP1) 
Mouse genomic sequence (SEQ ID NO: 241) 
30 Mouse mRNA sequence (SEQ ID NO: 242) 
Mouse coding sequence (SEQ ID NO: 243) 
Human genomic sequence (SEQ ID NO: 244) 
Human mRNA sequence (SEQ ID NO: 245) 
Human coding sequence (SEQ ID NO: 246) 

35 

Table 44 (mouse gene: Dusp10; human gene DUSP10) 
Mouse genomic sequence (SEQ ID NO: 247) 
Mouse mRNA sequence (SEQ ID NO: 248) 
Mouse coding sequence (SEQ ID NO: 249) 
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Human genomic sequence (SEQ ID NO: 250) 
Human mRNA sequence (SEQ ID NO: 251) ' 
Human coding sequence (SEQ ID NO: 252) 

5 Table 45 (mouse gene: Flii ; human gene FLU) 
Mouse genomic sequence (SEQ ID NO: 253) 
Mouse mRNA sequence (SEQ ID NO: 254) 
Mouse coding sequence (SEQ ID NO: 255) 
Human genomic sequence (SEQ ID NO: 256) 
10 Human mRNA sequence (SEQ ID NO: 257) 
Human coding sequence (SEQ ID NO: 258) 

Table 46 (mouse gene: Tk2; human gene TK2) 
Mouse genomic sequence (SEQ ID NO: 259) 
15 Mouse mRNA sequence (SEQ ID NO: 260) 
Mouse coding sequence (SEQ ID NO: 261) 
Human genomic sequence (SEQ ID NO: 262) 
Human mRNA sequence (SEQ ID NO: 263) 
Human coding sequence (SEQ ID NO: 264) 

20 

Table 47 (mouse gene: Nuprl) 
Mouse genomic sequence (SEQ ID NO: 265) 
Mouse mRNA sequence (SEQ ID NO: 266) 
Mouse coding sequence (SEQ ID NO: 267) 
25 Human genomic sequence (SEQ ID NO: 268) 
Human mRNA sequence (SEQ ID NO: 269) 
Human coding sequence (SEQ ID NO: 270) 

Table 48 (mouse gene: Zfhxlb; human gene ZFHX1B) 
30 Mouse genomic sequence (SEQ ID NO: 271) 
Mouse mRNA sequence (SEQ ID NO: 272) 
Mouse coding sequence (SEQ ID NO: 273) 
Human genomic sequence (SEQ ID NO: 274) 
Human mRNA sequence (SEQ ID NO: 275) 
35 Human coding sequence (SEQ ID NO: 276) 

Table 49 (mouse gene: Vdad; human gene VDAC1) 
Mouse genomic sequence (SEQ ID NO: 277) 
Mouse mRNA sequence (SEQ ID NO: 278) 
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Mouse coding sequence (SEQ ID NO: 279) 
Human genomic sequence (SEQ ID NO: 280) 
Human mRNA sequence (SEQ ID NO: 281) 
Human coding sequence (SEQ ID NO: 282) 

5 

Table 50 (mouse gene: Nfotd; human gene NFATC1) 
Mouse genomic sequence (SEQ ID NO: 283) 
Mouse mRNA sequence (SEQ ID NO: 284) 
Mouse coding sequence (SEQ ID NO: 285) 
10 Human genomic sequence (SEQ ID NO: 286) 
Human mRNA sequence (SEQ ID NO: 287) 
Human coding sequence (SEQ ID NO: 288) 

Table 51 (mouse gene: Syk; human gene SYK) 
15 Mouse genomic sequence (SEQ ID NO: 289) 
Mouse mRNA sequence (SEQ ID NO: 290) 
Mouse coding sequence (SEQ ID NO: 291) 
Human genomic sequence (SEQ ID NO: 292) 
Human mRNA sequence (SEQ ID NO: 293) 
20 Human coding sequence (SEQ ID NO: 294) 

Table 52 (mouse gene: Gnb1 ; human gene GNB1 ) 
Mouse genomic sequence (SEQ ID NO: 295) 
Mouse mRNA sequence (SEQ ID NO: 296) 
25 Mouse coding sequence (SEQ ID NO: 297) 
Human genomic sequence (SEQ ID NO: 298) 
Human mRNA sequence (SEQ ID NO: 299) 
Human coding sequence (SEQ ID NO: 300). 

30 Table 53 (mouse gene: Ccnd2; human gene CCND2) 
Mouse genomic sequence (SEQ ID NO: 301) 
Mouse mRNA sequence (SEQ ID NO: 302) 
Mouse coding sequence (SEQ ID NO: 303) 
Human genomic sequence (SEQ ID NO: 304) 

35 Human mRNA sequence (SEQ ID NO: 305) 
Human coding sequence (SEQ ID NO: 306) 

Table 54 (mouse gene Tnfrsf6; human gene TNFRSF6) 
Mouse genomic sequence (SEQ ID NO: 307) 
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Mouse mRNA sequence (SEQ ID NO: 308) 
Mouse coding sequence (SEQ ID NO: 309) 
Human genomic sequence (SEQ ID NO: 310) 
Human mRNA sequence (SEQ ID NO: 311) 
5 Human coding sequence (SEQ ID NO: 312) 

Table 55 (mouse gene Irf2; human gene IRF2) 
Mouse genomic sequence (SEQ ID NO: 313) 
Mouse mRNA sequence (SEQ ID NO: 314) 
10 Mouse coding sequence (SEQ ID NO: 315) 
Human genomic sequence (SEQ ID NO: 316) 
Human mRNA sequence (SEQ ID NO: 317) 
Human coding sequence (SEQ ID NO: 318) 

15 Table 56 (mouse gene Morf; human gene: MORF) 
Mouse genomic sequence (SEQ ID NO: 319) 
Mouse mRNA sequence (SEQ ID NO: 320) 
Mouse coding sequence (SEQ ID NO: 321) 
Human genomic sequence (SEQ ID NO: 322) 

20 Human mRNA sequence (SEQ ID NO: 323) 
Human coding sequence (SEQ ID NO: 324) 

Table 57 (mouse gene: Runx3; human gene: RUNX3) 
Mouse genomic sequence (SEQ ID NO: 325) 
25 Mouse mRNA sequence (SEQ ID NO: 326) 
Mouse coding sequence (SEQ ID NO: 327) 
Human genomic sequence (SEQ ID NO: 328) 
Human mRNA sequence (SEQ ID NO:329) 
Human coding sequence (SEQ ID NO: 330) 

30 

Table 58 (mouse gene: Bcl11b; human gene: BCL11B) 
Mouse genomic sequence (SEQ ID NO: 331) 
Mouse mRNA sequence (SEQ ID NO: 332) 
Mouse coding sequence (SEQ ID NO: 333) 
35 Human genomic sequence (SEQ ID NO: 334) 
Human mRNA sequence (SEQ ID NO: 335) 
Human coding sequence (SEQ ID NO: 336) 

Table 59 (mouse gene: Arhgefl; human gene: ARHGEF1) 
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Mouse genomic sequence (SEQ ID NO: 337) 
Mouse mRNA sequence (SEQ ID NO: 338) 
Mouse coding sequence (SEQ ID NO: 339) 
Human genomic sequence (SEQ ID NO: 340) 
5 Human mRNA sequence (SEQ ID NO: 341) 
Human coding sequence (SEQ ID NO: 342) 

Table 60 (mouse gene: Ptprk; human gene: PTPRK) 
Mouse genomic sequence (SEQ ID NO: 343) 
10 Mouse mRNA sequence (SEQ ID NO: 344) 
Mouse coding sequence (SEQ ID NO: 345) 
Human genomic sequence (SEQ ID NO:346) 
Human mRNA sequence (SEQ ID NO: 347) 
Human coding sequence (SEQ ID NO: 348) 

15 

Table 61 (mouse gene: McmdS; human gene: MCM5) 
Mouse genomic sequence (SEQ ID NO: 349) 
Mouse mRNA sequence (SEQ ID NO: 350) 
Mouse coding sequence (SEQ ID NO: 351) 
20 Human genomic sequence (SEQ ID NO: 352) 
Human mRNA sequence (SEQ ID NO: 353) 
Human coding sequence (SEQ ID NO: 354) 

Table 62 (mouse gene: Matn4; human gene: MATN4) 
25 Mouse genomic sequence (SEQ ID NO: 355) 
Mouse mRNA sequence (SEQ ID NO: 356) 
Mouse coding sequence (SEQ ID NO: 357) 
Human genomic sequence (SEQ ID NO: 358) 
Human mRNA sequence (SEQ ID NO: 359) 
30 Human coding sequence (SEQ ID NO: 360) 

Table 63 (mouse gene: Tnfsfl 1; human gene TNFSF1 1) 
Mouse genomic sequence (SEQ ID NO: 361) 
Mouse mRNA sequence (SEQ ID NO: 362) 
35 Mouse coding sequence (SEQ ID NO: 363) 
Human genomic sequence (SEQ ID NO: 364) 
Human mRNA sequence (SEQ ID NO: 365) 
Human coding sequence (SEQ ID NO: 366) 
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Table 64 (mouse gene: Itk; human gene ITK) 
Mouse genomic sequence (SEQ ID NO: 367) 
Mouse mRNA sequence (SEQ ID NO: 368) 
Mouse coding sequence (SEQ ID NO: 369) 
5 Human genomic sequence (SEQ ID NO: 370) 
Human mRNA sequence (SEQ ID NO: 371) 
Human coding sequence (SEQ ID NO: 372) 

Table 65 (mouse gene: Fish; human gene: N/A) 
10 Mouse genomic sequence (SEQ ID NO: 373) 
Mouse mRNA sequence (SEQ ID NO: 374) 
Mouse coding sequence (SEQ ID NO: 375) 
Human genomic sequence (SEQ ID NO: 376) 
Human mRNA sequence (SEQ ID NO: 377) 
15 Human coding sequence (SEQ ID NO: 378) 

Table 66 (mouse gene: Egr2; human gene EGR2) 
Mouse genomic sequence (SEQ ID NO: 379) 
Mouse mRNA sequence (SEQ ID NO: 380) 
20 Mouse coding sequence (SEQ ID NO: 381 ) 
Human genomic sequence (SEQ ID NO: 382) 
Human mRNA sequence (SEQ ID NO: 383) 
Human coding sequence (SEQ ID NO: 384) 

25 Table 67 (mouse gene: Sos1; human gene SOS1) 
Mouse genomic sequence (SEQ ID NO: 385) 
Mouse mRNA sequence (SEQ ID NO: 386) 
Mouse coding sequence (SEQ ID NO: 387) 
Human genomic sequence (SEQ ID NO: 388) 

30 Human mRNA sequence (SEQ ID NO: 389) 
Human coding sequence (SEQ ID NO: 390) 

Table 68 (mouse gene: Pou2af1; human gene POU2AF1) 
Mouse genomic sequence (SEQ ID NO: 391) 
35 Mouse mRNA sequence (SEQ ID NO: 392) 
Mouse coding sequence (SEQ ID NO: 393) 
Human genomic sequence (SEQ ID NO: 394) 
Human mRNA sequence (SEQ ID NO: 395) 
Human coding sequence (SEQ ID NO: 396) 
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Table 69 (mouse gene: Mef2c; human gene MEF2C) 
Mouse genomic sequence (SEQ ID NO: 397) 
Mouse mRNA sequence (SEQ ID NO: 398) 
5 Mouse coding sequence (SEQ ID NO: 399) 
Human genomic sequence (SEQ ID NO: 400) 
Human mRNA sequence (SEQ ID NO: 401) 
Human coding sequence (SEQ ID NO: 402) 

10 Table 70 (mouse gene: Map3k8; human gene MAP3K8) 
Mouse genomic sequence (SEQ ID NO: 403) 
Mouse mRNA sequence (SEQ ID NO: 404) 
Mouse coding sequence (SEQ ID NO: 405) 
Human genomic sequence (SEQ ID NO: 406) 

15 Human mRNA sequence (SEQ ID NO: 407) 
Human coding sequence (SEQ ID NO: 408) 

Table 71 (mouse gene: Fgfr3; human gene FGFR3) 
Mouse genomic sequence (SEQ ID NO: 409) 
20 Mouse mRNA sequence (SEQ ID NO: 410) 
Mouse coding sequence (SEQ ID NO: 411) 
Human genomic sequence (SEQ ID NO: 412) 
Human mRNA sequence (SEQ ID NO: 413) 
Human coding sequence (SEQ ID NO: 414) 

25 

Table 72 (mouse gene: Cbx8; human gene CBX8) 
Mouse genomic sequence (SEQ ID NO: 415) 
Mouse mRNA sequence (SEQ ID NO: 416) 
Mouse coding sequence (SEQ ID NO: 417) 
3 0 Human genomic sequence (SEQ ID NO: 418) 
Human mRNA sequence (SEQ ID NO: 419) 
Human coding sequence (SEQ ID NO: 420) 

Table 73 (mouse gene: Lmo2; human gene LM02) 
35 Mouse genomic sequence (SEQ ID NO: 421) 
Mouse mRNA sequence (SEQ ID NO: 422) 
Mouse coding sequence (SEQ ID NO: 423) 
Human genomic sequence (SEQ ID NO: 424) 
Human mRNA sequence (SEQ ID NO: 425) 
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Human coding sequence (SEQ ID NO: 426) 

Table 74 (mouse gene: Itprl; human gene ITPR1) 
Mouse genomic sequence (SEQ ID NO: 427) 
5 Mouse mRNA sequence (SEQ ID NO: 428) 
Mouse coding sequence (SEQ ID NO: 429) 
Human genomic sequence (SEQ ID NO: 430) 
Human mRNA sequence (SEQ ID NO: 431) 
Human coding sequence (SEQ ID NO: 432) 

10 

Table 75 (mouse gene: Sell; human gene SELL) 
Mouse genomic sequence (SEQ ID NO: 433) 
Mouse mRNA sequence (SEQ ID NO: 434) 
Mouse coding sequence (SEQ ID NO: 435) 
is Human genomic sequence (SEQ ID NO: 436) 
Human mRNA sequence (SEQ ID NO: 437) 
Human coding sequence (SEQ ID NO: 438) 

Table 76 (mouse gene: Dpt; human gene DPT) 
20 Mouse genomic sequence (SEQ ID NO: 439) 
Mouse mRNA sequence (SEQ ID NO: 440) 
Mouse coding sequence (SEQ ID NO: 441) 
Human genomic sequence (SEQ ID NO: 442) 
Human mRNA sequence (SEQ ID NO: 443) 
25 Human coding sequence (SEQ ID NO: 444) 

Table 77 (mouse gene: Pap; human gene PAP) 
Mouse genomic sequence (SEQ ID NO: 445) 
Mouse mRNA sequence (SEQ ID NO: 446) 
30 Mouse coding sequence (SEQ ID NO: 447) 
Human genomic sequence (SEQ ID NO: 448) 
Human mRNA sequence (SEQ ID NO: 449) 
Human coding sequence (SEQ ID NO: 450) 

35 Table 78 (mouse gene: BIm; human gene BLM) 
Mouse genomic sequence (SEQ ID NO: 451) 
Mouse mRNA sequence (SEQ ID NO: 452) 
Mouse coding sequence (SEQ ID NO: 453) 
Human genomic sequence (SEQ ID NO: 454) 
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Human mRNA sequence (SEQ ID NO: 455) 
Human coding sequence (SEQ ID NO: 456) 

Table 79 (mouse gene: Blr1; human gene BLR1) 
5 Mouse genomic sequence (SEQ ID NO: 457) 
Mouse mRNA sequence (SEQ ID NO: 458) 
Mouse coding sequence (SEQ ID NO: 459) 
Human genomic sequence (SEQ ID NO: 460) 
Human mRNA sequence (SEQ ID NO: 461) 
io Human coding sequence (SEQ ID NO: 462) 

Table 80 (mouse gene: Ptp4a2; human gene PTP4A2) 
Mouse genomic sequence (SEQ ID NO: 463) 
Mouse mRNA sequence (SEQ ID NO: 464) 
15 Mouse coding sequence (SEQ ID NO: 465) 
Human genomic sequence (SEQ ID NO: 466) 
Human mRNA sequence (SEQ ID NO: 467) 
Human coding sequence (SEQ ID NO: 468) 

20 Table 81 (mouse gene: Mcm3ap; human gene MCM3AP) 
Mouse genomic sequence (SEQ ID NO: 469) 
Mouse mRNA sequence (SEQ ID NO: 470) 
Mouse coding sequence (SEQ ID NO: 471) 
Human genomic sequence (SEQ ID NO: 472) 

25 Human mRNA sequence (SEQ ID NO: 473) 
Human coding sequence (SEQ ID NO: 474) 

Table 82 (mouse gene: Jak2; human gene JAK2) 
Mouse genomic sequence (SEQ ID NO: 475) 
30 Mouse mRNA sequence (SEQ ID NO: 476) 
Mouse coding sequence (SEQ ID NO: 477) 
Human genomic sequence (SEQ ID NO: 478) 
Human mRNA sequence (SEQ ID NO: 479) 
Human coding sequence (SEQ ID NO: 480) 

35 

Table 83 (mouse gene: Fus1; human gene FUS1) 
Mouse genomic sequence (SEQ ID NO: 481) 
Mouse mRNA sequence (SEQ ID NO: 482) 
Mouse coding sequence (SEQ ID NO: 483) 
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Human genomic sequence (SEQ ID NO: 484) 
Human mRNA sequence (SEQ ID NO: 485) 
Human coding sequence (SEQ ID NO: 486) 

5 Table 84 (mouse gene: Rassfl; human gene RASSF1) 
Mouse genomic sequence (SEQ ID NO: 487) 
Mouse mRNA sequence (SEQ ID NO: 488) 
Mouse coding sequence (SEQ ID NO: 489) 
Human genomic sequence (SEQ ID NO: 490) 
10 Human mRNA sequence (SEQ ID NO: 491) 
Human coding sequence (SEQ ID NO: 492) 



Table 85 (mouse gene: Pik3r1; human gene PIK3R1) 
Mouse genomic sequence (SEQ ID NO: 493) 
15 Mouse mRNA sequence (SEQ ID NO: 494) 
Mouse coding sequence (SEQ ID NO: 495) 
Human genomic sequence (SEQ ID NO: 496) 
Human mRNA sequence (SEQ ID NO: 497) 
Human coding sequence (SEQ ID NO: 498) 

20 

Table 86 (mouse gene: Braf; human gene BRAF) 
Mouse genomic sequence (SEQ ID NO: 499) 
Mouse mRNA sequence (SEQ ID NO: 500) 
Mouse coding sequence (SEQ ID NO: 501) 
25 Human genomic sequence (SEQ ID NO: 502) 
Human mRNA sequence (SEQ ID NO: 503) 
Human coding sequence (SEQ ID NO: 504) 

Table 87 (mouse gene: Tle3; human gene: TLE3) 
30 Mouse genomic sequence (SEQ ID NO: 505) 
Mouse mRNA sequence (SEQ ID NO: 506) 
Mouse coding sequence (SEQ ID NO: 507) 
Human genomic sequence (SEQ ID NO: 508) 
Human mRNA sequence (SEQ ID NO: 509) 
35 Human coding sequence (SEQ ID NO: 510) 



Table 88 (mouse gene: Nek2; human gene NEK2) 
Mouse genomic sequence (SEQ ID NO: 511) 
Mouse mRNA sequence (SEQ ID NO: 512) 
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Mouse coding sequence (SEQ ID NO: 513) 
Human genomic sequence (SEQ ID NO: 514) 
Human mRNA sequence (SEQ ID NO: 515) 
Human coding sequence (SEQ ID NO: 516) 

5 

Table 89 (mouse gene: Nr3c1; human gene NR3C1) 
Mouse genomic sequence (SEQ ID NO: 517) 
Mouse mRNA sequence (SEQ ID NO: 518) 
Mouse coding sequence (SEQ ID NO: 519) 
io Human genomic sequence (SEQ ID NO: 520) 
Human mRNA sequence (SEQ ID NO: 521) 
Human coding sequence (SEQ ID NO: 522) 

Table 90 (mouse gene: Dad1; human gene DAD1) 
15 Mouse genomic sequence (SEQ ID NO: 523) 
Mouse mRNA sequence (SEQ ID NO: 524) 
Mouse coding sequence (SEQ ID NO: 525) 
Human genomic sequence (SEQ ID NO: 526) 
Human mRNA sequence (SEQ ID NO: 527) 
20 Human coding sequence (SEQ ID NO: 528) 

Table 91 (mouse gene: Lck; human gene LCK) 
Mouse genomic sequence (SEQ ID NO: 529) 
Mouse mRNA sequence (SEQ ID NO: 530) 
25 Mouse coding sequence (SEQ ID NO: 531) 
Human genomic sequence (SEQ ID NO: 532) 
Human mRNA sequence (SEQ ID NO: 533) 
Human coding sequence (SEQ ID NO: 534) 

30 Table 92 (mouse gene: Git2; human gene GIT2) 
Mouse genomic sequence (SEQ ID NO: 535) 
Mouse mRNA sequence (SEQ ID NO: 536) 
Mouse coding sequence (SEQ ID NO: 537) 
Human genomic sequence (SEQ ID NO: 538) 

35 Human mRNA sequence (SEQ ID NO: 539) 
Human coding sequence (SEQ ID NO: 540). 

Table 93 (mouse gene: Anp32; human gene N/A) 
Mouse genomic sequence (SEQ ID NO: 541) 
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Mouse mRNA sequence (SEQ ID NO: 542) 
Mouse coding sequence (SEQ ID NO: 543) 
Human genomic sequence (SEQ ID NO: 544) 
Human mRNA sequence (SEQ ID NO: 545) 
5 Human coding sequence (SEQ ID NO: 546). 

Table 94 (mouse gene: Map2k5; human gene MAP2K5) 
Mouse genomic sequence (SEQ ID NO: 547) 
Mouse mRNA sequence (SEQ ID NO: 548) 
10 Mouse coding sequence (SEQ ID NO: 549) 
Human genomic sequence (SEQ ID NO: 550) 
Human mRNA sequence (SEQ ID NO: 551) 
Human coding sequence (SEQ ID NO: 552). 

Table 95 (mouse gene: Cd28; human gene CD28) 
Mouse genomic sequence (SEQ ID NO: 553) 
Mouse mRNA sequence (SEQ ID NO: 554) 
Mouse coding sequence (SEQ ID NO: 555) 
Human genomic sequence (SEQ ID NO: 556) 
Human mRNA sequence (SEQ ID NO: 556) 
Human coding sequence (SEQ ID NO: 558). 



15 



20 



Table 96 (mouse gene: Sept9; human gene Msf) 
Mouse genomic sequence (SEQ ID NO: 559) 
25 Mouse mRNA sequence (SEQ ID NO: 560) 
Mouse coding sequence (SEQ ID NO: 561) 
Human genomic sequence (SEQ ID NO: 562) 
Human mRNA sequence (SEQ ID NO: 563) 
Human coding sequence (SEQ ID NO: 564). 

30 

Table 97 (mouse gene: Fzd10; human gene FZD10) 
Mouse genomic sequence (SEQ ID NO: 565) 
Mouse mRNA sequence (SEQ ID NO: 566) 
Mouse coding sequence (SEQ ID NO: 567) 
35 Human genomic sequence (SEQ ID NO: 568) 
Human mRNA sequence (SEQ ID NO: 569) 
Human coding sequence (SEQ ID NO: 570). 

Table 98 (mouse gene: Calm2; human gene CALM2) 
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Mouse genomic sequence (SEQ ID NO: 571) 
Mouse mRNA sequence (SEQ ID NO: 572) 
Mouse coding sequence (SEQ ID NO: 573) 
Human genomic sequence (SEQ ID NO: 574) 
5 Human mRNA sequence (SEQ ID NO: 575) 
Human coding sequence (SEQ ID NO: 576). 

Table 99 (mouse gene: Ncf4; human gene NCF4) 
Mouse genomic sequence (SEQ ID NO: 577) 
10 Mouse mRNA sequence (SEQ ID NO: 578) 
Mouse coding sequence (SEQ ID NO: 579) 
Human genomic sequence (SEQ ID NO: 580) 
Human mRNA sequence (SEQ ID NO: 581) 
Human coding sequence (SEQ ID NO: 582). 

15 

Table 100 (mouse gene: Rac2; human gene RAC2) 
Mouse genomic sequence (SEQ ID NO: 583) 
Mouse mRNA sequence (SEQ ID NO: 584) 
Mouse coding sequence (SEQ ID NO: 585) 
20 Human genomic sequence (SEQ ID NO: 586) 
Human mRNA sequence (SEQ ID NO: 587) 
Human coding sequence (SEQ ID NO: 588). 

Table 101 (mouse gene: Mbnl; human gene MBNL) 
25 Mouse genomic sequence (SEQ ID NO: 589) 
Mouse mRNA sequence (SEQ ID NO: 590) 
Mouse coding sequence (SEQ ID NO: 591) 
Human genomic sequence (SEQ ID NO: 592) 
Human mRNA sequence (SEQ ID NO: 593) 
30 Human coding sequence (SEQ ID NO: 594). 

Table 102 (mouse gene: mCG10516; human gene N/A) 
Mouse genomic sequence (SEQ ID NO: 595) 
Mouse mRNA sequence (SEQ ID NO: 596) 
35 Mouse coding sequence (SEQ ID NO: 597) 
Human genomic sequence (SEQ ID NO: 598) 
Human mRNA sequence (SEQ ID NO: 599) 
Human coding sequence (SEQ ID NO: 600) 
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Table 103 (mouse gene: Rorc; human gene RORC) 
Mouse genomic sequence (SEQ ID NO: 601) 
Mouse mRNA sequence (SEQ ID NO: 602) 
Mouse coding sequence (SEQ ID NO: 603) 
5 Human genomic sequence (SEQ ID NO: 604) 
Human mRNA sequence (SEQ ID NO: 605) 
Human coding sequence (SEQ ID NO: 606) 

Table 104 (mouse gene mCG15938; human gene BAT1) 
10 Mouse genomic sequence (SEQ ID NO: 607) 
Mouse mRNA sequence (SEQ ID NO: 608) 
Mouse coding sequence (SEQ ID NO: 609) 
Human genomic sequence (SEQ ID NO: 610) 
Human mRNA sequence (SEQ ID NO: 611) 
is Human coding sequence (SEQ ID NO: 612) 

Table 105 (mouse gene: Iqgapl; human gene IQGAP1) 
Mouse genomic sequence (SEQ ID NO: 613) 
Mouse mRNA sequence (SEQ ID NO: 614) 
20 Mouse coding sequence (SEQ ID NO: 615) 
Human genomic sequence (SEQ ID NO: 616) 
Human mRNA sequence (SEQ ID NO: 617) 
Human coding sequence (SEQ ID NO: 618) 

25 Table 106 (mouse gene Zpf29; human gene: hCG27579) 
Mouse genomic sequence (SEQ ID NO: 619) 
Mouse mRNA sequence (SEQ ID NO: 620) 
Mouse coding sequence (SEQ ID NO: 621) 
Human genomic sequence (SEQ ID NO: 622) 

30 Human mRNA sequence (SEQ ID NO: 623) 
Human coding sequence (SEQ ID NO: 624) 

Table 107 (mouse gene: Kcnj9; human gene: KCNJ9) 
Mouse genomic sequence (SEQ ID NO: 625) 
35 Mouse mRNA sequence (SEQ ID NO: 626) 
Mouse coding sequence (SEQ ID NO: 627) 
Human genomic sequence (SEQ ID NO: 628) 
Human mRNA sequence (SEQ ID NO:629) 
Human coding sequence (SEQ ID NO: 630) 
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Table 108 (mouse gene: Ppp3cc; human gene: PPP3CC) 
Mouse genomic sequence (SEQ ID NO: 631) 
Mouse mRNA sequence (SEQ ID NO: 632) 
Mouse coding sequence (SEQ ID NO: 633) 
5 Human genomic sequence (SEQ ID NO: 634) 
Human mRNA sequence (SEQ ID NO: 635) 
Human coding sequence (SEQ ID NO: 636) 

Table 109 (mouse gene: mCG9110; human gene: hCG27579) 
10 Mouse genomic sequence (SEQ ID NO: 637) 

Mouse mRNA sequence (SEQ ID NO: 638) 

Mouse coding sequence (SEQ ID NO: 639) 

Human genomic sequence (SEQ ID NO: 640) 

Human mRNA sequence (SEQ ID NO: 641) 
15 Human coding sequence (SEQ ID NO: 642) 

Table 110 (mouse gene: mCG2257; human gene: PRDM1 1 ) 
Mouse genomic sequence (SEQ ID NO: 643) 
Mouse mRNA sequence (SEQ ID NO: 644) 
20 Mouse coding sequence (SEQ ID NO: 645) 
Human genomic sequence (SEQ ID NO: 646) 
Human mRNA sequence (SEQ ID NO: 647) 
Human coding sequence (SEQ ID NO: 648) 

25 Table 111 (mouse gene: mCG17918; human gene: hCG23764) 

Mouse genomic sequence (SEQ ID NO: 649) 

Mouse mRNA sequence (SEQ ID NO: 650) 

Mouse coding sequence (SEQ ID NO: 651) 

Human genomic sequence (SEQ ID NO: 652) 
30 Human mRNA sequence (SEQ ID NO: 653) 

Human coding sequence (SEQ ID NO: 654) 

Table 112 (mouse gene: Lfng; human gene: LFNG) 
Mouse genomic sequence (SEQ ID NO: 655) 
35 Mouse mRNA sequence (SEQ ID NO: 656) 
Mouse coding sequence (SEQ ID NO: 657) 
Human genomic sequence (SEQ ID NO: 658) 
Human mRNA sequence (SEQ ID NO: 659) 
Human coding sequence (SEQ ID NO: 660). 

40 
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Table 1 



MUTATION 


SEQUENCE 


CLONE 


CLASS. 


GENE 


IM000619 


GATCAAAGCAATCTCTATGTCTTTCTCTGCTG 
TCCTCCTCAGAC ATCTCCAGAGAGCTG G GAT 
Al 1 rnCTTTCCCATTTGAGATTATGAAGTTG 
TTTCTAGAGTGCATGACGCAGGTTGAAGGAT 
AAGTACACAGGTCCCAAGGAACCAAGCGTTT 
TCACTGACGGTGATGAGTCTTGTTCGTGAGA 
TTGTTGTGATTCTCAGCCTTTCTCTTCCCCTG 
TGTGTGCTCTTCA MM CTGGTTCTGTCTGCC 
TAGCACCTCCTGGGGAAGCTGCTGTGCTTT 


P000632 


A 


Spr 


IM000620 


GATCTTTGG AG CCCAGTTGTTAATCATAAG A 
GCTGATATTTTGAAAGAGTGTGTCAACCTAG 
ATGCACAGGGAAGCCAAAGCATTCAGCC 


P000633 


D 




IM000621 


ATATGACCACAAGGAAATAAGATAAAGTGTT 
CATACTGAATTTATAATGAAAAGTGATC 


P000634 


C 




IM000622 


GAACAGGCATGGCTTTACTTGTACAATGAGG 

AAACCAAGGCAGAGATTGCAAAGCGGGTCC 

TACACGTTTGCTCCATGCCCTGCTTCTCTGA 

CCACAGTGTACTGAGAATATGCTGAGCCCTA 

GTTCCTGGGGAGGAGGCAGAAGAGAGCAGC 

ATCCTG CCC ACTTG AAG G C GTG C ACAC ATAG 

TTCCTGTCTGATC 


p000638 


D 




IM000623 


GATCAGGAGACCACACCCAGCTAGCCTTCTC 

TGACTGGGTATCCTTGGTCAGCCAGCCTTTC 

TTCACCTCATGTTCTCATTTGCAAACTCACAT 

GAACACTATTTGACCTACACACTTCATAAAGC 

TGI 1 1 1 1 AGAAAGACGAGATAATACAGGAGG 

AACGCTACAATATTAAATGATATGTATTTATA 

T 


P000639 


D 




IM000624 


AGTGTTTAGGTCAGCTGGTGCAGGAGAAGC 
TTCTTGAGGAAGACGACCATCTGGCAAGGC 
CTGATGGT AG AAAATAATG G ACTTCTCTCC A 
ACTG AGTAG G AACTTGATG ATC 


P000640 


D 





IM000625 


ATCAGTAAGTTAATCCTAAGAATTACTATGCA 

I 1 1 1 1 uuuC i u i i 1 1 I I AACAACA n CCTCCTT 

AGCTTATATGAGGCTCTAGTGCCCGGAGACT 

TTAATACTGCCCTAACATGATGGTGGCTCTTT 

GTCCCTCTTTCTCAGCCACTGAAATCTGACA 

GTTTGGGGAAGAATAATAAGAATTTAAGAAA 

CTAGATGGTTTTAAATATAGATATAAAAACAG 

TTCTTCGACTATTCTCAATAAAGAAATTCAGT 

CAAAAGAATTTCAGTCCTAACACAATGATC 


P000641 


D ! 
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IM000626 


GATCATCAGAGTCCTGCATCTTATGTGTGCA 

GTGTTTTCAGCAATACAGGCTTACCTTCAAC 

CTCTAACAGGCAACCAGATGCTACAATAGCT 

TATATTGTTTTAGAAATCACTTGGACTACTCT 

AAACAACAACTTGAGTGAAGGCTCTTTGTAT 

CTGATACTGGAGTTTGTTAGTCTATGACACTT 

GTGGGGAGACATGTCTGCACAAGTAGCATAT 

GTGTGTACATGTATATTGTATACATATATAGT 

TTTGCTCTATGTATGTATGTGTATATGTATGT 

ATGTATATGTATATGTATGTATATATATAG 


P000642 


D 




IM000627 


AAGGGACCTGATAATCGTGTTGGCAACTGG 

GCTACAATTAGTTATCAATTGCTTGCTTGCCA 

C CTGCCCTGCTCCATAG AG AATCATAGTCTG 

GGGAGTGTGGAGGAATAGCGGAGTCATCTA 

AAC AC ATC ACTGCTGCCC C C ACC ATTTGCCT 

GCCACCAGGCCCCTGCCTTTCATTTTGCATT 

TCTCCCTCTTACAAGCAAATGGCGCTCACTG 

ATA 

AlO 


p000643 


Pv 

D 




IM000628 


GTTTG G G G ATTGTAC AG AATGC ACAGC GTAG 

TATTP A /*"* /"* A A A A A /** AAA OTV^ AAA I T A A T* 

IAI 1 GAGGAAAAAGGAAACTGGGAAATTAAT 
GTATAAATTAAAATCAGCTTTTAATTAGCTTA 
ACACACACATACGAAGGCAAAAATGTAACGT 
TACTTTGATC 


p000644 


K 


Myc 


IM000629 


G ATCTCATTAC AGATGGTTGTGAG CTACC AT 
GTGG 


P000647 


R 




IM000630 


GATCTCAGGAGGCACCGAGAGACTCAGCAT 

GGACTCAAATGAGTACCCTGGCAGCCCGCA 

ACACCAGCTGTGTAACACTACCGTGAGGGAT 

GTCTTCCCTGCCTCCCTCCAGCCCCTTCTCA 

GGCCCTGAGTCCAGTGTGCAAAGCTCATCAT 

G GTTAGTCCC CTTCACCT 


P000649 


K 


Gfi1 


IM000631 


AGAGCACCCGACTGCTCTTCCGAAGGTCCA 

GAGTTCAAATCCCAGCAACCACATGGTGGCT 

CACAACCATCCGTAACAAGATC 


P000650 


R 




IM000632 


GATCAAATCCTGTCAGGGAGAGGGGCTCCT 

CCCAGTAGTGCCATCCCATAATAATAAGAAG 

GACTCCTGGGCCTCAGTGAAGTCAGGCTGA 

CCACTACTGCAGGTTAGTCATGACCAGTAGC 

CAGAATGGAACGAAGGGTGACCCAGTGTGA 

GGACACAGCCCCAGGCAACTGCTTCTGCTTT 

GAGCCAAGTTGTTACCCCAAAGCTCGTCATT 

CCGCTTGGTTTCTCATGTGTGTGAGCTGCAC 

ATATGGAGGTCCCCCTTTGTTCCCTT 


P000651 


D 
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IM000633 


GTGAGGAAGGTCCCTCTGCATTCTAACCTTC 
CTCAACTCCACCAGCCTCGGCGTTTAAGGGA 

PA A ATATTAppnTTf v r > P'l T~YfZ(Z(Z(T A Af^TTfS 

OnM 1 A 1 1 MUUU 1 1 1 1 1 UOOUCfMAW 1 1 O 

GAGCCAGTGAAGTAGTCGGAAATGTACAGTC 

ACAGGAAATTGCTGCTACCAAGGCTGGAGG 

AACAAAGAGAAGACTTGTCACAAGAGGCCAG 

AGAGGAAGTCACCCAGTACAAACTGAAGCG 

CGCGCGCACACACACACACACACACACACA 

CACGCACACACACACACACACGATC 


P000652 


D 




IM000634 


TGGCCGCCTAGACAAGCTGACCATCACCTC 

CCAGAACCTGCAACTGGAGAGCCTTCGCAT 

GAAGCTTCCGAAATGTGCGTGCTCCACCTGT 

CCCTCACCTCACAGACATCATTTCTCCATTTA 

GCCCCTCCCGATC 


p000654 


A 


// 


IM000635 


GATCCCCTGGAATTTACAGTCGGTTCCAACA 
ATCATGTAGATG 


p000656 


C 




IM000636 


G ATC GG CT ATAGC ATTTGTC AATGTTTACCC A 
GAAGAATAGCACAGATATATTTGCACATCAAT 
GCTTATTGC AGTATTATTC AC AGTG GCTATGT 
AATGGAACCAACCTACATGGCCAGCAACTGA 
ATAGATTAAGAAAATATATATACACAATGGTG 
CI 1 1 1 1 1 CGGCTATAAAGAAGAATGAAGTTAT 

ATGATAATATCAAGTGCACAGTCAACCTCTCT 

CTCTCACCTCCCCCGCCCCGCTCTTTCTCTC 

TCATATACATTTGAGAGTAGCAGTAAACTGTC 

TGAGAACAAAGGGGATTAATGGGAGGGGAG 

AAGATTAAGGAGCGGAAGGGTAGTAGGTAG 

TAT 


p000659 


A 


Cr2 


IM000637 


GATCGGCTTCTATGGACTGAGTGTGTAAGAA 
AACATT 


P000661 


D 




IM000638 


TTAGGAGGGTAGAGAACATTCAGGAATCAAG 
AACAAGCATTTTAACACCCACTGAGCTATCC 
TGTGGATGGTGGTGG1TTTGTTTGTTTGTTG 
GTTTTGTTTTAGGAAGTCAGGGATGGGGTGG 

ATCCCAAATTCTGGAATGATAAGCAAGAGAG 
CTGTCTAGTCCCAGTCTCAGATACATGCTGT 
TAAI 1 1 ICTACTACTGCTATAACACATAGGCT 
CAAATGCGGTGGCTTACCTAACACACCCTGT 
GCAGTTCTGAAAGTCGTAACTCTGGCACGAT 
C 


P000662 


D 




IM000639 


ATGCTAAGCTGTGACTCCTCTCGATACGAGA 
CCCTGGCTGCCCTCCTTTCCCGATC 


P000663 


D 
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IM000640 


GATCGTCTGGAAGAGCAGTCAGTATTCTTAA 

CTGCTGAGCCATCTTTGCAGCCCCCAGTTCT 

TTGGGGT I I 1 1 1 GTrTGTTTGTTTGGTTGGTT 

GGTTGGTTTGGTTTAGTTTGGTTTGGTTCAA 

GACAGGGTTTCTCTGTGTTGCCCTGGATGTC 

CTGGAACTCTCTTTGTAGACCAGGGTGGCCT 

TTAACTCACAGAAATGCGCCTGCTAGGATTA 

AAGCTGTGTCCCACCACTATATATATATGTGT 

G 


P000665 


R 


— 


IM000641 


GTCACAGTGTTAGAGCCACAGACGGGGGAA 
CCTACTGG CTGTCCTGGGTTCCTGTAAACTA 
GGGGACAAAGCTGCCACAGCCAGACTTAGC 
TGCGATC 


P000666 


D 




IM000642 


GATCGCTGCTTCTGTAAATCCGCAACGACAA 
TTGTTATCTTCTCCI I 1 I CTTTCTTTTATTTGT 
TTTATTCTATTTTA 1 1 1 1 1 CAGATGAACTCTCA 
TGTAGCCCAGGCTGGTCTCAAACTCCCTCTG 
TAGCTGACGGCAACCTTGAAC 


P000668 


R 




IM000643 


TTCCTACACCATAGCATTTAGTTGTAGGCAG 
AAGCGATC 


P000669 


D 




IM000644 


GATCGGCTCAAG GGCTCTAATTTAGTCTAGG 
AAGTCCTTAGGAAACATGAAAATCTCCGAGA 
TAAGACCCGGGGTAAAAAGCTTGAGCCACG 

ft AftTTA ft AC ATft C C C A ft ft ft Tft ft AftTP ATftTT 

CAGAGGTTCAAGACCCGAATCAGCTACGTAA 

ATAAAGCATTTGAGGCCTACCTGGGCTACAA 

GAGAGTATCTTTAAATAAATAAGATGATTTAA 

AAAAAACTGTTTTCCCCTTAGATGGATTAAAA 

AAACAAGACAAAACAAAACAAAACAAAAACC 

CGTCTTTCCTTCTTAA 


P000672 


D 




IM000645 


CTGTCCGTGTGGGAAACGTTTAGCAAGTCCG 
AGC GTGTTCGATC 


P000673 


K 


Nmyc 


IM000646 


ATGCGTTCGTATGACAGTTCTCCAAATGACT 

GTCCCAAAGTCCCAGATTCCTGGAAACAGTA 

AAGACTGCCTCAAACTGTAGTCACTAGTCTA 

TTATCTTAATCATAGTAACCATTTGGGTTTGA 

CTTGAAAACCTGTGACAGGGAGATAAATTTC 

TGCCACTGTAGGTGAAGCTTGGAAGGGCTA 

ACCCAATGAATATGCTCAGTCGATC 


P000676 


C 
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IM000647 


AGATGAAGCTATCCCCAGTCCCTAAGCTGAG 
TTCTGCCTGAGACTATTTGAAACAGGGTACC 
CCTGGGTCCCAGTTCAGTTGACAGGTAGTG 
GACGCATGAGAACGCCATACCTGGTGGCCG 

TGTGTTCTCCAGAGCAGCTTTCCAATCTGCC 

TGCTCCTGTCTCCCCTGCCTGTTGGCACCAG 

GCAGCCAGAATTCCATTTGTTTGTTTGCTTCG 

CGATAGGCTCTTGCCATGTAGTCCTTCCTGG 

CCTAGAACTTGATATGTAGACTTCCCCCCTT 

GGATC 


p000678 


c 




IM000648 


CCGTGTCCGTGGGCATGTGCGTGTACAGAC 
AGACATACATGCCCCCGCATGAGTGTGAACA 
CCAGAGGTCAACCTCAGGTGTCC 1 1 1 IGATG 
TTATCTACCTTGI III! 1 GAAGCAAGGTCTAG 
GATTGACCAATGAGCCCCAAGTAGGGATC 


P000679 


D 




IM000649 


GATCCATAG GCAG AG AAGG CAGTAATAGG A 
CATTGGTCATTGTACCTCATTTGTGAGGGGT 
CACCTTGGAAATGTGCTGAGACTAGGTTCTA 
GGAGAAGCTCGCCA 


P000682 


D 


- 


IM000650 


CTGGCACTGTGTGGCAGAAACAGTGAACAG 
TGTAGCGGTGCAGAATGTGTGTGCTGTGGG 
TTTTAGCACCAGGGCTGCATGAGACTGCAGA 
OA 1 (jO 1 1 A i oAUoOAvjIjAAVjoU i OAoLjAUA 
CAGC ACACATGTGTG CTAACATAC ATGTTTC 
ACCTCAGACTCAGCTCCCATTTGACI 1 I I AAT 
TAAI 1 1 1 1 GGCCATTCCACAACAGAACCTTTT 
CTTGCTCCCI MM 1 CAATCTTATGTATATATC 
TCCTACATTTAGTTACAGGACTGTGACCTAC 
AGTTTAAAACTCGGGGATC 


P000684 


D 




IM000651 


GATCCCTCCCCTCCCTTCTTTTTCCCGCCAA 

GCGTCGGCGAAGCCCTGCCCTTCAGGAGGC 

AGGAGGGGAGCTGAGTGAGGCGAGTCGGA 

CCCAGCAGCTGAGAGCAGCGCAGCCCAGG 

GGTCCTCGGCCGCGCAGACCCCCGGAATAA 


P000685 


K 


Myc 


IM000652 


CTAGTAGCCAGGGCTGGCAGCTTGGTTTGG 
GC CAGC ATCTCACTATGTAGCCTAGTTGTCC 
TGGAATTTGCTATGTAAATGTGGCTACCCTC 
AAACTCATAGAGAGCCTCCCACCTCTCCTGA 
GATTATAGGCACATGCTACCATGCCCTAAGT 
GGATC 


P000686 


D 
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IM000653 


GGAGCAGGCCCTTCTGAATCAACTTGGCAG 

AGTGAAGGAGGCACTCTCCACACAAACAGG 

AAAAGGGCAGTGGTGACTTTCTAGGCAGGG 

AACTGGTTACATTTTGTTTATTTGAAGGTGAA 

GAGTCGTGACATTCTGG GAAATAGGC AAGAT 

GGCCGTTTCCCCTCAGCTACAACCAGCCATG 

ACTGGAACCAGAAAGGCACGCCCTGCTTTA 

GCCTCAGGCAGAACGATAATAACAGCGTGCT 

AGCTCAGTAGTCTGTGTGCTGGAAGGGTTTA 

TGAGGAGGAAGTCCGCAATTACATATTTCTG 

GGCAAACATTAACCAAGATTGAAACCTAGAT 

TTGAAGAGAAGTAGCAGGCTGGGATC 


P000687 


D 


- 


IM000654 


AGATGAACTTATAAATGCATCTGCAGTCCTC 
AAATAAAGATGAATAGTAACCCAGAGGCGTG 
GTAGTGCGCTCTTCAAACCCAGTGCTCAGAA 
GGTGCAAACAAAAGGACCGGGAGTCCAAGG 

AGAACAACAACCAAGAGCTGCTTATGGAGGT 

CAGTCTGTGTTCCCAGGGGGACAGCATCAG 

TCTAAGTTGGCGGTTGTTGTTGGCTGAGCAT 

GCACAAATCCCTAACAGCACATAAAGCAAGT 

TGTGTCACACACTCACAGTGCCCAGATTCAC 

TGGATC 


P000688 


B 


Mm.13133 
6 


IM000655 


GTCCATTGTGTACTGAGAGAGGAGTTAGGTT 
TAGAAAGCCTTCCTCAGATGTCCCTCAAAGA 
AGCTGCTACAACTGCCCTCATCCCACGTTGC 
CAAGGATC 


P000689 


D 




IM000656 


AGCTGTAGGGAAGCCCAAAGCACAGACGAC 

TGCTGCTGCTGCTGCGGTTCCCACTCTGGGT 

TGACCTTAGAAACGGGGGTTCATCTCCTCCA 

AACCATGATGAGCTTTGCACACTGGACTCAG 

GGCCTGAGAAGGGGCTGGAGGGAGGCAGG 

GAAGACATCCCTCACGGTAGTGTTACACAGC 

TGGCGTTGCGGGCTGCCAGGGTACTCATTT 

GAGTCCATGCTGAGTCTCTCGGTGCCTCCTG 

AGATC 


P000694 


K 


Gfi1 
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IM000657 


GATCGCCCCAGTTACCTCAAATTGTGTGAGT 

GTGTGTGTGTGTGTGTATG CATATATGCATA 

CAAGCATATACATGCATGCATATATATAATAC 

ACATAGACATATATACACACATATAGACGCAT 

AC ATGCATTTGTATGCATG CATCTATGTATGT 

ACATATCCACAACCAAATATACCAAACACGC 

AGACACAGCACACATAGGACAATAGTAATTG 

TGAATCTAACTGGTGGG GTTTATGGGTCAAG 

AGCCAGGGTAGAGGAAACTGGCTAAGGCTC 

TAACCATCCTAGAGCAGGCACATCTACCAGG 

AAAAGAAACAAGGAAAAGAGCAGAGTTGAG 

GGTTACTTAACATG 


p000695 


D 




IM000658 


ACAGAATCTGTGGGTCATTATTACGTTTATAG 
GAACAGGATTTTCTTTC CTTTCTGACTCTACC 
TTCTAGAAAGGCCGACTTTTAAATCCTCATG 
CTCTTGTCTATTGACAGGAAAAGATGGGCTT 
CCACACTGATC 


P000700 


D 


- 


IM000659 


GATCAGGCTGGCCTTGAACTCACAGAGACC 

CACCTGCCTCTGCCTCCTGCATGCTGGGATT 

AAAGGTGTGTGCCACCACTGCCCAGCTCAC 

AAAGTAGTAGTAGGACTAGTACTAGTACTAA 

TAATAACAAACATTACAACAATCTTAATTATTT 

TTGTTTCTACCTTTAAAATCTCCCAACTGTCT 

TTTTATATTGCCTCAAGTCTTCCCTCAGTCCC 

TGGCCTTCATAGCTTGAC 1 ill! 1 GCTAGAG 

GTTATCAGTGGCTCATCTCTCTCCTGAGATT 

GAGCTGGCTAAGACCACTATTCAGAGGGAG 

AATGTAATGTCTCAGACATCATAGCCAGTCC 

TCAGTTCTCCTTTTGCTGACTGACCACTTTGC 

CAAACTAGTTTTCCTAAGCCATACC 1 1 T ICTT 

TTTAAAAAATAGTCTTTCTTATAGTGGGTGCT 

GGCTTTGAACTTCTGTCCTCTTGCCTCACCTT 

GCACTGGTAGTAGAGGCTTGCAATTTCACCG 


P000702 


C 


- 


IM000660 


GATCAAG AACGAAACCC CTGAAAACATAAAA 

CAGTAAGATAACAATAGCGTGCCTGATTTTG 

TCCAAACCTTCTTGTCACCTGTCACTGAGATT 

GTCAACTCCI 1 1 1 CACCACCCTACATACGTTA 

GTTAGCTCAGTTTACGAGAGTTTGCAAAGGC 

CCCCACCAGTACCCTGCAACTTTACCCACCC 

CTGCATGGGACTGTGAGAAAATGGGACTGG 

AGAGTAACCCTCTTCAGGCTCACAATCTGAG 

CTAGTCAGAGCATCTCACGGGTCCCGGGAC 

TTTCAGTGTGCTTTCCTCTTGGGTATTGGACT 

TTAAACAATGTGTACCGATATGGGTGAATAAT 

ACAACATCCATGGAGAAATAAGCCAAATCAA 

GACACTTCTTCAGAGG 


P000703 


D 
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IM000661 


GATCAAAAACATCAACGTAAGGAGCCCTTAA 

TGACGCTTTGTGACGGTTTAGAATGGTCTAC 

CCAAACCTAGCCAAGTCTAACTATGTTATGG 

AGGTRRTAAAARrAfiTTAACCTAAACATCTG 

GGACACTCACAGAATGATAGGTAGGTAGGTA 

GATAGATAGATAGATAGATAGATAGATAGAT 

AGACAGACAGACAGACAGATGTTGAATAAAA 

AGTG ACGTTTACAGTGATGTTAGCTC AAGG C 

AGGGCTTTTCAGGCCATTTCCCCTGGTCTCA 

CCC 


p000704 


D 




IM000662 


CTACTAAGTCCAGAGCAGAGAAGGAGGCGC 
CGCCTGTGTGCACAGCGGAGTCTGGGAGAG 
ACCACCGGCCCAAACCAGTAAACACAGGGC 
ACCCACCGTGCTCCGATC 


P000706 


D 


- 


IM000663 


ACAGTAATCTGATTATCTTGCAGTAGATAATT 
TGTCTACCTGTTAATGACTCTGCTTCTTGAAC 
TACGTCCCAGTAGATGCCATGCTTTCAGCCT 
GGTAAGTGACACTAATACTACCTCCAAACTG 
TCACTTGGATTGTCAGGGTTTTGGTGTGGTG 
ATGATACAGGAGAAATGTAAAACACGGAGTT 

d ATf5 AT A d. A A AfVTP APT A ATAP ATTTTPT 
Un 1 On 1 MOrNAVMVjVjMO 1 unv/ 1 MM 1 MOM 1 1 t IUI 

TAGGAAAAGTCAAGTGACACACAGCAGAATC 

TAGCTGAAGGAGCTCCGCCAATAGGGCTGG 

AAGATAACTCTCGCACTAACCTGCTTTATTAG 

GAACTGTAGGAAAGGCAGGTCTGCAGCACA 

GTTGAAGTTTAGGTTGCTGAGAAAGTTTCTG 

CTCATATTTATTCACCAGTGATGATC 


P000708 


D 




IM000664 


GTTTAGCAAGTCCGAGCGTGTTCGATC 


p000709 


K 


Nmyc 


IM000665 


AGGCAAACCCATGTGAGGCCTTCTCACATCT 

TTCCTTGGATGCCTGCACACACCTGACTTGA 

CAGACTTCAAATCAGACTTATCAACTCACCTC 

TTCAGTCCTGGGCCTCTTC CTGTATTTC AATC 

TTAGATAGAAAATTGGTTCCACTGTCTACCA 

GCCTTGAACCAGGAATGCAGAGCCAACCAC 

CCCTGGGGTGTCCCAGGCAGCTGGGCTGGA 

TGCTACCTGTCATGCTCTTGATC 


P000710 


C 
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IM000666 


ATGTATGAGTGTGGGGCTGGGTTTGAACCTG 

TGTCACCTTAGGACTCTCTGAACCTCGGTTT 

CCTATTAGACGGAGG GGCTATTCGGAGTCCT 

CATCTAATGGAGACACTTTGTGGGTATCAGA 

GGGCAACACTGTGGTATTGGGGGTGGGGGG 

TTGCTGCTTAGAGCTCAGAGAAGAGGAGTTT 

GGCTTGCTCTACAGAACATGCAGGCTGAGG 

TGTGGGTGCAGGGTTTCCCTGAGGCCCCGG 

CTCTGACCCTCTCCCCACTCCATTTCCTGCG 

CAGGTGAGC GACAAACGTTCCAACAGCTTCC 

GCCAGGCCATCCTTCAGGGAAACCGCAGGC 

TGAGCAGCAAGGCCCTGCTGGAGGAGAAGG 

GGCTGAGCCTCTCTCAGCGGCTCATCCGCC 

ACGTGGCCTACGAGACTCTGCCCCGGGAGA 

TTGACCGCAAGTG GTACTATGAC AGCTACAC 

CTGCTGCCTCCGCCCGGTTCATGATC 


P000711 


C 




IM000667 


GATCAI 1 1 1 1 CTCTCGAGATGGATTAAAGCTA 
TGCTGCAGAAGGACCCGTGTGTGTCCTGTGT 
GTGTGTGTCCTCGCCGGCGAGACTCCTTATC 
ACACATGACAGCTTCAAAGCCCCCAGATTCA 
ATAG G7TCCAGGAGTTCACATTTAACACTCAT 
GGGGTCAAAGTGCAGGCAGATGGTGGAGCC 
TGTGGAAGGTCATCAGACAAACAACCTGGTG 
GTTGCAGCAGAAATCACCAGGCAAGTAG 


p000712 


R 




IM000668 


G ATCTGGCTAGCAGGG AG C C ATTTACAG CTC 
AGACATCTATCATCCTTA 


P000713 


D 




IM000669 


GATCATTGTACCTCACCTGTCAGTTTGACAG 

GTGGGAGGTGATATCTCTTTTCATTCATGTAT 

TCTTTG AAAGTTTGTTC ATG CATATAATACAT 

TCTGGTTCAATTCACCACTCCACCCTTTTGTA 

TCCCCTGCGTACCGAGCCCCCAI F 1 fCTCAC 

CAAGTCTTACTGTTATCTCAGTTTTGGGGCTT 

AG! 1 1 1 I IGI 1 IGICI IGI ! 1 IGriGI 1 1 1 IGA 

AACAGGGTCCCGTTATGCAGCCCTGGCCCT 

GAACTTGCTAAATAAACCAGGTTGGCTTTGA 

ATTCAGAGTTCTGCACACCTCTGTTACCCAA 

GTGCTCAGATTAAAGGCGTATACTACCAC 


p000714 


C 




IM0Q0670 


GATCAATTCAATCTATTGCAATAACCTGGTTT 

TTTTTTTCCGCAACTCCAAGATGGGGGGGGG 

G GGG C CC AGTC AGG AG AG GTTTC AAC AC AA 

ACGCACTAGTATTTACACACAGAATCTCCTC 

CACTGTTCTTCTTCTTTGCTTTAAAAGTCTTT 

GTTCCGGAATCTATAGATAGGGAGACAGATG 

GCTAGCTCCCCAAGGCTGAGAGCAGAGGAG 

AGTATAAACAGGGAAGTCAAGGGGTCTGGG 

AGGGCAAGGTAAGGAAGCCACAG 


p000715 


D 
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IM000671 


CAATGCCTTCCCCGCGAGATGGAGTGGCTG 
TTTATCCCTAAGTGGCTCTCCAAGTATACGT 
GGCAGTGAGTTGCCGAGCAATTTTAATAAAA 
TTCCAGACATCG I I 1 1 I CCTGCATAGACCTCA 
TCTGCGGTTGATC 


P000716 


K 


Myc 


IM000672 


TAGTATTCAGGAAAAAGGAAACTGGGAAATT 
AATGTATAAATTAAAATCAGC I I I I AATTAGC 
TTAACACACACATACGAAGGCAAAAATGTAA 
CGTTACTTTGATC 


P000718 


K 


Myc 


IM000673 


GATCAGAAAAACAGCCCATTATTCAAGATTC 
AGGT 


P000719 


D 




IM000674 


TAACTTCAATTTAATAATTATCACATGCTAGG 

AACTAAAGAGGTGCACAAAACAAACCAACAG 

TGGTTCCTATC CTGTCTAAC AGAAG AAACTA 

CAATTGTGGTTTGGGATGCCACATAAATGAC 

AGCAACGGGACCTACAGAAAATTAAGTCACA 

GAGAGAATGGACCATTTCTGCAGAGACCTG 

GAAAACAGACAAGGGAAGAAACATGGTGTGT 

CTAAGTGATGGGGCAGGTGGTGCAAACGCT 

AGAGGCAAGCAGAGGGGATATGAAACTGTG 

CTGCACAGCTGGACAGAAGGGAGGCTGGAA 

GGGAAGAGAGGACCCTCTG I I I I GACTCAAT 

GGCTAGATGCCATGTGCCAAATAAGAAAGCA 

AGGGACTGGAATCAAACCTCAACGTTCCTTG 
CATACTCCAGATAAGAACCAGGCTTTGAGCC 
AGGGCCTGGGAAGAGGGCTGGCCTACATAT 
CTCATTTTAGAGATGAGCAAACAGGACTGGG 
AGCTCTAGGTCTTCAGTGACACGCTTGCTTG 
GCCCGCAGGAGACCCTGGGTTTGATC 


P000720 


D 




IM000675 


GATCATGTCATGGGTCAACAGAAATAATTCT 

GAAAGGCTAAGTCATTTCTTCTACCCCCAAG 

AAAAATCAAGAACACCCCACATTACAAACCTT 

CCGTAGTAAACTGAGAATGGAGCCATGGCC 

AGAGCCCCTCTGCTCTCCCATCCCCCAACCA 

AGAACCAAAC 


P000721 


D 




IM000676 


ATATAACTTC 1 1 1 H 1 1 1 1 AAAAAAG AATTATT 

TATTTTATGTATATAAGTTCCTTATAGCTGTAT 

TCAGAGACGCCAGAAGAGAGCATCTGATC 


P000722 


R 


-- 


IM000677 


GATCATAGCACACTGGGGTGCCATCTGTCAC 

CCCTAGACAAACATCTTTAACC NGCATCTCTT 

CCTGAAGCCCACTTGGACCACCCTTTGGAAA 

ACCATCACCAAGGCCAGTAAGGTACCCGTG 

GTGACTC AC CTCAGCCTAG CCCACC ATAGAC 

GCTTAGCAGAGCAGGTGTGTGTAAGTCAGA 

GCCAGACAATCAGAACACTCTCCCTGCTCCA 

AAGTAGCAATGTAAAAAATTGAACCCAAAGTT 

G 


P000724 


D 
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IM000678 


GATCAAAGTAACGTTACATTTTTGCCTTCGTA 
TGTGTGTGCTAAGCTAATTAAAAGCTGATTTT 

AAT 1 1 ATAPATTAA 1 1 1 PPPAfi 1 1 1 Pr 1 1 1 1 1 P 

CTGAATACTACGCTGTGCATTCTGTACAATC 

CCCAAACGTATACATACACACTTTATATATAC 

ACGATAATCTAGCTTATTAACCAACCAGAAAC 

ATGAGTCTTTTGCTCTGTGCATTGGTTCTAGA 

TTTATTATATAATGCATATTCCCTCGGGATTT 

GCTTATCC 


P000727 


K 


Myc 


IM000679 


GATCATTTGATGCTTCAGATAAATATGTAAAT 
GGTGAC 


P000728 


B 


Mm.12788 
1 


IM000680 


GATCAAGATAATCCCCCACAGGCATGCCCAG 
AGGCCC ATTTCCTAG GTGAG ACTATAGTCTG 
TCAAGTTGACAATGCTAACCATTGCAGTGAG 
GGAGAGAAAGAAGGC CAGGATGGTGCCTCT 
CTGTTACTCTGCTTACCCACGGGGTGCAAGG 
ACAGTG GGGGATGGGCCTG AGCTTCCTCAT 
GAACACACACATGAGAGCAGTCAGCACATG 
GCCTCTTCCTCTAAGCTTCACAGTGGCAGCC 
G C ACCTCTGCTGTTAAG AC CTAAC ATGTG GC 
CGGGCAGTGGTGGCACACGCCTTTAATCCC 

CTGAGTTCGAGGCCAGCCTGGTCTCCAGAG 
TGAGTTCCAGGACAGCCAGGGCTACACAGA 
GAAACCCTGTCTTGAAAAACCAAAACCAAAA 
CCAACCAACCAACCAACCAAACAAACCATCT 
AAC ATGTACATCCTATC CATGTGC ACGAATC 
ATAC 


P000729 


R 




IM000681 


AGACCAGTGCCGGAGCCGTTCCTGGCTGAG 
GCAGCCCAAGTCCTTGAAGAGCTTGAAGAG 
GTCGCTGCGGAACTTGACGCCGATGAAGGC 

ATAPAAnAAA^^f^TT^APfirAnr'AfiPrifiAP 

GGAGGCCAGGCTGTAGGTGACGTCATAGGC 
AATGTTGAGCTGCTTGCTGGTTTC GCAGCTG 
CTATTGGTGATGTTGAAGTtGGCCACCGTCT 
GAGCCAGGACCACCCCATTGTAGGGCAGCT 
GGAAGACTATGAAGACTACCACCACGGCAAT 
GATC 


P000730 


A 


Cmkbr7 


IM000682 


CCCTCTCAAGCCTTCCTTGTTACTTAGCCTCT 
ATAGGTCTGTGCATTATACCATCATTCTTTTA 
ATTTACAGCTAATATCCATTTATATATGATTAT 
GTACCATATTTGCCTTTTGGGGTCTGGATTG 
CCCTACTCAGGATGAC CTTTTCTAGTTTG ATC 


P000731 


D 




IM000683 


GATCATGATGTTTGTTGAAGCAACAGAAACT 

ATAAGACAGTGCCCAAGAGCCTCTCTGGAGA 

TAGCC 


P000732 


D 





85 



WO 03/008583 



PCT/US01/51291 



MUTATION 
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IM000684 


GATCGTGTTAGACACAAGTAAGAAATGAATG 
AGTCTTCCTGAI 1 1 ! 1 I AAATTAACTTCTCCC 
CATATTGGCTGTCACTACTTTTTAAATCAGAA 
AGGAGAATCTGGACGGTTCCAGGCCTGCAG 
CGCCATGCTTGCAAAAGGTTTACAGAATCGC 
TCTGGACAACT 


p000734 


D 




IM000685 


CTACCACAGCATCTTTTGAGTGTATATAGTCA 

GTGTGCTACATGTTATCTATGAACATATGCAA 

ATGAGGTTTGAGAATTAAAGTTGCTGATAGA 

CTCATGGGTTAGGGGTTTGATTGCCTGCTAA 

TGATC 


P000735 


D 


- 


IM000686 


GATCACGAAACGGTTGACTAAAGCAAGACTG 

AACCACAGGCAGATACCAAACCCAAAGCTCT 

ATGTCTAGTGTCTAGAATACATAGGTTTGGG 

TAGCCATGCCCCTGTGACCCTGCCACCTGCA 

C1C AC AC AT AAClACAAT ACT AT AGACAACC AC 

TTCTGAGTCAGAATTGCAATGATGTCTTTGG 

CAAACTACTCTAGTCTCCTTTGGCCAGGAGC 

TGCTAAGTGGTTCAGGCTGAGGTACAATCAA 

CCTAGGTAGGTGGGACTGTGTGCCCCTGTG 

CTCCTGGGTGGCCTTCATGTCTGCTATGCTT 

GCCCTTT 


P000736 


D 




IM000687 


GATCATGTCAACTATACCTGGACACGGACCT 

TCATC CTTG CTGGTTTCACTACCTCTGGCAC 

CCTGCAACATCTTGCAG 1 1 1 1 1 GGAACCCTG 

TGCATCTATCTCCTCACACTGGCAGGGAACT 

TGTTCATCATTGTCTTGGTCCAGGCAGATTC 

AGGGCTGTCCACTCCCATGTACTTCTTTATC 

AGTGTCCTCTCCTTCCTGGAACTCTGGTATG 

TCAGCACCACAGTGCCCACCTTGCTGCATAC 

CTTGCTCCATGGGCCTTCACCCATCCCCTCG 

TCTGC ATGCTTTGTCCAGCTGTATGTCTTC CA 

CTCCTTGGGCATGACCGAGTGCTACCTGCTA 

GGTGTCATGGCTCTGGACCGCTACCTTGCTA 

TCTGTC GTC CACTGCACTACCATG CACTC AT 

GAGCAGACAGGTACAGAAACAGTTAGTTGG 

GGTTACATGGTTGGCTGG 1 1 1 1 1CAGCTGCC 

TGGTGCCTGCAGGTCTCACTGCCTCTTTACC 

TTATTGTTTGAAAGAAGTGGCCCATTACTT 


p000737 


C 


- 


IM00058S 


CTGTCAATTCATCCAGCTCTAGGCCGCTGTC 

TGGCTCGATGCTTATTGGTTTAACAGTGCCG 

ATGCATAGGATTCTACAGTCAGAGTGGCCTA 

AGCAACAGCTAAATATTG I 1 1 1 CTTGCTGTTC 

TGGGAACTAGATGTTCAAGGTCAAGGCGTCA 

GTAGCTCTGTTATGAGACCTCTCTGCTGTCG 

GGCTGTGTCTTCAAGI 1 1 1 I ICCCCCCTCTGT 

GCATGTGTGTTC CTATTTCCTCTGCATG AAA 

GACCAGTAGAGCCAAGTGGTGGCACACACC 

TTTGATC 


P000738 


m 
ls 
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IM000689 


GATCATGA6AGGCGAGAAACCCAGACATCT 

CTAACTCTTCTTGCCAACTCAGGAGCCACCT 

RTr:r;rrrrAf;rTf;f?cc:ACCAGCCGTTCCTC 

CCTCAGAGGCCTCCATTTCCACAAAAGGCCT 

TCCTGGTTGTTCAGGACAGAGCCTGGTTTCC 

CTGATACCCCTTCTCTCAGTGGCCACTGAAG 

TTACAGGGATGCAGCCAGCCGTGGTTGCCA 

TGTCTGTATATGCTAATCTCCGAATTCCACTT 

CCTGTTTAGATTCTCAG 


P000739 


D 


- 


IM000690 


GTTTGTCCGCATGAGTCCCAGGGACCACTCA 
GAGTGGCTGGCAGGCATTGTGGAGTGGAAT 
GTGGGAAGACACATTCCCAGCCTTGTTTGCA 
GCTTGGGACTGTCTGTGTTTTGGGATGATC 


p000740 


D 




IM000691 


GATCACCTGGGAAGGGGGAAAAGGACAAGT 
CTGAGCTCCCAGCCCACATTCTCCTAGGGTA 
GCAGCTCCCTCACTTAGTGT 


P000741 


D 


- 


IM000692 


GATCAGTTCTTATTAAACAATACAGACTTAGG 
CAAAATGAGTCAGAAATAAGGATATCGCATA 
TCCCGAGACCATTTGAACTCTAAGAAGTATTT 

TGGCACACACCTTTAATCCCAGCACTCGGGA 

GGCAGAG GC AG GTGGATTTCTC AGTTTG AG 

GCCAGCCTGGTCTACAGAGTGAGTTCCAGG 

ACAGCCAGGGCTACACAGAGAAACCCTGTC 

TGGACAAACCAAAAAAAAAAAAAAAAAAAAA 

G 


P000744 


R 


- 


IM000693 


GATCATCACAGATGACATAGAACCAAACTGT 
AACTTTCTAGACTACATGTAGCAGACATTT 


P000745 


D 




IM000694 


G ATCATACATGAATACAAGCAGG CTTCTGGT 

ATACTCTTAAGTTGAATTCTGTTTTCTGTAGT 

CGTAGTCTTGTCTTTTCCAGTTTTAAATTCTA 

GAACAGGTATACTGTAGAGCACCCGCCTCC 

CCTTGCTCTGGAGGTAGGGTAGAGTGGGAG 

TTAAGGTCAGTTCC 


P000746 


B 


AA657028 


IM000695 


CATTTTGTCTGTGTCCTTACTAAATTATTTCTA 

TATAGGAATCTTTGTATCTTCTGATATAAGCT 

AGCGCATGGGTACCACCAGCACCCAAGTCA 

TCTGCTGAGGTGCTTCTAACCTTGCTTGATT 

CAGTGTCTTCAACAGAAGGTGGAGTAAACAG 

GTCAI 1 1 1 1 IACCCTAGAGAGTTCAGATC 


P000748 


D 




IM000696 


GATCTCCGGGTGCCAGACTTGCCCAGCAAG 
CACTCTTACCTGCTGAGCCATCCTGAGGGCC 
TGGATTTAAAAAAAAAAAATATTGACATATTG 
TTC 


p000749 


D 
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IM000697 


GATCTCCTAAAACTCCCTGTGTCAGGAAACT 

TTCTGTGCTTTTGTATTGCGTTCCTGTGTTCG 

TGGAAGGCCCCCACGCCTTCATCCTTGCTAA 

TTC I 1 1 1 I GGATAGCTTGTTGCTTTAACTAGA 

TTGGCCCTTTCTTGGCTAGTA 1 1 1 1 CTGCTGT 

AC CTATGAGTGGTGTG GGAGAACTGTGCAG 

ACTTCCAGGAAGCGCAGCCATGAAGCTACAT 

GTGCCTATGTGTAGACACATCATG GATTTTCT 

TACTAGTTTACTAGTGGGTGATAATCTGTCCT 

TTTGAGCTCTCCAGAACGTTCTAGAAGCTTA 

AGGAGAGAAATCACTTAAGAGAG 


P000752 


D 


- 


IM000698 


ATCTGATAGTAAGTAAAAGGACAGCTAAAGA 
TftAAftttnAAAfiCAfiGAGAGTCCTGGAAGAA 
GAAACTAGTGTTTCTAAGAGTTCATCATTGAT 
AAAATGCAAAAGAAGTCAATTACATACATGTT 
TAGGAAACTGAATCCTCTTGTTTTGGGGGAT 
GTTTGTTTTGAGGCAAAGGCTCTCTTACAGA 
GCCCTGGCTGTTCTGGAGTTCTGTATATCAG 
GCTCTGGCCTCAAACTCAAGAGATC 


P000753 


D 




IM000699 


ACATCAAGAGGAAGTTGGAAATGTCATCTTT 

AGCTATCTTATATCCTG GT AGCTTTAAG ATTT 

CCTTTGTGTGACTTTATAGTTCTCAAAATATT 

TTTAAGGGTCAGGGGAGGAAGCACTTTCAAG 

AAATGAGATGGGAGAGGGAATGTCTTTGTGT 

TGGCCTGGAGATC 


P000755 


D 




IM000700 


AGCTATACCTGAAATTTGGCCAAGAACAGAA 
GCTCAGGAAATAGTGTGATTTAAAAACCAAA 
ACCAATTTACAAAAGGAAGACTGTGGTGTAG 
ATC 


P000756 


D 




IM000701 


rrAPAArTttAAAfiCAACACACACAGTATTTT 

TCTGTGGGTTTTAGGATGTATCCACACTCCC 

GAACTTCCTTTCCCTGAAGCACCCCTCAGTT 

TACTCTGAAGCATGGTTTGAGTCCCAAGGCC 

AGTGTCAACTTTCTGCCAAGTCTCAATGGCA 

AAAGTCTGTTTTAATCTGCTCAGGCTAATGTA 

GATC 


P000757 


D 


— 


IM000702 


CI ICCAGICI 1 1 1 IAGCIAI 1 IAI IGAIAIGA 

ATTCCCTGCCTTATGTATCATCCAAGATTCTA 

CCTAAAATACTTCCAATAAGTATCAAGGACCA 

CTCAAATATTCACTATTGGACTTAGAAGCTCC 

ACTCTTAAAAAT AG ATTCTATAG AAAG AG CCT 

GAAATGGGGGCATGAAATGGGTCCATCTCC 

ACCATCACGCACACATGAACAAAGAAAAGGA 

GGAAATGGTGTTAAGAAAACTTACATCATACT 

ATTTAAAAATAAGGAGGAAGGAGGGAGGGA 

GAGAAAGAGAGAAAGCTCAATGCTTAGGCAA 

GAGTGCTTAAGAAAATTACAGTTAACAGATC 


P000758 


D 
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IM000703 


GATCTCCTAAAACTCCCTGTGTCAGGAAACT 
TTCTGTGC 1 1 1 1 GTATTGCGTTCCTGTGTTCG 
TGGAAGGCCCCCACGCCTTCATCCTTGCTAA 
TTCI 1 1 1 TGGATAGCTTGTTGCTTTAACTAGA 
TTGGCCCTTTCTTGGCTAGTATTTTCTGCTGT 
ACCTATGAGTGGTGTG GGAGAACTGTGCAG 
ACTTCCAGGAAGCGCAGCCATGAAGCTACAT 
GTGCCTATGT 


P000759 


D 




IM000704 


GATCTGAGTGCTGGGAACCAAACCTGGGTC 
CTCTGCAACAGTTTGTGCTCTTAGCTGCCGA 
GCTTT 


P000760 


R 




IM000705 


GTACGGCGATGGGCACAGGCTTCGGGACAG 

TCCGCGCGACGCTCAGGCGGACAACGGGA 

GGCGGGCGGGGAAGGCAGGGGCTGCAGTG 

TTCACTGCCTCTGCGCATCCGGCATGGCCC 

CTCCCACTCGGACTTCGTCAAAAAACCGCCA 

CCGTGGAGTGTCCCAGTATGTGCGGTGTGG 

GACAAACTATCGCACTGTTGCCCTGGCTCTT 

CTCCTAGACCCCCTTTGTGAGCCAAAAGAGA 

AACGCTGGGCAGATC 


P000761 


B 


Mm.27393 


IM000706 


GATCTCGTTACGGATGGTTGTGAGCCACCAT 
GTGGTTGCTGGGATAA 


P000762 


R 




IM000707 


CTGGGTTGACCTTAGAAACGGGAGTTCATCT 
C CTC C AfiP A dCTC Pft f5f$ A A & ft AA ft ftTft A A ft 

GGGACTAACCATGATGAGCTTTGCACACTGG 

ACTCAGGGCCTGAGAAGGGGCTGGAGGGA 

GGCAGGGAAGACATCCCTCACGGTAGTGTT 

ACACAGCTGGTGTTGCGGGCTGCCAGGGTA 

CTCATTTGAGTCCATGCTGAGTCTCTCGGTG 

CCTCCTGAGATC 


P000763 


K 




IM000708 


GATCTCAGGAGGCACCGAGAGACTCAGCAT 

GGACTCAAATGAGTACCCTGGCAGCCCGCA 

ACACCAGCTGCGTAACACTACCGTGAGGGA 

TGTCTTCCCTGCCTCCCTCCAGCCCCTTCTC 

AGGCCCTGAGTCCAGTGTGCAAAGCTCATCA 

TGGTTAGTCCCCTTCACCTTCCTTCCCGGAG 

CTGCTGGAGGAGATGAACTCCCGTTTCTAAG 

GTCAACCCAGAGTGGGAACCGCAGCAGCAG 

CAGCAGTCGTCTGTGCTTTGGGCTTCCCTA 


P000764 


K 


Gfi1 
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IM000709 


GGAAGAAGTGTGTGCAGGCCATGGTCAAGT 

CCTGCATGGCTCCCATCTGGGTCCAGCAGC 

ACCCAGCCTCCAGTGCTTGCTCCTGATGTCC 

CAGTGAACTCAGGTCCTGAGCAGCAAATCCC 

AGGGGCCAGTCCTAGGGAGAAAAAGAACAC 

ACTGCCATCTCAGTGCCTCAACAGAAGCAAA 

CCTAGGCGTCAGGTCATGTCCTTGTTACCCA 

CATCACACCTAGACTTCCCTGGGTATCATGC 

TCTGTGTGAGATC 


p000765 


B 


Mm.15351 
2 


IM000710 


GATCTAAGGATATATCATTCCTAGGAGAAAAT 

GMTATTTATGACCTTGGATTTGTCAATGTTT 

TTTTAAATATGGCATTAAGCCACAGAGATAAA 

AATAAGAAAATAGATACATCGAATTTCAGTAA 

AATGAGGAAGTTCTTGTGATTCAACAGAAAC 


P000766 


A 


Mtm1 


IM000711 


GAGGTAAGTCTGTTCAGTGTAGCTATCCTTA 
GCAGCTAACAGTCCTCAAAACTTTTTAGAGA 
TC 


P000767 


D 




IM000712 


CTACAGATGCATTATTAATATTACTTTTTAAAA 

AAACCCAGTATACTGCTTGAAAACAGTGAAT 

GCAATGGGTTCTCATTCACCTTCCTGCTCTC 

AATC AATCTCC ATCTCTAAAG CAAG AAGTGG 

GGGCCCTTCTGGCTGAGCGAGGGGTGAAGG 

GAGGGGAAGAGATC 


P000768 


D 




IM000713 


GATCTGGAGAAGATGTCAAGTTTTAAAATGA 
GGCAG 


P000769 


D 




IM000714 


GAGTGAAGCAAGAATTTGGAGCCCAGCTGC 

CGCAGCCTTTTTCCTTTCAGCAAAGCTCGGG 

AGTGATAGATATGCATGAACCAAAGCAAAGC 

CTTGAGAGTGCCACTTGGCCCTGCCTCCTGA 

GGGTCTCAGGGCATCAGCTGGAGACCACCC 

TGTGACCCACACATCACCGACTATGAAAACA 

GCTCATCAGAGTAATAAAGATC 


P000770 


D 




IM000715 


CAATGAACAGGACACATGCTTCACACGACAG 
TCCAAAAATGCAAAGTGTGGAAGAATTCCAC 
AGCCATAGCCTTCATTACTAGATC 


P000771 


D 


- 


IM000716 


ATGCCTTCCTGGTAGAAGAGGGCCATGCTGT 

GGCGGGGAGGGGCCACTCAA I ! 1 1 ICCTGC 

TCCCTTTCCCTGTCCCATATTCTCAGGAGCTT 

CTAGAAGCGTAGCCTGCATCTCATGCCCTGA 

CTTGGCACCAAATGCTTGCTTTGTATCAACA 

CCGCTTTCTCTTCTGCTCTTTCCAGCTCGCA 

GCCATTCAAATAATACCACCCGGTACCCGTG 

GAATCAGGAG C AGAGATTCC AAATTGAGTCC 

TAAAATCAAATCCAAATGGGCCCGTCAGCTA 

GATC 


P000773 


D I 
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IM000717 


AGGCGAGCGGATTACTAAGGACTGAAAGAC 

TCCTAAGACTTGTCTCCTGCTCCCTGGCCAG 

CGGTGGAGCTCAAGCAGAATTGCAAGCTCA 

GCTCAGGTCTCAGTGATGCAAAGCACCCTC 

GTTACTCCAATGTGTGTTACTCCTACAGGTG 

GGCTGCCTTCCACTTTCAAACACCCGCACAA 

ACAGACCTCCCACCGTATGCCAGAGCATCTG 

TTCGATGCTTTCTGGAAACTATGCAAGCCCA 

AATTTAATATCCAATCAGATC 


P000774 


D 


- 


IM000718 


GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TGTTACAAGGTCTCATACAGAATCCAGGCTG 

GTCTCAAACTACTGGAGTCAAGCCATCTTCT 

CACCTGGCTTAGCTGGGGTCACAGACTTGTG 

CCATCATGCCCAATGGAATGCTGTTCCTTTT 

GGAAAGCCTGCTACTGTCATATACTGTCATA 

GGAGTTAGCGACTGCTGGCTTATTCCTTCGC 

TTTGCTTGGAGATC 


P000776 


R 




IM000719 


CCCCCTTCCTGTCACCTCCTGACCCCTTGCG 

CAAAGGAGGCTCGTGGCCCGCTGTCCCACT 

GGGGGATGGGGCTGGGGTTGAGAAGGCTA 

GTGAGCGCCTCTAACGCTCAGGAAGTGAAG 

TTTGTGGI 1 1 1 GGGGGCTG AGCTCCGAAGGA 

GATTAAAAAAAAAAAAAAAAAGTCAGAGAGA 

CAGATC 


P000777 


D 


- 


IM000720 


CTTTGTATAAGCAGCAAACAAAAAGCCAGAG 
GCAGTCCACAGATC 


P000778 


D 




IM000721 


ATACAACAGGAGCAAAGCTGGAGGGGAACA 
GATATAGAG GACAGTTCAGGGCATCTGCAG A 
GGTGCTGTGGAATGGGGAGGGGACAGTGGA 
TAAGGGGACTTAC CCTGAG CATCTCGGTAAT 
AAGCATGGGTCACACTGCGGAAGCGCTCCT 
GTCCTG CAGTGTCC CAGATC 


P000780 


A 


Rab37 


IM000722 


G ATCTATGTCATCTTC CAGGACTCAGAGTTA 

AGAGAGTTACCAAGTGAGAGCTCTCATCACC 

TTCTGAAGCAGTTGAGAATTGGAACCCAGAA 

AGATGCACATGCACGGGCACACACACACCC 

ACGGGCACACACCCACCCACCCATGCAGAG 

AGAGAGAGAGAGAGAGAGAGAGAGAGAACT 

CACACTGGTACTGCAGTAAACGGGAGCTTGT 

TT 


P000781 


D 


— 


IM000723 


GATCTTC 1 1 1 CTCTGCTCAATTAGTTCACTTC 
TGCTTTCATCTCC 1 III CI 1 1 IGATAAACCAT 
GAGTTTCATTAGGGCTATTACAATCACATGC 
AGI 1 1 1 TCCTTATAGTA 


P000782 


D 




IM000724 


GAATTAGGCCTAGAAACATTAGAATCCAGAC 
CACGGAGCTCCCCAGATC 


P000783 


D 
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IM000725 


GATCTTGTTCTAGAACGACCCTGAAGGCAGC 
AGAACAGAGCAGGACTGAAGGCCACCAAGG 
GGATTTCAACTCTTCAGAAAAAATAAGTGACT 
CACCTTCTCACAAAGAGCAAGAATCACAGAG 
GTCAGATTGTCTCCTCCTGCCCATCAGGGAC 
AGAGTCCCCCATCTTTGCCTTGCTCCATCTG 
GCAGGTAAGAGATGGGAAGTCTCCTTTCCCT 
CGGTCTGCAGCATCCCTGGCATCCCTGGGG 
AGTGTTGGC AC AG AACCCCC CTC CC AA 


P000784 


C 




IM000726 


GATCTGTGTGG GCAAAGCCCATGTGCTGCA 
GTGTGTCTGG GTAGAAATGAGTTGTGTGGTG 
CTCAAATGTAAATGAAGTCCCTGTGTT 


P000785 


D 




IM000727 


GATCTCATTACAGATGGATGTGAGCCACCAT 

GTGGTTGCTGGGAATTGAACTCAGGACCTTT 

GGAAGAGCAGTCAGTGCCCTTAACTGCTGA 

GCCATCTCTCCAGCCCCCCACC I I I I I I i I lA 

AAAG ATTTA 1 1 1 IAIAGI 1 1 1 IGCI Mil IAAC 

AGTACTGGAACATCTCAGTAATTGCTAAGTT 

GTCCTTGCTCCAGGTG AG CAGTCATATTTTC 

TCCAATTCTGGTTTCCTTACTTGTGTCAGAGA 

CCAAAATAGCTTGTTTAATCAGTTAGAGCTCT 

TTAGTTACCCATATCTGTGTAGTAA 


P000787 


R 




IM000728 


TAAGAACATAAAAGCAAAATTTGGAGGCTCA 
AGATTCAGTTTAGTTGCTAGAGGGCTCACAT 
AGCATGCCCTCCCCACCCGGGATTCCATTCT 
CATTTATCGAGGC ATAAGG CCAGGTGTGGTG 
GGATATGTGCTGGGATGCATAAGATC 


P000788 


D 


- 


IM000729 


GAAAGGCACACTGGTGAAGGCTGAGGACCA 

CCAAAGCTGCATTTCTGCTAGGCTAGGTAGA 

ACAAGAATGGTGCTCCACTAAGAACTCAAAA 

AGCCACAGCCCACCCCTGAGGCCCTCCATC 

TGACACATGCCGGTCACCTGTCCTCCCACAG 

CCCAGCACAGAGAAGCCACCATCCCTCCCC 

TTCCCACCTCCTGCAGCTGACAGTGTGCATC 

TTTCCGCACATTCCTCTCTCCTCAATCAGGTC 

AGAATGTATTCCAAAGATC 


P000789 


D 


- 


IM000730 


C ACTGAAAATGGCTAG AATTCTGGTG ATGG G 
TGAGCCGATC 


P000793 


D 




IM000731 


GATCGGAGTCCCTCGTTTCAGAGGCCCCACT 
TCTATGGCTCCTGCCTTCCTTGGCTACATCC 
trmrTCxcTn a a ptppt^ a a apptyvtrt at 

M 1 1 1 Ox-/ 1 OMou 1 Ub 1 UOnnnUb lOIOlnl 

CAAGTCTTTTCCAGTTAGTGCGTTCTGAGTG 

GCTCTAGAAACCGCTTCCCATTACAGCGAAA 

GACCCGTATAAACCATGTTCTCTTCCTCTGT 

GACAAGAG ACAACAGACAC CGCACAAAGGA 

CTGTCTGGCCTGGGGGGGGGTCCCTGGTTC 

ACAGCTTCAGTCCTGA 


P000794 


D 
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IM000732 


GATC6CTCAATATAACAGCAACATGCCAAGT 

GCCACTTGTAAAATTTGTTGTTGAGCAGTCTC 

ATTATCAACTGAAGCACAATGTCAGGCTAGC 

AAGAGGCAGGTTCAGTTGTTGATTAGCGATA 

GCACACACAAGCCAGCACATGCTTTTTCTGT 

GAGTTCTAT 


P000795 


D 




IM000733 


GATCGCTGAGTTTGTTTACAGAGCAGGGACG 
CCTCAGCTCGGATGCCAAAGCTACCAAGAG 
CTGCAAACGCAAACTTAGCAGAAGCACACGT 
ACTCCC 


P000796 


A 


Cited2 


IM000734 


GATCGCACAGGTAAAATGGGGACTCACTTTA 
GCTAAAACAACAACAACAAACAGCCTGATGA 
GTCGAAAGTCTCTTTAGGTTGCCCTCTGTTC 

CTCCCACAGCAGTCTCAAAATAACCATAGTG 

CTCAAGTCCCCTGTATCAAATGGTGGTATCT 

GCATC CACCCTAC AGGTGTTCTTTGATTCTTT 

CTTTTCTTTGTAAGTGTGTCTGGGTGTTTTGC 

CTGAGCGTATGTATGCGCCTAGTACCTGCAG 

AGGCCAGAATAAGGTGTCAG 


P000797 


D 




IM000735 


GATCGTGAGAGGCGAGAAACCCAGACATCT 
CTAACCCTTCTTGCCAACTCAGGAGCCACCT 

CCTCAGAGGCCTCCATTTCCACAAAAGGCCT 
TCCTGGTTGTTCAGGACAGAGCCTGGTTTCC 
CTGATACCCCTTCTCTCAGTGGCCACTGAAG 
TTACAGGGATGCAGCCAGCCGTGGTTGCCA 
TGTCTGTATATGCTAATCTCCGAATTCCACTT 
CCTGTTTAGATTCTC G G 


P000798 


D 




IM000736 


ACTGTCCGTGTGGGAAACGTTTAGCAAGTCC 
GAGCGTGTTCGATC 


p000799 


K 


Nmyc 


IM000737 


ATTTCTTTTTGAGTACTTCATATAAGAGCTTC 
GCATCTACACCACTCTTGCTCGC CACTCCTC 
TTTTCTTCTTTCATTAACTACTGTCCACTCTC 
CAAACTTCATAATCTCTCTAATTACTATTGTTA 

TTTAPAPAPAPAPAPAPAPAPAPAPAPAPAP 

ACACACACACACGTATATGTAACCTACTGAA 

TCTTACTAAATAGCTTTACTATCTTCCAAGTA 

ACAGGCACTTGATAAATCTTCTGTCAATCTCC 

CAGAACAGAAGCCTTAAGAGTCATTTAAGTT 

CTTTTATCTCAGG CTGTTCTGTTCTATGCCTT 

TTGCTTTTAATCCATCACCGATC 


p000801 


D 




IM000738 


GAATGTCTAGATGGAGACTGGACAGAGTTG 
GATTCCTAGACACCTAACAGAAGCGAAAGCA 
GGGGATGGATAAGGTGGGTGCCTCGTCCTA 
CAGC AGGTTCTGAGTGTC CGC AGAGACTC C 
CATGGCTTGGCAC CATGGTTGAAGCTTTC CA 
TCGATC 


P000803 


C 
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IM000739 


CTATTTTCGTTCTCTCCGATC 


p000804 


D 




IM000740 


GATCCTCATGTCAAGGCAGGGGCAGACCAG 

GGTCAAGGGAAAAACACCTGCTTTCCTGGGT 

TGTAAATGCCAGAAAGGGAAGGCACGGGGT 

GGGTAGGGTGGAGAACATGGCCCAGACCCC 

TGTCTCTTCTCT 


P000806 


D 


- 


IM000741 


GCACCTGACTTCCTCATATAAGACACAAACA 
TCTTGAGTGCTGCGCAGGTGTACCAGGATAC 
AGGTGAATCCAATCTGGTGGAGATTTGCCCC 
TGCTGCCCTGATTAGCTGAAGCTGCGTGCCT 
rifiTcmczczmcz c ATf? a ccTd CT(vrcx c a Tan a 

OO I UMVJO 1 OOl/M 1 OUwV/ 1 OO 1 O 1 OUO 1 OOM 

TGGGAACTGAGAGTATAAAAGAGCGAGAGG 

CCCGGGTTAGAGGAGGATTATTATTCGAGAG 

AGGATTGTTATTATTGGGAGATATGAACAAG 

GGAGATATAAACAGGGGAGATATAAACAAGG 

GAGATATATGGAGAAAGAAGAAACAGGACTG 

AATAAATGTGTGCAGAAGGATC 


P000808 


R 




IM000742 


GATCCTTCTCCTGTCTTCTCTTCTGGAAGGC 
TGGGCTACATGCCAACATGTCAGAGTTTTAC 
CTGGGTTCCTTCCAGAGGTTTGAACTCAGGT 
CCTTGTACTTAC AC AG C AG CTACTTTG CCTAT 
TG AGTC AATATTTTGTGTGTGTTTGTGTAG GT 
GTGTTCATGTCTGTATACTTG 


p000809 


D 




IM000743 


GATCGTGCATGCATGGGTGTGTTTTGGGGA 
GAGGTTCTGTCCTTGCTAAG 


P000811 


D 




IM000744 


AGCTCAGCTTGTCAGGCCTGATTGTGAACAC 
TTCACCAACCGAGCCATCTCGTCAGCACAGC 
CCTGI II II IAI ICCCAI 1 1 1 CTTTTCTGTATT 
TCTGTTGAATTTCTCACATACTCTCCTTTCTC 
TTCTGCCTTCTTCTGGTTTCTGCATCATTTCT 
ATATTGACATTTAAACAACCCCCAAAATTCAA 

nATAPATPAAP'AAAAATTTATTPAAPTAfTIY^T 

TTCTTACTTCCATATCAATAATGAAAGAAAAT 

TAAAACCTTTCAAATTCAACAAATCCCTACAC 

TACATATAATCACTTTCCTCTATGCTAAATCC 

AACTTGAAATTATATCCTCAATACCCTGCTGG 

TAI 1 1 1 1 ACTGTCTACATCACTGCCTAGTCTT 

CGATC 


P000812 


D 


- 


IM000745 


CTGGTATATGAACGAAGTTGGTCTCTAAAGG 

CCGTCTAGAACAACGGTTCTCAACCCGAGG 

GTCGCACCGGGGTCACCTAAGACTACTGGG 

AAAGCACAAATATTTACATTACGACTCATAAC 

AGTAGCAAAATTACAGTTATGAACTAGCAAC 

AAAAAATAGTTTTATGGTTGGGGATTACCACA 

ACATGAGGAACTGTATTCAAGGGTCGCAGCA 

TTAGGAAGGTTGAGAACCACCGATC 


P000815 


R 
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IM000746 


TTCTMCCTGCTAGGGTTTTCTCACGTGGGT 
TCTTCTTTGAGGGCTCTCTG GCTTCCCTACT 
GAGCTGTAGCTGCCAAAGTTGAAGGGCTGC 
GTCTCCCTTGCGTCTCCCCAGTCTTTACAGC 
TCCTGAAACACACTAAGGTATTTATTCAAATC 
CCTGTTTTGTGTGCGATC 


P000819 


D 


— 


IM000747 


AGGGCCCTTCCACCTCTTCTAGAATTCGGTA 
AGCTAAAAGTACATGTATCCGATTAATCTGAA 
ATAATTTTGTAGACAGTTTGGTGACGGGTGG 
AGGGTGTGTGGTTGCGCGATC 


P000820 


C 




IM000748 


GATCGGCGAGACCACGATTCGGATGCAACA 
GCAAAAGGCTTTATTGGATACACGGGTACCC 
GGGCGACTCAGTCTATCGGAGGACTGGCGC 
GCCGAGTGTGGGGTTCGGACCAA 


P000823 


R 




• 

IM000749 


TTGGCTGTGGAGATGAACGTGGGAACCGTG 

GAAATGACCCTAGAATGGGGCTCAAATGTGA 

AAGGCATGCCAGAGGTTGCTCTGTTGTTTTA 

AGTCCCTGCCGAACATTAGAATTTAGCCTCA 

GTTTTAAAAGCTGTTACTGCCTAGTTGGGTG 

CTTCTTTCTTAAAAAGCAACCAAAAAAAAAAA 

AGCCGTTTTCACTCTGAAATGTATTAGAAATT 

TGCATTAGCCCAATGGCTAATAAGCGATC 


P000824 


D 




IM000750 


GTTATAAGGATTGCATACAAATGGCATCAGG 

ACTGGATGTGGTGGCACATGTCTTGTATCAC 

AGCACTTGGTGAACAGAGGCAGGGGAATCT 

CTTTGAGTTACAGGCTAGCCAGCATGACACG 

GTGAGACTCTGTCTTAAACAAACAAACAAAC 

AAAAAAACAAACAAAGGTAGCATAAGAGCGA 

TC 


p000825 


D 


— 


IM000751 


ACCrGAAICI IGAAIAAIGGGCIGI 1 1 1 ICCG 
ATC 


P000827 


D 




IM000752 


AACTAATACCTTTCCTTCCGCTGCGATGTTTC 
ATGAGACTCTGGGTTAGTGCATGGTCAGGG 
GCCCAGGCAAACAGTGGCAGTTCTGCCCAG 
GATC 


P000831 


D 


- 


IM000753 


GTTTAAAGAGCCGGTTCGACCCGCTTTCCGT 

TTCGCTCCGGGTCAGCTAGTACTGTGAACCG 

CTCGGTCGGGTCCGGCGCTGCTGCGCACCT 

ACTCGCCGGGACCCTGAAGCCCCCCAACTA 

CATATAGGGGTCTTCCCGGAAAGTACGCAG 

GAAGTCGCGTTCGGCCCCCTCCCCCCAGCA 

CCACACCCAGTCCCTTCCACCCCCCGGGAT 

C 


P000832 


D 
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IM000754 


GATCCCAGTAGAGACAGAAACAGTGCCTTTG 

GTTAAGAATTCCAGGCAGGATGGTACAGGAT 

TGCAATCTCAGCATGGGAGACAGAGGCAGG 

ATTTCCAGGCCAGCCTGGGCTACAGTATAAA 

TGGGACCCTGTCTCAAGTAATTGAAAAAAAA 

ACAGAGAAAGAATTTGGAGACTGTGACTATA 

GCTTGGTGATGGAGTCCGTTTGCCTAGCAGA 

GTGAAGCAGCTGTGCTCCTGTGTTCACACCA 

CAAAATAA 


p000833 


D 




IM000755 


GATCCAGTGAATCTGGGCATTGTGAGTGTGT 
GACACAACTTGCTCTATGTGCTGTTAGGGAT 
TTGTGCATGCTCAGCCAACAACAACCGCCAA 
CTTAGACTGATGCTGTCCCCCTGAGAACACA 
GACTGACAA 


P000834 


B 


Mm.13133 
6 


IM000756 


G ATCCTCCCTAC.C G GTClfTTnGGGr AG APCT 

CCAGCCCTTCCCCAGACACTGTTGGAAAGCA 

GGCACGCCTTCCACAGTATGGTCTGAGGTTA 

ACCCATGACAGCACTCTGGGTGCCTGGTGG 

TGTTCCTGGTGGGGACGTCAGTAGCTGTAG 

CTCTGTCATTGGTCCTTGCAGCGTCTCATTC 

CAACTATTCTTCCCATCACTCCTCT 


p000835 


D 


- 


IM000757 


ATATGTGTTTGTGCGTGTGTGTACATGTGCA 
TGCATGGCATGTATGTACCCATATAAATATGT 
GTATGTGTGTGAAGTGCTGATGTATTTCACA 
C AGCATTTTGG ATTTAATGG AG AAG GTAGCT 
CAGATGTCAAGTGTGCCCTCCTGTCAGGAGA 
GGAAGCCTGATGTGCCTGCTGTCATAACTCT 
GGTTTTGATAAATACAGCACGAGTGA 1 H 11 G 
GCTGTTGGGTTTGCCGTGTATGGATC 


p000837 


D 




IM000758 


GTTTGCTTGCAACATTGTCATAGCTTAGTGAA 
CAGTATAGCATTGTTCTGGCTCAAGAAGCCC 
TGGTTCTTCAAAGCTCCTACTTAGATGAAATT 
ATTTGCATCACAAACAAAAATTG I I I I GCATT 
1 1 i f AGATAATGAAGGATC 


P000838 


C 




IM000759 


GATCCTAGGCCAGTCAGGGCTACCAATAAGA 

ACCTGCCACACACACAAAAGGAAAGCAAATT 

TTTGCAAAAACTCTAGTCTCATGGTGTCACG 

GTCTTTAAACATCTTGAGGGGCTCGAACTGG 

TGAGGTGGCTCGGAGGTAAAAGGGCTTTGA 

TGCACAACCTGAGTTCAACCCCGTGTTTTAA 

AGACTTTCTGCAATGATTCTGGTCTGCAGTC 

CTAGCCCAAGCACAGTCAAGGAGAGATTGA 

GGCTGAAACGGAAGAATGGAAGTTTGCATAA 

CAGCTCAGTGGCAGAAATAACAGGAGAGAC 

CTGACCTTAAAAACAGGGTGTAAGGTGAGAA 

ATGATGACAAATGACATCCACTTCAACTGTG 

CTACGAACAGCTACCTGTTTGCACACCCCAA 

ACACACACACACACACA 


D000839 


D 
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IM000760 


GTAAGAGGGAATGTACTCTCTGCCATCGGGA 
CACCCAGTGGAACTGCTCACCTGGAGTCTTG 
CCTCCACGAAGACTAGGATC 


P000840 


D 




IM000761 


GGGACTTCAGGGCATAGAGCTTAGTTCCAGA 
CAAAACCAAAo 1 1 AbUAb f OoL/O 1 0 1 U 1 OT 1 
AAAGACGTTCTCTCTAGCC GCAGATGACCTC 
AGAAGGGGCTCTGGGAGCCGACTCCCACCC 
TTCCTTCTCTGTTTACAGAATCTGGTTGGGCT 
GTGAGGAGCGACCCACGAGACGGGCTCCCT 
GTAGTG AGTTAGGC CAGTGGGAACCAACGA 
GGATC 


P000842 


D 




IM000762 


ACACACACTAACACACACTCACTCACACATA 
CTCACACACACTCACACACACTGTCACACAC 
ACACACACACACACACACACACACACACTTT 
TCCACCAGGATC 


P000843 


R 




IM000763 


GATCCCTGGATATGGCAGTCTCTACATGGTC 

CATCCTTTAGTCTCAGCTCCAAACTTTGTCTC 

TGTAACTCCTTCCATGGGTGTTTTGTTCCCAC 

TTCTAAGGAGGGGCATAGTGTCCACACTTCA 

GTCTTCA 1 1 1 1 1 CTTGAGTTTCATGTGTTTAG 

CAAATTGTATCTTATATCTTGGGTATCCTAGG 

TTTTGGGCTAATATCCACTTATCAGTGAGTAC 

ATATTGTGTGAGTTCCTTTGTTCAAATTTCAT 

TTCTATCACCATTGTGTGTATATGTGTGTGTT 

GTGTGTGTATGTATATGACGTGTGTATGTTGT 

GTGTGTATAIATAAGGTGTG 1 ATGTTGGGGG 

TCAAAGGCATGCTCATGCCACAGTGAATGAG 

TAGACATCAGAGGACAACTTTCAGGACTCAG 

TTCTCTTGTTCTACCCTGTGGTTCCAGGACA 

CTAACCCAGGTCATCAGGCATGGTGACAAAG 

GTTTTGACTCAAGGAGCCATTTTACATGCCT 

CATAAGAAGGGCC 


P000844 


R 


— 


IM000764 


GCACTAGGAAGGAAATTGACCCGTGTTGTTG 
GTTTGTGTTCTGGTTTTGTTGGTGGTGCTTTT 
TGI mm 1 GTTTGTTTG 1 1 1 1 1 1 1 GTATCAG 
GATC 


p000845 


R 




IM000765 


GAICC IGCI 1 ICICIi 1 1 GACACAGAACAC 1 I 
CTCCTGATTGACTCTGGTC CAG ACATTTCTTT 
CAAAGGCAGAGGACTCTGGCTTAGCTGTGG 
ATGACTTCTCAGATGAAGTTCATTGGTTGCG 
ATTGGAAACGTAATCAGAGCAGG 


P000847 


D 
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IM000766 


GAICGCA! TAGGGI MINI FAIGGI 1 ICICA 

TCTTCTCTTCAAATTAGCATAGAAGCCTCTTC 

CTAAAG AATG G ATACTTAATTCTTAACTTG AA 

AATATCTTTTCTCTGTGTGTTTTCCTCTCCATT 

GACTGTTCGCTCTATCTATCTATCTATCTATC 

CATCTATCTACTGAAATTAAAAATAAGGGAAC 

G C CTTCTTCTCTTC ATTCTTGTTTGTTGTTTG 

TTTGTTTGTTTGTTTTTGAGACAGGGTTTCTC 

TGTGTAGCCCTGGCTGTCCTGGAACTCACTT 

TGTAGACCAGGCTGGTCTTGAACTCAGAAAT 

CTGCCTGCCTCTGCCTCCCAAGTGCTGGGAT 

TAAAGGCGTGCACCACCACCACCTGGCTCT 

PTTP ATTPTTTTTAAAACGAT I I I 1 GAAACCTT 

TTTAGTGAGGTCAACATTGTGTACTCCAGTC 

CCACTCATCTTCCTGTCCCTTCCCTCTTAGG 

CCTGCCTGTCTGGTAC CTCACTCATGTTTGT 

GTATTCTCTGTGCTGAGCCTCTTCTGTGCTTT 

CCCAGCACATGGCTGCTGGCTCCAGTTTCAT 

TCCAGTCCCTTGTGATGTGAGCCTAGTTCAG 


P000852 


R 


- 


IM000767 


CTCTCATGGCATGGGTCTCAAGGTCCTGCCA 

TTTCTGCTCCATCTTTACCCCAGCACATCCTG 

TAGACAGGACAAATTGTAGGCCGGAG GTTTT 

GTGGCTGGGTTAGAGACCCAGTTTCTCCACT 

GGAAGCCCTGCCCGGTTACAGGAGGTGACC 

AGTTTCTG GCTCCATGTCCCCC ATTGCTAGG 

AGTCTTAGCTGGGGTCATTCTCACAGATTCC 

TGGGAGATTACTCTATTTTATCTCCTTGTTCA 

AAGTGTTCCATCAGATATTAATTATTCTCAAG 

ATTCAATATTCTCAAATATTATTCTCAAGCTAT 

GGACCCTTCAAATTACAGATAGATTTTATGAA 

TGAAAAGTTGTGTGGTTTGAATATGTAGTTGA 

GGGTGACTTTGAACTTCTGGTTTTCCTGTGT 

CTACCTTCCAAGTGCTGGGG7TACAGGTATG 

AGCCATCACGCCAGTTTCTGTAGCACTGAGG 

CTCAAACACAGGGCTTCTGTCTGCTAGGCAA 

GCACTCCACCTACCAAGCCAAATCCCCGGG 

CTTTACTGCATCTTTGTGTGTATATGTATGGT 

ATGTGCGTGTGTATGTAAGGATATATGTACC 

TGTGT 


P000854 


R 


- 


IM000768 


GATCAACACCTGAAAAGTCGCGCCGCCTATA 

CACATCCCTAATTGAGAAGTATGTGGAAGAT 

TCCATCCGTGAAATTCAATTATCATGCAAGC 

CAAGTGGAAGCGCTTCCCTGGGGAAGGAAC 

CCAGCAGCCGCATCAAAACGACCCCACCTG 

TCTATTTTCATGTCAAAAGAGTGAGAAGTCTG 

GGTGATGAAATAGAGAGCATACATCAGCTTA 

ATGAAAATTTCCAGGGGTCC CTGC CTGTAAT 

GGGAGTCCCAT 


P000858 


D 
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IM000769 


GATCACCACCAGGGTGTTGAGAAAAAAAAAA 
AGCAAGTTAGTAGATGTTAG 


P000860 


D 




IM000770 


GATCTGACAAAACCTACCTGTTTTTGAACACA 
TGTGGGACAGCAGTCTGAGAGAATCTATGAA 
TAAAATTCCTTTCTGAGTCTGGCACATTGGTA 
CAC 


P000861 


D 




IM000771 


GATCATTATACCC CAAATGGTACTGTATCTAT 
ATATACCTCAAACATGTCATGTTAAAGAAAAT 
ACTCTGTTGAACTAATTCACTTGTTT 


P000863 


D 




IM000772 


GATCACAGGACTGAATCACATTTATGCCAT 


P000864 


D 


- 


IM000773 


GAICAI I IAI I IACTTGI ! I IGGIGI I ICAIGI 
TTGTG GCTCCTTATGTAGTCTAGATATTAACT 
TGAAGTCTGAAGTGGAACTACCAAAGATTTT 
CTTCCATCCTCATCT 


P000865 


D 




IM000774 


GATC AACC GCAGATGAGGTCTATGCAGGAAA 
AACGATGTCTGGAATTTTATTAAAATTGCTCA 
GC 


P000866 


K 


Myc 


IM000775 


GATCATCATGTCAAACCTGACACGTGACGAG 
AC AAATCTGTGTGC AC AGAG GTGTG AC ATCC 
TAAAAGTACTAACAATACCGCTGGGCAGGGA 
CACAC GCGGCAATTCCAGTCCTGGTATC CAT 
GGCTCAAGCTCTGCACGGAGAGCCCGGCAC 
ACGGCAGGAGGGAGAGCCACAGGCTAAGGA 
GAGCAATGCTAACTAACATGGCACCCGTGTT 
AG 


P000867 


D 




IM000776 


GATCTGGCTTCCAAGGGCCTGTACTCATGTC 
TACAATGCTCCTACACAGATATAT 


P000868 


D 


- 


IM000777 


GATCAGC CTTCCTCCAAAG CTACCTGCATAG 

AAGAG ACCTCTGCTCTC AC CTACTCTCCTCT 

ACAGTTCAGCCCATATGGCTTCACCTGCATC 

CCCTACACACACACACACAGACACACACACA 

CACACACACAAACACGCACACAGCACACACA 

ACACACACAACACGCACACTCACAACACAAA 

CACACACAACACACACTCACAACACACTCAC 

ACACACACACAACACACACACACAACACACA 

CTCACAAACACACTAGTACACAAAGACTCCA 

ACACACACATTCCCATGCACTACTCCCTCAG 

TATCCGCCGCATTTGTGTTCACACTCATCCA 

CACTCTCACACATGTAGCACACACACATCAT 

TCCTACAC AGGCATGGACACACACATG CTCC 

TATACAGGCATGCCCAGTACTCTCACATGCA 

TGTTTGCACGTTCCCAAACAGGTTCCCACAA 

GGGTTTGGCAAAGTACATGCATCCTCACACG 

CTAATGCAAGCCGTCACACCCCATACCACAA 

GCATGCAC 


P000870 


R 
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IM000778 


GATCAGATGTGGAAATTAGAGAGAAGTTTTT 

AACGGCTCATGCACATTTCTGAAAACTCTTT 

GCGAGGTATACTGGTAGATAAATGAACATTG 

GTCAGACTCCTCTAGTTTAAACCACTCTCTTC 

CCCGCTATGGGGGGAGGCGAGAGGCATTTC 

TAAAGrTTATATGTAGTTGPAAAGTGTGTGT 

GGTGTGTGTGCATGTATGTGCATGTGGTGTG 

TGTGTGTGTGCATGTGGTGTGTGTGCATGTA 

TGTGCATGTGGTATGTGTGTGAGTGGTGTGT 

GTGCATGTGTGTGCATGTATGTGCACCGTGT 

TGTGTGTGTATGTGTGCATGTGGTGTGTGTG 

CATGTATGTGCATGTGGTGT 


p000871 


R 


- 


IM000779 


CTAAC ATCT ACTAACTTG CTTTTTTTTTTTTCT 
CAACACCCTGGTGGTGATC 


P000872 


D 




IM000780 


GATCATAAGGACTGTTAGCAGGCAAAGGCG 
CGTGCCCAATTAAAAGATGGCTTTCGTTCCA 
AGAGGAATACTCTGGC AAAGTC CCAAGCGCT 
TCGGAAGCCCCTCCCTTCGCTCTCCCACCCC 
AGCTTTGATGCTCTGATTATCCTAA 


p000874 


D 




IM000781 


GATCAGGCTGGCCTTAAACTCAGGGAGATTC 
ATATGGCCCTGCCTTCAGGGTGCTGG 


P000875 


B 


Mm.83635 


IM000782 


CI 1 ICI I ICI 1 ICI 1 ICI 1 IC! I 1 1 1 1 1 ICIGA 

GACAGGGTTTCTCTGTATAGCCCTGGCTGTC 

CTGGAATTCACTGTAGGCCAGGATGGCTCA 

GTCTGCTTTCTTATAGAACTCAGGAC CACCA 

GCCCAGAGATAACACCACTCACAGTGGGCT 

GGTCCTCCCCACATTGATC 


P000876 


R 




IM000783 


GATCACACACTTCACTGTGGCTTGTCAACTG 
TGATTTGCTGATACAAGGGCTGTTTACAAGT 
CAGCTATAGCTCC GCATTG CAGCTGCAAC 


P000877 


D 




IM000784 


GATCACTAATTGAGAAAATGCCCCACAGCTG 

GATTTCGTGGAGGTACTTCCCCAACTGAAGC 

TC CTTTCTCTGTGATAATTCC ATCCTGTGTC A 

AGTTG AC AGAAAACCAG CC AGTACACAAGTC 

GACACAAAACTAGCCAGTACACAAGTCAACA 

CAGAACGCGCAf AAGPTGAAGGCAAAGAGA 

ACCAAGCATCTACCAGGCCTCAGTTGCTATG 

TCCACTTCTGCAGCCACTCCAAAACACCTGT 

CAGAAATTCGGTTTGATAGAGAACTCACCGA 

GGGATTTCCCTAACACCAGGTCAACCAGGG 

CACCTCAAACCTGGAGGCACGACTGGCACA 

ATACAACCTAA 


p000878 


A 


Ccf5 


IM0Q0785 


GATCACTTGATAAAGATGCTCTGAGCAGAGG 
CTCACAGGAACCCAGCCCTGTGTGCTCCCC 
AGGAGCGAGATTCAGCAGTCAACAGTGCAG 
TGTTCACGTGACCGTGCGCAGGCCATGAGC 

AOTAP 

t » W 1 # *w 




Q 


r \iv • vw v ■ 
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IM000786 


CTCCTTTTCAGCAAGCTCCTCACATCACAGG 

CCTTCTCTTGGGATGGCAGCCGCCTTCTATC 

TGGAAAGTATGTGACAG CTCAC AC AATCCTG 

TAAGTCTTCCATGTAATCACATTCCACTGCCT 

CTCTCTGAACGTGCTCCATGCCAGGGCCATG 

TGGAGGGAGCAGCAAGACTTGAGCTCAGCT 

AGTCTATGAAGATGGTGGCAGAACAGGCTCT 

GCTGCCTTGATC 


P000881 


B 


MMU7675 
4 


IM000787 


GATCAAGAGTTCAAAGTCATCTTCAGCTACA 

AATGAAGTTGGAGACCAATCCAGACCCTCTC 

TCAGAAAAAAAGGAAAAAGGAGAAAGCAAAA 

GGAAAGGAGGGGGAGACCGAGAAAGAGAA 

GAGGGAAGGAAAGGGAAGTCAACAGAACTC 

AAGGTCAGCCTGGGAGGGTGAATGAGGCAT 

TGTTGTCT 


P000882 


B 


Mm. 13880 
9 


IM000788 


GATCACCTC CACTTTATGGTGGAC AGAGGAT 

GGCAGTAGTAACTGCCCCAAGGAAACAGAA 

ACAACAACAACAACAACAACAACACCTCCAA 

AAAGACCAAAGCAGTAAGCTGTAGAACAAAT 

GCAAAGAGCCAAAC 


P000883 


R 




IM000789 


GTTCCACCTATAAGGTTGCAGACCCCTTTAG 

CTCCTTGGGTACTTTCTCTAGCTCCTCCATTG 

GGGGCCCTGTGATC 


P000884 


R 




IM000790 


GATCACATGGACCGATTGCCGCGGGACATC 

GCACAGGAGCGTATGCACCACGATATCGTG 

CGGCTTTTGGATGAGTACAACCTGGTGCGCA 

GCCCACAGCTGCATGGCACTGCCCTGGGTG 

GCACACCCACTCTGTCTCCCACACTCTGCTC 

GCCCAATGGCTACCTGGGCAATCTCAAGTCT 

GCCACACAGGGCAAGAAGGCCCGCAAGCCC 

AGCACCAAAGGGCTGGCTTGTGGTAGCAAG 

GAAGCTAAGGACCTCAAGGCACGGAGGAAG 

AAGTCTCAGGATGGCAAGGGCTGCCTGTTG 

GACAGCTCGAGCATGCTGTCGCCTGTGGAC 

TCCCTCGAGTCACCCCATGGCTACTTGTCAG 

ATGTGGNCTCGCCACCCCTTCTTCCCTCTTC 

ATTCCAG 


P000885 


K 


Notch 1 


IM000791 


GATCATACGCAATGATTTCTTACCTTATGATA 
TAATTATGTTTAGAGGGAAAAC 1 1 1 1 1 1 1 1 AA 
ATTGAAGTTCATTTATTGTATGAAATTATTTCA 
TAA 


P000886 


C 
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IM000792 


QATCAGCATGGTCTACAGAGTAAGTTACAGG 
ACAGCCAGGGCTCCGTGGAGAGACCCTTTG 
TCAGAAAACAAACAAACAAAAAATTAGAAAGA 

fJACPPTPTrTPTfSATTTfSAPPAATPAPPPfiT 

GTCAAATCTTGCCACAACCGAATCACCACCA 
AATTGCCAGACAAGCGGCTATGCTGGGTTTC 
TGAGGTTGGACTCCTCAGGTAGCCCGTGTCT 
AGGC AG AATGATG CCAGC AGCTAC ACTTTTG 
AGAACAAGGTCAG GTCAGG ACTTGCCGCC A 
AACCTAGGAATGCAGC 


P000887 


R 


- 


IM000793 


GATCAGTCATGTCCTTTAGACGTTTACTTTCA 
TCCCAACTTGGAACATTTCAAGC 


P000888 


D 




IM000794 


tt a r* a a a nnr* Afs a a at atp a 1*2 a a a a AfiP pth 

AAGTAGCAGCTGTTAACCTGTACCAGGAACT 

GGCCGAAGTACACACGCGTTAACTCAGCCC 

TAATTATTCTCGGGAGATACAGTTGATTATCA 

TACACATGTCAAAATGGAAAATAAATGGGTA 

ACTAAAAATTGAGGAAAATAAGATTAACACTT 

AAACAACCTAGTTCATTATGCCACGGTGATC 


p000890 


D 




IM000795 


GATCACAGTGGGACAGATTAAATGTTA 


P000891 


D 




IM000796 


AAAPAAATAPAAA f5 Tfi ATA ATTflTf5TPi ATP 

TGAACTTGTCAATGAGATAGGTAATTATCTCT 

GGGCAATGGGTAAATGTGCTGGCCAGCAAA 

CCTCACAGCCAGAGTTCAATCTCCAGGAACT 

TAGGTGGGGAAGGAGATAACTGACTTCCAAA 

TGCTCACCCCCAAATATACAATTAAAATAAAA 

ATCTTCCTTTTATGAGTAGCAACTGATC 


P000892 


D 




IM000797 


TACCCCTGGTCCTCCAACACTCCGATC 


P000893 


D 




IM000798 


GATCATGACATAGACTTGAGTCACTTCTCTG 
CAGTTTGTCAATmAAAGCCCCTAAGGGACAG 
TGTGGACTTTAGAGATAAC 


P000894 


D 


- 


IM000799 


AATGCCAGCCATAGTGGCACACACTTTTAAT 

frPAAPA PTP A A ft A A {VTTAAn 1 1 1 PTPTT 

AGCTCAAGG CCAAGTAGCTTGGTCTACTCCG 

TGAATTCCAGCCCAACTACATAGTAAAACTA 

GCCTTAAAAAAAAAGGCACAGGCAGAGGGA 

GATAACAAAAATGCCCAACTCCTAGCTACAG 

TAACTGTAGGAATTAAGATAGAATCTGTAGTT 

TGTTTATCATTATCGTGATGATC 


P000895 


A 


li 


IM000800 


GATCATGGCTTGATTGTAACATTATCAAAGCT 

TCCTTGGCACACTGCAGGGCTGTCTTCGGG 

AAACTGCGTATTGTGCTCTTCAGGTACAAAG 

CATAGAGCCCTTACATGACAAACGCTGGGGT 

TAACTTCTTCTAGTTCCCTCTGCCCCACTTGT 

GGCGCTTCCCACTCATGACTTCTTCAGTGTG 

TATTCACTT 


P000896 


D 
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IM000801 


GATCATG CTGAACTCTTGAAAGTATTCTAGC 

AAAATGTGGCTTAAAAGAAAGAACAAACATTA 

ACTAGGTATGCTTTGAAAAATTACCTGTGGTA 

AAATTTCCACAAGCATGAGAAGTTGTTTCTTT 

TGTTGAACCTTCAGAC 


P000897 


D 




IM000802 


GATCATATATCAA 1 I 1 IAI 1 1 1 IAACI 1 IGTI J 
GTTTGTTTGTTTGTTTGTTTGTTCGAGACAGG 
GTTTCTCTGTGTAG CCCTGG 


P000898 


R 


- 


IM000803 


ATTGTGTATCCAGAGTGTGACAAGGTATATA 
TGGTTGTGTGATC 


P000899 


D 




IM000804 


GATCTTCTGTCTGGAAGAGTGCTTGCTGGTT 
CCGACTAC 1 1 I 1 1 1 1! 1 1 1 1 1 1 1 1 M 1 1 11 1 1 NG 
CTTGGGTTTCANATTG GCTTCAG GTTCTGGG 
CCCTTTCGTGGGTTGTGCTGCANAGCCCCAN 
ACAATGTCTTGGG 


P000900 


R 


- 


IM000805 


CAGGAAACCAGGGGAAATGGGACACAGTGA 
CATCTG AGTCCTTAG AAGAGGTC CCACAAAG 

CXTCT AT ATCX A CCTAC^C A A P f2TP A PTTPTP. A 

TTATTTCTCAGACACAGTGGATGTTTGTCACA 

GCACACTGTAGGACATCCCAGAACAGCACC 

ATGGGAGACCATGGTTGGTGCAACAGAGAA 

CATGCACACTGAGACAGTACAAGAGTTCCCA 

AGCAAGCAGACACAAACAATGGACTCAATAC 

ACATACAGTGGCAGATC 


p000902 


c 




IM000806 


GATCTGCTCACCAAAAATCTTGTCCTAGGGA 
AGTTGAGTTTGAACTGCGTGCTTACTGGCAA 
ACACGCGGTGCCCAAATTTAAA 


P000903 


D 




IM000807 


ACAGTTCCCCCTGGAAATGGTCCCTGTACCA 
GAGGAGCAGATC 


P000904 


D 


- 


IM000808 


CTGGGGCCCAGACTCCAATCCCGAAATATCA 

TT A PTH nCZ C d P APTTPTP PP; A P;P, A A rVTTT 

ACACCAGTACCCTAAGTTCAAGTCTCAGAAG 

CCTCCAAATCCTCGTTGCACCC CTATATTTC A 

CTTGGTCATCCGACTGTAACTCACTCACCGA 

CAAGACAAAGAATATCTTAGGCTCCGTCGTA 

AAAGAACGAGCCCGGTTCACCGCAGCTCCT 

TTTATAGTCTCCTTTGTGCGAGATC 


p000905 


B 


Mm.21798 


IM000809 


GATCTGAAGATATTTTGACAACAGCTAAAAAA 

AAAAAAACCAAAAAAAACC CCTTATTACTAAC 

CAAGGGAAAATGCAAAAATAATTAAAAGTTC 

CTCAATTTTAAGTAAATATCCAAAAAGATTGG 

TTGTATAACAAAGTTGAAGAGTCAAACAGTAT 

TTGAATAA 


P000906 


D 




M»IVUVU I \J 


AGCTCATTGCCGTTAATTTTCCTCAGCCTAAT 
GAGAATCTAAG CCTTGATTTGTATGTACC ATA 
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IM000811 


CCTTGAACCTAGTTCAGGGAATAGGCCACCT 

GGGTGGGACTAGTGCTGGTTGGGGATGAAA 

AGACAGTTGGCTCAGGTGAACCCTGCTCGC 

ACCCTGGTCATCCTCTGAGACTGCTTTGATT 

GCTGACCCCAGTGCTCCAGCAAGAACTTGC 

GTTCTTGTTCTCTCCACTCAAGCCGGAAGAA 

ATCTG AGGAGAG GGTGTGAATCCTGAGCC A 

GGATGTCCAAAACAACGGAGTTGAGCCAGA 

AGGACGTCTAGTTGGGCAGAGTTAGCTCAGT 

CCCCTGACCCCCAGTCCGTGCAAGCTCGAG 

GGTGTTATATAGTGATACAGATC 


p000909 


D 




IM000812 


PATPTPTTPTTATPTPTAPP Mil nGGGCACA 
ATCTTATCTGGG G ACACCACAGAGCCCAAGA 
ATTGTC CTGTATCAG AAATTTGGACCTTTTCT 
GTGGCTATCTGTAAACCCCACTGACTTAAAG 
TTTTAAGTAGAAAAGGATATGCCTTTTGTAGC 
ATGGTAAGGTCTTTATGGCACAGGAGGATGT 
CATCCATGT 


P000912 


R 


- 


IM000813 


CI ICCI 1 ICC! 1 1 II IGAAACAGGGI 1 ICICI 
GTGTAGCCCTGGCTGTCCTGGACCTCAATCT 
GTAG ACCAGGCTGGC CTCG AACTCAGAG AT 

C 


P000913 


R 


- 


IM000814 


GATCTGCTCCACTTTACACAG CTGACCATG A 
GACCATGTNCACATAG 


p000914 


D 




IM000815 


ACATGACATATCACCCTCATTCAGAGTTCAG 
AGTCTTCAGAAAACTGGGCGCCTGAAAAACC 
TGACCTTTTAAAI 1 1 1 CGTCCATAGTTTCTTCT 
GTTGAATGAATATTCATTTAAAAGCTTCATAA 
ATGCCAAGATC 


p000915 


D 




IM000816 


GATCTTCACAGCGCACCCAGGGATC 


P000916 


D 


- 


IM000817 


CI 1 1 1 ICI IGGIAI 1 1 AGGGAG 1 CAGGAAAA 
GAAAAACCATTGGG 1 1 1 1 1 ACATTAGCTTTCA 
GGTAGGGTTGTGGCTTTTGAGCAACAATAAC 
GTATGACCTTGTGGTCGGTTCTAGATC 


p000917 


D 


-- 


IM000818 


GATCTTCTTATATCTGGTTTCCTGGGCGCTTC 
CTGGTAT 


p000919 


D 




IM000819 


GATCTCTGACAGGGTTTCAAAGAACTGTTAC 

TGATGTTTAGATTGCCTCTGAAGACATCACAT 

ATACTGTGCTACTCTGCCTTGTCAGAGTCCC 

GGGCCCTGGGCACCCCAGACGGCAGCAGA 

GGAAGAGCGGGGTATCACTTTCTATACTTCG 

GTAAAGTCATTGGGATATGTGC CCT 


P000920 


C 




IM000820 


GATCTCCTCTATCATTTATCTTTCTTCCTTCCT 
TCCATCTGTTTGTTT 


P000921 


D 
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IM000821 


GATCTGCTCACCAAAAATCTTGTCCTAGGGA 

AGTTGAGTTTGAACTGCGTGCTTACTGGCAA 

ACACGCGGTGCCCAAATTTAAGGAGTGCCAA 

CGACTTCGCGGGCCAGCAAGGTGAAACCGG 

AGCGCGCACGAGTGAGCAGTGGCCAGGAG 

GCCTGGCCAAGAGGCCAGGGTCCCTGAGCA 

TGACCGAGAGCTGGCGTGCTCTCTGTAACC 

CCCAATCAGTTCACCTAATCTCGGGTCGAAA 

CCTGAGCCCTGCAGGAGGCGGGGCTGAGA 

CTGCATCCCAGCTCCTGGCCCGCTCCAGGG 

GCGACCC 


p000922 


D ! 


- 


IM000822 


CCAG GCATCTCC ATTCTTAATCCAG ATC 


P000923 


D 




IM000823 


CATAGACTCTTTCATTTAGAATAAAGTGTTCC 
ACCTAACATCCTGTAGGAAGTGATGAAACTA 
AAAAGAAAAATAAACGCATTTTCTCTTTCTCT 
CGTTACTTTTTCCATTCACTAAACAAAATTGA 
CTTTl 1 1 1 1 1 1 CCATGAGAGTTCACACTGGGT 
CTGCCTCAGTAAGAGTCACACTGTTCAGCCC 
AC ACACG CTGTGATATGTTATTTACTCATTCT 
CTTCTCAGGAACCACTCTCACATGTGAACCC 
TGAATACCAGCTCCCTCCCTCTTCAGATC 


P000925 


D 


- 


IM000824 


ATAGGTTCTGTCTCAAAACAAACAAAAAACCA 

AAACATGTCCACAGGGTCCAACAGACACAGT 

CTCCGCCACTCACAACTAATGGGTACACAAA 

TAC AC AC CTC AGC CTT AC ATGGTTAC AG AGA 

GAAGCAGGACCACAAGGTAGGCAGGCACCT 

AACACTTGCTTCTTGGAAGTTGGAGCACACA 

CACACACACAGAAACACACACACACTTTCTC 

ACACTCACACACACATTCTCTCTCTCTCACAC 

ACACACACATGCACACATGGTCTTGTACAAG 

CTCCTCCTGGGATGGGCACACACAGGGGTA 

AGAGGACTCCAGATC 


P000926 


D 


- 


IM000825 


GATCGAACACNCTNGGACTTGNTAAACGNTT 
CCCACACNGACAGA 


P000928 


D 




IM000826 


GATCGTCTGGCCCGACCGCGCCTCAGTAGA 
TTTGGGTCCTGGTCTGAGCAGCCGGGCTGG 
TGCGGGTGTCCTCACTAGGATAATGAATACA 

GCTCCAl. 1 AOO 1 A 1 AO 1 AUUOAAuAUoAUUU 

CTCACACGCTCTGCGAGGAAACCGGTCTTC 
GGAC 


P000930 


D 




IM000827 


GATCGACCGCAGATGAGGTCTATGCAGGAA 
AAACGATGTCTGGAATnTATTAAAATTGCTC 
AGC 


P000933 


K 


Myc 
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GENE 


IM000828 


AGTAGACTGAGATTTGTGAGCGCTAAGATAA 
AGATGAGCAAAGCTTTGGCAGCTCTTAGGTA 
TCTGAGGGCCACCGTCCTCTACAAAGCAAC 
G AGAGGCACGGCGG ATTAG GATAGACTGGT 
TGCATCCAAACACTACCTTGCTGCCTCAAAG 
GCTTATTGGACACCACAGAAAGACCTCTGCT 
GGAGGCAGAAGTCACAGGACTC CTCGTC AC 
AGACGATC 


P000934 


D 


- 


IM000829 


GATCGGCCTTCCTCCAAAGCTACCTGCATAG 

AAGAGACCTCTGCTCTCACCTACTCTCCTCT 

ACAGTTCAGCCCATATGGCTTCACCTGCATC 

CCCTACACACACACACACAGACACACACACA 

CACACACAAACACACACACAACACACACAAC 

ACACACAACACACACTCACAACACAAACACA 

CACAACACACACTCACAACACACTCACACAC 

ACACACACAACACACACACACACAACACACA 

CTCACAAACACACTAGTACACAAAGACTCCA 

ACACACACATTCCCATGCACTACTCCCTCAG 

TATCCGCCGCATTTGTGCTCACACTCATCCA 

CACTCTCACACTTGTAGCACACACACATCAT 

TCCTACACAGGCATGGACACACATGCTCCTA 

TACAGGCATGCCCAGTACTCTCACATGCATG 

TTTGCACGTTCCCAAACAGGTTCCCACAAGG 

GTTTGGCAAAGTACATGCATCCTCACACGCA 

AATGCAAGCCGTCACACCCCATACCACAAGC 

ATGCAC 


P000937 


R 






ACACCACATGCACATACATGCACACACACCA 

CATGCACACATACACACACAACACATGCACA 

TACATGCATACACATGCACACACACCACTCA 

CACACATACCACATGCACATACATGCACACA 

CACCACATGCACACACACACACACCACATGC 

ACATACATG CACACACACC AC ACACACTTTG 

CAACTACATATAAGCTTTAGAAATGCCTCTCG 

CCTCCCCCCATAGCGGGGAAGAGAGTGGTT 

TAAACTAGAGGAGTCTGAC CAATGTTCATTTA 

TCTACCAGTATACCTCGCAAAGAGTTTTCAG 

AAATGTGCATGAGCTGTTAAAAACTTCTCTCT 

AATTTPPACATPPf^ATP 




R 


Ljc 1 704^1 
rI5. 1 * U*t>> c t 


IM000831 


GCTGGACCCCGGTGACAGACTGTGCAGATG 
GATC 


P000939 


K 


P/>77t 


IM000832 ! 


TTAGCAAGTCCGAGCGTGTTCGATC 


P000941 


K 


Nmyc 


IIVIUUUOOO 


Ml/ 1 oLrAuAUA 1 1 butob 1 loi L»oA 1 O 


puuuy^o 


ft. 


NOXCnl 


IM000834 


CAAGTGTAGACATTGCAGGAAAAAAATATGG 

TGACAGTGAACAAAGCCCGTGAAGGTGACA 

AAAGCCAGTTAAAGTAGGACAAGGCAGAGC 

GAGGCCCATGACCGGGACCAGGCCCAAGAA 

AATAAACGAAGGCCACGATC 


P000944 


B 


AW321468 
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IM000835 


GTCGGAGGAGCTGGCTGGACCGGTACATGC 

CCTGGCCATCCAGGCGAAGACCCCCGCCCA 

GTGGAGAGAAAACCCACAGTTGGACATTAGT 

CCCCCCTGCCTAGGTGGGAGCAAGAAAACT 

CGAGGGACCTCTTAATAAATACCTGGATTGG 

GAGAACGATC 


P000946 


R ■ 1 




IM 000836 


GATCGCGGGGCTATCTATAGAGTCCCCGGG 
ATGTCTGAGAAATCAGCCCTAGAAATGACTA 
GAAAGAAAATCGAAGTATTCTTG GCTCCTGG 
AGACTTCCGC AGC GAGAAGTCACAGATTCAG 
GACACAGATTGACAGGAGCTGCGGGCGCTG 


p000950 


D 




IM000837 


GATCCCAGGATTTGGGAGGCAGAGGCAGTT 
GGCCCCA 


P000953 


R 




IM000838 


CAGGCTGGCCTCAAACCTGCAGAGATGCTC 
CTGTCTCTGAGTGTTAGATTTAATAAAGGGG 
TTCACGATC 


P000954 


K 


Lck | 


IM000839 


GTTGCTGGGCCCTAAGCGCCCACATTTCACA 

CaO i UlfuA \ oLf I OA 1 UAbUA 1 oAt 1 U 1 1 oA 

GCACATTATCTGGTGGTGGCTGACACTCTCT 
TCAGTACCCCCCCCCCTCCCAAAAAAGAAAA 
AAGAAAAAAAGGACTGGTTGCTAAAAGAAGT 
AAAAGTCAAGTCATCAAAAACAATGTAATATC 
CTGTGTGAAAGTCACGAAGCCTTGCGGTTTG 
AGTCCCTCGATC 


P000955 


D 




IM000840 


bATCboOOuoU 1 o 1 ULAotuALUuoAOAAA 

GGAGAGCACTCGAATCGCAGAAGCTATCAG 

GTGAGTCCGACCTCTCTCTGAATGAACGCTT 

TGGGGAGCCTGCCAACGGTGACCAAATTTA 

GCCAGTTAAAAGTACAGGCTGCCCAGCTGTA 

AACGTAC ATCAAACAATGTGC GATTTTATTTT 

TAGTGTGAA 


P000956 


D 


- 


IM000841 


ATAGTAACACTTGGGAGGAGCCATTCCCAGT 

GAGGCTCGTATAGCATAGCCCTGTCCAATAG 

AGCCTCTGTTGCACTCTGTGTACACTTAGCT 

CCTTGCTTAGGGA 1 Mill! 1 AC ATGGGTG AC 

TACAGCACCCCAATTTCACATTGGACAGACT 

CCAGGACACCCCTCGGTGTCCTGTGACGCA 

TACAACAGCCCCCCACGGGGCTGCACCGAA 

AACGCCACAGTACTGAGGCTGCACCTCACTC 

ACTCACACACACCTCTATGGCTCAACGTCCT 

GGAGAAAAGGCTGCGACAGATTCCCACATCT 

GGGAATGCAGTGAAAAAGCACTCACACTGG 

GGGTGGGGTGGGGCTGGGGGGGCACCCTG 

TCTTCCCGTCTTCCCATGACCCTCTTCCCTTC 

CAGGAGACCATAGCCAGAGCTGACAGGAGA 

TTCAGTCGCAGCTGCACACGCTGCTGCCTTG 

jCCGATC 


|p000957 


P 
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IM000842 


GATCGGGCAGGACACACATTGGGGAGGCCC 

ATCAAGCCCGAGCCTGCCTTGTGAGCCCCC 

GGATTGGCAGGGCAGAGAGGAAAGCTGCTG 

CGTGCTTTATAGACTTTGGGGAAGTCACAGG 

CTCCGCTTGCTTGGGGGAGGCAGGAAACCC 

CCTCCACCTAGGCGTCTGCCAGAGCACCCG 

CAGGCTTCCTCTTGTCTCTGTCCCCCTCCCC 

AGCACCTCTTCCCCTGAACAGCTTCCCTCTC 

CTGGCCCTGCTGTCCCTTTAAAGGAACTTGA 

ATCAGAGTTGAGAATGATGGTGACTCAGGGT 

GGAAGGGGTGGTCACTTG 


P000959 


D 




IM000843 


CCAGGGCTACACAGAGAAACCCTGTCTCGA 
ACAAACAAACAAACAAACAAACAAACAAACA 
AAGTTAAAAATAAAATTGATATACGATC 


p000960 


R 


- 


IM000844 


GATCCAGGACATGGCAGAATATGGTCATCTT 

CTTTG CTTGCATGTC ACACG AATGGCCTCTG 

GCTCCACCCCTGATTGCTTGCTCCCCTTGGA 

AGCCTCTTGAGCCTAGCTAACTTTTCCTGTTC 

ACCTTTGTATTATGTGCTCCCACCATGGCCC 

ACCAGGCTCTGCTTGCAGCACTGCAGCCTG 

CAGCTCCAGCGGCCTTTACATGGCTCCTGTA 

AACAAGTCCCAGAGGCCTCAGTGTCATCATT 

TCAGCAACCGCCTCACTTCTTGGTGCCGCCT 

TCCTTTATTACTTTCATATTTCTGTGACCGAA 

ATACCCCCAAAGAAGCTACTCAAGGAAAGCA 

GTATGTGTGGGCTCACCATTAGAGGTCAGTC 

CCCTGCAGCAGTGGAAGCATGTGCTGGTGA 

CGC 


P000976 


D 




IM000845 


GATCGCTACTTTTTCAGAGACGCCTTCATTAA 

GGGGAGAATGGAAAGATGCTGGTTGACTTG 

AAAGATTTCTCTCTGATTTGTTTTACAGGAAG 

TGCATTCTGTACACATGAGAGACTCCGGGTG 

GAGAGGCATTGTGGCGGTTGAGATGCACCT 

GGGAGTGCCAACTGCCCCCGCTTCTACCAC 

AGCTCTGCATAGCAGGCTGGAGCAAGCAGC 

CAGCCAACCATTGTGCCCTAGCCTCATCTCC 

TCCAGAAGAGGTTATCTGGGCTCTGTGTAAC 

CTCTGCTCTTTGGCTATGGTATTCCTTCTTGG 

TGCTTTCTGTGGTCAACCTCCAGGTACACTT 

AGGGCCTATCCTAGACAGACTGGGAAGAAA 

GAATGACATTCCCATTGACCTCTGTTTTTATT 

TCCTGGAAATCCAGACCTTGTTCCAGTTAGT 

GGAGCATGGGGTTAGACCAACCACACTGCT 

AAGAGTTTTGGCCTGTAGACATATCTGG 


P000983 


c 
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IM000846 


TAGCAAGGTAAGTACTTGTCTCAATTTCCAG 

GTAGTATAGAAGAAACATATATGTTACAGCTT 

TAACACCAGAACAATCACACAGTGTTGTATTT 

TAGCTAAAATATGACTCTGTGGTTTTCAAATG 

G CATAGTTGTGGACAACTTAATTAAGC ACGC 

TCTTATAAGACGTGATAGAGTATGTGCCATC 

CAGATACTAAGAACTGTGTCCAAAGAGCTTG 

GGACACACACTAAGGGGCCTGCCTCTTTCAT 

AACGG GG ATGAAAATG ACTG AGGCTTC AC AT 

TTGCACAGTACGATC 


P000988 


D 




IM000847 


AAGCCATCTGGGTCTCAAGTTGCTAAAACTT 

AATAACTCCCTCCCTGTGTTTGTCCTTTATCT 

AATGGTAAAATATGACCTAATGAAATAGGTTC 

CTAAGGCTTTCATATAAGGCATGATGTTGAA 

GGATG GAGGACAGAGTGGG ATGGAAAATCA 

GAGCCTGCACAGAAAACCACAAGCAGCTAA 

CAAAAGTCCACAACCAAAGCCTGTGCCTGAA 

ATGTCACCTACAATGCAGTGGACTATTCATAT 

GCCAGCCTGGTCCTCATGCGATC 


P000991 


D 


- 


IM000848 


ACCAAGAACAGAGCCCCAAACTAATAGGATG 

GTTTGTTGCACGTGTACATGTGTATGCATGC 

GTGCATATACGTGTGTGTGTGTGTCTGTGTG 

TGTACACCCACACGTGTGCATGTGTGTTGTG 

TGI 1 1 1 1 I AAGCAAACCTC AGTGTGTCATACA 

TACTCTCCTATACTTCCCCTCCCTTGTTCCAT 

ATGAGGGTGCCTTCTTATCTCACAGGGTTGT 

TTTGI 1 1 1 1 1 1 1 CTATAACAGAATGCCGCTGA 

TGCTCI llll 1 CTATATGAACCCTACATTTAAT 

ACTTATCCATAAGCAAAGGAACAGTATCTTAT 

CTTGCGGATC 


P000992 


R 




IM000849 


CTGGGGGCTCTGCTACGCGTCAAACGCCTG 

GAGAACCCCTCGCCCCAGGCGCCGGCACG 

CCGCCTCCTGCCTCCCTGAGCGCTGCTGCA 

TCCTGCACGCCCTGGAACCCAGGAGCGCCC 

CAGCGACCCTGACTCCCTGCCAGCACGTCC 

AAGGCTGCTTACCCCAGCAACCTCCCATCCC 

CTGAGCCCTCAGTAAATGCCATCTGTAGCAG 

CTGTTTGTCTGAGCGCCCTGTACTAGGGGG 

CCGGTGGGCTGGGTGACAATGATAATGGAA 

TAGTGGCTGTC CTACTG AGGACAGC ACAGTA 

CTGTTTGGGAC CTGTACTGGTAAGGAATACA 

TGCCTGCTTCCTCTGGACTTTGCGGGTCTCA 

CCGGGTGCCTGGGCTACCCTTCTAGGCTTC 

ACTGAGGCGGGTTCCCTGGGAGGCTCTGAG 

GTTACTTTCAGCGTCTGCCAGGGGTCCACAG 

CACTTAGCCAAGGGGCTATGGATTCACTCGT 

GGTCTGCCAGGACCAGGCTTGTTGTGAGGG 

CCCCAGGTGGATC 


p000993 


A 


Saas 
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IM000850 


GIGTnCMIICIIICIIIIIICTHniCMII 

CTTTTCTTTTC I I 1 1 1 I 1 1 1 AAATCTAAGTAAG 

GTGCAACAATGTAATTCGAAGGGGCAGTGTC 

TTCCCTTCCTGTAGTCTCTGCTTAATTCCTGA 

AGTTTGCCAAACCAGGAGTTAGGAAAAGTTG 

GAAACCTGCAGAGAGAGCGTTTGAGAGGTTT 

GAGATG7TATACGAGAGGGTTTGGCAATGTG 

TGGAGTACAGGTAACTTGCGGTTATTGTTTT 

CTTGGCCCTCTATCTTCATCCTTTGTGCTTGC 

TATTTACCTTGCTGTCGGATC 


p000994 


R 


- 


IM000851 


GATCCTTGAGTCTGTACTTAGCCTGAGAGCG 

CTATAACACTATATACAAAGTACCGACTAGAA 

ACTCCACACACATTTGTTGACTGACTTAATGT 

GTAGCCCTGCAATGGTTGACAGTTGGGGGT 

CAGGGGGCTCTTGCACTGAGGGTAGTGTAT 

AGCCTAAAGAGATAATCAAGATGATAAGTAC 

ATCCACACTAGGACAGGAGCTTTAACAAGAG 

CTTTTAGTGAAGGGAACTTTCTGGGAGCCTC 

AAGGAAGGCATAT 


P000995 


D 


- 


IM000852 


AGCAACACCTCATGTGG GAATTC ATACATTG 

TAGGTAATCAGTCTACTAGCTGAACTATATCT 

CCAACCCAGGAGGTCAGGTTTGTTTGTTTGT 

TTAACAATCTAGTTTTGAAACAGTCATATCCT 

AGGCTGGC CTCAAGTTATGTAGTCAAAGATG 

GCCTTAAAAGATGACTCTTGGTTA 1 1 1 ICCAA 

GTGCTGGGATTATAGATATGCACACCACCAC 

ACCTCATTTGTCTCGGGGCTGGACTCAAATC 

CAGAGCTTCATGCATGTGAGGCAAGCACTGT 

ACCAACTCGACTTTTGCATACTCCATTGAAA 

GTCATTTTTATAACAGGATC 


p000996 


D 




IM000853 


CTACTTATCTATCATCTATATGTCTATCATCTA 

TCTATCTATCTATCTATCTATCTATCTATCTAT 

CTATCATCTATCATCTATCATCTATCATCAAT 

CATCTATCTAGCATCTATCTTCCAGAGCTCAT 

GTTGTGGCTTGGGCTTCTCATTTCACCATCA 

TCGAAGGTAGTTGCA 1 1 1 I 1 1 CTATTGGCTTC 

TTAGAAGCAGGAGGCACATGAAACAACTTGC 

TAACCCTTTCCTGGTCTTTTGTTGTTGTTGGT 

GGTGGTGGTGGTGATGGTGGTGCTGGTGGT 

GGTGGTTGATGTGCACAGGAG AC CTGTCCG 

GTATGGAGATATGGAGAGCGTCTACGTCCTC 

ATGGGATC 


P000997 


R 
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IM000854 


GTGGGACGCGGAGGGTGGAGATGAATTGAG 

AAGCAGTTGTCGATTTCCTCCTTCTTCCAAAC 

ATCAAAGGCAGCGGTGGATGACAAACTGAA 

GGACAGAGGGTTTGATGATGCAAGAGGAGC 

CAGCAGCAACCAAGGCCAGCCTCTTGCGGG 

TGTGGGCAGGGCCTTCTTTACAATGAGTTCA 

CACACACACACACACACACAGAGAGAGAGA 

GAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 

GAGAGAGAGACTGCTCTTTCAGAACAGCCCT 

AGGAGGTTAGCTTCAGACTAAGACAGGAGA 

CAGAGAGTCCTTGATTTTGCCAAGGTTGCAC 

AGCTGGGGAGAAACCCAGCTATGGCTTCAC 

CTTG GCCCTTGTTAGGACTCCTTCCTAGTCC 

GGTTGCAGTCTCCTGGATC 


P000998 


R 




IM000855 


GTATTAGAGGCCAGGCCATTCAGAAGATGTG 

GCAAGATTGTCATGTGGAAAATATTTGAAAC 

CATTCTAACCTAGTCATTCCATCATCAATAAT 

AATAATAATAATAATACTACTAAAATGAAAAA 

ACCTAGATA 1 1 I 1 GAGACTGTACTGCTGTATT 

TTAAGAAATACACGGAAATTTAGCACTGAAAT 

TTAGTGCTAGTTTTAAGAATACTTTGTACCGT 

TACTTGGACCCACAATTGCTTAGAGCAAGGG 

ATC 


P000999 


C 


- 


IM000856 


GATCCTGAGACAGTACAGGAACTAAGAAGCC 

CTGGGCAATTTGCAGTGTGCACACCCAGCCT 

GAATTTGCCTGGTTCTCACCAGCCTACCAAT 

AGAGCATTGTAGTGGCAGGGATGTCTGCTG 

GTGTCTCGCAGAC AACTTTTGAG GTCCTGCT 

TCTCCAGAAGTGTGCAGCTGGCAATTAGCAG 

CCTGGTCTTTTCCTGTCCCCAAGACCAGTGC 

TTCCACCAACCTGGTCTCTTCCCACAGCCCA 

GCCCTTTCTCTTCCTCTTTGACACCCACTTCC 

TCTAAATGGTGGTCACATGCTTTGTCTCTTGA 

AAAAAAGTTGTATGAGTCAGGGTATTTTCAAC 

GCCGGGACAGAAAAATTGACTCAACCTGGCT 

TTTTC AATTAAC C ACTAATGG GTTTC ACTTAC 

AGTCCTGACAAATAC CAG GCAC AATTCATCC 

AGGACAATAGTGAAGAATTTCATCTCTTCCC 

CCCAAGCCAGTCAGTCTGGTTTTAATATGCA 

CGGTGGATAGCCCATAGCATGCAATGAACTG 

TGAGCACCCCTCTGGGAGTCAGCAGAGACA 

CACACACAGGCACCCATACCACACACTGTGC 

TTTGTATCA 


p001000 


D 
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IM000857 


GATCAAAACAATATTCAAATAATGACATCAGT 

C AAAGT ATG ATTTGATGGC CATC ACTC ATGT 

CAATAGGCAACACATAAGCCTGAGAGTAAGT 

TAAGGAGAAATTCAGCAATAAACAAATTGAC 

ATACTATGTCCACTATGAGTAAAACCTGCCT 

CTCTTAAAACGTTTTACTGTACTCCATGGCTC 

TCCCCCAATGTGCGTTCGTGAGAGTCCCCAC 

CCCTGTGACTCCATCTGTGTGTGGGTTCAGG 

AGAGACTCCTGTGTGTATTCAAAAGAGCCCC 

CCATGTGTGTACACACAAGAGACCCAGTGTG 

TPTAP ATf^ Af2 ACtfZCrCC A P PP P ATCZTCZTCVTT 

CATGAGAGACCCAACCCCTGTGCGTGTACAT 
GACTCTCCCCATGTGTGTTCATAAGAGACTT 
GTGTGTATGGGAG ACTCCAC CCTGTGTGTGT 
ACATGAGAGACTCCTGCCTCTC CTGTGTATA 
TGGAATACCTTCAGAGTATCAAATATTTTCAC 
CCACTGAGCCATCTTAGAACTTCTCTCCCTT 


P001001 


c 




IM000858 


ATACATATGTACACACACACTCACAAACACA 
CATATATACACATACATACATACTCACACATA 
TATATACAC ACTAGTAC AC ACATAC GCAAATA 
CACACATGCATATACACGTACTCACACATAC 
ATACCCATACTCACACAAACACATATATACAC 

APATAPTPAPATATAPATTPATAPATAPAPAP 

ACATATATACATACACACACTTGCATACACAC 

AGCACACACTCACACACAGAGACACACAGAC 

ACACAGACACACACACAGAGGAACCCAAAG 

GATTGGAAGAATAATTTCCTGTGCTCAGTGG 

GAAAGTTTACCAGAAAGACAAGTGGTCATGT 

GGGATGATC 


P001005 


c 




IM000859 


GATCAGGGACCCTGTACCCTCCCCCGTGCA 
GCCTGTGATTC 


P001006 


c 




IM000860 


GGACTGTAACCAACTCGGAGAGGAAAGGGC 
TTATTTCATTTTAGTCTTTACAGTCCATCATTG 

A P A f^P,TT A A Af^P A P, Ci AP PTf^PTT APTfi 

ACTTAGCTCCCCGTTGCTTTATCAGCTACTTT 

CTTAATACAACGCCACCCCCGCGGCCGCCA 

CCTCCCTAGGCAAGACCCACAGGTCAATCCA 

ACAGAGAGGATTCCTCAAGTGACACTCCTAT 

GTCAACGCTATCAATG GCAAAGGTATATTGA 

GCTAAGAATTGATC 


pOOl 007 


U 




IM000861 


GATCTCAGGCTGCCCGTGGGCGGGGCTGAC 

GGAGGGAAGCAGACTAGGCCTCTACCATAT 

CCGTGGGAGGGACTTCCAAGGACCGAGACT 

GAAGAAACAGCGCGAAACAGGAGACACTGG 

GAGGAGAGGCGGAGACCGACACTTAGTAG 


P001009 


B 


Mm.76753 
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IM000862 


AGAGAAAAG ACTATCTTG ACCTTTG G ATATG 

CGGGTGCAAAAATGAGAAGACCACAGTGCA 

GCTGTGTGCCCTGCACGGGGCAGCGAGAG 

GAGAAAGAAGCATTTTACATGAAGCACAGAA 

CACGCCTGACAGTTCTCAACAG CAGCACGTC 

AGACCACCGCAGCACTGCTCGI 1 1 1 ICTCAG 

CAGACCCCCAGGAAGCACCACCCAGGATGG 

ACATGTAGGGGTGCATCCGAGAGAATCAAAA 

TCACACAGGGGCCATCC 1 1 1 1 GGTTCGGCAT 

GAATGATGGGGGCCGCCTGCACTGGCCTCC 

ACCTTCTATGGTTGTTCTTCCTTGTATCAATG 

TTTCAAAAAAAATCCTTGGGCTCACAACTGC 

CTAATGACATCTTCAGGAGTCAAGTCAAGAA 

ArsAnAAAAGTAGPCGArrTGGrACGTGGTA 

GATAAGACTCAAGGGTGCAATAAGCAGATGA 

ACTGGCTTAGTTGGGCTTTCTATTGCTGTGA 

TAAAACACCATGACCAAAGCAACTGGGGCG 

GGGGGCGGGGGGTGTCATCTTACACTTCCA 

TATC AC AGTCTATCACTGAG GAAGTC AGGG C 

AGGATTCAGGCAGGAACC 


p001011 


R 




IM000863 


GATCGGCCAACACAGGATAGATACCACACA 

GGATAGGAGGTACAGTGTCTGGAAGATTATT 

ATCGAGCCCCTGAACGTAGTAGAAGCTGGC 

TGTCGTTCCAGTGCAAGCTGAGCAGATGGTC 

C 


P001013 


D 




IM000864 


GATCCACATGAAAGCCAAGCTGCACATTTGC 
TTP ATATGTATGGAG AGGCCTAGGTCTAG CC 
CATGTATGTTCTTTGGTTGGTGGTTCAGACT 
CTAAGAGTCCCAAGGGTCCAGGTTAGTTGAC 
TTTGTTGGTCTTCCTGTGAAGTTCCTATTCCC 
TTTGGTGC CGTC AATCCTTCCTCCTATTCTTC 
AATAAGAGCCCGCAAGCTCCATCCACTGTTT 
GCTTGTGGGTATCTGTAA 


P001015 


R 




IM000865 


CCCTCAGCTACATAGTCAATTCCCATCTAGC 
CTG GGTATGCGAG ATGGC AGTAAAG ACACTA 
GCTGCAAAGCCTTACTGCCTGAGTTTGATC 


P001018 


D 


- 


IM000866 


GATCCAGTCACAGGAGAGCAACTGGGGGAG 

GGAGCAGGACAGAAGCACACCATAGCCCTT 

TCAGGGGGCCGGGGGCGAGGGGTGGACAA 

G AGAAG AC AG ATAATG ACTC AC AG G ATG AAG 

AAGCCTCCCACAGCCCCTCCCTGAACTGGC 

CATCTGTTCTGGGGCCCCAGAGCAGGCGAG 

TACCGTGAAGCTTGGGGACTAGCAGCCGGA 

CCACTGAACAAGGTCAACCAGCCAGTTGTCC 

CACGAGGGGAGAAGCTACCATTGAACTGTC 

ACTTTGGAAAGTAGCCAGAGCCCATCCCTGG 

TCACCACCCAAC 


P001019 


D 
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IM000867 


GATCCCTAGAGCTGCTGGTCAGCTGGCCTG 

GCTGAAACTACTTCTGTGCAGTGAGAGACCC 

TGCCTCAAAACACAGATAATGGAGACAGATA 

AATGACATCGTCCGCTGTGTCTGCGTGTGTA 

TATGTAACACAACACACAGTATACACACATAC 

ACACCACACTCATACCGTCACACATGCACTC 

TCAGTGCATGTGCAACACAACACAGTGTACA 

CACATACATACACACCACACACATACACATA 

CCACCACACACGCGCACACACACACATAA 


P001020 


R 


- 


IM000868 


GATCCTTGTGCATCACTGAGCCATCTCCCCA 

GCCTACAGTGTAAGTATTCTATACATATTAAT 

TTAATCCTGCCGGGTGGTGGTGGCGCACGC 

CCTTAATCCCAGCACTCAGGAGGCAGAGGA 

AGGTAAATTTCTGAGTTTGAGGCCAGCCTGG 

TCTACAGAGTGAGTTCCAGGACAGCCAGAG 

CTACACAGAGAAACCCTGTCTCAAAAAACCA 

AAAAAACAAAACAAAACAAAACAAAACAAAAA 

TCCTATGGAGTATTCTAAAAGTAAAACC GTAT 

CATTAGCACTGCCAAATAACAGAAAGGAAGA 

CCAAAGCAAA 


P001021 


R 


— 


IM000869 


G ATCCTCTG AAAATGG AGTTAC AG ATG GTTG 

TGAGCTGCCATGTGAGTGCTGGGAACTGAA 

CTCGGGACCTTTGGAAGAGCTGCTGGTGCT 

CTTAACAGCTGAGGTGTCTCTCCAGCCCCTT 

TGGGTGTGTTTTGTTTTGTTTG 1 1 1 IGI 1 IIG 

CTTTTTCAAGACAGGGTTTCTCTGTGTAGCC 

CTGGCTGTCCTGGAACTCACTCTGTTAGACC 

AGGCTGGCCTCGAACTCAGAAATCTGCTTCC 

CAAGTGCTGGGATTAAAGGCGTGCGCAACC 

ACTGCC 


P001022 


R 




IM000870 


GATCCAATATATTCATATGGAGATACATGTAT 
ATACATAA 


P001023 


D 




IM000871 


GATCCAGGTCCTTTCCCCCTTATGGTCCTAT 

ACACCCCTGGGTACTTAGAGGCTTTCAGCTC 

TGACTGGTGGTGTGGGGAGAAGTGAGGGGT 

TACACATGTGACACAGGTCCTAAAAGCTGTC 

GCCATTGGCACATGACCATCCTAAGTCTGTG 

GCAGAAGGCTGCTCAGAGCCTCTGTCCAGG 

AACAACCCAACACATTGCAGAAATAACTGTG 

CATCTGGGCAATGGGGCAACTACTACCTGTC 

CATCCAGATAGCTCTTCTAGAGGCATTCGAA 

ATAACACGTAAAGTGGGGTGGTGATGAACAC 

ATATAATCTCAGCCCCTGGGAACCGGAGACA 

GGGGAGTCACAAG 


P001024 


D 
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IM000872 


GTCACAGTACTTGCTCACTTGCCTCTCTCAT 

GGTTTACTCGCCCCTCCTTCTCGTACCCCCT 

TTCCTCCTACAATCCTCCTCGTCTACTTTCAT 

G CCGTATATGTCAAAC ACCGTC ATATATAAC 

AATGTATGCATGCAGCATTTCTTTTTCTTTCC 

CATCAGCCTCCCTTGCTCCCCATCCTCCCGC 

CCTTCCTCCTTCCTCCCAGGATC 


p001026 


D 


— 


IM000873 


AGTTATGCTTGCAGACAGGAATGTAGCATGG 

CTATCCTCTGAGAGGTTCCACCCAGCAGCTG 

ACTCAGACAGATACAGATACCCACAAGCAAA 

eAGTGGATGGAGCTTGCGGGCTCTTATTGAA 

GAATAGGAGGAAGGATTGACGGCACCAAAG 

GGAATAGGAACTTCACAGGAAGACCAACAAA 

GTCAACTAACCTGGACCCTTG GG ACTCTTAG 

AGTCTGAACCACCAACCAAAGAACATACATG 

GGCTAGACCTAGGCCTCTCCATACATATGAA 

GCAAATGTGCAGCTTGGTTTTCATGTGGATC 


p001027 


R 




IM000874 


GAICGIGGICICI ICICI II 1 1 ICCCICIACI 

TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCT 

TCTACTGTCTTCTTCTTCTTCTTCTTCTTCTTC 

TTCTTCTTCTTTCTTCCTCTCTCTCTCTGTCTT 

TCTCTGTCTGTCTGTCTCTGNCTCTCTGTCTC 

TCTCTATCTCTGTCTTTCTCTGTCTCTCTGTC 

TCTGTCTCTCTTTCTCTG N G NCTCTCCCTGTC 

TGTCTGTCTCTCTCTTTCTCTCTCTGTCTCTC 

TCTCTCTGNCTCTCTNTCTCTGNCTCTCTCTG 

NCNCTCTGNCTCTGTCTCTGTCTNTGTNTNT 

CTCTCGCTCTCTNACACACACACAGATGTAC 

ATGCAC 


P001028 


R 


- 


IM000875 


GATCGGCGGTATCATATTTTATGTGTTTTATT 

TCTGTGTCAGAAAGTTTAAAAGGCCTCAGAT 

TGGAAGTCTGGTTTGCATGGAATGCATATGA 

GCTTTTTCATCTTATTGCCCAACAGATTTAGT 

CTAAGAACCACCTCTATTATATAGGGTATGAT 

AAGTAATATAGGTAAGGGAATGCATCCCATT 

TGATAAGTGAAAGTTGAACACACATAGAGTT 

GGCTCACCCCGGGGTCTAGGCTCTAATCCC 

CTGGGGATACCCAGGCCAACTAAACGCTATA 

GCAACAGGCATTGGGGCATGAAGATAC 1 1 1 1 

TGTTGTTTGTCTTGAATTTATATAGGGGCTTA 

TATCTCATTACAATTAATCATGAGTTGCAGTC 

AATAAATCTTCATTGCTCAACATATTTGTACC 

CTCAAATAI 1 1 1 1 1 IC 1 1 1 1 1 1 1 GTGTGATAT 


P001029 


C 
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IM000876 


CTTGTAAACACGATTATTTTAAAGATATAAAT 

GGCTCTTTACTCTGTTTAAAAATTGTTTCTTTA 

CCAGTTCTTCGTGTACATTGGTCTCCATTTCA 

CATGAAATAAAATATTTTGTTTAATGTTAGATT 

TTCAATACCAGCTGAGTGTTCGATGTGTGCC 

TTTTGGACATATATTTGTTGTAAAGTGGTCAT 

TTGGGATC 


P001031 


D 




IM000877 


GATCAGATTCAACTCCCGCATTTCTAGCCCC 

AGCATCGTGGAAGGGCTACTGTGTCI 1 1 ICA 

AGCACTATGGTGGATACACATAATGCCAGCT 

TCCCTCATTACTGGTGATGTGAGCTGTTTGC 

CTAAGGTCCTTTCTGCCAGGCTTCTCTGCTG 

CCAAGGCTCTGAATTTCCCTTTGTAGCTAAT 

GCGTAGCCCTATTGGCAGACTCTTCCCGTGG 

CTGACTTCTGCCTCCCGTC ACACAG C AGTAC 

CTTGTTTGTTCTCACCTTGATGTTTCTTATAT 

GCATTGATGATGGTGAACAGCCCAGCAAGT 

GCGCCTGTTTCTTCCCTTCCTCCCACTTTTGT 

TCTCAGTTGTACATGGCAAGGAAAACCAATT 

CCTTCTTTCATATTTCTCCCAGAAAAAAAATC 

CTCTTTATAAGAGTTCACATCCTTGAGCACAC 

ATGATAGGAGCTGGTAGCCAG 


P001032 


D 


- 


IM000878 


GATCATGATATTGTACTGCTGAAGACAAACA 

TATTTAAGATATAAGACTTGGAGAAATCAAGT 

TGGTATTGACATTGGAGATTAATCTC 1 ! 1 IGG 

CTAGCTTTTGTAGAG CTAGAAGTTGGTATGT 

AAGCTATAAGGAAGAGAAGTATTCATAAGAC 

TTACCCAGTTGTCTCTCCTGTAAGCTAAGAC 

CAGCCTAAGAAGCTAAAATTATCTTTAATGTA 

GAACCACAGAG AAAGAAATTGTG GTATGAAT 

TTTGCTTGTTCGTGGACATTAACCATTAACTC 

AATGATAATCAAATGACAATACATAGAGACAA 

AGATATGCATACTAGTAAAATAGTGATAA 


P001033 


D 


- 


IM000879 


GATCGTGCTAGAGAATGGTACACTTGGGTTA 

TATTAAGAAATCTTGGTTGAGTGGTGGTGGC 

ACCCTCCTTTAATTCCAGCACTCAGGAGTCA 

AAGGCAGGCAGACATTTGAGTTTAAGGCCTG 

CCTGGTCTACAAAGTGAGTTCCAGGAAAGAC 

AGGGCTATAAAGAGAAATCTTGTCTTGAAAA 

AAACAAAAAAACAAAAAACGAAACAGTAACT 

GAAACCGAAAAAAAAAAGAAAGAAAGAGAGA 

AAGAAAGAAAATCTTACAATGTGGGAGCTGG 

AGAGCTGGCTCAGTGGTTAAGAGCATTGGCT 

GCTCTTCCAGAAGACCCAGGTTCAATTTCTA 

GCACCCACATGGTGGGTCACACCTGCCTGT 

GGCTTCAGTTCTAGAGTTTCTGACACTCACA 

CACAAACATACATTCAAGT 


P001034 


R 
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IM000880 


GATCTTGTATTTCTTCTTGGCTTGTCTCCATA 

GGAACAGGCAGCACAGCAGAGGTCTGGGAG 

ATGGCTCCGAGGGTAAGGGACCAAGCAAGG 

TCACCTGCGCTCACTCCCTGGAACCCACACA 

GTGGAC AAGAG AGAAAG ACTCTATGG CCTC 

CACGTGCGTGCGTGC GTGCTGTGGTGTG CA 

CGTGCCCCTCCCCCAAATAAAGAAAACTTAA 

CGAAAAAAAATTAAAAGTAAAAAAACAGCACT 

GCAGTAGCTC CAG GAATC AACTGGTCAATCA 

GTGTATCACATTTGACTATCCGATGATGGTTT 

TATTTTACATGTATGCACGTGTTTGCATGTAT 

GTGGGTGCACATGTACAAACACATGTGCCAA 

AGGAGTCATCGACCTTATTTTCTGAGACAGG 

GCCTCTCACTGGAATCTGACTGGCCAGCAG 

CCTCCCAAGGATGCTCCCCAACCTCAGAAG 

GATGCGCCTGTCTCTGCCTCCCAGCCCCGG 

GGGTTACACTGGTGGACCACTGGGCTCI 1 1 ! 

CACCTGGGTG 


p001035 


B 


Mm.13883 
4 


IM000881 


GATCTCTTCTTAAAATTACATTACAGTAGAAA 

ATGTTTATGAGGCCG 1 1 1 1 1 ATCTCTAATATT 

ATTTATTACCACTCTCCTACCCCCAGAGTCTT 

ACAGGCATCAGGGAGTGGACAAAGGCCGGC 

GGTACTGAATGGTGATGTTATTTTTGAAATAA 

TGAAAAG 


p001036 


D 




IM000882 


TACCTGTTGCTCCAACATGGTCAGAAATCAG 
TTTGTTTCAATTTTAAGATACAATGAGAGTAA 
CACCCTAAAGACTTCACATTTTATGCATATTT 
GCTACTCTGTGAGCACATGAACGCTTCTCCT 
TGGGCACGATC 


P001066 


D 




IM000883 


GATCGCAGATACTGCAGGTATGTAGTAATGA 
AGTCTGTAAACATACAGAATGGAGAAGGCCA 
GAGAGGAAAGTGCAGGCATTGGGTAGTCAG 
TAGGTAAAATAT 


P001067 


D 
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IM000884 


GATCGCAGCTCTTCCTTGGTGCTTTTCCCCT 

CAGTTCAAGTGCTGTGGCGGGGAGGACTAC 

AGAG ACTGGAG C AAAAAC CAGTACCATGACT 

GCAGCGCCCCCGGGCCCCTGGCCTGCGGG 

GTGCCCTACACCTGCTGCATCAGGAACACG 

GTAACTGCATGGGTGCTGGATGTGAGGGTC 

ACCCAGTTTGCCAAACACTGCCCTCACTCTG 

CCCAAGTGGAGCAGGCAGTGGGAGTGGGTG 

GGACGTGGTGGCCGGGGCTGAGCTTGCCTT 

AGACCAGGGGCCCTAGCAATGGGAGATGAG 

TGGGCAGCTTCCTCTGGGAGTGTGTCAGTG 

AGCGTGTGCGTGTGTGGGCCTGGCCCAGGC 

GCTTTGGTTGTAGTTACTTGGTTCTTACAACA 

GCTTTGGAGGGTGTGAATTGGGGTAGTGTTG 

CTTTAGCCACTTAGGGGGACTTGCCCAAGGT 

TGGCAGGGCTCTTCCCAGCAACAGAGAGCC 

AGAGTGCCCGGCAGGTGCAGCAGGCTCTAC 

CCAGTCACTGGAGGCAGAGTACAGTGCAGG 

TGCTGTGAGCACTGGCAGCAGAGCCCTGGG 

CAGCGGCATGCGGTAATGTAAATG 


P001069 


B 


Mm.28112 


IM000885 


CCATGTCAGGTGATTAACCTGTGAGTCTAAC 

TTCCAGGAATGCAATGCCTCTGGCATCTACA 

GGCATAAACATACTTGTGGCTTACACTCAAA 

CTG ACACAC CAACACATATGTGCACGCGCAC 

ACACACACACACCAAATTAAAAATAAAATAAC 

CCTTTTTAAAAAAAATATAGAACCTATAGATA 

ATTGCTTTACTGCACTCACAAACATTTTAGGA 

TC 


P001070 


D 




IM000886 


GGGGCACATAGTGAGTTCTAGGATAGCCAG 

GGTTATAGAAGCTATAGTGTGAGACCCTATC 

TCAAAAAAACAAAACAAAACAAAAAAACAAAA 

AAAACCTAAGCCCGTGTGGTGGTGTGTCTCA 

GTCTGAGCGCTTGGAAGACAGAGGGAGGTG 

CATCTCTGAG CTTG AGGCTAGCCTG GTCTAC 

ATAGAGAGCTCCAAACCAGTCAAAGTAACAA 

AATGAAACTGTCTCAACAATGACAACAACAA 

ACAAACAAGCACTAGAATAAAAAGAAGCCAG 

CATGGTGTCATGTGCCCGTCATCCTACCACT 

TGGAAGGAGAGAAGCCAGTGCAGGAAAATT 

AGGGATC 


P001072 


D 
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IM000887 


GATCCCAGGCTTCCTGTAGGCTAGGCAAGC 

CCTCTCCCCACCCTGTCCTGGTAGAATTCAT 

CCCGAATGTCAGCATTCCTTCAGTTAAAGGA 

ATGTGCTCCCTCAG GCTCTCTCCCATGGTGC 

ATTGCTTCAGCACGCAGGCAGACACTTGTCC 

AAGCTAGGCTCCCTGTCTCCCATCTGTAGGA 

AATGCTTGGTATGAAGGCCCTGGTGGACCT 

GGCTAGATGGGCAGCGCCCAGTGAAGGGCT 

GTGTCTGGAGCCTGGGCTGTAATTAGTGGTT 

TGAACTGGGTGCTCTGGGGAGAGGCAAGTA 

AGAATTTGCTTTCTG 1 1 1 1 1AGAGCAGGAGG 

CCTCGAAGAGCATTTGAGGTGTTCGCCATCT 
TAATGGGTTAAGACTCTCCTGTGCTAATCTG' 
GTGGGTTGCTTTTAGGCACGGTGGTCCCACT 
GTGGTTGTGTGAACAGTACCTTAATGCCAAC 
ACTTTG GAGGCCTAAGGTATCCCCATCTGCA 
GGAACTGGGGTGCACA 


P001075 


D 




IM000888 


GATCCTCACACAAATTGAGTAGTACTAACAA 
GAGTGTGATTCACATAGTCAATAAAGGTATA 
ftftrCATCTGTGnCCTGGCTTGACCTCCGCA 
GACCAGAAGCTAACAAAACCAAAACAGACTC 
AGTTTCTGCATGCTAACTTAACCATGA 1 IMC 
C AG ACTATTTC I T f rATCCTGTGAAAAATATA 
TTAATCTCTATTCTG CAG AGTATCCCTTCTTT 
AAGAGAAC ATG ATTTCACTG ! 1 ! \ T GACAATA 
TGCCTAGACACAGAAAAAAATCATTTAGTTT 


P001078 


B 


AA793356 


IM000889 


TTTTGAGTGCTCAGTGAACTACTTAGGGCAG 
CCTAAGGAATACAGTGACCCACCAGGAAATG 
CCTTGTGTTTTGGCAGTCTGATAGCATCACT 
CACAGCTGTCGGTCGTGACTTCATTGGATC 


P001079 


A 


Edar 


IM000890 


GATCCAGGGACAAAGAGCCCATTCTCCTGTT 
CCTTCGTAT 


P001081 


D 
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IM000891 


ACTTTCAGGCAAGCTCTTTGCTCAGTGAACC 
TGCTACC ACAC ACAGACTC CTCCTCCCTGTT 
CCCGTCGTTAAAAAAAG I I I I ATTTGAGGTTT 
AGAGCAATGGCTCAGTGCTCAAGACTACTTG 
CTGTTCTTACAAAGGACCTGGGGCTAGTTCC 
AACACCCACATGATGGCTTACAATTCTCCAG 
TCTCAGGGGTTCCAGAACTC I 1 1 1 ICTGGCA 
TTGAATGCACATGATGCATATATAGACAAGC 
AGGCACACACACACACATAAAATAAAACAAA 
TCT 1 i TGAATGTAATTTTAAAAAGATTTATTAA 
ITTTAAI 1 1 IAIGIGI AIGAAI Gl 1 1 IGCCTG 
CATGTATGTCTATGCACTGCATGTGTGCCTG 
GTGCTCAAGGTGTCTGATAGCCTGGTGCTG 
nn rTr^^TTr a ptp a a p a rs ptp p,p p pt aty^ a 

AGGCCAGCCGTGAGGACACCTATCCATGCT 

GACAGACACAGATGCTCAAATGAGACAGCC 

CCTTCTCTATGAATGCCCTCTTGAGAATGAA 

CAAC CTCCCTGCAGCAGACCTCCTTCTGGAT 

ACCCTGCCCTTCCATACTTTCTGGGTGTCTA 

GTTCTCTTCC 


P001082 


R 




IM000892 


GATCACACGCTTCACCTAATTACAAATGATTT 

CTTTAGAG GGGTCTGTATATAACAG AG ATGA 

TAAAATTCAACGGCAGCCCTCCAACTGCATT 

GATATACAGGAAGTACTCATGAAATTGGAGA 

CACTGATTATCTCTTTGTGTGGTGTCCACATA 

TGTGCCATCATATCATATTATTATTATTACATG 

G CTAAAAAATGGGGTC ATAGGTTTCATGACC 

AGAACCAAAATATTCCCCTGTAATTTACACAG 

G ATTG ATG GTAAG AAATG AAAAC AGTTTAC AT 

TITTGATAATTTACTTACTTGACATAAAATGTG 

ACTTTCATTTCCTTGCATTCCTTTTCACAGGT 

AAGGCTACGACAATAGATTCTCAGTTCTCCA 

CCTCTCTCTATCTTGTCTACTCTATCAGCAGC 

AATAGCAACAGTTTTCCATGGTCCTTCCATCT 

GTAAAAGCAATAAAAAATAACAAAGAAAACCA 

TACAAACCATTAGAATATGAGTTGGTATTCAC 

AACTCTCCTCTCAATACTTCATATTTTAAAAAT 

TACTAGAAATATTCATCAATAATATTTCATTTG 

TTAGCTCTAGATAATGTTTCCAGG 


P001083 


D 
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GAICAIGGI IA1 1 1 1 IGIAGGGI 1 IAI 1 IAIA 

CATGTCTACATGAATTTATGTGCACCAGATGT 

GTGCAGGTGCCCATAGAGGCCTGCGAGGAT 

GCCAGATACAGATAGTTATGAGCCACCTAAT 

ATAGATGTTGGGAATTGAACCCATGTACTCT 

GCAAGAGCAGCAAGTACTCTTAACTACTGAG 

TCATCTGTTTAGCCCTCCTGTTG G GATTTAAT 

GGTCAGTGTGAAATACTATGAAGATAGAAGG 

GTTTCCTAGACTCTGGTGTGTAGGGGTGGG 

GTATCTGTGAGATGGGTAAGCTCTGTTGGCT 

TTCTAAGAAGGAGAATGAG CAGAAGGCAC AC 

ATAGACATTCACACTTTCACACACATGCATG 

PPAAAPAPPAPAPATfSPAPAPPAPATAPPAP 

ACGCGCCCTCCTGTTTCTTACTATGTAATAAT 
GTTCTTGTAATAACTTAGTACTCTGCTAATGA 
AAAGGTCACCACTAACTAGATGCTAGCCTTC 
AACTTTGGACCAGAACTATGAGCCCAAATAA 
ACCTCTTGCATTTATAATTTAGCCAGCATGTA 
GAACTGTGTCAATAACAATGGAATAGTGTTG 


P001085 


R 




IM000894 


GATCATCTGGCTAAAATTTTATAATATGACTC 

TTTAAATTCCTTAAGAATTCACAAGGACCTTT 

ATGTTGAAATTACTCATATGTAAGCTTACTGG 

AATGAGATGGCTCCCCAGTTGAAAACACCAT 

TCTTAAAATACTCAGAAAATAAGAACGAGGC 

CAGCCCGGTCTACAAAGTGAGTTCCAGGAC 

AACCAGAGCTATACAGAGAAACCCTGTCTCA 

AAAPAAAAAPAAAAAPAAAAAPPAAAAAAAAA 

ACAAAAAAGAAAAAACAAAACAAAACAAAAA 

GAATGTAGATATAAAGAAAGAATAGTGTTTG 

CTGGAAATAAATAGTAATATAAACTTAACAGC 

AGCCTGTCAATTGCAGGGI 1 1 1 1 GCACTTGC 

AGCTCAGAAAGAAGTGACCCTCCTCAGGAA 

GTAG 


P001086 


R 




IM000895 


GTGGGTTGTGTGACTCAGAGAGCAAGCTTCT 

ACCTCCACAGGCAAGGATGCCTGTGCACAC 

AGAAATGAGATGAAGTCATATGTGGGGACTG 

GAGTTGCAGTGGCTCCCAGAAGGAGGTGTG 

CAGAGTTCAGGCTGGAGTCCAGATGAGGAA 

CATCAAATAGAGAGGCCTTTGGAGGGAGTG 

GGTTCTCTTGATAAGTAGGACTGCCACCCAT 

ATCAAGTATAAGACTGCCAATCATACTGAATC 

TCAGGTTATTTCCCATGTAGCATTGGGAACA 

TATAGCATTTGTCACACTGCTATAGCAAAGAA 

TCTGTGATGAGGTTGGGAGTGGAGGGGAAC 

GCCTTTGGTCCTAGAAAAAGAACCAAAGGTA 

GGCTGATC 


P001087 


D 
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IM000896 


CCTGCCCTTGCCAGACCCGACCGCAGCTCA 
TCGAGGAGGTACCCTCTAAAGTCGTCACCTT 
GAGGAGACAAGCTCTGTCATAGTGCTCGCA 
GCCCCGCGGCCCCTGCGCCAGGTTGCGGA 

CCTCCTCCTCCTCCTCCACCACCTCCCCCTC 
ACCTGCCACTGAACCTTTCCCCCAGCTTGGA 
AGCCACGCCTTAAGGAAGCAGAGTCGGTCG 
GACACCCGCTCCTCCTCAGAGCAGCGGCCA 
CC AGAGTCAGG AAGGGGG GGTC CAATCACG 
TGATC 


P001088 


R 




IM000897 


GCTCAATTAGTTTAI 1 IAAAI IUAAAAuAAAIj 

CTAAAAGCCTGATGTGTCAGTTGCCTTCAGC 

AGAGCTGTTTGGGGCCCATTGTTAATGTTGT 

GAATTAAGTTCTGATGTAAGTAACCAAGCCA 

CTCCCCACACTCTTACTTGCAAGAGTTCCAG 

GCAGATGTTAAGGTCAACCCACCTGACTCTG 

ATC 


P001089 


D 


- 


IM000898 


GATCACAGTGTTTATCTCAGCAACAGAAAGC 
AAATGAGGACACACCTGGGTCTCACTGATAT 

A PTTP r*~Tr* ATATrTr*TArTTATTATrTrTr'AP 

ALr 1 \ k?\d I IjA lAlblbi Ao 1 1 A 1 \ A 1 o 1 U 1 UAU 

AGTAATTGGACAAGGAAGAGAGTTCATTGTT 

TTAGAATGTTGTAACTGG CATTGTTCTTCTCT 

CTCTTGTTTTCATAAAATCTCACAATATCTAC 

AGCTGTGAGGTCCAAGGGGCTCATTGGTGA 

TACCCACTCTTTCTACTTTGTGTGACCAACCT 

CTTI 1 GGATGTCAAGGGT 


P001091 


C 


- 


IM000899 


GATCAGTTGCTATTGCTTGATTGATTGCGAG 

A PTTTrTT AAPAA^A PTPTTTPTPTr fT^TO A 

AUTTTCTTAAUAAoAoTOT 1 I \y I U 1 I U 1 OA 

CTCCCTAGCTTCATCTTAGAACTTAAACCCAC 

AGCCCAAATGAGTAGTTGTATGTCATATG CC 

TCGGCCAAAGCACGACTGAAAGGAAAAGAA 

AGGCAGACACTGGAGTGCAGGAAGAAGACA 

CAAGGC AAAGC CC AGAATTCAAAAGTAGAAG 

CACAGATTGTTTTCTTTGTTT 


P001092 


C 




IM000900 


GTACCCTGCATCCCCGGTGTGGCCTTGGAG 
TCTGATGCCAGCACTACAGAGCCAAGCCATA 
ATACAAACCAAATAGAATTAACAAGAGCTCC 
ATATGATC 


P001093 


D 


- 


IM000901 


GATCACCTTCCTAGGATGAACGAAGAAGGAT 

GGCTGGAGGTTAGGGACCCAAGGGACTTCC 

CCCTAGAGCTGGCTGTGTACCCTAGGCATGT 

GTGACTGCAGCTGTACAAGCAGGGTATTCTG 

GGATTCACAGTCCTCAGGATAAGATGACACT 

ACAGATTCTAAGCTTTATACCCAACATGGTG 

GAACCCCATGGTCACACTCTTTCACAGATGG 

TCACTCCCATTGCCCGAAGCCCAGCCTTTAT 

CCAAG 


P001094 


G 
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IM000902 


GATCAATAACAGCAAAAGAAAAAAAGAAGTT 
TACTTTTC ATGTAGC AATGTGG ATAATTCC C A 
TCCAGAGAAACAAAACCAGTTCCAG 


p001095 


c 


— 


IM000903 


GATCAGGGAAGATGTCACCTCCAACCCAGC 
CTAGACATGGTGCTGTGACCA 


P001096 


D 




IM000904 


GATCAAGGAGCAAC CCAATAGCTTCTATTCC 

CC CCCTACTAAAATATGACCCACTG ATGGAT 

TCTGGGGATGCACAGATGTTCTCAGAAGTTA 

CTGATGAACACACCATGCTCTAACAAATAGT 

ATCAAACCCACAGTCACAGATGGCCCTAGTT 

AAr^PAPAfiTfiPATPAPAAAfiPAAAnPAAAfiA 

GCCTTGACTGTGGGAAAGGTACTTGTGGTGA 

GGACTAGTGGGGTATGAAAGAAATTAGAGAG 

GATGAAGGTAGTGATATTCAGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTAAGACTATTAAAGAACACCCTTTTTTAA 

AG AAAG GCTTTCTTGAGTGTCAC C 


p001097 


R 




IM000905 


GGTTAATAAGCTAGATTATCGTGTATATATAA 

AGTGTGTATGTATACGTTTGGGGATTGTACA 

GAATGCACAGCGTAGTATTCAGGAAAAAGGA 

GACTGGGAAATTAATGTATAAATTAAAATCAG 

CTTTTAATTAGCTTAACACACACATACGAAGG 

CAAAAATGTAACGTTACTTTGATC 


P001098 


K 


Myc 


IM000906 


GTGAACGACAGCAGAATCGGGTTGTACCTCA 
AAGCACTTACCTTTCCCAATACACCTGATC 


P001099 


D 




IM000907 


GATCAGTGACAATGTAGCTTTGCCTGGAAGG 
ATACTTGAGTC 


P001100 


D 




IM000908 


f5 ATP A f5 P AA A ATfi fi £5 AP ATP R A AfTTTfi AAP P 

AAAGTCATAATAAAACATCCTGAGGTACATAA 

ACACTCTGTAATAGACTAATACAGTTCCTCCA 

GGCACCAACAGAAACCTTGACTACTTCCCTT 

GACTACTTCAGTCAAATCTTCTGATAAAACCA 

GACCCAACTTGGAAACGTCCATGTATACAAT 

G 


P001101 


D 




IM000909 


GATCATCTGCTTCTACCCCCAATTAAAAGAC 
GGACTAAGAACATAAAAAG AATCCAGGC AC C 
TAGGTTTGCAGAAATCTAAAGGTTGAGTTCC 
TTT 


putmu^ 


V 




IM000910 


GATCACAAGTTATAGTTGAATAACAAGTCCT 

GTGTGTGTCTATGTATCCGTATATCATATTTT 

CTTTATCTGTTACTCTATTCATGGAAACTAGG 

TGGATGTGTTAACTTGGCTATTATGAGTTTTG 

CTGCTAT 


P001103 


D 
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IM000911 


CTACAATGGTTCAGGCTTTGGAATATCACTCT 
ATAGGCTGTCTGCCGGCCACCACCCTTCAG 

CCGAGAGGCAGTCCCCATCAGCCTGTCTCC 

TACACCCACACACTCTGTGTGGAGACCACAG 

GCGCCCAAAGGGTATGCTAGTCTCTGCTCTA 

CCGCGTACCCTCTCCTGAAGGCAGGCATTTC 

AGAGATTCCAGTTTCACCAGGAAGCTCAGAT 

C 


P001104 


c 




I M 0009 12 


GATCTTTTCCCCCTTTGTAGTATCAGAGAGAA 
AAGCCATGGCATGCATGGCACATGCTAGGC 
AAACACTCAAGCATCCTACTCTGTGATGCAG 
TTTGAAACAAAC 1 1 II I ! I 1 I C 1 1 1 1 1 CTTTCT 
1 1! 1 1 ICI 1 II 1 ICI 1 MCI 1 1 1 ICI 1 1 MCTTT 
TCI 1 1 1 II 1 II 1 1 II 1 IGAGT 


P001105 


R 


! 


IM000913 


GATCTCTCCCCATCCTCCTGTTGCCTCTTGT 

CTGTCATACCTCTACTACTCCATCAGTTTGCT 

GCCTCTGAGTCCCTCTTCTTCCTCTCCTATC 

CCTCCTCCCATCTTCCTCATCTCCAGGTCTC 

iCCAoCilUI ICCI lOI lOUUiUI 1 1 lOITCCC 

CI 1 1 ICCTCTTTCCACTGTCTTGTATTCCCTT 

CCTTTCTCTGTTGGTCCCTTCCCTCGCACCT 

CTTTCCTCCTGTCCCTCCI 1 1 1 CATGTACC AT 

ATTTCTCTTCCTCTTTCTGTGTCTCCTCTTTC 

CTTCCTCCTTTACTTTCCTTCTAACCTTCCTC 

TTTCTCCTCCTCCGGCAAGCCTTTGCTT 


P001106 


A 


Gatal 


IM000914 


GGTTGTTCCAGTTAAATTGGCTCTCTACAGG 

AACATGGCTTAGTTCTCCCTTAGCCTTTCATG 

ACCCTACACCTCAGACACTAGTCAAAGTCTA 

GCTTAATAAAGTGTTCAGGATGTTGGTGGAG 

GGGGGGAGATTGTTAATACAGATC 


P001107 


D 




IM000915 


PPA^OAPTT I ArTATr^^T^ATATOTTPTA A 

IjoAOUAO 1 1 1 Ao 1 A 1 ow 1 LrA 1 A I o 1 1 O I AA 

CTTTCTTTCATTTTCTAATTCTTTCCATCTGCA 

TTGATTGTGCCCAGTTATCATTAGTGACTTAT 

TTTAGTAACTTAAGGGAAAGTTGTCTATGCTC 

TACTTAGTGTCGATTTAACTTACTCTCCAGAC 

ATGGGAGTGCTTATTTTTGTTTGCCTTACCTC 

ATCCAGGAGCTTGTAGATC 


P001108 


D 




IM000916 


GATCCGATTATGAAACCGGTTTTGAAC 


P001109 


D 


- 


IM000917 


GATCTGTGGAATGCTATCCAGCTCTTCCAAC 
AAATAC 


poomo 


D 




IM000918 


TTAGTATCTGCATCTGACTCTTTCAGCTGTTC 

GTTAGGCCTTTCGGAGGGCAGCCATGCTAG 

GCTCCTGTCTGCAAGCACACCACAACATCAG 

TAACAGTCTCAGGGGTCTGAGCCTCCCCTTG 

AGCTAGATC 


P001111 


R 
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IM000919 


GATCTGTGGTAATGATTCTGTAAATACAGATA 
AACAACGTACACATGGGAATTGTTCCCTGTG 
TGAAAGTGTTCATCATAAGGTG I I I I IATTTT 
ATCTACAATATCTTTGGGTTTT 1 AG 


P001112 


D 




IM000920 


AO 1 bUUAL/A 1 1 V/LU 1 AAOAUO 1 OA 1 UMAnuA 

AAACAACACCACAGGTCTCAGGCTGCCACTC 
TAGACCTCCGAGTTGACTCTGGCTCCTGCTC 
TCTGCAAGCAAACACGCATCCCTCAAGTCTT 
CATGCTGGTTCTCTCAAGTCTTCATGCTGGC 
TCTCTGTAGTTCTGTAAGCTTACCCTTTCAGT 
GGTGATTTGGGGAGATC 


P001113 


D 




IM000921 


GATCTCCTGGCTTTGTAGATAAATGAAGAGA 

GTTCGTTACCAACTGAACTAAAGAGCGGCAC 

AGGAAATTAAAAAAAACAAACAAACTGATAGT 

TAACTCAATTGAGTAAGTATGGAGTTTTGGG 

ACCAAGACATATTAGGCAAACAGACAGTTTA 

AGGCCTAG 


P001114 


D 




IM000922 


GTTCCTGTACTTTATCATGTCTTACCCCTACC 

TCCCTCCATTTTAATCATCTTTACTGGGATGT 

AATGC ATTCCTTTGTCC ATTCCAGGATG CTAT 

AACAAGATACCTTCAGCCTGTAAGCTATAGA 

ACAGTGTGGTCCTCAACCTTCCTAACTTTGT 

GACCCTATAATATAGATC 


P001117 


D 


- 


IM000923 


CCANCGTGCCANACTCANAANGGAATTTTAT 
TCATAGATTCTNTCANACTGCTGTCCCACAT 
GTGTTCAAAANCAGGTAGGTCTTGTCANAT 


P001119 


D 






GATCTCATTGCACAGAAGAGTTAGAAGAAAG 
AAAGAAAAGCAGACTGGGAAAAA I 1 ! I IGCA 
GCGAGCATTCAGAGATTGAACATCTATCTAA 
CTTATGCAAAATTCCTATCAAAAGAAAAAAAA 
AGCTTCAACAGCTGGGTAAGTTAAAATGTAA 

Lr \ A 1 AAooOAAOAUAAljoOAAAo 1 o I I \j 1 1 O 








IM000924 


Mill GCTTGTTTCCGAGATGAGCTCAATTAA 

AATATCAATAGCGACAACAATTCTGAGCTGG 

ACTAACAAAGAGTAGAACAATACTACCCAAC 

GCTTGTGGTTAGGTAACCTTACACAATATTTT 

CCTAATGCTATTC GGCAATAATTGTCAAGAAA 

A 


p001121 


D 




IM000925 


G ATCTTTTC CTAC AAG ACTTCTGGGTG ACCTT 

GCCAAGCCCAGCCACTGGCTGTGGTACCTC 

ACCAGGACACTCGGTGGACATTAGGTAGTG 

CTCCCCAAGTGCTAGGTGACAGTTTATGCTT 

CAAAGTGACTCCTGCAC 


P001122 


D 
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IM000926 


GTGCTGACGCGCCCTTGCATTTGGGAGAGC 

AGTCAAGCTATCTGTACCTTCACCGTAAGAC 

TACATTGTCACTGCTGGCTTCCCTCCTGTGC 

AAGGGACGCATTTGGGTCAGACTATGCATGA 

AACAGGACAACAAAGGTAGGGCCATTGGTA 

GATC 


P001123 


D 




IM000927 


GATCTCACTGAATATAAAAAGACATCAGTCC 

AAGGGTGGAAATTTAACCAAAATAATACAATT 

GTTGTTG 


P001124 


D 




IM000928 


GATCCTCCAGGAACTAGAGTTACAGACAATG 
CCCGCCTTGTATT 


P001125 


D 




IM000929 


GTGGCAGTGACTGTCCGTGTGGGAAACGTTT 
AGCAAGTCCGAGCGTGTTCGATC 


P001127 


K 


Nmyc i 


IM000930 


CAGGAGAGTGTCTCAAAAAGCAGCAAAGCA 

CCCAGCACCTTAGGGTGAAGGACCACTTCT 

GGAATGTATCCTCCCAGTTGCAAATGTACAC 

TGTCTCATTCACTCCTGTGACATACTTTGTTT 

GTGAATGCTAATATCACATAGTTCGATC 


p001129 


C 




IM000931 


CCAGCAGAGACCAAGCATCCAAAACATGAG 

CCCATTTCAGGCTTCAACCATAGCAGCTCCC 

ATCTCAATCCTGTTCACCCCCCACCCCACCC 

CCCGCTTCTCTATTTAAATCACCACTCTCAGT 

GACCAAAAAGATGCTCATGGCAAATGGACTC 

TTGGCTCTC l I M ACCTAATACTGAAGGTAAC 

AAGATAATCAACTGTTTCCTCTCCTTCCCGG 

GGACCTCATCATACAACATTCTCCCACATGA 

AATTATCACCACGTCCAATACCCACATCCTC 

CCCGTCCTGTAGAGAAACCACATGCCTAGCA 

GCAGTGGTTTCCCACCTCTGTGCTCCCTTCC 

ACCTCGATC 


p001131 


D 




IM000932 


GATCGCTGTGGTTGGTGTCTGTGTATATGCA 

CTGTACATACTAACCAGGTACACACATAAATA 

TTTAATATATAAAAAATAAAGT^mTTCTAAR 

AGGCCCCTAGGCAGGGACGTTATAAAACATT 

TC AC AAAGC AG C AAAAC AAAATTG ATAC AAT 

CAAAAAAACAACACTATAACCAACATAGGTG 

AAAACAGCCAAACACATAATGTACAATCTGG 

TGTTCCAGGACAAACATCTGTCATATACATG 

GTATATACATACATACT TTT 1 CACTCAATAA 


P001132 


B 


Mm.36692 


IM000933 


GATCGCTAAGTGTGCGCGGCCGCCGTCTGG 

AGAATGAATGGAGGGAATGAATGAGGGTGC 

GCGCGCCCGAGGCCCGGCTTGCGTCAGCC 

ATGCGTGCCCGGCATGGACACGGCCTGGCC 

TTCCTGGGAGGATGGGACCGGATGCAGTTA 

GTCCAGGCGTTCAGCATCCCAGGGCCCTTC 

CTCTGTTGCGTGGTCTGAGTAATCTGTCTCG 

GAGAAGATAGCGT 


pQQ1133 


g 


Mm.15152 

ft 

w 
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IM000934 


GGAGGTCTCTGTAGGTGCTTAGACTCACGTT 
ACAGTCATTCCAGAGGAGGGAGCTGCAGCT 
G CTAGTTTCTGTGC ACACCG ATC 


p001136 


D 




IM000935 


GATCGGCTGTCAAGACTGGGGAAGGGTCCT 
CCTAG 


P001138 


D 




IM000936 


AAGCAAGAGGTAATAAAATACATGTGGATGG 
/VTGACTCAGGGGTTCAGAGCATACACCGATC 


P001139 


D ] 




IM000937 


GATCGGGGACCTTGCATAAAGGGGTCCAGG 
GCTCTCAGTCCTTGGGAAGG 


P001140 


B | 


AA709647 


IM000938 


G ATCGTGATGACTTCATAAC CATCACGTGTG 
AAAAGACTTAATGG CGCTGAATTCACATG AC 
ACTT AAAATG C AC AAAGTAAC AAATTTTATGT 
CACATGTATTAAACTACAGCTAAGTACATGG 
GGAAAAAGTTAGACTTAGAATAACTCATCCA 
GAGTCATATGGTAG 


P001141 


C 




IM000939 


GATCGAGGAGTAACCCAATAGCTCCTATTCC 

C CCCTTACTAAAATATG ACCC ACTGATGGAT 

TCTG GGG ATGC AC AG ATGTTCTC AGAAGTTA 

CTGATGAACACACCATGCTCTAACAAACAGT 

ATCAAACCCACAGTCACAGATGGCCCTAGTT 

AAGCACAGTGCATCACAAACjCAAA(jCAAA(jA 

GCCTTGACTGTGGGAAAGGTACTTGTGGTGA 

GGACTAGTGG GGTATGAAAGAAATTAGAGAG 

GATGAAGGTAGTGATATTCAGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTGTAAGACTATTAAAGAACACCCTTTTTT 

AAAGAAAGGCTTTCTTGAGTGTCACC 


P001144 


R 


- 


IM000940 


GATCGGGCCACATCTCAGACACTCCTATAGC 
TACAGAGAGATACCGTTTCCTGTTATCTTTGC 
AGACAACTTTATCTGTTACTCAGAGAAAACCT 
CCAGGTGCCCCTAAAGAAACTGGGCCCTAC 

A Tf* A AT A f*f*r* ATA/"*/"* AAAA* l\ Tf~* A A /"* A T 

ATCACATACCCATACCACACACATGCAACAT 

GCAAAACATACACACATACATAGACACACAC 

ACCACACGCACACAGACACATACAGACACAC 

ACACATACTATACATACAGACACATATGCTAC 

ACACATACAGACACACACAAGCACACATACT 

TCACACACAGAGACACACACACCACACACAC 

ACAC 


P001149 


R 




IM000941 


GCCTGCCTCTGCCTCTCGAGTGCTGGGAATA 
AAGGCGTGCTAGAGCCTTCACTTGGCTCTCT 
CTCTCTCTCTCTCTCTTTTAACCTCC 1 1 1 1 IC 
CTTTAATGAGTTATTTA 1 1 1 1 IATTTTATGTGC 
ATTTGTGTTTTGCCTGTATCCGATC 


P001151 


R 




IM000942 


GCTTCAATATTCGAAAAGAATTAGTAAGAAAG 
GCTGTTCGATC 


P001152 


D 
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IM000943 


CACACTTGGCTTCCTCTGCACAGAGGGACCT 

CATACCAGTGAGATGGTGATATGCTCCCTTG 

TTCCTGAGCCTCAGTGGAAGCGACTTTCTAT 

GGATACTCCCTCCCTCGTGCCTCTCCTTCTT 

TCCCTCTCTGCTCTCCCCCCCCCCCCTCGCC 

CTCACGATC 


p001154 


D 




IM000944 


ATACACACCATCAGATATACCTCATTCTGATA 

TACCTACAGGTACACCAATCACACACACACA 

TTTACTCACATGTACATGCACACACCACATC 

GGTTAGAACCAAAGACCTCACACACACCCCT 

C AC AC ATGTTTC ATCTCC ATTATC AGTGC CGA 

TC 


p001155 


D 




IM000945 


GATCGTCAGGTTATGAATGCCAT 


P001156 


C 


- 


IM000946 


AGTTCTCAGAACCAGCTACTGTTTACACAGG 
GCCTCATGCAGCCTTGCTGTCCTCCATTCTG 
CAAGCACAGGATACACACCCCTGAAGGCCA 
GATTGTCAGGTCAGCCCGATC 


P001157 


C 




IM000947 


CTTCAAACCGGTCCTGCGAGGAGTCCACAA 
CCTCTGCCTGCCGATC 


P001158 


D 




IM000948 


GATCGAGGCCAGCCTGGTCTACAAAGTGAG 

TCCCAGGACAGCCAGGGCGATACAGAGAAA 

CCCTGTCTCAAAACAAACAAACAAACAAGAT 

TCCATTGAGGAACACCCAGATGGAGACATG 

GGTGTTCTCCATAGAAGGGTTAGGGGCTTCC 

ACACCGTTGACAC 


P001159 


B 


Mm.81366 


IM000949 


GATCGGTGTGCTTTCTGCAGTTTCAGCGAGG 

ACTCTGGGCCCAAAATGTTTTAAAGCAGAAA 

ATTGGTAACACTAGAGATATTGTCAAAATACG 

ATTTCCTCTGGTTCAGAAATGGCGAGAGGGA 

GGGCTGGAAGGGTGGAGTGGGAAGGAATTG 

TCATCAAAGCATTGTTGATAC 


P001160 


B 


AA408945 


IM000950 


CTGTCTCAGGCATGAAAACACTAAAAGATGA 
CCAATTTCAATAAAGATGACCTGAATGTCTAC 
TCAATTCCCACCATAAGGTCTACAAGATGTA 
AATGGGCCGATC 


P001161 


D 




IM000951 


GATCGTGGAAACAGAGCCTTGAATATAATGA 
AGAAACAGAGGGCAGGCAGCAGCCGCAGCA 
CAGCAGGGGCACTGTGAGCAGGCAGCAACA 
GGGGG 


p001 162 


D 




IM000952 


CTCCCTACTACCTTCCCTTCCTGGACNTCCA 
CTGAGATGAGG CAGGATAAAGG GTCAAAAG 
AGACCTGACCTTCTCTGCCAAAGCCAGGGAT 
TTCTGGAAGAATAGAAATGGTTCTGGAATTC 
ACAGATGCAGTGGTCTAGGATC 


P001163 


C 
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IM000953 


GATCCATAGGTCTCTGCTTTCCCCATTCAGG 
GCTGGAGTTATAGATATCTGTCTATCACCCA 
GCTTTTATGTAGGTTCCAGG 


P001164 


D 




IM000954 


TATGTATCTACAAGCCAGAAGAG GGCATTGG 
ATC 


P001166 


D 




IM000955 


GATCCGAGTTCTCTCCGGCCACGTACCTTCA 
CATCCCATGCACCCTGGTATGTAAGAAGAGC 
CCAGCTCAC 


P001167 


D 




IM000956 


TCCCATAATATTTCCTCAGAAGGATC 


p001168 


D 




IM000957 


TATAGTTCTGCCTGTGGAGTGTGAGCAGAAA 
TGTGTATCGTTTCTGGGTCAGAGCTTTCAGG 
AACTG AGC ATG ACTG CTCTAC AGTGTCTTTC 
TCCTTCTGCCTGCTGAAGCCCTAGGGGACAA 
TAGAACCACAGGATGAAAGGACTCGGGATC 


P001169 


D 


-- 


IM000958 


GATCCAATGG CAGCTAGCAGAGTCAGAG AG 

CCCTCACTCCAGTTAACTAGGGGACCCACAT 

GAAGTTCAAGCTACATATCTGCTACAAATGTT 

TGAGGGACCTCCTAGCTCCACGCCACATGC 

TCTTTGGTTGGTGGTTCAGTCTCTGTGAGCC 

CCACTGGGCTCAGGTTAGTTGACCTACAGTC 

TTCTTGTGGTATCCTTGACCCCTCTGACCCC 

AGAGTTTAACAATAGGCCTTCTGACTCTAGA 

AATCTACCTACA 1 II 1 1 I CCACTTTAAATTCCT 

CGGCTCACATAATACCAATGAACT 


P001171 


R 




IM000959 


GATCCATCTGCACAGTCTGTCACCGGGGTCC 

AGCAAGTAGCAGCCTTTCTGCTGCTGTCTGT 

CAGACCCTCCAGGGAGGGAGAGCTTGTCTT 

CTGGCCTCCCAACAGGACCCTGCGTGACGA 

TGCAGGGACAGCAATGACAACTCATTCCAGA 

CTCCAGGTCCCTGGAGGAGCCTCCCACAAG 

GGAAAGAGACTACTTCACTGGTCCTGGGCC 

CCTCTTTGCGCGCCCCGCCCCCAGACTCAG 

CGTCTAGTGTTGCTGGGCTCCCCT 


P001172 


K 


Pim1 


IM000960 


AGGGTAACAGGCTTAGTTTGGGGCCTTTCTG 

TTACAGGAAAACCATGAAATGTCCTGAAGTG 

CTCAACAAACAGGGAATATAGAAAATCATAAT 

GGTTCCTCCCTAGCACAAGGAAGCATGTTTA 

AAAATTGCAGCAAAATAAAAAAGAACAGATTC 

TTAAGATTGAGGGATTTTACGGGGTGGTACT 

TTTTCTTTCTCTTATAAACATTTATTTACTTTT 

GTTATTCAAGACAGGATC 


p001173 


D 
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IM000961 


GATCCAGCTGTTTGCTAACATACGTAAAGGT 

ATGGATGCTGAGAGAGTATCTATCGAAAGCG 

AAGGCACCCTCCCCAAATTCAAGAAAGCAGC 

TGTTTCTAGAACCAAAGACACCACCGCCGCC 

GCCGCCACCACCACCCGCGAGCGCCCGGA 

CCCTGTTACAGAGTGTC 


p001174 


C 




IM000962 


GATCCTGAAATTATCACATTTGAATCAAATCA 
TGCCCTGCCGAGGATAAATAACCCAAACGAC 
CGAGAAAACCGAGAAAAAGAACATTTACTGA 
CCATCCTTC 


P001175 


D 




IM000963 


GATCCAGTCCAGAGCAATGTTCACGTCTGTG 
ATGGTAT 


P001176 


D 


- 


IM000964 


AAAGGTGCTCTCAATACTTAACAATCCATAAG 

CTTGTG CTCTCTTAGTCGTAAAGGTGGGGTC 

CATCAAAATCCCATGACACCACAGCGAGACC 

AAACTCCTTTTCTCTTACTCCGAATCACCCAT 

CC CATGTGGG AGACGAATAAGAACACAAACT 

ACATCTTCAGTGACATAGAGTAGCATCTGCA 

ACAGAGGAAGTGGATGGAGACCTTGTCTCT 

GGTCAAAGACAAAGCATGTGACAGCTGAGC 

CTGGCACTTCCTACTTGGGTCACAGCTCAAA 

CCCACCTGAACCAACAGCAGAGCCCCACAG 

GGATGGGACTCACATGTTTCCCTCTTGCCCT 

GGAGCTTCGTGCATGTTGTTAGAAGCTAACT 

GGCTAACACGCACGGGAACAGGCAATGTAG 

TTGGAGTATGAATCGAAGTCACTGGGCATGG 

TCCTCAGTCAGCCAGGATC 


p001177 


C 


- 


IM000965 


CTAGACTAGTATGGCAGAACCTATCTTCTTCT 

AATCATTTAGATGAATACTCCACATGAGAGA 

GCCCTGAGAATATCTGTAAAAAGTAATCCAG 

GTTCTGTTACTTCTAGCTAATCTTATCTAGGT 

AATAATAGATAAGG AATCG GG ATTCACG AAC 

ACAAATACCTGTACAAAGCATGTTGTCTCAC 

ACGGGACGAACACTGTTTCTGCTGTGCTTTA 

TAACGCTGGGACATACAAAACTAGACTCTGC 

CTAAGAAGTGTTTGGAAACATTTGGGTTAAAT 

TATAGTCAGATAAAACAACAACCATGAGTAAA 

TCGAAGAATATAAAACTAGGGATC 


P001178 


C 




IM000966 


TTTCCTGGACAATAATGTTTTCTTCATTAAATT 

I AU ALr I 1 AoAuLA 1 1 o 1 U 1 1 AA 1 ULr A 1 uAA 1 A 

ATTCCCAGCTCCTAGCTCATTACCTGTGACA 

CAGCAGGGATTCATACATTTATTGAATGAAT 

GGATGAGTGAATGAATAAAAGAATGAGCATA 

TCAAGAGGATC 


P001179 


D 




IM000967 


GATCCCTTCTGTCTTTGGTTATCTC 


p001181 


D 
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IM000968 


GATCCACCACTGAGCCACTTCTTCAGCCTGT 

GACTGTCATTCTTAATCATCCACACAGACTTC 

TCCTTGGCAGATTTTGCCCACCTCTTAAGAC 

TTTCACAAAGGTTTTTTTCTTCTGCAGGGCAC 

ATGAGAAAACAACTCTGTCATAAAGAAACCC 

AGGAAGAAAACCAGCAGAGGCAGGTGAGTT 

AAGCCTGTGGTGGACATTCCTTCTGGGGATG 

ACCAGATGGGAACAGTAATTCACAGAGGCA 

GAGGGGTCTGCAGTCACTCTGCATGCCACA 

TGTGTAACCCTTAAGAAGTGAGGAATGCTCT 

CAACAGGAAAAACACAGCAGCAAATGCTATG 

ATACCAAAGCCACAACTCCATGGGTCCCTGG 

AGC CTCTCGAACTAAGCTGCC AGCTAGGGA 

GCTAACACTAGCTTTGGATGAAACACAGCTC 

TGGTAGAGTT 


P001182 


C 


- 


IM000969 


GCTGGGATTTGAACTCAGGGCCTTCAGAAGA 

GCAGTCTGCTCTTACCCGCTGAACCATCTCA 

CCAGCCCCCTTCCGTTCTTCCTTTCTTCCTTC 

0 f T 1 I I I I 1 1 1 CCACATTGTTTTCAGACTGCA 

CCTTGTTTAGTAGTCTAGGCTGGCTTCCAATT 

CCC CAATG ATTGAGCTATGGGTATACTCTCT 

TCACCTACTTTGA 1 1 1 1 1 1 GTTTGTTTATTTGT 

II 1 1 11 G 1 1 1 M 1 1 GAG AC AG GGTTTCTCTGT 

ATAGCCCTGGCTGTTCTGGAACTCACTTTGT 

AGACCAGGCTGGCCTTGAACTCAGAAATCTG 

CCTGCCTCTGCCTTCAAAGTGCTGGGATC 


p001183 


R 




IM000970 


GCTTCATTTAATATACATCATTTAC C AG AAAC 
CACAGACATCTTTGTACCAACATATAGTAATA 
TTAATCACAATAGCCATCACTCTTATGTAAGG 
ATGAGAAGACTCCCAGCTAATATGCTAATGT 
GTAGAAGATG CC AG ATGGATC 


P001184 


D 


-- 


IM000971 


GATCCCTGCTTCTGTAAATCCGCAACGACAA 
TTGTTATCTTCTCCTTTTCTTTCTTTTATTTGT 
TTTATTCTATTTTA 1 1 1 ! I CAGATGAAAA 


P001185 


C 




IM000972 


GATCCTCCTGCCTCTGCCTCCTTCAGCAAAT 
CCTACCGGCGTGCGCCACCACTACCGGCGA 
AAAA 


P001186 


R 
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IM000973 


GATCCCCCTTTCTCTCTGTCTACGGCCTCTG 

TCCTGTGTTAGCTGTAGGCCTACTCTGTATG 

AACAGACCTCAGCGGAGGGGTTTGGACTTG 

GGCTTGTGTTTCTTAAGAGAATGGGGCTTCC 

ATGACTGTCCCTCTGTCCCTTTCATCCTAACC 

CTGCCTCCCGCTAACAGGCAGCCTGTATGTT 

TCTTGCACTGTTCCTTCCTCCTGACGGTCTG 

AGTCGTTTCCCTCAGAGACTGTTGCTGCTGC 

TTCAGCTTTCTCTCAGCTTCTCTCAGGGCTTC 

CGCTCTGGAGTTTCTCCTGCTTCTCTGTTTAC 

TTTTCAAAGCTCAGCCTCCATCTTCTGCACCT 

GCGGAGTCATCACTGATTCCCAGCTGTGGC 

CTGTCACCCTTCCCTTTCTTTCTTCCTCCTGT 

GCCACCACCATGCACCCTCCCCTTCTGTCTG 

TTGTGTTGTCCTAACCTTTCTTCTCCCCATGC 

ACCCTCCCCTTCTGTCTGTTGTGTTGTCCTAA 

CCTTTCTTCTCCTCTCTGTGCTCTGCAGGTTT 

TAGGGTCTCTGTATGATTTGTACCTGCATTTA 

TTTGAACCTCCACTCTTCTCTTTCCCTCTCTT 

ATC 


p001187 


D 




IM000974 


GATCCTGCAATACCTCTCCTGGGCATATATC 

TAGAAGATGTTTCAACTGGTAATAAGAACAC 

ATGCTCTACTATGTTCATAGCAGCCTTATTTA 

TAATAGCCAGAAGCTGGAAAGAATCCAGATG 

TCCCTCAAC AGAGG AATGG GTACAGAAAATG 

TGATACATTTACAA 


p001 188 


R 




IM000975 


ATCTAAACTATAATAGTTGCAGGGCTAGTTCA 
TTGTCAGGTG^CGTGGCGAAAGAGTGCAAAT 
CCCGGGGGTTCTTTCTTCAGAATCAACGAGG 
CAATACACTTGAACATGTATG 1 1 1 1 1 GTAATC 
TGCGGGGCATCACCCGTCCTCCAGGATC 


P001190 


D 




IM000976 


GATCCCCCAGAAGTGATAGTTTAACAGTGAG 
GTGAATGCAAGCAATAAGCTACCTAAATCAT 
TAAAACTTCCTAI ! 1 1 ATTAGCATCTATTAGTT 
GCACACAGCAGTGATGGGTTTCATT 


P001192 


K 


m 


IM000977 


GGACCTCTGTACAAATGTCGGGAGATAAGG 
GAAGAAAAAGACGACAGAGATAGCAGTCAG 
GATGTAATGTGTACTAGATGAGTGGTTCAAG 
CAATAGGATGGAAAGGGCTTAGCAGGAGAG 

ATTTTT A A ftfi ATf5 A f5 C± C" A C1T A rt. ATT A P ATP 

TGGGAAATGTCACTGGAACTGGATC 


P001194 


D 




IM000978 


GATCACCAGGCTGGGCAGGCCACCTAAGGA 
AGTGGCACGGGCACGGGCACTTCCCCAGAG 
CACCCTCTGGGCACTCTGAGAGGGGCACAG 
ATGTACTGCACTAGGCTGGGCCCGGAGGAG 


P001196 


D 
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IM000979 


ATATAAAATATCGAACGTCCTCTGGCTTGTAA 

ATATCATGTTAACCTTCAAAGCGTTCGAAAG 

CGCAGGAAATCTGAGTCAACAGAATAGTATG 

TAAGTTTAI 1 1 1 1 ATAGAACCTGCCTGAACTG 

CAAGGGAGGGGCGGGGCGTGGACCCAGGC 

CTGCCTGCCAATCTGCGCTGCCAGTGAACTA 

AGCCTGATC 


P001197 


D 




IM000980 


GATCAAGTCCTGGTCAGTACCAAGTTAAAAA 

AAAAACTATATAAAAGCTATATTAGGGGACA 

GCTGTGGCTTTTGTAGAAAAGAAGGTCCTGG 

TGCTATGACCTGCAGATGCCCATGTGGAAGT 

CTTCAGATGAAGACTTTCTCATGGAGTAAAC 

ATACTCTGTTGTTTGACCATGTGGACTTGGTT 

C AAAATGCCCATGGATGCTCCTTTG G GTACC 

AGGCTTCAGTGGGAGTCCCAAGCCCATGTC 

TTTATTTGAGCATGAGCAGTACTGATGCTTAC 

CTAGTCTTATTCTTTCCTTGCCCCCTGCCTG 

GACCGTCTCTGGTTACAAGGATGCTGCAGTG 

GGAAGCG GTATGACCGTTACCTTTATGGGAC 

TGAGACCAACTAAGGGGAGGCTGAGGAGGC 

TGCAGTGAAGTTATTGTTGGGACTGTGGGCT 

AAGATGGAAGATAACATGTTAACAAACTCAA 

GTGCGGAGGTCTCAGAAGTAAAATTGCCTG 

GTTAGTA 


P001200 


D 




IM000981 


GATCAATTGGTAACCAAGCCTTGAACTGAAG 
AGTCGTGAGGTGGGGGACTTTATAT 


P001201 


D 




IM000982 


GTATCTCCCACCTGGCTCAATATAGGCTCTT 
TTCAAAGGCTAAATTAAGACCAAGGACACAG 
AAGGGTAGCTCGCTGGGCAAACGTGATCCC 
TGCTGATAGTGTAG 


P001202 


D 




IM000983 


CTCTCGTGTGGAGATATTAAAGGTGTGAACC 
ACTAAGCCCTGATC 


P001203 


A 


Scp2 


IM000984 


GATCAAGCAGAGGGGTAAAATAAGGGCAAG 
CTCAGTGTTAGACAAGCTCATAAGCCAAAGC 
TGTGAACTCTCCAACGCCT 


P001205 


D 




IM000985 


GATCACTTCAACATCAAGAAGTTACCCAGCC 

CCGGGAAGAAGTACATTTCCAGGAAGCAGT 

G I f I I CAM 1 1 I I GAGTCTGCTCCCATCCCGT 

TTCTCTGCAGCTGGGTAAACTTGAAGCTGGG 

CTAG CCTCTGGGTAGAAGGCAGCTAATGACA 

ACTACCTTGCCTGTCCCACGGAGCCCGGAC 

AGAACCTGAGATAACACACCTAGCTTGCTGA 

GTAAAGGCAGGTTACTGTGTGAATGACTCTG 

AGCTGTTCCAGCTCTGCAGAGCAGGAAGTCT 

GACTGTGGAGATAAGAGATAT 


P001207 


D 
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iMnnnooc 
IMUUUyob 


GTCATGATTTGTAATTCCCTGTCCAACTCTCA 
TTGCTTAGGTCAAAATGGCTTAACTCCTAGC 
CTACTTCAGTGTAAAAGTCATGCGTAATGAT 
C 


p001209 


0 




IM000987 


GATCAGG CTGGCCTCAAACTCAGAAATCCAC 
CTGCCTCTGCCTCCTGAGTGCCGGGATTAAA 
GGCGTGCGCCACCACTGCCTGGCTGCTTTC 
I 1 1 I I I I ICI I I I I CTTTGTGTGTGTGGGGTA 
GTGGTGGTGGTGGTGGTGTTCGAACC 


P001210 


A 


Hsc70t 


IM000988 


ATGTGTGTGTGTGGCATGTGTGTGCCATTGT 

GTGTGTGTGAGTGAGTGTGTGTGTGTGTCTG 

TGTATGTTGTGGAACAGATTCCTGTGTATGTT 

TCCTTCTTCACACATGTTTTCAGAAGTGAAAC 

CAGGCTATGAAGACCGCCAGGCAGCTCTGC 

AAAGCAGTACTGAGAAGGTGGGACACTGCG 

GGGGTGAGAACAGTATGCATGATC 


P001212 


R 




IM000989 


GATCACACTCCATGAAGCTTCTCTTCTGCAA 

CAGGAAACAAATAGCAAGCAAAACCACTGGT 

AATCATTATGTGGTGTCTAACAGAGAGCGGT 

GACAGGGGTGGAAAACTGAATGACATTTAAA 

AGGAGCTGG AGATGTTGGTTTAAG GCGTGT 

GGGGGCAGCCTACAGCATGGAATTGGTCCA 

TAA 


P001213 


D 




IM000990 


AACCATCATGGTAGCTTCTGCTTCTCTCCAC 
G AAG ATGGTTGTTTCC AC AGTTG CC CTCTCT 
ACAG AGTGGTCCTGTATTAAGTC AC AGGTG C 
CATCCTGGTGATC 


P001214 


D 




IM000991 


GATCTAACCACCCGTTTCCTGCCCGGTCTTA 

GATAGACCTCTTGGCCCCCACGCACCTAGA 

CAATGGAGTAGACAAGACTTCGAGGGGAAA 

GAGGCTTCCCAAGATGACCCAGCTCATTGG 

CTTGACTCCCAACGCCACCCACTTACACAGT 

GAGTATCTCTGGTCTTTGCTGT 


P001215 


A 


Farp 


IM000992 


GATCTATGTCATCTTCCAG GACTCAG AGTTA 
AGAGAGTTACCAAGTGAGAGCTCTCATCACC 
TTCTGAAGCAGTTGAGAATTGGAACCCAGAA 
AGATGCACATGCACGGGCACACACACACCC 
ACGGGCACACACCCACCCACCCATGCAGAG 
AGAGAGAGAGAG 


P001216 


D 
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IM000993 


TAGGTTGTGCCTGGCCTGTGCAGGACATGC 

CTATGGGGTCTTC ATC CCTCTCACTTACTCTA 

ATGTTCACTACTGACAAGCACTAGTAAGAAA 

GTAGGTGCCTGTAAGAGACTGGAGCAGCCT 

GCTGCTGACTTCAGCACCTGGGAGGCCTCA 

GTAGCAAAGCTTAGGGTTAGCAATCCTTGGG 

GCTGTGGCTGGCTGAGCTCTGGGGTACCGT 

TTAAGAGGAAAGCTGGAGTCCAGGTTCTCCA 

GGCCCTGGGTGCATCCCACAACCTCTCTCTC 

TCTCCTTTACCACTCGCAGCCTTGGCTAAGG 

ATGAGGACCGGGACCTGGAGTTATCTGAGA 

TC 


P001217 


A 


Snn 


IM000994 


GATCTCTCCCCATCCTCCTGTTGCCTCTTGT 

CTGTCATACCTCTACTACTCCATCAGTTTGCT 

GCCTCTGAGTCCCTCTTCTTCCTCTCCTATC 

CCTCCTCCCATCTTCCTCATCTCCAGGTCTC 

TCCAGGTCTTCCTTCTTCCCTCTTTTCTTCCC 

CI 1 1 I CCTCTTTCC ACTGTCTTGTATTCCCTT 

CCTTTCTCTGTTGGTCCCTTCCCTCGCACCT 

CTTTCCTCCTGTCCCTCCTTTTCATGTACCAT 

ATTTCTCTTCCTCTTTCTGTGTCTC 


P001218 


A 


Gatal 


IM000995 


GATCTTAGATGGCCAAATGTTGTGAACGTTT 
CCTAGATGTGTCGTGAGCACTCAGGGTTGA 
G AG CCCTGGTTATTTAGCAAGTGAAGTGGAT 
GTATACAC AAGCAGAAG GCTGAAACTAGACC 
CCGGTCTCTAATCCTATATAAAAACCAACTCC 
AAATGGACAATAGAAATAAGTGCAAGACTAA 
CTCCAGGGTCACTGGAGGGATACAAAGGGA 
GATGC 


P001219 


D \ 




IM000996 


GAATGAATATATATATGGGACTAAATGCCAT 

/~\ A "T" A A ^% A A /"> A ^> A A /*» 1 T AAA r\ A A f\ AAA /SV 

GCCATAACCAAGAGAACTTAAAGAAGAAAGT 
GTTTAGTTATGCTTACTCTTTCAAAGAGTCCA 
GCTGCCAAAGGGATGCTGTCAGGAGTAGCT 
GAG AGCATAC ATCTGG ACC CATTAAC AAAG A 
AGGGATGCTTCCCCAGCAAGATC 


P001220 


D 


- 


IM000997 


GGAGGAGGGGCACCTTCTCAGAGATC 


P001221 


D 




IM000998 


GATCTTAAAGCTAATAGGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGGTCAGTGG 

TAAAATTGTCTACCAAGCTCTAGGTTCACCC 

CTCACAGAGCCGGAGAGAAAAGGAGAAATC 

AACTCAAGTCAACCCAAACAAAACAAAGGAC 

TCAACA 


P001222 


R 
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IM000999 


GATCTGTTCCCAAATCCTCAGTTACTCTCTG 

GGAAATGGCTTCTGTATGTACACATGTTCTCT 

AGCTATGTAATAAAAGACCTCTCTTCCTTGG 

CAAAACTTAACTCTACCTTAGAAAACTCTGAT 

GAGTACTAGAAAGATGACATGTTCCACAAAC 

GTCTTAAGTGATTCAGGGTTCACAACAAAGA 

AGGAGATGCTATATTGTCTTTCATGACATAG 

CGTCTAAGTCCCATAGCATAACTTCTATAACA 

CACAAGTGGGT 


P001223 


D 




IM001000 


ACACTAGCTTCGAAACTTCTTAGTTGTCTGTC 

CCTGAGCCCTTTGTGGTACTTCCTCCTCAGA 

GCCCAGCTCCAGCAGTCCCCTTAGCGGCTG 

TTTTTAGCAACCACACCCTCTGACTGTGGGT 

TTGCTCTGCAGTGGCTTTAAGGTTTGAATAC 

GAAATGCCTTCCACAAACAGACACTACAGAA 

TCTTAGGTGTCGAGACAATGGGCATTTGAGA 

AGGAATTGGAACCTTCAGATC 


P001224 


D 


- 


IM001001 


GATCTAAAGGGAAACCCTTGTCTTTTTGAATC 

TGAGCCAGCACAATATTGTATTTCCTTCAATA 

CGTGGTGAATGTTGTATTAGCAACAATAAAT 

GGAAGCAGGGAATCTCTCATCTCATGAGTGA 

TATTTACAATGTCTGTCTGG AAACAAACG GC 

TAATCAAGTTAGTCACTTACTGTTCTTTAGAA 

AACACAGTACTTTGAAATGCATACCTAGCAG 

AGAATATAAAGTATTTACTGTTGGACTAGACT 

GGGCCCCCGGGTGTGAGGG 


P001225 


D 




IM001002 


GATCTATCTCATCCTGTTATAGCCGGAAACA 

TGATAGCAGGATTGGGCAACTCTCCAGTCCC 

TTTCTCTTGGGTAAAGTCTGAAAGCAAATCG 

CCCGGACCCATCTCCTGTCTCTGCAGCCTGT 

CCCAGTTGCCTCTGCCACTCACTAACTTCAC 

TCCTTAATTTAAAAAGCCAGCACATTTATTGA 

CCGTCT 


P001226 


C 




IM001003 


GCATGTCTCCAGACTCTCAGCTGCTTCCTGT 
CTGCTCCTGCTGGATGCTTCATGAAGATGGA 

TGTTCTATGTGCATGTGTGCACTTGCTGGAG 
CTTATGTGCACCACAAGCACGCAGGTGCACA 
CAGAAGCCAGAGATC 


P001227 


D 




IM001004 


GATCGAACACGCTCGGACTTGCTAAACGTTT 
CCCACACGGACAGTCACTGCCAA 


P001229 


K 


Nmyc \ 
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IM001005 


GATCGTGAGTTCAAGACCAGCCTAAAATACA 

CAGTGAGCCTCTGTCTTTAAGAAACAAACAA 

ACAACAACAGCAAAAACAAAAATATTGCTCAA 

GACC CAATGTTCCTCGGACTATTTATAGGAA 

TCAGAGTTGCTGTTCTTCTCAGGGCATGCCA 

GTTAATTTGAAAGACAAGGTGTAGAGGCAAA 

GGAAAAGTGATTTTACTTGGATAACCACCTC 

ATG GAGC AGTC AGGGGAACTCTAGCCTCAA 

AGCTCTTGCAGAAGTTATAT 


p001230 


D 




IM001006 


GTAGAAGC I I 1 1 I AGAAAIACGI I ICI IATCT 
ATCTATCCATCTATCCACCCATTATCATCTAT 
TATCTATATTTAACATCTATCTAAGTATCTGTT 
TATCTATCTACCTGTCTATACCTACCTATCTA 
CCTACCTACCTATAGCGATC 


P001233 


R 




IM001007 


GATCGTGCATGCATGGGTGTGTTTTGGGGA 
GAGGTTCTGT 


P001235 


D 




IM001008 


GTTACTATTCATCTGAGGTTCTC MM GTTGT 

ATTTGAACAGGAGGAAGGAACCAGGAGCTC 

AAGGATGTAGCTGGAAATGCTATAAAACTGG 

GATGCCCTAGAGAATCACACGGACAATCCTG 

CTAACCCATGGATTGTACACTCCAATATACAA 

GATAACATGTTTGTGCAGGCATGCCACCATG 

ATGTTCGATC 


P001239 


D 




IM001009 


GATCGACCGCAGATGAGGTCTATGCAGGAA 
AAACGATGTCTGGAATTTTATTAAAATTGCTC 
AGCAACTCACTGCCACGTATACTTGGAGAGC 
CACTTAGGGAT 


P001240 


K 


Myc 


IM001010 


CCAAGTATACGTGGCAGTGAGTTGCTGAGCA 
ATTTTAATAAAATTCCAGACATCGI 1 1 1 fCCT 
GCATAGACCTCATCTGCGGTCGATC 


P001242 


K 


Myc 


■ ft 4 f\ A A 

IM001011 


GATCGTAGAGAGATGGACCC AAATATCAG CC 
AGAGAATTAG ACCAG AAAATG G AACC AAAGT 
ACCTGTCAGTCCAAGGATGTAGTGGCACTAC 


p001244 


D 




IM001012 


GTCCCCAAATGTAAACAAAACTATCAAAAGA 

AATTGGGCATGCCAGAATTTTGTTCTTCACAT 

TAAGGGAATTCTGAAATTGAAATCTTGCTAAG 

GGAAGGGTGGCTTGAGAATATTAACAGAATC 

CTAGGTTGAAGGAGCAGGAATAGAGGATC 


p001246 


D 
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IM001013 


CAGCTAGCCC ATGG AGCTG CTGGGACACG A 

GGCCGCAGGCTGAGCATAATGGGGAAGAGA 

TGGCAGATTCATTCACCCACTTGAGGAGACC 

ACAATTAGTCAGAGGCATGCTGGGCCTGGT 

CAGAGTGCTCAAATAAACATTCACAGGACCA 

AAGTAATAAGCATTGGTGTTACAGAGATAAAT 

CCTTTAGCAGGGACACGGGACCCCAGAAAA 

CCGGAAGGACATCGTTCCCATCATGAGAACA 

AGGACAGCAAACAGTCACTGAGGGTATACTA 

TGAACGCTGGATATATCATGAGCTCTGACCT 

AAATATTCTGAGTATTCCCCATGTTTGAATGG 

ACTGAATACTCACATTTTCTAAATGCTGAATA 

CTGAATTTTCATAGCAACCATCATAAGGCAT 

GGTGGCAGAATAATATCTCTCACTCAGAAAG 

CAAACTATTCTAAGTTGGGGATC 


p001247 


D 




IM001014 


GATCCCGTGGGGACTGAGCCTGCAGCTCAG 
TGGTAAAGCAGATGTCTAACGTGGTACAGGG 
TCCCAGATGAGATGACACAAGTACCTGTCAG 
TACTCCGGGAACACTGGGTGGGACTTTTATA 
TGTTTATTTGTATTCTTAA 


p001248 


D 




IM001015 


AGTCCATTGTGTACTGAGAGAGGAGTTAGGT 
TTAGAAAGCCTTCCTCAGATGTCCCTCAAAG 
AAGCTGCTACAACTGCCCTCATCCCAAGTTG 
CCAAGGATC 


P001249 


D 




IM001016 


AGATTGCGTGAGTTCTGATGCATGCTGGCCA 

TGATGTGAGGCAGGGGCAGTGGTTGGATTC 

GGAGTCAGAAAACTTTCCCGTCTACTGCCGT 

AATTCCCAGCTAAATTCCTATCCTCGTTGTAG 

CTGTTGGTGAGGATC 


p001250 


D 




IM001017 


GATC CTTCCGAATCTGCCATTTATTGAATATT 
TAAAACACACCTCACTGCAGACTAAACACAT 
TGCAAGCACTGGGAGCAGAGGTGGCTAGTG 
AGCACCACTCTAGATGGTCCTTC 


P001253 


D 




IM001018 


GATCCTCCTGCGTCTACCTTCGGGTGGGATT 

GCAGGCATGCACCAC CATGCTTGGCTTTGTG 

TGGTACTGGACATTGAACCCAGAACTCTTTG 

AGCACTAGGCAAGCACATCCTGAACACCAGT 

AAAACATTTTCAAAGAGAAAAGAAAATTTAAA 

ACATACACCTATCTACATCCATTTCCACCATG 

TTAGTAAACCAGGGACAI 1 1 IGAAGTGTGGT 

CTTTATAAAAACACCCGGGTGCTTATCTCCC 

ACGCTCT 


P001254 


R 




IM001019 


CCAGCGGTGCTCACTACTGCATGTAACCAGC 
TCCAGGATC 


P001255 


D 
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IM00102Q 


GTCTCAAAGAACAAAAATAAAAGAGGAAATT 

AGTAACGAGTCCTGAGAGATAGAAGAGTATT 

CAGCCTGGGACCAGAGCTCTGTCTTACAGTC 

TTGCCATTCTGTGGGGCCTGGGACACAGCAT 

CCTTGGTCTTTAGAATGCCATAGGCCTCCTG 

AGGGAGCCTTTTCTGTAGGCACTTCTCCCAC 

ATTCTTGGATGGATGCGATTTATTCTGTGTCA 

GGGGACTAGGGTGCTGGATGTGTGGGTCGA 

ATGACTGTTGTTCTGTCACTTGGGAATTTGG 

GATAGGAGAATTCTGAGTGCAAGGCTAGTCT 

GCACTTGAACGTACATATCGGG 1 1 I TAAGCC 

AGCCTCTGAGCTACCACAGTGAGACTCTCTC 

TTAACTAAAATCAACATAAATAGTCTTAGTAT 

GGAGAGGTTAGGGGATC 


P001257 


C 




IM001021 


CGTTTTCCTCGGAAAATGTGAAAAGAAGAAG 

CACGAGACGAAACCCCCTCGAGAATGAGAA 

AATTAAATCTAGAACCC AAATG GCGTCC AAC 

AAGAACATTAGCTCTTGAAAATGAATATTGCG 

CCTGCGCAGCCACCGCCCGGCCAGCTGCTC 

AACTGCAGCTAGAGCCCGACCCCAAGCGAT 

C 


P001260 


c 




IM001022 


GTGTC AC ATGTATGAAC AGC ATC AC ATG GTA 
TGAATGGTATCATATGGTATGACGTGAATGT 
GTGCACCGGCACTGATC 


P001262 


D 


— 


IM001023 


ATACCACCCACTCC CTTAAGAAATGATC 


P001263 


D 




IM001024 


GACTGATATTAGTAGGTTGTTCTCTAAGGGC 

CGTGAAA I 1 1 1 IAGCTAGAAGTTCTTGCTTTC 

ATTAACAGTGCCAAGTATGAGTTCCATCTCAT 

G GG GTG GGTCTTG AATAC AATCAGAAGGTG 

GTGAGTTATCGCCATAACATCTGTGCCGCTA 

TTGTACCAGTGGACATAGTTGCCAGGCAGG 

CCATTACTGTAGCTCTTAGGTCATTCCTGAA 

GCTCTCTGGGGTCTGTTAGGTGAGACTGATG 

ATAACTCTTCTCTTCCGTTAGTGTACACAGCA 

CCTTTTAGCACTATGAAAGCGAGG CAGTATT 

GATC 


p001264 


D 




IM001025 


GTTCCGATGTTTGTATCTCGTTTGAATTATCC 

ATCAGTTGATTAAGTTGATGGTCATCTAGGCT 

GATTCCCCTACATGGCCATCTCAATATTGCTT 

CTTTAATAAGACCTGGACAATTAACAGCACC 

AGTTGACATGCCAACTTGGATTGGGGGAGG 

GGTCTTAAAGGGCCCCGCCCTTAGATGAAG 

AGCTATACGCAATTAATGACTGTCAGAAAGG 

GAGAATGGCTTTCCCAGAGATGAACCCCCTA 

ATGGATTAC CCAGTACC AAGTGATC 


P001265 


A 


Rad52 
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IM001026 


ATTCAACCTATGGGGCCGTTAGACCCCTGGT 

CTTGGGTGGGGTGGATATGTTATTCTTTTTTG 

CTGTGGTGGCAGCAATTTTGTTTGCTTTCTTG 

I I I II I GATACAGTTTCTCGTCATGTATTCCT 

GGTTGCCTGGAATTCACTTCTATAGACCAGA 

ATGGCCTCAAATTTACAGTGAACCCCCTGCC 

TCTGGCTTCAGATTACTGGAATTACAGGTTT 

GTGCTATCTCACTAGTTGGTGTGTGATC 


p001266 


c 




IM001027 


GATCAAGTCCCCAGTTAAATGCTTTCTTTGAT 

AGGTTGCCTTGGTGATGTCTCTTCATAGTAAT 

AGAAAAGCAACCTAAGACAAGAGGAGAGAG 

TGGGTTTAAGAACGAGGAGAGAGAGGAACT 

CAGAGGGTCCTGGAGGTCCCGGGAA 


P001267 


c 




IM001028 


CTCACACATACATTCATACATACACACACATA 

T ATACAT ACAC AC ACTTGC ATAC AC AC AG C A 

CACACTCACACACAGAGACACACAGACACAC 

AGACACACACACAGAGGAACCCAAAGGATT 

GGAAGAATAATTTCCCGTGCTCAGCGGGAAA 

GTTTACCAGAAAGACAAGTGGTCATGTGGGA 

TGATC 


p001270 


c 




IM001029 


GATCATCACCAGTGTAGTGTTGGCTTTAACG 
GTGCACGCCTTTAATCCTAGCACTTGGGAGG 
TGG AAACAG GTAGGTGTGC7TACTTCAGTG A 
GTGAATTCCAGGCCAGGCAGGGATACAGAG 
TGAGAACCTGTTATCTAAATAAATAAATAAA 


P001271 


c 




IM001030 


CACCCACGGCI IGCI ICI 1 1 ICICIAIGIGIA 
ATTGAAGCACATACCCGGTGGGAGCCATGTA 
AAGCCTGTGTCCATGATC 


P001272 


D 




IM001031 


GATCATGTGTTAATGAAACTGTCAGGGGTTG 

GGTAAGATGGCTCAGTAGGTAAAGGCACTTG 

CCTCCTAGCCTGGAGACCTGAGGTTCCTCCT 

GGGGCCCACAGGGAAAAGGAGATAACCAGC 

TCTCTGTCCTCTGACCTCCTGGGCCCCTCCC 

TCACAAACAAACAAACAAACACACACACAAA 

CGACCAGACCATTTCCCACAGTAGCTGTGGT 

GCGTTACACTGTAACGGGCACCATGTGAGG 

GTTTGGGCTTTATCACATCTCCGCTAGTCATA 

CTTGGTGTTTCCTGCGTCTTGCTTACAGTTGT 

TCTAATGG GTGGGCGGTGATATCGAATTGTG 

GTTTTAGCATGTATTTCCTGTGCTCTGCTAAG 

ACCACTTACAATTACAG 


p001274 


R 
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I MO0 1032 


CCTTAACGCTCCCTTGATGTCCACTCCCGTT 

TTCTCTGCAGCGATTTATTGCTTAGTCTATCT 

ATAAGGTGTATGCAAGCTGCAAAGTCAAGTA 

TTTCCTTTGTACTTGAGCAAGTCTCCTAAGTA 

TTATGCTTCATAACGTTGTGATATGCTTGAGC 

AAATTTGAGTCTATTTCATAATTAAGCCACTG 

TTCTGATAAAAGACCCTAGAGTGCTATATCT 

GATC 


P001277 


D 


- 


IM001033 


AAAAGAGTGTCAGATGTCAGAACTGACTAGC 
TGGGCTGACACTGAGGAATGAAGGTTGGGG 
ATATATGC ACCTC CTGAAAACAGGAAGCCTT 
TTGTTGGTTGATC 


P001279 


D 




IM001034 


GATCAACCTTAGTACACAGCAGAGTGTTTTC 
TGGGAAGCTCATGG AGACC C ACTTTTGTCAT 
CCCATAGAGGTTACTACAAATCTGAGCATGA 
GAATAACTACTTG CTGTTTAATAC AAAGAACC 
ATTAGCAGTCAATGCCCCAAGTTCTAAGGGC 
ACAGACTTCATACGAGAAAAAAAAACAAAGC 
AAAAC AAAAACTATC AC ATG CTACTATCTGTA 
CTGGGGAATGCATACAA 1 1 1 IGTAGGTAT 


P001281 


D 




IM001035 


GATCAGTAGAGAGCAGAGGGGTCTATGAGG 

GAGGTAGAGCAGCCTGGGAGGCCTGAGGAA 

GGAGGGACAAGGGCAGAGTCTTGGTCACTC 

TTTGGTCTAATTGCCTTCAGAAGGCTTGCAG 

ACTCTGGTTTGGAGTTCCAGGTGGGTGGCT 

G 


P001282 


C 




IM001036 


CAAGTAGGGTTTGTGTGTGTGTGTGTGTGTG 
TAGC CAGTGTCTTTCTCAATCACTCTCCACCT 
TAATAI 1 1 1 1 1 1 1 1 GAGACAGAATCTCTCACT 
GAACCTGTATGCTGTCAATTTGTCATGGCTG 
ACTGGCCAAGGAGCCCGAAGAATTTATCTCT 
ATGCTCAATCCAACCCCCAGATC 


P001285 


R 
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IM001037 


GATCACATGGACCGATTGCC GCGGGACATC 

GCACAGGAGCGTATGCACCACGATATCGTG 

CGGCTTTTGGATGAGTACAACCTGGTGCGCA 

GCCCACAGCTGCATGGCACTGCCCTGGGTG 

GCACACCCACTCTGTCTCCCACACTCTGCTC 

GCCCAATGGCTACCTGGGCAATCTCAAGTCC 

GCCACACAGGGCAAGAAGGCCCGCAAGCCC 

AGCACCAAAGGGCTGGCTTGTGGTAGCAAG 

GAAGCTAAGGACCTCAAGGCACGGAGGAAG 

AAGTCCCAGGATGGCAAGGGCTGCCTGTTG 

G ACAGCTCGAGC ATGCTGTCGC CTGTGGAC 

TCCCTCGAGTCACCCCATGGCTACTTGTCAG 

ATGTGGCCTCGCCACCCCTCCTCCCCTCCC 

CATTCCAGCAGTCTCCATCCATGCCTCTCAG 

CCACCTGCCTGGTATGCCTGACACTCACCTG 

GGCATCAGCCACTTGAATGTGGCAGCCAAG . 

CCTGAGATGGCAGCACTGG CTGG AGGTAGC 

CGGTTGGCCTTTGAGCCACCCCCGCCACGC 

CTCTCCCACCTGCCTGTAGCCTCCAGTGCCA 

GCACAGTG CTGAGTACCAATGG C 


P001289 


K 


Notchl 


IMUU IV JO 


GATCTAACTCAGG CTGTTC AGCTTGGCCAAC 
AAG CTCAAATATCC ATTCCGCTGTCAC ATCG 
GGCCC CATGTGATGCTTTATATACTAAATAG 
AACAAGCAAATTGATACTAGATGGGACAGTC 
TGCTTACCCAGTTTGGTGTTTGGTGGGGGAG 
GTGAGACATATCCCACAGTCCCAGAGCAACT 
GTCACTGCAGGGTCCCAGGGGAGGAGCCAG 
GTGTGAAGCTG GCAGTGTGTGAGGTACCCT 
GGGGAAAATGAATGGTTACT 


p001292 


D 




IM001039 


AGGCCTGGTAGTGACCAGCAAGTACTGAAC 
GCTCGCTCTATGC CAG ACACAGACCCTCTTC 
TTC CTTCGTCTTATCCTATTATCCATACTGAA 
CAGACAAGGAAATGAAGGCTTAGATGAGTCA 
CCCGACTTGCTGAGATC 


P001293 


D 




IM00104Q 


AGTGGGGCCTGAAAATCACATCTGGGCAAA 

CCCTGAGGCCTGCCAAGTCCTCATCAGAGG 

GATGCCCTCTTCATCC CAG GTGCTTTCTGAC 

TATAAAATAAGGTGAATACTACCTCC CCTGA 

GGTTACACCTCCAGGGTTAAGCTGGTTAGAG 

AACCCAGGGACACACTGGGAAACAGCCCAC 

AACAGCAGGAGCTGGAGCACTCACCCACGG 

ATGTCCATGGGGTCCAGCTCCCTGCGCTGG 

CGCCCACCACTGGTACCAGGAAGCAGTGAA 

GAGGTGGCCCAACCCACTGTAGAGCGCTTG 

ATTGGGTGCTTGCGCAGCTCTTCCTCGTGGC 

CATAGTACGGGAAGATC 


p001297 


K 


NotcM 
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IM001041 


AGTGGAACCAGATTCCTCCTACGCTTTGCAC 

TCCACTTTCGTTTTCTCTTCTGTACCATTCTA 

ATGGAGGCCAGAGTAGCAACTGTATAGACAA 

ATCAAATCGTTTACTCTTC C AGTCTTGCCCCT 

TAACAGTCTTTCCTTTGTTCTTCCTCTTAGCC 

TCATTTTCTCCTTTCTCAGATC 


P001298 


B 


AI604147 


IM001042 


GATCTTCTG CTTCATCTGAGTAGGCTTAGAC 

TGGTTTGTATTATTATTATTATTACTTGTTGTT 

GTTGTTATTTTGGTGGGAGTAGTAGTAGCAG 

TAGGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTGTAGATGTCACAGCATGTATATGGAGG 

CCAGAGAACAGCTTCTAGCGGTTGCTTCTCT 

CCTTCCTTCCACTGTGGTCCAGGGAATAGAA 

CTC AGGTCATCAGGCTGGG C AGCTGTCACC 

TTTAATGCTCTGAGTTATCTCACCAACGTTAA 

TAAAAGGC I I I I CAAACAGCAGTTTGGGCTG 

GGCCTGGTTGTGCAGACCTGGAATTGCAGC 

TTCTTAGGATGCTGAGGCAGGAGGACTGGA 

AGCTCAAGTTGTGTCGG GG AAACTTAGTAAG 

TCCCTATTCTCGTCCCGCACGCCCCCAAAAA 

GCCAAGACCAAGACCAAGCAGTTTGGTACA 

GCAGAAAAAGCACGAGAGTCTCCTCCTCCTC 

CTG CTCCTCTTTAATG ATGC AG AACCC 


P001300 


R 




IM001043 


GATCTGTGCATTATTCTGTTGGAAATGTGACA 

AGATTCTGTTGAGAATCTCATACTCTATGAAC 

TCTTAAAAAAAAAAGGTTTCTGCTGTTTTGAG 

ACAAAATTACTTATAAAGGTTTATGATGTAGT 

TAAGGCCCTGAATGTCCCCCAAAGACATGTG 

TGTTGAGGGTTTGGTCTCCACTCCGTGGTCT 

TTTGGGAGGTGTTTTATGTTAGCTGGTGAGG 

CATAGTGGCAGGGGAGGAGAGTTGGGTCAT 

AGTCCTTTTGAAGAGGCTATTCAGGCTCTGG 

TGCCTAA 


P001303 


D 


- 


IM001044 


GATCTGACTGTGATAGGAGGGTCCTGGGGC 
CACCCTGACATAGGCCTGGTCTATGAATGCT 
CTCATGGACTGGGCCTGTTTGTCA 


P001305 


D 




IM001045 


CTGCCTCTCTCCCTGGTCCCTCTCTGAGGTT 

CTGGACCCTCAAAAGGCCCTTTCCCACCCCA 

GCCTTCAGGCCTGTAACCCAGCCTCGGTTTC 

TCTCCCATTGCCAAAGCACAATGGCTGTTAT 

AATTAACGGATTATCTCAGCGCGACAGCTGC 

PrrPrTTTPAA A ATTAP/ITT^AATAArAAOA 

oUUUO I 1 1 oAAAA 1 I Aoo 1 1 bAA 1 AAUAAoA 

TC 


P001306 


C 




IM001046 


GATCTTGGACCACCACGTCAAGCCTCTTGTA 
CATTTCTTTGAAAAACAAAGCTTGGTTCCCCC 
TAGTCACCACGGTGAAAAAAACCCAGGACAG 
TAAAGGTCCCAA 


P001307 


D 
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SEQUENCE 


CLONE 


CLASS. 


GENE 


IM001047 


TTAGTACCTCTGGTGGAATCACCATGCCTGA 

CCTAAAGCTTTACTACAGAGCAATTGTGATAA 

AAACTGCATGGTACTGGTATAGTGACAGACA 

AGTAGACCAATGGAATAGAATTGAAGACCCA 

GAAATGAACCC AC ATACCTATGGTC ATC GAT 

C 


p001308 


R 




IM001048 


GATCGCACCGATTGCCAGTATAGTACCTAGA 
GTGTCAAGTTGGCCTCTCAGGGAAGAGAGA 
ACATGTATTAGGGTAAGACGCAAGCCCCAGT 
AAAAACATGTGAG 


P001311 


D 


- 


IM001049 


GATCGCTTCACCAAGTGTGAACTGTTGGTAG 

GGACAGAGCAGACCACAAGCCCCTCTTTGC 

ATTTACATGGGGGCGTCCTAGTGTAGGTGGC 

TAGGGATGGTGGACAGGAGAGGAGGGAAGA 

CAGTATCACATAAGAACAATAGTGGAGGGCA 

GGGGAGGAAGCCTTCTCATGGCTGGGGTGA 

AGTCACTTCCGTAGCCAGAGCTGACTGAGAA 

TATCACTGCTTTCCTAGTAAGGAAACACCGG 

AAGTCGGAAGATGATAAACGCGAAACTCACT 

ACATCATAGACACCATTCTGTCTTCATCAACA 

GAGAAATTTATAA 


p001313 


D 




IM001050 


GATCGTCCACTTCTGTGTTTGCTAGGCCCCG 
GCATAGTCTCACAGGAGAGAGCTATATCTGG 
GTCCTTTCAGCAAAATCTTGCTAGTGTATGCA 
ATGGTG 


P001316 


R 


-- 


IM001051 


AGGGTACAGCGAAGCTTGAAAAAAGCAAGG 

AGTGCTCTGGGACCGGGAGTGATGGAGAAA 

GTCTGAAGCCCCTTTGCACACCCCTACAATG 

GGTTTGCGCCAAGAGAGGCGCCGGCAACTC 

TACGCGGCGTGGGGCTCTCCCCAGCGCTCT 

AGGTTCTACTGTGCTGAGCCACACTAGTTTC 

TCTCCCTAGACCTGAAGAGACCCCAGAAGTC 

TGAGAGTCCCTTTGGTTCTCCATCTCTCACC 

ACCCCCCACTCTCGTGCTTTAACTCTGAGGA 

GGGCCACTCAAGTTCATTCATAAGAACAAGG 

GCTTTGCTCTTAAAGGAGCCGCATACCGAAA 

GAGCTCCGCGTGTCTCGACATCCTCTCTCTC 
CGATC 


P001317 


D 




IM001052 


ATCTCAGGAAACTCCTAGCAGCTTTAGTACG 
CATCGTGCTGTTTCCAGCTGTCGGTATTTTA 
CACAGGTTTTGAGCGATC 


P001318 


D 
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SEQUENCE 


CLONE 


CLASS. 


GENE 


IM001053 


CCTTCAGGATTACTTTGGATGATTCATTAGAG 

AATCTTGTCTTTAGACTATAAAGCACTTGTTG 

AACAAGGTTACAATGTAGCAAG CAACCTTGT 

TTTG GAATGTATTTTGCTACATTGTGCTCTTC 

CCTGGTCTGGTGCTTTCATTTCACATATTTTG 

CTCTTAATAGAAGTAGGGTTCAGTGCTGGGG 

ATTTCATTTGCTGTTTTCTCCATTGACCTCTT 

GAGCTGAAGTTATTCTTATTAGAAAGTCAGG 

GTAGGCGATC 


p001319 


D 




IM001054 


CCAGCAGGCAGCGAGACGCATTTTCGCGTG 

GCGGTGGTGAGCTCTCGTTTCGAGGGGATG 

AGCCCCTTGCAACGGCACCGGTTGGTCCAC 

GAGGCACTGTCGGAGGAGCTGGCTGGACCG 

GTACATGCCCTGGCCATC CAGGCGAAGACC 

CCCGCCCAGTGGAGAGAAAACCCACAGTTG 

GACATTAGTCCCCCCTGCCTAGGTGGGAGC 

AAGAAAACTCGAGGGACCTCTTAATAAATAC 

CTGGATTGGGAGAACGATC 


P001321 


B 


Mm. 10453 
1 


IM001055 


GTTTTTCCTGCATAGACCTCATCTGCGGTCG 
ATC 


P001322 


K 


Myc 


■ « jAAi nee 

IM001056 


AAACTAGGAAAGGGTATAG C ATTTG AAATGT 

AAATAAAGAAAATATCTAATTTAAAAACAAAA 

AAGAAAGACAAAGGAAAATTAAAAAAAAAAAA 

AAAAGAAACAAAAGCCACTGCAGGACTGCCC 

AACAGTCTACTGAAAACTGTGAGCCTTATTC 

CTAG ATGAGCCTCTGATG CCTCC ACTTACAA 

GCTACCTTCACTCCTCCATCTATCTCCTTTTG 

TTATGTCCCGCGATC 


p001324 


R 




IM001057 


GATCGGACTCGAAGAGCAGAAGAAACAAAA 
CTCAAAGCAGG G ATTAGGTC AAAATTAAAAA 
GGGTTTGCACACAAAAGGAAACCATCCGAAG 
AGACAACCTACAAAGTGAGAGAAACTTGTTT 
TGAAC 


P001325 


D 




IM001058 


GTCTGAGAAATTGTCTTTAATGTAGTGACTGT 
GGAGCCTTGCAGGGATACCCACGATGGGGG 
TGTCATTCATATGTCACTGCACCTGGAAGAC 
CGATC 


P001326 


D 


— 


IM001059 


GATCGCACAGC CTGCTTTCTCAACAGTAGGT 
AGGACCAACAGCCTAGGTGGCACCACCCAC 
AGTGAGCTGGGCCTTCCACATCAATCATCAA 
TCAAGAAAAATAGCACAAAACCCTTTCCCGA 
AGGCCAATCTGCTGGAGGC A 1 1 1 1CTCAGTT 
GAGATTCCCTCTTCCCAAATGACTGCATAAA 
ACTTGTGTCATGTTGAC ATG AAACTAG CCAG 
CACAGGGTGT 


P001327 


K 


Pvt1 
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CLASS. 


GENE 


IM001060 


GATCGGGTAATTTAGTAATAGTTCATGATATT 
CATTACTCGGCGTAAATCAGGAAAAACATTT 
CTAGATGAATGTGGTATTCTCAGTGCACAGT 
TTGTTTAGTTTAGAAAACAAAT 


P001328 


D 


- 


IM001061 


GATCG AGGAG GGGAAGTCCTTCCTTCCTTCC 
TTCCTTCCTTC 


P001329 


R 


— • 


IM001062 


GATCGGGGGTTCAAGGTCCTCCTCGGGGTA 
CCTATTAGGAGGGCAGCCCAGGCTACGTGA 
GACTCTGTCTCAATAAAAATAAAAATAAAAAG 
CTGGGTGGTGGTGGCGCACGC 


P001330 


R 





IM001063 


GATCGAC CTGCCTCTGTCTTAAGCAAGAAG G 
GAGATAGATATGCATAGTATTTAGTGTAATGA 
AAGTTACGTTGTATTACGCTGAGGTTTATCAC 
A 


P001331 


D 


— 


IM001064 


ATCTAAGTAGTATAATGTTTAAGACGATC 


P001332 


D 




IM001065 


G ATC GTCGTCTAACTTAGCTGGCTTTATAGT 

GATATAACAAAATATTAGAGGATGCTTTGGTT 

GAAAAAGAAGTTTATTTGCATCACAGTTC 


P001333 


D 


— 


IM001066 


GATCGAACACGCTCGGACTTGCTAAACGTTT 
CC 


P001334 


K 


Nmyc 


IM001067 


GATC GTC ATC ATTTTTATAACAGTAGTGAGG A 
GATGTCCCCTGGGGCCGCCCTGGCTCTGGA 
GAGGGAAGCCACATGCTCCAAGGGGCTATG 
GTGAGGACCACAGCCTTTACATTTGGCTT 


p001338 


D 




IM001068 


GATCATGCACTGTCTGGGATAGTGATGGGCT 
GTGTCCTTTGTTGGCCAAGAGGAAGTGGCAA 
AAGGCAAAGTTGCTGTTGGCTCCAGGAGTCA 
GTCTGGGGACGGGGCTGAGATGCTGTGGGA 
CAGACTCTGGAAAGGGCAG 


P001339 


D 


- \ 


IM001069 


GATCGTGGCCACTGAGAGACCTTCTTCTGGC 
CACCAG ATG CACACAGCTGC ATG AAC ATCTG 
CATACACATTTAACACATACAAAGTTGAAGAG 
AAGCACGTGTGTCTTGTGGTCTGACCACTTC 
CTG GG C ACC AC CAAG CTGCTCTGAC AACGG 
ATTCCCACTGGGTTCGGCCATCTTGCTTCCT 
p p pptp a a a fSTTTrsp p p at(z tp rTr*T^Tr*rr 

TTCATAGCCACAGCCTTGCCCAAGATAAGAT 
ACATCCAACTGTACAGTGCTCCAT 


P001341 


D 




IM001070 


GATCGTACCAGGAGCTCCAAGCGTACCCCT 
GATGCTACAACCTCATTCCTGAGCCTTGATT 
CTGTGGACTCTAG 


P001342 


C 
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SEQUENCE 


CLONE 


CLASS. 


GENE 


IM001071 


GAACAAGGAAGGAAATAAAGAATAAAGGACA 

TCTGACACTACCAAAGTTAGGTCAGGATGTG 

TCTTACAGATGGCCACTCAACAGCCTATAGA 

AAbOACCGCACAGACUAGCACGGTC II 1 1 IC 

TCCCAGGTGTCTCTGAGGTACTGCTTTCTTT 

CCAGGGATC 


p001344 


0 




IM001072 


GATCCCGAGTCCTTTCATCCTGTGGTTCTATT 
GTCCCCTAGGGCTTCAGCAGGCAGAAGGAG 
AAAGACACTGTAGAGCAGC CCCCAAA 


P001345 


D 


- 


IM001073 


GATC CTGGGATTTTCTGGG CAATTGGAGGC C 

ACAATTTAGATAGTTTCCGGAATCGATGTCC 

CTTAAAGACCAGCGCCTGGACTCTACTGAGT 

AAACTCCCATTTCAACTTCCTCCTCTTCCTCT 

ATTTGAACAACGTGTATCATTAAATTATAAAA 

TTGTTGTTGTTGTTGTTGTTGTTTCAAAAATTA 

ACTTTATTGGGGGAGGGGCAGTTGCCCGAG 

GACAACTTGTGAGAACCAGGTTTTGCCTTCC 

ACACTTAGGGGTCC CTGGAATGGAACTTATG 

T 


p001 346 


R 




IM001074 


AGAGGAGAAATGGGGGTGCGAGAGGACAAA 
GTCTGTGCCCCACAGCGCTGGGGCCAGAGC 
CCAGGAGGGCCTCATGGGAGAGGTTGCCTG 
AAGGCAGTAAGAGAGGCAGAGGATGCTTGG 
GCCAGAGAGGTTCCCCACAATTGCTTGGATC 


P001348 


D 


- 


IM001075 


GATCCCAAACAACTGGAACAGGGGTTATCCC 
AAAAGCTGTTGCCTG 


p001349 


D 




IM001076 


GATCC AACTC CTCTTCACAAAGAG ACTATGT 
GCAGGATGGAGAAGAAGATGTATCCAAGCAT 
ATCCTGTGAAATTTATGTCAATGCTGTGAAAT 
TTGTCCCAGCACTCACAATCCAGATTTCTGC 
Nil 1 AGGTGGCTTTTTCTATTTCATTTCTTCT 
GGCTTC ATAGAAGTTTG AG GTG AC ATTTTTAA 
GACCTGTGCCACTAAAATTTCAGACCCTATTT 
G 


p001350 


B 


Mm.12380 
2 


IM001077 


GATCGGTTAGTTTG ACCAG CCATACTATAAC 
TTTAGTGCAACCCTTTACTTGGTGGGTGGTA 
CTAGGAATTAAACCC AGG AC CTTCACATATA 
CTACTATCATTGAGTTACATTTCTAGCCCTTT 
TA ACC AATTTPrPTTTAA rCCTTTTT ATP rTTT 

G 


P001351 


D 




IM001078 


CTCAAGATTCTGTTGTCTGAGAATCTCTCCCT 
CTGCTTGGGGACC CATTTATAATGAGGTGAT 
ACTTCATCTGAAGTAATGGCCAGGCCACGGT 
GTGAGACTCTTGAATGTCACATGCTGGATC 


P001352 


D 
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Breast 



TABLE 2 



SAG RES # 


SEQUENCE 


SEQ ID # 


CLASS. 


GENE 


IM000127 


CATGTGAGACTTGTTAATTTAGATTTATT 
CTGTAGTGTTTTTGATATGAGTATAAATA 
AGACAATTAAATTCTATATTAGAAAGTGG 
CTTTTTACATTGAATATGCTTTCAGGATA 
TGCGTGAGAATTTGGCGATGTGTAATC 


1 


D 


- | 


IM000128 


CCTTACTGCAGAGATGACTCGGCCAACGG 
CTNCGAGCTCCTGACCACTTCCTCAGGTT 
TGGTTTTGTTAGTTTTTTCTCACAGCAAT 
czncz a ar:rATA aTra atapa APTTPPPAGA 

ATGCGACCTGTGACAAGACCAATGAGCAG 
ACTCAAGGCTGGGCACATAAAAGCACCAA 
AAAAAAAAAAAAATTCCCTTGCAATTATT 
GTTCATG 


2 


D 




IM000129 


GCTGCTCATCACCAAAGGAAGTCAGGACT 
GGAACTCAAGCAGGTCAGGAAGCAGGAGT 
TGATGCAGAGGCCATG 


3 


R 




IMOuDloU 


CATGGCAAGATGGAGACTTTGTCTACCAG 


A 

H 


\r 
W 


rgrJ/rgT4 


IM000131 


GTG AAAGGGC AGAAAT AATT CCTG AAGGT 
TGTCCTCTGCCTTCTACATG 


5 


C 




IM000132 


CATGACTATGTTTCTTTTAGGTATATCTG 
AATAGTATGGATCTAAATGATGAAGTTAC 
ACCATTTT CTACAAATGGGCACAGAACAC 
AGGGCATAGATACAAATGGCAAGGTGAAC 
CCAGATCTCTGTGCTTATCTGCAATATAA 
CAACACTAAGAAATATTAGGTCTCTCTGT 
GGTTTTCCTTAAATCTA 


6 


D 


- 


IM000133 


GTATTTC CTGTCAGAGGAAAAGAGTTTTC 
AAAAAACTTTTAAAATTTTTATTTGTTAG 
CCTGGACCAGTTTCATAGCAACCTGTCAT 
CCATATCCTCAGATTCACTTATGAGTTTG 
TCTGCCCATTAAGATCTTTAAAATGGTTC 
TAACAGCTTACTTCATTGTTCATTAGTAA 
AGGGTTTATATCTACACTTTGATATTTGC 
TTACTCCATACATG 


7 


D 




IM000134 


CATGAGATGAAAAAGAACCTTTTGGACTT 
GAATTTTGTTGCTTCAAATGCGTACTGCA 
GTTGATGGAAATT 


8 


D 




IM000135 


AGGGTCCCTTCAACTTCCTCAGAGCCAAG 
GCTGACTTACTACCGTTCCCCAAGATCTC 
ATG 


9 


D 




iM000136 


CATGCCTCTGGAAAGTACCTTAAACATAG 
AATCCCCTCCCTAGTG 


10 


K 


Myb 



148 



WO 03/008583 



PCTAJS01/51291 



IM000137 


CCAGATCCCATTAACAGATGGTTGTGAGT 
CACCATG 


11 


K 


Wnt1 


IM000138 


CATGACTTCTTTCATTTCTTCTGTGTGTC 
TGTCTTCCTGTGTTTGCCTGCCCCTCTCT 
TTCTCTTCTAACAGCCCCCTTGAACCAAC 
TGATGCGCTGTCTTCGGAAATACCAATCC 
CGGACTCCCAGCCCCCTCCTCCATTCTGT 
CCCCAGT 


12 


K 


Braf 


IM000139 


CATGGGAATGTAATGTATTAATGAATATT 
ATATAAAAGAGGCTAAATAG CTTGGCTTT 
AATTTCTCACTTTGCCTACTCAATTGAGA 
AGTTTATGGATCACCAAAAGT 


13 


D 




IM000140 


CATGTCCTTATTCTAGGAAGCCCCCTTTT 
TTACCCCTGC CTCTGAGAGAAACAG 


14 


n 




1M000141 


CATG AACAC CCAAAT C CAT ATGAATACAC 
ACATAAAATATTTTATTTTCTCTATAATT 
TATGCCCACC 


15 


D 


- 


IM000142 


GAAAGCATTGAAATATACTGGCCTTATTA 
ATGG CACATG 


16 


D 




IM000143 


GATGTCAGTGGTACTGAAACTGAGAAACT 
GATGATAGAGCCAGTAAAAATACTGAAAG 
TGCCTGTTTTGAGAGTTTATATTTTACAA 
TACTTTAAT AT CTAACT AC ACACACAT AC 
ACCTGAAAAGGGCTCAGAATACACAGGCC 
TGAGATGGCTCTCAAGAACCAGCCTC 


17 


D 


— 


IM000144 


GGCCTTCCACTGCTCAAAGCTCAGACTGC 
AGAAAAGGTTGATAGCCTCCCAGGGGCAA 
TGACACCCTTTCTGCTTGAGCTTCCCCCC 
CCCCCCTCTCAGGATGTAGTCATG 


18 


K 


Wnt1 


IM000145 


CATGCCAGTC CACATCTGCTTCTATGACA 
AATGCCACATCCCAACGACAAACTCACTC 
ATTCTTCCTGTATCAATTTACGCATACAC 
ATAATACTTTTGCTCAAGGTACATTCATA 
TTTCCGGCAAACAGACAGCTATAG 


19 


D 




IM000146 


CATGTCACTCACTTGGAGAAAGAGTTCTA 
ATTATTTATCACGGCATTTTT CACAACTA 
TAGAAATAAAGTTAATTTCTTTGGAAATA 
AAGTTGAAGTTGTAATTTCCAGATGGGCT 
CAGGTTGCTGTT 


20 


B 


Mm.6055 
2 


IM000147 


CTCCTCCTAAAAGAAAAAAGGAAAAGAAA 
AGTTAAACCTGCAACAGCAT CAGCAGAGC 
TCACCCCTCCTCACCTGCAGCCCTGGTTG 
CCTCTCTTCCTTTCATG 


21 


D 





149 



WO 03/008583 



PCT7US01/51291 



IM000148 


GAAAACACTGTTCTGGGTTCAGGGGTTAC 
TTAGCCTTGGAATCAGAGTCTACCCAGAG 
TCTACCTGCTTCTACCCAAAGCAGGTGGA 
AGAAGCTGCCCAGGACGGGGCTCAGAGTC 
TACATTTGAACTCCCTGTGCCAAGAAGTC 
TGGATAGAGTATAGTGTCTGTATATTCTA 
AACTTTCTGGAACAACCCCTGCTTACAAT 
ACTCTTTCCAACTCTCAGGCCATG 


22 


D 




I M 000 149 


ACCTCTGTGCCAGCTTCTCGGACATTTAA 
CAACTCTGGATCATG 


23 


K 


Fgf3ri=gf4 


1M000150 


CTGGCAGTAACACACTTAAACTGCTAGCA 
C CTGGG AAGTGG AAAT AAG AT CAGG AG CT 
CAATCAAGGTCATCCTCAGCTAAACAAGA 
CCCCCCCCAAAAAAAAAGAAGAAGATGGC 
CTAGAAAGAGAACTCAGCAGCTGCTGATC 
TTACAGATGACTAGAGTTTGGTTACCAGC 
ACCCACATG 


24 


D 




IM000151 


CATGCCTGGTCCCTGCTGAGTGCAGAAGA 
GGGTGT C AG ATT C CTTGG AACTGG AGTT A 
T ATACAGT CGTGTGTCACTGTGGGTGCTG 
GGAACTGAACCTGTGTCCTCTGCAAAAAC 
AAGAGGTCTTGGTTGTTGTTGTTTTGTTT 
G AAACAGGGTTTCTCTATGTGGCC CTG 


25 


C 




IM000152 


GCAGGAGCCCTTGTGCAGGCCACAACCTG 
CACAGCTGTACAAGGCCTGCCTGACTGCC 
TGAACAGATGTGTGGGATCTTGCCCCCCT 
TGTGCAGGCGTACAGATGCAGACTGCTCA 
GAGACACACATG 


26 


K 


Fgf3/Fgf4 


IM000153 


CATGGGCTAGACCTACACTGAGTTGTGCT 
AAAGAAGTGAC 


27 


D 




IM000154 


CATGTCCTCCACAGCTGAGCACCCTCAAC 
TGTCTCCCAGGGCCTCTGTTCTATCCAGG 
GTCTGCAGGGTCTCTGCCCCACGCCTAGC 
CCCTGAGAAATCTTAAGCAGTCTGAAAAC 
TACGCCACTGAACTGCTAAAACCCTGGAG 
TCACTGATGGAA 


28 


K 


Fgf3A=gf4 


IM000155 


TAGTGCTAGACTCTGCCTTTTCACCTGGC 
ATAGATTCACCTTTTTCCAGATATCCAGG 
GCACTTGCAAAGAAGCCAGGCATCATCAG 
GGGTTTGGACTTCCAGCCAGAGTCTGAGT 
TGTCACTTGAATGTGCTGCATTTTGTTGG 
ATTCAGCCCCAGTCTCCCGACTCTTTGTG 
AGTTTAGGATAATAATCACAACAGCACCC 
CTTCTTATTTGATGGCTAATAAGCTCTAG 
GCCAGTGTCTTAGCTCCATTCATG 


29 


D 





ISO 



wo 



03/008583 



PCT/US01/51291 



IM000156 


CATGTATTCTGAGAGTAGAATTTATACCC 
AGAGAATACCTAAGAAGTGAACTGACGCC 
GGGCGTGGTGCCGCACGCCTTTAATCCCA 
GCAGTTGGGAGGCAGAGGCAGGTGAATTT 
CTGAGTTTGAGGCCAGCCTGGTCTACAAA 
GTGAGTTCCAGGACAGCCAGG 


30 


D 


- 


IM000157 


GCCTGGTGTGGTAGCTCACACCTTTAATC 
CCAGCACTCATCTCTGTGATTTGCTAGGC 
CAGCCTGGTATACACAGTGAGTTACACAT 
CAGCCATG 


31 


K 


FgWFgf4 


IM000158 


CGACATCCAACTT CTGGAAGGAGAGATGG 
GAAGGGGCATTTGGGGTGCTAGGAAGGGA 
TGGGAGGTGTCCCTAGAGCAGTGCTCATG 


32 


K 


Wnt3 


IM000159 


CATGAAATAATGCCTTCAGAACTGCATTA 
GAAATCACAAATAGCCCTGAATGCCCTCT 
AGATGCTTTTCTTGAGAACAATTATGTGT 
TAAAGTCCTAAGGCCCTTGTCAGCCCACC 
ATATGGAAAGGGAGAACTAACTGAAATGG 
GAGTT 


33 


D 


- 


IM000160 


ACTGACAAGAATAGAGAGAAGTTCAGTCA 
TG 


34 


D 




IM000161 


GTGTCCTGCTCCTGTCTGGGTCAAGGTCA 
TAAAAGATGAGCCAAGGCTGACTTCAGTG 
CCCACCTGGGGAGACTGATGTCTTCACAG 
GAATGCTCACCTGGAAGGTGTCCTCTGGG 
TGCATCTGTGTCACATTCGGTATAGAAGG 
AAGAATGCCAACAATACTCTAAAAATATT 
AGAGGCCTTGAGAGTC CTCAGTGGTATTC 
CACCAACATCAAAGCTGCATCGTAATATG 
CCAGCCTGGTCCTCACCTTTCCTGCCCTT 
CCCAGGAAAACATCAGCCTTTAACCTCAG 
CCCATAGGGGACATG 


35 


D 




IM000162 


AGGAT CT T AT AAAAATAACAGTG AC C CAA 
AACATAATTTTTGCCATCAAGAATCT CAA 
AATCAAGTCTCATCCAAGTCTACTCTTCT 
TTATTGTATCTTAAACACACACACACGCA 
CACATCACACAAGCACACACACAAGAATT 
CACACACATACATG 


36 


K 


Wnt1 


IM000163 


CATGGTATTCTGATGATAGTACCAACATA 
v_ 1 1 l»(_A(j\- 1 A(*L I (j 1 A 1 v_ 1 GGAAATCC 

CaACCTCAGCCaAGTATTTGTGGTTGAAA 
TAACCTATACTTCTCACATCAAACAC 


37 


D 




IM000164 


ACTGTGACCTGAGCACTTCTTGTCTTATC 
AATAGCTCACGTGCCCAGGCCGGGTGACC 
AGTCTCTAGGATGTTCTCCATG 


38 


K 


Fgf3Pgf4 



151 



WO 03/008583 



PCT/US01/51291 



IM000165 


CATGCACACAAACTGGCCCTGAACTTTTG 
ACTTCCAGGCCTCTGCCTCTCTGCGCGCA 
CACACACACT CG CACT C CTGT AT ATG AAG 
CGTATATGTGTTTCTCTGGGAACTGTTTT 
TATCAGGTGAAGCACTTCCTTTGTTCTTG 
CTACCCACCTCCAGGGCTCCAGGATCTCC 
AGAGAGCCAACCCTAAGACAGGCCCAGCT 
TCCTCTGTATCTCTGTGATGAGAACCTTG 
GCATAGAGCTGCCCTCACCCTCGGGATAG 
GGCTTATGTTCCCCGGAACGAGCCAGGCA 
CCTCAACAGCTCCTGGGGAGGAATAGGGG 
ACT 


39 


K 


FgWFgf4 


IM000166 


CATGGCACTATGAAGGAAATGAAGATACA 
AAAGATTTCCCATACAAAGGGTCAACTGT 
TCAATTTGGCATTTATT 


40 


D 




IM000167 


CATGATAGAAGACCACGTCTGGGATGGGG 
TAAGGGTTTCTCAGAGTACCTTGCCCTGG 
GGCCACATCCTAAATCTACAACAAAGCT 


41 


D 


- 


IM000168 


CATGCAAAAGAATTCCAAATGATTTTACA 
GATCTTAGCCCTCTAAGAGATAGATATAG 
CACAAGTCCTGACTCCTGAGGTAGGTACA 
CACTGACTTCCTTCCACAAGCACTGCCTC 
AGCC CGGAGATGAAGGTCACATCAATAGA 
GACAAGTCAGGTTAACCGTGAGCAACCTC 
AAGACAAGGAGGAGCACAGCATAGGTCGG 
TGGAAGTGTTTGCATAAGCCTAAGGCCTG 
GGCCCAGTCACCAGCATTGCAGAGGAAAA 
GGAAAAACAGATAGTAGGTGCCTTGGTGT 
GT 


42 


C 




IM000169 


CATGGAGTTTACCAATCTTTTTCCACTCT 
TTAAAAAGACAAAAAATATTAGAATACTG 
GGCTGAGGAATGGCTCATCAGTTAAGAGC 
GCTGCTCTTTTGAAGGACTCCCGTTCTGT 
TCCAAATGCCCACCTGGAGGCTATCCTGT 
AGCTAGAGGT 


43 


D 




IM000170 


AGGAAGTGCTGAATAGAGAGGTTTGGGGA 
GAGCCCAACAATCTGACCTATTTATACCC 
TGCCAGGCCCTGCCCATG 


44 


K 


S100a4 


IM000171 


CATGGTGCTGGAGGATCATCCATCCTGAC 
ATTCTGGGA 


45 


R 


- 


IM000172 


Ui 1 lAALLCAl 1 lATGGTGTUACCAGAAA 
CCACAGATCTTACCTAGGCTTCAGACACA 
TCACCCGAGGAAAGCTCCATTAAAATCCT 
CATTCATG 


46 


D 




IM000173 


CATGTATTCATAAGTGGATATTAGCAAGA 
AAGTACAGGCTAAT 


47 


D 





152 



WO 03/008583 



PCT/US01/51291 



IM000174 


CCTCTGGAAGTCAAGTGCAGCTTTGCTTA 
TTTGTTTAAGCCATCCACCATCCAGTTAT 
TAGATCTGAATTCATCTTTTAGGGTCAGC 
TTTGTTGTAGATTTAGGATGTGGCCCCAG 
GGCAAGGTACTCTGAGAAACCCTTACCCC 
ATCCCAGACGTGGTCTTCTATCATG 


48 


D 




IM000175 


GTTTTCTTTCTTTTTTTTTTAAAAGAAAC 
AGTCTCAAGTAGCCCAGGCAGTCCCTAAA 
CTTATTATATAGCCCAGGACAGTCTTGAA 
TTCCTGAACCTCCCTCCTCTACCTCGTAG 
TCCTGAGACCGATTGCATG 


49 


D 


- 


IM000176 


AGAGACCCAGAAATAC CAAGGTGATTTCC 
AACTGCCTGACCTGGGAGGCAAGCATG 


50 


D 




IM000177 


CATGTAAGATCTTCACTTTTCCAGTGTCT 
GTTTGTGCTGCCTTCAAACTGTTGACCTG 
ATGTAAAAATGTTTGCATCAGCTCAGGTG 
TATAGAATTGGACTGATTC CAGGAGAGT C 
AAATATACAGAATATCTAGTGTCCAAGAT 


51 


D 


- 


IM000178 


CATGCT AATGGAGTTT ATT CTT AGGACTG 
CCTCCTGCATCCATTGATTGACTTAAATA 
TGTGCACACT 


52 


D 




IM000179 


ACTAGGTGACTGTCTCAGGGTCTCACTGT 
GTAGTCCTGGCCTAGAACTCTCTATGGAG 
ACCAGCCAGACCTCACACTCAGATCCAGA 
TGCCTCAGCCTCCTAAGTGCTGGGATTAA 
AGGCCAGTCCCACCATACCCTGCCCCTGT 
TTCTGACATTTGAACCCCTCCTTTAGACA 
GTAGGGAAACTGAGGCCCTGAGATATGAC 
ACTTTTAGGGGCATG 


53 


R 




IM000180 


AAACTTCAGAAAGCGGGGGCTACCAAGGA 
GACTCAATTAAGATCTCTCCTCGATCTTG 
AAACCATCCCCAGCCCTTCGCAAAGCACA 
TTTGACGGACAGGGTTCTCTTGTCTTGGG 
CAACACATCCCGGCTACGCTCTGCAGGGT 
GAAGCTGTTAAGAACGTTCCATG 


i 

54 


D 


- 


IM000181 


GATAAGCCTCTACAAAGCTGGAGAGGGCA 
GTCCAAAGAAACTTGAAAAGATTAAAAGA 
CAGTGCCTAAGGACACAAACGTTTTTCCA 
TAAAGAGCCTATGACATATTTTACTGCTG 
CT AATG AAACTGAC CTTGAAGGAACAAGT 
GTTTAGGGTTAGCCTAAACTTTGGAATTG 
GTGAAGGCAATGTGTCAGCTAGACAAATT 
AGAGAAAGAACTCAACAGATGAGTCAATG 
AATTGTTCTAAACTAGCTTGACTTAGGAT 
TTTCAGCACAGGAACAAAAGCACATACTG 
TCCCTCTGGTTGGCATG 


55 


D 
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IM000182 


CATGGAAAATGATAAAAACCACACTCTAG 
AACATATTAGAGGAGTGAGTTACCCTGAA 
GAACACATTCGTTGGAAACGGATATTGTG 
TAA 


56 


R 




IM000183 


CATGCCCGGCTCTATTACTATTTCTTTCT 
TTCTTTTTTGTTTCAGGATCCAGTTTCCT 
TGATAAATTTTTCTTGAATGTTGTTGTTG 
TTTTTTCTTTTGCTGAGTTTTTCTTCAAT 
ACTGCTGCTTTTTCTCTCCAGGTTCAGGA 
TGAGA 


57 


D 




IM000184 


CATGCTGTCACTAAGCTGTGCTCTTCCAA 
GGAGATGAAGAGACTAGCTGGTACCCTTG 
CTATGCCAGGCTTTCTTCTTGTTTATACA 
CACCTAATG 


58 


D 




IM000185 


CATGATCTAATCTGAACTTGTATCCCAAC 
CCTTTAT AAACAAGTGAATGTGTAAT CTA 
AACTAGTATAAGCTCTTGAATAATAGCTG 
AGTGAATTGCCTTTGATACACGTTTCCAA 
ATTAGTAGCC 


59 


D 





1M000186 


GTCAACCACAGCAGTACTGTTACTTTCTG 
TGGGGGAGACGTCTCCCCTCCTCATG 


60 


D 




IM000187 


GGCAGTGAG CTTGCCCACTCTGCT ACAGGACCTCGG 
TGACCCACTATATACAG CCCTCTTCACTACGGCTCA 
CAATCGGAGTT TAAGACC CAGTGAAGTAAACCCAGC 
AGGACCCTTTACAAAGCCAGGACATG 


61 


D 


- 


1M000188 


CTTGTCCAAACCAGCTTAGTCAACAGCCT 
CCTATCTGGGCTCCATCTTACCCTCCTCA 
TCTAGCTGATGAATGTACCTGCCTTCTGT 
TCCCTTCCTCCTGGTCTGAGCTGAGCCTT 
CTTGGGACTGAGAGCCTTCATCCACCACA 
GGCAGACTATCTTTAGATCATCATAG CCC 
CAGGTCTTCATTGCAGTGCAAAAGTGCAG 
ACCTTACATTTCCATTTTTATGCTCCCTT 
TGTAACGGCT C CTTACCGG ACTG CAGCAT 
AAGTGGCTGAGTATCCAATCACAATAGAA 
CACTTAGTTGTTTGCTTGTCTAACTCTCT 
CAGTTACACCATTGAGTATGTTACACAGG 
GCTGCTTTGTAGCTGTCACTGAGGCCACA 
AGGCAAGGGGACTAAGGCAGGACTCAGAT 
GAG.CCTGTTTTTACTTCCCGTTGTCCCTT 
TCACTTTGGGTTGAGCATG 


62 


D 




IM000189 


ATATAGACTCAATCAAGGTATTATTCTGG 
AACAAACAACTAGTAACAAAAATAGTGCA 
ATTGCAAGTATGATAACACAAGGCAGCCT 
TTACCAGCTTTGTCGGAAGGAAATTGTTC 
TTTGAAATCTGAATTCCAGAGAAAAAGTC 
AAATGTAAACTAGAAGTGTTTGCATG 


63 


D 
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IM000190 


CATGTATGTGCGTGTGTGAGTGCATCAAC 
ACAAGTGCATAGATGCGTGTGTGTTTGTG 
TGTCTGACTGTTTAAGTAGGTGGCATCTG 
TCCTAGTCCTGACTTTTGATAAGTCTACA 
CGTTTGATAAGAGGATCTCTCTCACCACT 
CAGGTTCCTCCCCCCACCTCCACCCCAGT 
ACACAGCCATAACTATAAACTCCCCACGC 
AGATGAAGCCCCTCTGATCCCATTTTAGG 
GACATAACACCCCCCTCCCAGACTGAGCT 
AATGCCTTGGACCCTCCAAAACTGATCTG 
AACCCTCTCTGACCCTGCCCTCCTCCCAG 
CACAGGGCAA 


64 


B 


BF16381 
0 


IM000191 


CATGATTTTCAGTTTTCTTGCCATATTCC 
ACGTCCTACAGTGGACATTTCTAAATTTT 
CCACCTTTTTCAGTTTTCGTCGCCATATT 
TCACGTCCTAAAGTG 


65 


R 




IM000192 


AAGTATGTCTGCTATGAGTCAAAAGTCTT 
ATTTTTGCATCACATG 


66 


D 


- 


IM000193 


CATGCCGCAGTGGCCAGCAGCCCTGGTTC 
CAGCATTCTCAGAGATAACAAGGAGCCAG 
TGACCCTTTCTTCAAGCACCAAAGAAAAG 
CTAACCGACCCCACAAAGACCTGAGTATG 
AATGGTTT CTG CAG C T AAGG CACTT CCTT 
TGAGGTCAGCGCAGTTCGGGGCTGAGAAA 
AGAGCTTGCCCTGGCTTAGAGCCTTTCTC 
TGGCTCACTGTCCCAGCCAGGACCCATCC 
ATCAGCCCACAGTGGGGTGGCATAGTGCA 
ATCCTAGAGAGATGTTCAAAGGGACATAT 
C 


67 


K 


Fgf3^gf4 


IM000194 


ATTCTCTGGGTTTTCCTGTGGTGCTCTGG 
ACCCCTCTCGCTCCTACAATCCTTCCTCC 
CCATCTTCCACTGCTCTGCCTAGTATTTG 
GCTGTGAGTCTCTGCATCTGTTTCCATG 


68 


R 


- 


IM000195 


CATGCCCCTCTCGACCCTGGGAGCATTCA 
CCATCTTTATAAACTGATTCTTTCTGGGA 
AGATGATG 


69 


D 




IM000196 


CATGAAACACACTTTTAACTTTCCACATA 
CTTTTTAAAAGTGTACCTTCCCATTTTTT 
CGCCCCTAGACCCAAATTGGATGTTTCTG 
GCTCCCTCTCGTTCGTAGCTTTCCTGTGA 
TGTAGAAACCTCTTAGAAACCACACC 


70 


D 




IM000197 


GTTTCCCACGGTGGAAGAGGCAAACAAGA 
TCCCTTGGGCCTGCCTTCTTGTGGCACTA 
ATCTTACTCATG 


71 


D 
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IM000198 


ATGTGGTGTTTAAATGAGAATGTGGCCCA 
TAGGCTCATATGTTGAATACNTATTTTCC 
AGTACTTGGAAGTATTTGGGGAGGACTAG 
AGGTGTGACTTTTTGAAGGGGGTGTATTA 
TGTGGATGTACTAAGAACCTTTAAATCCC 
TCTGACCATG 


• 

72 


D 




IM000199 


GCATCATAGTTGTACCATG 


73 


D 




IM000200 


CATGGGTTAACAGTGGGCCCTAAACTTGA 
ACTAGAAAACTTAAAGATG 


74 


K 


Wnt1 


IM000201 


CAAGTCTGTCTGTCTCCTTACTAGCCTTT 
TGCTGTTCTGACTCTCAAATGGTTCCTTA 
ATTGGCCATTTGTCC CCTAAATTAGGGGC 
GATTAGGATCAACACTCAAGCAATGTTCC 
AGATGGGGTCTGACGTTCCTCACTGGGGT 
CCCAGGGCTCCTCTGACTTGGTCACAGAA 
AGGTCAGCCCTCTGACCTGGCATAGATGT 
CTGGATGACCTCTGACCTCAGCTCATAAA 
CCTGACTGTGGAGATTGAGACTGGAGGGA 
CTCAGGGCAGTGGCTCACTGGACAGTGCC 
AGGGTGTGCAGTGGTAGGCAGACTTCTAT 
GTCAGGTCCTCCTGTGCCTCCATG 


75 


K 


Fgfyfgf4 


IM000202 


GCACATATCTGAGCATCTCAAGAAGCTGA 
AGCAGCAGAATCATCCGCTCGAAGCAAGT 
GTAAGCCAATAAGAAGACTCTGTCTCAGA 
AGAAACTGAAACGAAGAGAGACAAAAACA 
ACTTCTGGGGCTGAAGAGATGGCTCAGCA 
ATTAAAAGCCCATTCTGCTCACTCAGAGG 
CCCTCTGTGAGCTGTCTCCAGATGTTTAA 
CAAGCACAGCTAACATTTGG CATG 


76 


R 




IM000203 


CACATTCATTAAAGAGACTTTATTAAAGC 
TCAAAGCACATATTGCACCTCACACAATA 
ATTGTGGGAGACTTC AACACACCACTTT C 
ATCAATGGACAGATCATG 


77 


R 


- 


IM000204 


GGGGAGAGGCTTCAATGAGCCCCCTCACA 
TTTGCATTTAAATAGCAGCATCAAGCGCT 
TCGCGTGCCACACACCAGTGGGCTCCCAG 
ATGTCAAGCCGGAGTCAGTCAGATGGCCA 
GTGCCCAGCTGTCCTCCCTATGTCGTGCC 
GGAGCAGGCAGTGACCTTAAAGAGACAGC 
GCTCACCGCTCCTGGAGCCCGACTCTGGG 
TCCCTCATG 


78 


D 




IMQ00205 


CTTGTCCGCCACCCCGCCTGCCTCATTAC 
CTGGCTCACTCACTAACGTGAAAGCCTTA 
CAGAAATCTCCAGGTCCTCAGCGGGAAAG 
GAAGTCATCTTCTTCCTCATCCTCGGAGG 
ACAGAAGTCGGATGGTAAGCATCTGTGCT 
GTGCTCCTCTAACTGTGACGCCGGGTTCC 
CATCACATG 


79 


K 


Braf 
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IM000206 


ATATAGTATGACTGCCTCAAAACAAAACA 
ACAACAAGAAAACCCCAAGATATCTAAAG 
GAGGAACATTCCAAAAGACAGAAATGTCC 
ATAGACCTTGACAAAGGAACATG 


80 


C 


- 


IM000207 


GTCAAGTGGATGTTTCTCATTTTCAATGA 
TTTTCAGTTTTCTTGACATATTTCACGTC 
CTACAGTGGACATTTCTAAATATTCCACA 
TTTTTCAGTTTTC CTCG CCATATTTCACG 
TCCTAAAGTGTGTATTTCTCATTTTCCGT 
GATTTTCAGTTTTCTCGCCATATTCCAGG 
TCCTTTAGTGTGCATTTCGCATTTTTCAC 
GTTTTTTAGTGATTTTGTCATTTTTCAAG 
TTGTCAAGTGGATGTTTCTCATTTTCCAT 
G 


81 


R 




IM000208 


CATGAAGTTAGAATAATTGGGATAAAGCT 
TTTATCATTATCAATTGGTTTTGAAATTA 
TTGTATTGATATCTTGTAAACTGAATATT 
TATTGGTACATAAGTCTGGTTATGGTTGA 
CTACTTTAAGTTTTAAGAGTTTTGATTCT 
TCCAGGTAAATGGGTGTTGTAATG 


82 


R 




IM000209 


CATGCAGCCGGGGTGGGATTTGAAGATTA 
TGCCTAGTGAATATTTAATATTAAACACG 
GTGTGATCGAATTGATAGCTGTTGAAAAC 
TAGAGCGAAACC 


83 


D 




IM000210 


GGACAGGGTCTCTCTCTCTTGTTGTTCAT 
TGTTTCATATATCATCGTCGGCCTGCTTA 
CAGACTGCATTGTGTTCCCCTGTCTCTGC 
CTCCCATCTCACTGTAGAAGTAATGGGAT 
TACAGATAGATGCTACTGTGTCTGAAAGT 
TAAATTCCTAGGCCCCATG 


84 | 


D 


- 


IM000211 


AGTGGGAGGGAGCGCCACTCTTGGAGCTA 
GGCAGGAACTGTTGTTACTTCAAAAACTA 
ACAAGACAATCT CACATTCCTGAGCTGAA 
GACCAGATGCAGCCAGGGACAGGGTTCTG 
CCCTGGCCACTAGATGGGCTCTCTGGCCC 
TGCTAAAGCACTGCACAAAACTGGACGAG 
GTGCACCAAGAGTCCCGTGTTTGGCCCTC 
AGGGCAGACTAGAGAGCAGGACTTTCTCC 
TGGGAGCAGAAACTGAGCCTGGGGTCTTC 
ATG 


85 


K 


Fgf3/Fgf4 


IM000212 


CATGCTCATAATTCTGCAGTGCCTTCTCA 
TAACACAGGATAAAACACTCTAACCTTTA 
ACATTATACTTGAAAACTTATGTGGTTTT 
TTCCTACCAGAGTCATATCAAACCAGTCT 
CCCTCTCCACTCACAAGGATCCAGTCACA 
ATGGCCTTTTA 


86 


D 
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IM000213 


CTGTAGGACCTGGAATATGGTGAGAAAAC 
TGAAAATCACGGAAAATGAGAAATACACA 
CTTTAGGACGTGAAAT ATGG CGAGGAAAA 
CTGAAAAAAGTGGAAAATATAGAAATGTT 
CACTGTAGGACATG 


87 


R 




IM000214 


CATGGCGAGATTCTGTGTCCAAGCTGCCT 
CTACTCGTGACATTCCAAGATGCCTCTGA 
GGTGGGAACTGTGAAATAGGACAGAGCCC 
CACAGTCCCCTCTT 


88 


K 


Wnt3 


IM000215 


CATGGGGGGGGGTACCAAGAAGGGACTGC 
TGTGATTGGGATGTAAATAAATAAATAAA 
TAGAATAAACAAAACCCAAAAACAAACAG 
a a a pr*T a a a r*T r a at a 2V PTdf* ah a a atcj a 

CTCTTGCTCTTTTCTGGTAAGGTTAGAAG 
CAGGTTACAAAT CTATATTAGAGATGGAG 
GCATTTCACACCAGCATAGGTATAGGAAG 
TAGATGAAATGAGGACTACACTAGAGTCT 
GTTTGTCACAACCAATTCTGAGTGATTTC 
ACTGAGATAT 


89 


D 




IM000216 


CTCTGAGAAACCTACCCCATTCTCCCTCC 
TTTCTCCCATAAGCAACCACCTCCACAGC 
ATTATCAAAAGACTGCTGACAGATTGGTG 
GCTCAGCAGGGAGAGTCAGAG CTGTTTCT 
TAGGTCTAAGTTGTAGCTCCACAGTAGTA 
TGTTCTCCATG 


90 


D 




IM000217 


CATGGAACACTCAAAGCTGGCCAGGGCCC 
ATTTACCAGGTATCCTTTGCCTTCTCAGC 
TGATGGGCATCAACACATTAATTCACATA 
TGACTCGTTTGTGTCATATCAATAGTAT 


91 


D 


- 


IM000218 


GTGGTTTTTGTGGTAGAGAGACACAGAAG 
AAACTGAAGTCCTTGGAACATAATTATCA 
CTGTGGTTGAATGTTTGTGTTCCTATAAC 
ATCCTATGTAGGAACTGAACCTATAAAAG 
TAGTGGCTCCGAAGGTGGTGTCCTTAAAT 
f^TnaArTnnnrTAPAAnATTTTnrTrTTr! 
TGAATGG CTTTATGGAAGAGGCTGTCACT 
TTTCTGTCTCTTCCTCCATTATCTTGGAA 
GACACAACAGTTCAAGGTCTCATCTGGGA 
AACAGAGACCTTTACCAGACCCTAAATCT 
GCCAGTGGTGTCTTGATCCTGGTCTTTCT 
GTCCTTAGGAGCTATAATGCATG 


92 


D 


- 


IM000219 


GGCCACAGCCAGTCCACCTGTATGCAGCT 
GGGTGCTTGGAGTGGCCCTGGTAGACAAA 
GTCTCCATCTTGCCATG 


93 


K 


Fgf3/Fgf4 


IM000220 


CCTTAGGGCCCAAAATCCTTCCTCCCATT 
CTTCCATAAGAGTCCCCAATCTCCATCCA 
CTGTTCACCTGTGGGTGTGTGTATCTGTC 
TAAGTCAG CTGCTAGGTGGAGATGCTCAA 
AGGACAACATG 


94 


R 
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IM000221 


GACAGTAAAGAAGACAAAGAAGTGAGTAG 
AGCTGGATGAAAACTAGGAAGTTCAGACA 
AAGACTGCGGGAATGANGTGTAGAGTCTA 
GAGCCCAAACAGTTAAACATG 


95 


D 


- 


IM000222 


CTGCTACATTCTTAGCTCTAGCTAACTAG 
CATCAATTGTCCCAAC CCCTTCTATGTAT 
GACTCCAAAGCCAGTGTCACATG 


96 


R 




IM000223 


CATGGTCTCTAGAGCTAAGAGATACCAAT 
GCTGCGGCAGGCAGTTTTTATTACAATCA 
TTACAGTTTTGACAGTGTCTGGCCGTGTG 
CCAAGGCTGGCCTTCATCCCTGAGCTCGG 
TGATGCTTCTGTCCTGGTCTTCTGGCTCG 
TCACAGCTTAAGAAAGTAGCTGCTTCTC 


97 


D 




IM000224 


CATGGAAAATGATAAAAAC CACACTGTAG 
AACATATTAGATGAGTGAGTTACACTGAA 
AAACACATTCGTTGGAAACGGGATTTGTG 
TATATCAATGAGTAGTTA 


98 


R 


- 


IM000225 


CATGGAAAGATAATGTGTAAATTTGGGTT 
TGCCGTGGAAAACTTTGGTTTCTCCATCA 
ATGGTAATTGAGAGTTTGGCTGGGTATAG 
TAGCCTGGGCTGGCATTTTTGTTCTCTTA 
AGGTCTGTATGAAGTCTGTCCAGGATCTT 
CTGACTCTCATAATGTCTGGTGTAAAGTC 
TGGTGTAATTCTGACAGGCCTGCCTTTAT 
ATGTTACTTGACCTTTTTCCCTTACTGCT 
TTTAATATT CT A 


99 


R 


- 


IM000226 


GGTAAGAGTGGGAGAAAATGGGGGTGGGG 
GGTGGGGACACTGCAGAAACCTGGGAGAA 
AAAAAATCCAACTAAAATCAGGAAACACA 
TG 


100 


D 




IM000227 


CACCCCCATCCCGCAGTTCCCAGAGGGAA 
CAGTCCCAGCAAAAATACATG 


101 


D 


- 


IM000228 


CATGGAGATGCAATGAAAGCACACAATAT 
TGCTGAACCAAACAGAAAGCTCAAAACTA 
GGCACAGAAAAGAGATACAAACACAAATC 
TGAACAAATTGACCTTCTCCCTATAG CAT 
AACTAATATCTCAGAGATAAAAGTGGTCT 
TTATATACCAGGGCGAAAGAGGTCTAAAA 
AG AGAGGAAT AAAAAAT ATGG CATATTTC 
CTGTCATATGCAGAACCTATATGAGTCTT 
TTTGTTTGTTTCTTTCAATACAGCCTATG 
lAbt 1 C I AtsC 1 Cj I CC1 AbAACl TACTTTG. 
TAGACCAGGCT 


102 


R 




IM000229 


CTGTT CTACAATG CCGGTTTCCAACGTAT 
GTGTTTTTCAGTGTAACTCACT CATCTAA 
TATGTTCTACAGTGTGGTTTTTATCATTT 
TCCATG 


103 


R 
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IM000230 


GACAGGCTCCAATCAGATATACCAAGGGC 
AGGAAGCACGTGACAAAATCAGATGCCTG 
GAGACAAGTGTAATAAAAGAAGCAACAGA 
AAACAAGGTTACTTGGCATTGTCACAACC 
CAACTCTCCCACCATAGCAAGTGATGGAT 
ACACCATCACACCAGAAAAGCAAGATATG 
GATCTAAAGTCACTTCTCATG 


104 


R 


- 


IM000231 


CATGGGTC C CTGAAGGGT CT CTC CTTTAG 
CAAACCCCTGTACAGTTGAAGTGANTTTT 
CAGGT ACC CATTGGTCTTAGC 


105 


D 




IM000232 


CCCCACTCCTCACAGGGCTCCCCACATCT 
GCCCTGGGACACCCCACTCCTCACAGGGC 
TCCCCACATCTGCCCTGGCACCCCTCCAT 
TTTTCAGGCACCTGAAGTCCCTACTTTCT 
AAAGGCCATTCTTCTACCTCAGGTCTTGC 
TCTAGGACTGTCAACATG 


106 


K 


Fgf3fi=gf4 


IM000233 


CAGGACAGCCAGGGCTACACAGAGAAACC 
CTGTCTCAAAAAACAAACAAACAAAAAAA 
AGACCATTATGCATTCCTGCGGCTCTGAC 
ATG 


107 


R 




IM000234 


CATGGGCAGCACCTCGTGGAACACTATTA 
TAAGTGTCCTCCAGTCAGGTCAACAGCGT 
AAGAT 


108 


D 




IM000235 


CCTGTACATTCTGTGTTAAGGACAGAGGG 
CCTCCTGCATG 


109 


K 


Fgf3/Fgf4 


IM000236 


CATGGAGGCGCAGGAGTTATTGTCTAAAG 
TTGTGAAGATGAAGCCTAGATTGTATTGG 
AGATCCGGGTAT 


110 


D 




IM000237 


GCAGATATTTCCACCTCTGCCTTCCACAG 
TCCTTCCTCCCATG 


111 


C 




IM000238 


CATACGCTTACAATGTGTTGTTATTTCTG 
GTTCTCGTCTGCCTTCTTTATAAAAACAA 
ATC CACTAAGGTGGAGTAGCCAGCCTTTA 
CTCAGGGACTGTCACCATG 


112 


D 


- 


IM000239 


TTCTGTATATATTGTGTGGTCAGAAAACC 
GTGGTTTTCCTGGTGTCAAGAGTTAACAC 
TTTCAGTAATCACTCATTCTAAACCAGAC 
AAACCTTTAATCTTTCATCTGGAAAGGTA 
CTCATTCAAACCAATGCTCTCTTAAAACC 
AGAGTATTTAAACAGCCAACTGCATCTTC 
AGGGTTTCATAGAAAATCAGCTTGATCTA 
AAATAGTCACTGAATTCTGATATCATAGA 
CATG 


113 


D 
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IM000240 


TCCACCCACCCACCCACCTGCCCACCCAG 
ACAAATGTTCACTGAGCATTCATATACTC 
CATTCACTTCTAAGTACAGAGCCTAAGAA 
TATGAGAAAATCCTCATAGCAAAGAAATG 
CCTCTTGCAACTCGAGTAAAAACTCGAGT 
ATGGGATGGAAGAGTTGAGAAAACAGATG 
ATAGTATGAGAGCCTATG 


114 


D 




IM000241 


AGGAGCCTAGCAGAATTGCCCTCTGAGAA 
GCTCCACCCAGCAGAAACAAATGCAGAGA 
CCCATCGATAAACACTGGACAGAG CACAG 
AGTCTTGTGGAAGAGTTGGGGGAAGAATT 
GAGGAACCCAAATGGGATAGGGACTCCAC 
AAGAAGAAAAAGAGAGTCAACTAACATG 


115 


R 




IM000242 


CATGTCCTACAGTGGATATTTCTAAATTT 
TCCTCCTTTTTCAGTTTTCCTCGCCATAT 
TTGAAGTCCNAAAGTGTGTATTTCTCATA 
TTCTGTGATTTTCAGTTTTCTCGCCATAT 
TCCAGGTCCTACAGTGTGC 


116 


R 


- 


IM000243 


CATGTGGAGGCCAGAAGTCAACATATAGT 
CTCCTTCCCAATTACTTGTCACTGGAGAG 
C 


117 


D 




IM000244 


GTTCAGTAGCCAGCAGGGGGGATAGGACC 
AGCCCAAATTCTCCCTTTGCTTGGCCTTG 
ACTACTAGTCTGGGAAGGGATAAGTGGGC 
TAACCAGAAGTCTT CCACATCT CTAAGTG 
ATTAAAAATGGAAGACGTGATCTCTGGTC 
ATT CAT AAACAGGC ATT T C T C AAAGT TGG 
TCTGTGCAGTTTGTGGGAAAAAATGAAAT 
GTACTCATG 


118 


D 




IM000245 


CTACAGAGTGAGGTCAAGCTCGAGGATAG 
CCAGGCAGGGATGCACAGGGAAACCCTGT 
CTCAAAAATCAAAACCAACCCAACAAACA 
AAAACAAAAATGGAAGGATAGAAGAGAGA 
TAATCCATG 


119 


D 




IM0Q0246 


CATGTACTGAATCCCTGAAGTTGATGCTG 
AGCACCATCTTGTGCTGTTCTACCGCATT 
TACTGGGG 


120 


D 


- 


IM000247 


CATGTGTCACTCAAAGG CTGCTGAGAATC 
AGGCTGTAC CTGTATTC CTAAGCCATCCA 
CAGCCATCCTGACCCACAGCAAATGCTGG 
CAGTCGCCCCACAGCTGGACTCCGTTCCT 
CCCTCCACTCCTATAGCCGAGGCTATCCA 
CACAGGCTATTTCAGTGCCCTAAGCCTTG 
CTACC GTTATGTATACATTGAGGACAATG 
AT 


121 


D 
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IM000248 


AGAAACCACTGCCAAATCAATACATTTTA 
ATTGGAAGTGTTTATGAAGCCCAGGAGAG 
ATCCCTAAATGTATTAATTGCTTCCTGAG 
GAAATATAAAACTCACAGTTACTAAAGCC 
ATG 


122 


C 


- 


IM000249 


ATCTTCTACACAGATGAAACTGACAAAGT 

7\ r>1\. 7V AT* 3V A BP ATTaTaTarnZi TV A 7\T , r*Zi R. zv 
ACAAA1 AAAvjAI 1 Al Al AUL-Ai\iW\lLrAftA 

AAAAGTAAACAG CACACATTT ATAGATG C 
ATCTAGCATCCCCCAAAGCTCAACACCAT 
CCATACTTGAAGACTGCAGTGGTCCCTCT 
AGACAGTATGCTCCAGGTCAGCCCTCAGC 
ACTTGAGAATAAACAGCTTCATTTACTCA 
GCCTGTTGTCAGGATCCATG 


123 


K 


Fgf3A=gf4 


IM00025O 


ACTGCCTCAAAACAAAACAACAACAACAA 
AACCCCAAGATATCTAAAGGAGGAACATT 
CCAAAAGACAGAAATGTCCATG 


124 


C 




IM000251 


CATGAGCTGTCX3ATAGTGACCTGCAGTCAAGGAAAT 
CTGAGGG CTTCCTAATTAACAGAGGAGCTCTAAATG 
AGAGTAACGCGCTCCACAAACCCCCTCACACTCGGT 
AAGTGTCACGGTGCAGATAAT 


125 


C 




IM000252 


GCCG CGTATGTGTTTCTTTTTCATAGAAGAATTAGC 
ACATAATGGAATGTGCGTATCTGAAGTGCACAACTG 
AGGAGTATTTATTATTACATACCTTTACAAGATATC 
TTTT CT CAGGGAGCAAC CTG AAAACATAAGG AG AAA 
AACATAAGAACTG C CACTCT AAGGG TTGG TG AAATG 
G CACAG C CTGG CGGT AGG ACACACACATG 


126 


D 


- 


IM000253 


CATGGAGAAACCTGGGCTTATTCAAGCAGTTTCCTT 
TGTTTACCCTGCCCAGGGTTGCCAGTGAAGGGGCTC 
CTCCATCACTAACTAAAGGTCTTATCCTATGCTGGT 
TCCTCTCCACCCCACCAT 


127 


D 


- 


IM000254 


TATAGGAATAGAAATTCAGAACTTATCAGTTTGTTT 
TGCTTCAAATGTCAACACATAATTTAAATTTACAAA 
CCCCTTGCACATTTGCATG 


128 


C 




IM000255 


GAAGACAAAAGATGTGTCAAATAC CTGGG CAAAAGG 
GGGTGGTGGTGCTCTCTTTCCAACTCCTGAAAGACA 
CCTCTGCTCAGCACACTAGTTTCCAGGTTCCTGGGT 
TAGG ATTTGGGTGAG ATTGGT CGG CGATGGTTTGGT 
TCCTCCATTCTGCTGCTTCTCCCTGATACATTGAGT 
TACAGCAGCCCACG CGTACACACTCTCG CACATG 


129 


K 


Wnt1 


IM000256 


G AAGAGGAAAT AAGG CAATAG C TAG ACTGG AAAAAC 
GAGCCAGCCTAAGAAGCTGCAGAGTAGTCTGTGGGG 
TTCTGCTTTGGTTAGCTGCCTTTAGTGCTCATG 


130 


D 




IM000257 


CATGGATAGAGGATGGAAGTTGAAAACCT 
GCTATTAAGAACATAGCCCTGTCCATTAG 
TGAGAGTG 


131 


D 





162 



WO 03/008583 



PCT/US01/51291 





CATGTGGCCCAGGGGCACTTGGAGCCTTAGATAGCT 

X I Xf\ 1 \ 1 v_ x\j\j x i3VJk.c X x vjxjJ^. X\j x l vj\j 

GTGACAGGAAACAGGAAGAG CTGGATAG TGGGGGGT 
CC CCAGGAGG AG CTAG CTG TG CT CTCT AT CACTTTT 
GCTCTCCTGGGGCTACCCCCGTCTCAGGGGAAGGCC 
TGTGACTGGCTAAGCAACAAGTGTGGGCTGAGACCT 
TTCTCTGTGACACTCTGGTGCTACTCTGGCCATAGC 
ACAGATCTCTAGGAACGCACTCT 


132 


K 


Faf3/Faf4 


IM000259 


TATATGGATATGTTTATGTGAGGGTAGGCACTCCTG 
GAGGGTGGAGGCATTAATTAGATCCTCTGCAGGTGA 
GCCACCTGACATG 


133 


D 




IM000260 


ATATGTGGACTGTAGTCATCTTGAACATCTGTAACA 
AAATAT ATAGATTAGG AGGTTTAGACAG CAGACAT G 


134 


D 


- 


IM000261 


GTG CCTCTTGTCTGCCAAGCTGGTATTGTAG CATG 


135 


D 




IMGQ0262 


ATTTGTGACATCTTAv^^ACjt-TTAtjlj 1 1 vso ILlX ULjA 
GACACAGGGCTGTCCCCTGTAAAGCAGGTTCCATCA 
GTG ACTCCAGGGTTTTAG CAGTTCAGTGGCGTAGTT 
TTCAGACTG CTTAAGATTTCTCAGGGG CT AGGCGTG 
GGGCAGAG AC CCTG CAG ACC CTGG CTAG AACAG AGG 
CCCTGGGAGACAGTTGAGGGTGCTCAGCTGTGGAGG 
ACATG 


136 


K 


Faf3/Faf4 


1 IVIU UUtUJ 


CATGACGACTTGAAAAATGACGAAAT CACTAAAAAA 
CAA 


137 


R 




IM000264 


CCTAAGTCTGACCGTGCCACTTCCCAGTCTTCCCTA 
CACTTC^TGCTTTTAGGCACMC^AATTTGTACCC 
CTCATG 


138 


B 


Mm.1028 
99 


IM000265 

IIVIUUU4V/J 


CCCCCCAGCCTGCTCCCTCCCCGGAGGGAGTCCCCA 
GTGTGACATG 


139 


D 




IM000266 


GTTTAGGTGATAGGGTACTTGCCCAG CAGTAGGTGG 
TGCCCAGGATTCTATCCTCAAAATTGCACAAACAGA 
ACATG 


140 


D 


- 


IM000267 


CATGTTGTGTAGATACCTACATAATTATAATTCATA 
ACTGTAATTTGCTAC 


141 


D 




IM000268 


CATGGGTTTGAGCCTTGT CCTGAGCTGGAGGAAGAG 
AG TG AC C CAAAGG GAC CT TGG TAG CAG C CAGGGATG 
TGTTGGGGAGCAGAGAAACTTTTATGAACTTCAGTT 
TCAGTACTGAAACTTCCCTTTCCCTAGACTTCCTTT 
G 


142 


D 


- 


IM000269 


CATGGGACAACTCCTTTTTCCTTCTGGGTCAGGGGA 
GAGAG AC CTCCTATCTAAACTGTATAGGC CATTGCT 
GTAGCCCTTAGCTCACTTCCGGGGCGGGGAGGAGGA 
GGTTAAGACCCTAT 


143 


D 




IM000270 


CATGAAATGAAAGAACAGAGTAGCAATTTGGGGAGA 
AAAGCCTGCCGAG CGGACTTAATCTTT CCCAAGTGC 
TATCAGT 


144 


D 





163 



WO 03/008583 
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IM000271 


ATGCTTGTCTTTCCCGCCCATTACCTGCTTTTGTTT 
GAGATAATAGTTTTGTTACTTTATCAACTAGTAGCG 
ACTAGTTTACATTTGGTTTCATAAATAAGATCCATT 

^*V^. A AAA AAA VJ^J AAA A ****** A ""W* * * * * 

TTAATCTGAGTTTTCCATCCTTGATTTATTTTGATT 
CATATTTTAATTGTCTAGTTCCCATCCCTGGGCAGG 
ACTTTTTGGGAAAGTCTTGCAGGTGACTATGTTGAG 
AATGATTTATGTTGTATTAGCACAGGTACATTCGAC 
AGTGCTGGTTCCTTCTGGAGCGCCTCGGGTGTGGGT 
CCTTTTCCTCAGC 


145 


D 




IM000272 


CATGAGT TTGATTATTT C C TG AATT CT AC CT C TCTT 
GGG TCTATTTT CTT CTTTTTG TT CTAG AG j 


146 


R 


- 


IM000273 


GGGATAAGACTGGATAGTAAGCCGGGCGTGGTGGTG 
CATG 


147 


D 




IM000274 


CAATTGTTTTACACT AAAC CAT ATAGGAATT CACC C 
TGAGAGGAGTTCGAAAGCCTTTCAAAACCTGTACTG 
AT AT AAAG CAAAT CTCTT T TGGATTC CCAAT CAAAA 
TGATTTGGCAGAACTTTAAGGCCACAAAAATTGTGT 
CTGAACAACCCCTCTGAGCCCAGTTTTGTTAGCTTA 
AATTAAGGGCCATG 


148 


D 




IM000275 


C CT CAAACTAAG AAG CAT C CATTTCG AAG CTG CTGG 
G ATTAAGGGAGTATG C CAC CAC CAC CAG CTATGG CA 
TTTTTTTCTTTAATTTTACTATTTTTTTGCTTGTAT 
ATTATGGTTTCCAGTTTTGTGGGTTTTATAAG CTTT 
GAGTGTGTTTCTGCATG 


149 


D 


- 


IM000276 


GTCCACTTTAGGACGTGG AATATGG T AAGAAAACTG 
AAAATCATG 


150 


R 




IM000277 


CATGGTCAGCTCTCACTGCCCCATCCCCTGTCTCCA 
GTTCACGCACTGTATCCTGTGTCTTTCTCTGTGGCT 
AGACTCTTCTCTTGGGGGAGGGGAGTCTTGTATATC 
GATGTGTGCTCACG CACATAGAGGCT AAAG ATT AAT 
CTAGGTGTATTCATTCAT CGTCTCATTGC 


151 


D 


- 


IM000278 


CATGTGTTTCCTGATTTTAGTTGGATTTTTTTTCTC 
CCAGGTTTCTGCAGTGTCCCCACCCCCCAC 


152 


D 




JM000279 


ATGGTGTCTGTTCATAGCAGTAAAAC CTTAACTAAG 
ACACTG ATATAACT CAC CTTT C C CAG C C T CAAAG T C 
TCTACCATCTCAGGATCCACTCACTCATTCACCAAA 
CTTCATCAAATGCCCACTGTGCTATCATCAGTACAG 
AAT AAAT CATG 


153 


R 




IM000280 


CATGAGACTGTCACAAGCTCCTGGGATGGGGACCTT 
ACCAGAAAGCCACCAAATCAGAGGCATCCCTGTTTG 
GTGAGGGTACATTTGTTTTTCCCCAGG CCCTGAGTG 
CCAGG CAGGAGCAGG CAAAG TTCACCTGGGAGGATG 
CCCTGGAT 


154 


K 


FgWFgf4 


uvlO0G25 1 


GTTTTGGTTCTTTTCAAAGAAAAACAAAGGTCATTG 
CAG CTTTTTGTACCATTG AGGTGATGGTAGG AATTG 
AGAT AT AT AAT CT AC TTG AAG AT AT AT ATT ATGG CA 


155 


D 





164 



WO 03/008583 



PCT/US01/51291 



IM000282 


COSCTGCrcrCTC^CCAACCCAGTGTGTCTGCTTTT 
AGCCCAGACGGGGGAGGGGGTAAGGGGGTGGTCTGT 
CTCATG 


156 


K 


Wnt1 


IM000283 


GTGTCCCTCCTGTCGTTAGGCAGTACTTCCAAATCA 
AACCATG 


157 


C 


- 


IM000284 


AGCTGGTACAATG CTT AGAGCAGAG CTG CAGAAG CA 
ATACAAGAGAT C CTGGCT CAG CT AGGTG CAAGCTGG 
AATAG ACT C C TG ACAGTTG T CCTATGAACT C CATAC 
ACAGGCATG 


158 


D 




IM000285 


ATGGATCCCTGGGGGGCAGTCTCTGGATGGTCCTTC 
CTTCTGTCTCAGCACCAAACATTGTCTCTGTAACTC 
CTTCCATG 


159 


R 


- 


IM000286 


CATGATX3CACTTAGCAATTCCTCAATTGAGACTCAA 
GTGAGCCTAGGCTGTGACAAAATGACTGTTAAAACT 


160 


K 


FgfUFgf4 


IM000287 


CATGTAAAGCTAGTTCAAAACATACTAAATAATTCA 
GTTGTAGAAGAGG TGAGGTTAT CTCACTG C CAGGAT 
AAGCTATTGAACAAGCAAGGGTTCTCACTTACTGTT 
T AAGTGGAAGTGTTTTCTTACT T CAAAAAGT CATTA 
ATGAATTTTAAG CTG CAT AAATATTTAGT TATT 


161 


C 




IM000288 


TAAGCTTTTCTCTTACACAATCCCCCGGAAACCCAC 
AGTTTAGG T CACAAAGACC CAGG CAC CTATT CCT AG 
GCCTGGTAAGTGGGCACCC^CCATTTACAAAGAGCT 
CAGCATTTGGCTCACACATG 


162 


D 




IM000289 


CATGAAGATGAACCGGGCTTGTTTCTCTGGCAACTA 
GG CT CAGAAAGGATAGG AC CAC CAG C CG AG TAG CTG 
TCAGATGGAGCTGAAGAC CT G AGGGAAAG AATG CTT 
GTGGGAAGAAGCTGGCTCCTTTTGGTTTTG TTGTTG 
CTGGTTTTGTGACCGGAT CT TGCTGTGTGACCCTAC 
CTAACAT 


163 


K 


Wnt1 


IM000290 


CATGGACTTAATTTTACTGCATTTGAATTATGGAAA 
ATATATATGAAAAGTCTTTAGAAAAAGGCAGAGGAC 
GAAAAAAACCAAAGAACTTTAATTATCTGAGACCAA 
GAAAACT CTTTAAGAAAAAG CAG T AGATTT AAACTA 
CGTGTTGTTAAAATAGTCCTGTATAGATATAAAGTC 
CCTCAGAGGGAAGAGATTTGTTGAATAAATTCAGAC 
ACT CAAG AG AA 


164 


D 




IM000291 


ATTAAACAGCCCAGTGCACTCAGAAGTGAATGTTGA 
GAAGTGGGTAATCTGGGGACAAACAGAGGGAAGAAT 
AGTGCCCTTGGCACGTGCAAAGGAGTTTGGGAACAA 
ACATG 


165 


K 


Fgf3rf=gf4 


IM000292 


CATG TATGACAG TGAGG T CAGG AG TG C CCAGGG AG C 
TTGCATTGGCAGAACAGCCTTTCCTGGCCAAGCCTA 
GTGTCATCAAGTATATATTGGACCAGACCTTATAAA 
ACTTGGGTTCCACTCTGGCTGGACCAGCCTCAAGGC 
GTCGCCTCTCCAGGCCTACCTCCCAGACGCAGAGGC 
AGCATTTGGAGGATTGAA 


166 


D 





165 



WO 03/008583 



PCT/US01/51291 



IM000293 


CATGGGAACTTGTTCCAAGCAAGGGACTCTGCTACA 
CCTTCAAGGGACGCTGCTAATACTGGGTTCAACCTT 
GGG CAG CGTG CACAG CAGG AG TGGG AGGG CT CTG AT 
GAGGAGAG CCACCCACACTGTGAGATCTAGGAGATA 
AGGTCACATCCAC 


167 


D 




IM000294 


CCCTCCAG CAAATTGAAATACGAAAGACTCAAACAC 
ATTAGAACCATTCCAATAAAAACTTGCATTGCCCCA 
GGCCCCTCCCACCACCATG 


168 


D 


- 


IM000295 


CAAGAGTATATATCCAAGAAAAATACAGCTGAGTTG 
ACTGTTAGTTCTGTTTTGGCCTTCATG 


169 


D' i 




IM000296 


GGTAAAAACTCTACCAGTTAAACTACATT 
CCCAGCCTGCCTCCAATGAATTTAATTTG 
TGTTTTTAGGGTTTCTGTTATTGTTGTTT 
TTGAGACAGGGATTCACAAAGATCTGCCT 
GCCTCTGCTTCCTGAGTGCTAAAATTAAA 
GGTATGCATG 


170 


R 


- 


IM000297 


GTTTAGTAACTGTTTTCTGTATTACTTTTGTTGAAA 
ATTAGATTGTTCCTGGTGACTTTGTGTGCTATATTC 
TCTGCATG 


171 


D 


- 


IM000298 


CATGTTTCTG CTT CTACTTTATCCACCCTGCACACA 
CTCACTGCTATGTTCCTGTACCTTTTCCATCTCTCC 
ATTGAATATTCACTCCAACAG TGGCATTGGAAATTG 
CAG TGGAG AT ACC 


172 


D 




IM000299 


ACGATGGTCTTGCCCTTTCTCACACCATCAATAGTC 
ACTCAGAGCTGTGGTTGTTATCTGAAGTGTGTTGCA 
GTCCAACTTTGCCCCATG 


173 


D 




IM0003OO 


GGAGTGTAAGCGTCGGTGTGTCACCCGTGAGATTAA 
GTCAAAGTGTACATG 


174 


K 


Wnt1 


IM000301 


TAGACCCAGTCTTGCACTGGCCTGGGACT 
CGCTTATTAGGTTTGACTGTTATCTGGCC 
AACAAACACCAGGAAATGGGGTGACAGGT 
GGTTGTGAGCCCTCTGAAATGGGCATTGG 
GAC CTGAACCTGGGTCCTCTGTAAGAGAC 
" ATG 


175 


D 




IM000302 


TCAC CCCAGCTGGGG CTGTG CTGAAGACTCTGAAGG 
GGAAGATAGGCCTATGGTNACATG 


176 


K 


Fgf3/Fgf4 


IM000303 


GTTGGGCTGAGCCACAAGTACACCTCCACTCACTGA 
GCCATCTAGCAGGTCCCAAACAAGGTGACTTTTGTC 
ATCCAG CAAGACATAGCCATCTATG CCAGTCATCCT 
TGTCATG 


177 


K 


Fgf3fi=gf4 


IM000304 


TAACATATTTGCTTGTTATGAAGGAAAATGTTGGAT 
GTGTGTG CCTGTGGTTGAGTACTGCAAGTAGTGTCA 
GGG AAG AG AAAC CTAG CTTGAACAG T C C CCT CAT CT 
CCTTCATATCCTCACTCCTTGTCAGGCCCTGTATTA 
GGTAGTGCTTCCCTACCTCCCTAATGCTGTGACCCT 
TTCTTTAATAGAGTTCCTCATG 


178 


C 





166 



WO 03/008583 
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IM000305 


CATGTGAGCACAGGTACCTATGGAAACCA 
AAAGTGTAGGATCCCTTAGAACTGGAATT 
ATAGGCAGCTGTACGCTATTGATGTGGGT 
GCTGGAAACTGAACTCCAGGCTTCTTGAA 
GAGCATCAACTGCTCTTAGCTGG 


179 


D 


- 


IIViUUUOUO 


CATGT AGAGACTG C CAT AT CCAGGGATC CACC C CAT 

AGCAAGATTTTATTGAAAGGACCCAGATGTAGCTGT 
CTCTTGTGAGACTATGCCGGGGCCTAGCAAACACAG 
AAGTGGATG CT CACAGT CAG CAAATGGATGGAT CAT 
AGGG CTC C CAATGG AGGAG CT AGAGAAAG TAG CCAA 
GG AG CTAAAGGGATCTG CAAC C CTATAGG TGAAACA 
A 


180 


R 




IMUUUOU 1 


CATGT C CTAGAGTTGTT C CAG CACAG AAG CTT TTGG 
GAGAGACCAC CATTACTGAAACG CAGCAGATGCTGC 
AGCT 


181 


o 




IIVIUUUOUO 


CTGCTTGTTGTGGGGACCAGCCAGACACCCTCCACA 
GGTGCAGTGGTGCAACATG 


182 


K 


Faf3/Faf4 


IM000309 


CATGATGTTTGTGCAGGAATAGAAACCCTGACTAAG 
ACAGAGG ATATT CAAG AT C CAAACTAG CAGG TT AG C 
TGTGGTTCC 


183 


R 




IM000310 


CATGAAGCACACATTACCCTGTGACTTGCTTTTTTA 
TTAAT 


184 


D 




IM000311 


CATGTGTCCTCTTGTCTTGTAGTCTCTATTCTTTGT 
G ATT C CGCAG CT CT C CAT AGAG TG CAG TTCTATGT C 
CTGCCTGCAAGGTCCATTGGCTTACTAGGGTCTGCC 
CCTCCCAGAAGAGTAGCTCATTTAGAATGCATTACT 
GGTGTGCTGTCTTGCATCTTTTTTACCCAT 


185 


D 




IIVIUUUO 14. 


ATCTATGTTTATGCACTACTAATTACTGTTTAGTTT 
ATATATGCCCTAATAATTACCCCATTGAAAACTTAA 
ATTTTGTTTCAAAAGTGTGGTCTCATTGGAGGTGTT 
AATGTACAATGTCTTTCT CATG 


186 


D 




IM000313 


CATGGCCAGCTGAGCGGGCTGGAACCTGCCCTTCTG 
CTTCCTGTCCCTGCACCTCAGCACCGCTGTGCACTT 
GGT ACTAGACCT CAATCACCG CAG 


187 


D 




IM000314 


CATGTGCGTCCCCCCCAAACACGCAAGCGCACACCC 
ACAAAG AGAAGAG ACAG GG 


188 


D 


- 


IM000315 


CATGGCCACTTGGAGAGAAGGGGGAAGGGAATGCGG 
AGAGAGCGGGAGCAAGAG 


189 


C 




IM000316 


CTTAAGCACTGATCAATGGCCAAGGTTTGCCGACTT 
GGGATCTGGGGTATAGACATCCACCCACTGAGACCC 
TCT AACAAAAC CAG ATG TGG AGG T ACGAAG C C TGG C 
TCAGGGGCCTGTCCTTTGTCATCAGAATTCACCAGC 
TG CAG CTCCTGGGT CAG CTTTGTTTGG CATG 


190 


K 


FgMFgf4 


IM000317 


GTGTATTGATATG CAAATGTGTTAAAATATGATTTA 
AAATT C CCCATG 


191 


D 





167 



WO 03/008583 
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IM000318 


GCAAAGTGTCCACACTTTGGTCTTCGTTCITCTTGA 
GTTTCATG 


192 


R 




IM000319 


ATAGCAGGT CCTGGATAC C C CAACATACCAGAAAAG 
CAAGATTCAGATCTAAAATCACTTCTCATG 


193 


C 


- 


IM000320 


CATGTCCTGGCTTTGTAAAGGGTCCTGCTGGGTTTA 
CTTCACTGGGTCTTAAACTCCGATTGTGAGCCGTAG 
TGAAGAGGGCTGTATATAGTGGGTCACCGAGGTCCT 
G TAG CAG AGTGGG CAAG C T CACTG C CTGCTAC CAG C 
AGTTCACTATGTTTTATGGTCTGCTGCCTGCTGGTG 
G TTT AT AGATG CTG TGT CG TAAG AGAAAAGTT CAGG 
GTAGCCTGGAGTGAATGGAGTTGGGGTATCAGGGAG 
GTCTTTGTACACTGGGGTGAGCTAGGCCTCTGGAAA 
GCTTCTGGGGGTTCCCC 


194 


D 


- 


IM000321 


CATG CTC CCAGG CAC CAGG CTTG CTTTG CAT AGGTG 
GGACAGGGTCCCAATACTCAGCCTGGGGTGCCAATG 
ActppTPAPpppAPAPAPPPTPTTGGTAGPAPTPAPT 
GTAGTGGGGTCTGTGAGAGCCAGTAACTTGTGAGGG 
TGTGAACTTAGCTCAGGACAGAGGC CAGCAGGAAGC 
TTT CCCTACAG AGAG TGTTTT CG TCTTTT C CT TTTT 
CTGGTTTGTTTCTTGGGAAGGGAACAATTTTCGCTT 
TTAGTTGG CTTGT ATT ATT CGCTACTGAAAC CTTAA 
G 


195 


D 


- 


IM000322 


CATGTATTAAGTCCCTCGTGAGGAAGGGT 


196 


D. 




IM000323 


CATGAGTCAGAGG CTTCTACTCCAGTT AAAACTGAT 
CTGGG TAT AG AATTG TG TT CT CAAGAAAT AG TAAG T 
TATAATCAACTAAGTCATCTCCTGTCTCATTTTTTT 
CTTCCAAATCGGGTCCTCGAATTGTTATAAGAAGAT 
TCAATCAATCAACAGTATCCCTTTCCCAATTTGTGT 
GCTAAGTGGAAACAGGTCTTAGCACATCAATCACAT 
AAAG TTCAATTAAG AAGG AATTT AAAGAT CAG 


197 


D 




IM000324 


GCTATGAGTCTCCACTTGTAAACAATTAT 
ACTCAAACATATTCAGGACACACTTGGGC 
TTCCTCCATCAAGCCAGGCAGGTTTGTTT 
TCTTGTTTGTTTTGAGATAGATGGATGGG 
CCAGCTTCATG 


198 


C 




IM000325 


CCCACCCCTAGCAACCAGTTCCTCCTCTGAATGGAA 
GACATCTGATACCAACTTTGAG CTTTCACATG 


199 


D 




IM000326 


ATCNNCGAATCATTCTAGGCTTGTGGGAC 
CATG 


200 


D 


- 


IM000327 


ACTATTCTCAACAATAAATGAACTTCTGGGGGAATC 
ACCAATCCTGATTTCAAACGGTACTGTAGAG CAATC 
ATG 


201 


R 




IM000328 


CCTAGGCACCCACCACAATAGTTAATCCATCTTTGA 
ATTTTTGACCCAGTGTTGCCAAGTATTCATTGCAAC 
AG CTTTT CAAATG TTTT ATT CTT T CC CAAAT AAATT 
CCATG 


202 


D 





168 



WO 03/008583 



PCT/US01/51291 



IM000329 


AGAGGCTACCCCTTCAAGTGGCTTGCCTAGTATAGC 
TATTACAGACAGAGAACTTCCAGTAATTTCCTCAAG 
CCACATG 


203 


D 


- 


IM000330 


ACTCTGAACTTTGCTTTGCCTGGTATTTTTGCCTCT 
CTTATCCCATTGACCCTGTACAGAAAAGCTGAGGAA 
GCAGGTGCAACCAGGCATCTCAGGCACCCAGTTAAG 
AAGTAG ATGAAAT ACTG TAATGT ACATG 


204 


D 




IM000331 


CATGATTTTCAGTTTTCTTGCCATATTCCACGTCCT 
ACAGTGGACATTTCTAAATTTTCCACCTTTTTCAGT 
TTTCCTCGCCATATTTCACGTCCTAAAGTGTGT 


205 


R 


- 


IM000332 


CATGAGACAGTCCCAGATCCCTCACCATAAAGAGCT 
ACCATATAC 


206 


D 




IM000333 


CATGCGACCATCCATCAGGAGTTGGAGGTGCCATCG 
GCTCTGCCTTACAGAAAAGGAATCTGAGATTTAGAA 
ACCCCAGGTGACCCACTCAGGGCCACCGGGGCAGTA 
AAAAGAATCTAAGATCTAAAGTCAGTGGAAACTCCT 
CCCAACCAGCAGAGACTCCTCCCAGCCAGCTCTTGA 
T 


207 


K 


Fgf3fi=gf4 


IM000334 


GGGAAG CAAG AGG CAGTAAGAAAGGGGAAACTGGGG 
AGGTAACCAAAGTCACATG 


208 


D 




IM000335 


CATGCTAACAAAGAATGGGGAAAGCTCTCTAGGCTT 
CCACCTTAAACAATGAGGAAGGGAAGAAGGAAAG 


209 


D 


— 


IM000336 


CATGTTGG TGGG ACTTT ATGGGTATTG CTT CTGATA 
TTACTAGGAGGCACAATCTCACAGAAAACTCCCTGA 
TCT TACAAT C CTT CTG C C C C C T CTTTTG CAATGTTC 
CCTGAG CCTCAAGTATGGAGTTATTTTATAGCTGTA 
TTCATTGAGACCAGAATC CACAGGTATGC 


210 


R 




IM000337 


CTCACACAGATATGCATG 


211 


D 




IM000338 


AGAAGTGATCTTTCTTCTGTGTGTCCCTGTCACCCT 
GGGAGGCAATCAGACGGTCCCTCATG 


212 







IM000339 


CTTTCCTTTTGTTTTGGACGAATATTATTGAAATAT 
GTAGTGTG CATG 


213 


D 


- 


IM000340 


CATGAGATATGATTTTAGAT CTGAATCTTGCTTTTC 
AGGTGTCTTGGCATATTCAGAACTCGCTGTGGTGGG 
TGAACTGGGTTCTGATGATGCCCATTGGTGCTGGTT 
TC 


214 


B 


AI597062 


IM000341 


CATGGAAAGGTATTTGGAAATAGG CTGTTTTGTGTG 
TAACTC 


215 


D 




IM000342 


CCCTAGGACTCACCTGGTAGGAAAGAAGTAATTCTT 
CCAAGTTGTCCCCTGACATCCACAAG CACATAGTGT 
CAGGCATG 


216 


D 




IM000343 


CATGCCATTCATACATACTGGCAATGGATATATAGA 
AAATGAGACTCCTTCTAATATTGTGTGATGACAGAT 


217 


D 




IMQQQ344 


AGAAACCATTTACACTGCCAGGTTTGGGGCCTGCCT 
ATGCATG 


218 


D 





169 



wo 



03/008583 
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IM000345 


GATC CCTTTAACTTCTTGGATAGTTTCTCTAGCTCC 
TCCATTGGGGGCCCTGTGATCCATCCAATAGCTGAC 
TGTGAG CATC CAC TTATG TG TTTGCT AGGCCCTGGC 
ATAGTCTCATAAGAGACAGCTATATCAGGGTCCTTT 
CAGCAAACTCTTGCTAGTGAATGCAATGGTGTCATC 
ATT TGGAGG CTGATTATGGGATGG AT C C CTGGATAT 
GG CAG TCT CTAGATGGTCCATC CTTTTGT CTCAG CT 
CCAAACTTTGTCTCTGTAACTCCTTCCATG 


219 


R 




IM000346 


AGGGTGGTCTCTGCAACCCAGGCTGGAACCCAGCAC 
AATAAATAGTTTTATTTACATAACCGAACGCGTGGC 
TCTCJCGGCCACATTTCGGTGCAAATTATTTACACAG 
TGATGAGGAGG CAGG ACAGG AAGGGG TGGGAGG AGG 
CTGAGGGAGGCATG 


220 


K 


Wnt1 


IM000347 


CATGTGTGTTCTTTTGTGATTGGGTTACCTCACTCA 
GGATGATATTTTCT 


221 


R 




IM000348 


CATG AGG C CAAGGGAGAGG CAAATTC C TGTGTG AAT 
CAATTATCATCTCACAGAGAACATACC 


222 


D 




IM000349 


AGTAGTATGCCACAGGGAGAAAGGGTATTTATCAAA 
GGGACAGGAGCTAGTTGTGGTGACCTTACCTATCTG 
CTTGCCTCTGCCTCCACGGTGCTGGGATTGAAGGTG 
TGCACCACCACACCCAGCTTCAGATTTTTGTTTTTA 
TTTATTGNGTATTCCTGTTTCACCTGCATG 


223 


R 


- 


IM000350 


CATG CATAT ACAGG AT AT AAC CTTTGT AAGT AAG AA 
TAAAGCACATAAAAAATACTTTCAGTAATATTGTCC 
AAACCACTT 


224 


D 




IM000351 


CATGTGTGTGTTTGTGTTTG CGGAGTGTGGGGG CGG 
CAGGG AAAGGTGG C CAGG CTGT CACT CAGAG AT CAG 
GATGACAGG CGCTCCCTCATCTAGGCG CGGGAG CTC 
TGATTG CAG ATTCG AGGAAACAAAAT AG CAAT TG 


225 


K 


Fgf8 


IM000352 


CATGAAGATGAACCGGGCTTGTTT CT CTGG CAACTA 
GGCTCAGAAAGG ATAGGTCCACCAGC CGAGTAG CTG 
T CAGATGG AG CTG AAG AC CTG AGGG AAAG AATG CTT 
GTGGGAAGA 


226 


K 


Wntl 


IM000353 


TCAGTTC CAAGAGATGACACAGCCGCAGT 
CATG 


227 


R 




IM000354 


CAGAGACTGAAGGAAAGACCATCCAGTGACTGGCCC 
AACTTGGGATCCATCCCATTTGAAAGCATCAAATCC 
AGACACTATTACTGATACCATG 


228 


R 




IM000355 


CCCTACAGTGACACTTACTCCAATAAGGCCACACAT 
CCTAGTAGTGCCAGTCCCCATG 


229 


R 




IM000356 


GGCCTCTATTCTCGGTT CAG AT TAAG TAC CTGG C TT 
CACTG AGAG CGG CT CTTTCATT C CTAAAATGGTTCT 
CATG 


230 


D 




IM000357 


AGTAGATGGCAGAGAATAATCAAACTCAGGGCTGAA 
ATTAACCATG 


231 


R 
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IM000358 


CCAACCCAACAGCTGGGAAGGGTTGGAAGTAGCCCC 
GAGGCTGGTTAGTCCCCTTCCAGATGGGGAGGTTAG 
ACTGGGGCTAGCCAGGCTGCTCCACATAGACTTCCG 
ATTCGCATTAGAAATGAAAAGAGGAGAGGAAAGGGA 
AAAGGAAGAAAGGCTACAAGCATG ! 


232 


C 




IM000359 


CATGGGGTCTGGAGCCAGCTATCAAACCCAGGATTG 
TCTTAACTGTGGTGGCTTGGATGAGAATGGCCGCCA 
TAGG CGCAT AG ATTTGAATT C TTGG T C C CTAG TT 


233 


R 




IM000360 


ACGGTGGGCTGATATTTTCTAGATCTC CTAGTG CCT 
ATCCCCTATTATCATG 


234 


C 


- ! 


IM000361 


CATGAATTTTGAGATATTCTCTGAACCAA 
ACAATATT 


235 


D 




IM000362 


GGAGAAATTATGC CTTAAATTAAAAAGCAAATATTG 
AAAAATTAAATATAATTTCCATTAAATCATAATGGA 
CCAACAACAGAACACATCTATCTATGTATCTATCTA 
TGTATCTATGTATTTATCTACCTATCTAT CTGAAAA 
GCAAAAACTACATG 


236 


D 


- 


IM000363 


GCAAGGACAACTGACAGTTTGAAGCAACTATTTTCA 
TCTTGACTCT CACTCGGCTTTTAACGTCCATTCAGG 
AAACAGGCATG 


237 


D 




IM000364 


CATGAGAAGTCACAATTCCACCACTTAAA 
ATCAGTGCTTGGAAGGATACTGTAGGCCA 
AGAGGTAAGTAGAGGGGACAGCAGTGCAC 
GTTTTTCAAAGTGTGGGTGTGTGTTTGTG 
GGTGTGTGTCTGTCTGCCTGTGCGTGTAT 
GTGGGTCAGTACAGG AAAAG C 


238 


D 




IM000365 


CAAGATAAACTCTTAATGGGATTCTAGGGAGTCATT 
C TGT AGAGAG CACTTG ACTAGAAGG TTAAG TCTT AG 
AT CCAGATCCCAG CACAAACATAATACATCCTATAC 
TCACACACACACACACACACACACACACGCAGTCCT 
CATG 


239 


D 


— 


IM000366 


CATGTCTCAAAAAAAAAAAAGAATCACTTGGATTGT 
ACATAGTAGTTAATAATATGTAATTAGTCTAACTGT 
GAAGGGG CACTTATTAGTTTCTACTATGTAGTGTAA 
ATGAACTATGTTG CTATTAGAAATTC 


240 


D 




IM000367 


GAAGGTTGAAATCTGTAATCTATCTTCTATGGCATC 
ATTCACCTCTCTAATACAGCTGTAGAGAAAAATGTC 
TGAAGATT CGGTT CTACTCTCGTTCTTTGAGGT CTC 
CCAACCCATG 


241 


D 





IM000368 


CATGGCT GGACT AT AGAG CT CTAG CTT CAGTTG C TG 
G GATGTTCAGTG CATCAC CACAG AGAGGGTT CTT AA 
GTGGTGATGGTGGTAGTGGAAAGGTGGACCCTCCAG 
ACAAAGGAAGCACTCACCACGACCCTGCTCACCTGT 
GAACCTTTCCTTTCAGACTGATTCCTGAGATCAGCC 
AGG CAG GG CT AC CAA C CAGGGACT CGTAATGAAAAT 
TTAGG CAT ATGG ' 


242 


D 
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WO 03/008583 
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IM000369 


CATGGTCTGGTGAGTATGGCACCAGATAGGATGTTA 
TGCCCGTTTCTTATCTCAAGAAACAAGGAATCTTGT 
TT CTT AT CATTAATAGG AAGAATAG AG CAGT C CTGG 
CTAAATGAAAGGTGGNAAAGTTGGTTTGAGTATCTC 
TTTCC ; 


243 


D 


- 


IM000370 


AAAATC CAATACACATT CATG ; 


244 


D 




IM000371 


CCCTTTGTTGTvjW\I A AAlUJlLAlLLLlui 
TTGGGTCCTGGAACCCTCTTGCTTCCCTGGCATCTA 
GGACTTGCTAGTGGCTACCCCCAGCTCCCCATTCCC 
CATTG CT ACACAC CTCTG TT CAAATT C C TG AC C CTC 
TGTATATCATCCCAGT CTCTTCT AATAC CTGACCTG 
AACCCCCTTTTTCCCCTCCCTCTATTCTCTTCCTTG 
CAAGTCCCTCCCACCTTCTACCTTCCATG 


245 


R 




IM000372 


CATGGGTCATTTCTGATCTTTACCAAGCAACAGTGA 
TGAAT CTAT AAAT AG AAC CAT CAGTT CAAG AAACAC 
AACT T TAG ATTC CTTT C CATAC CT TG C'f rGTTTC 
TTACATCTTCCCCCrcCCCTGTC^TTTTTCTTTTAA 
TCTTGTTTTTACAATCCAAATTGTATCCCCTTCTCT 
GTC 


246 


D 




IM000373 


TTGGGCCTTTGCATACCCTGTTCTGGCTAAGACAAT 
TGTCACCTGACTGGG CATG 


247 


D 




IM000374 


AAGTGGATGTTTCTCATTTTCCATG 


248 


R 


- 


IM000375 


TATAAGCAATCCCAAAAATTCTACCTGGGAACTCCT 
AG AG CTGATAACAC CTT CAGTGAG C CAAG TAT CTGG 
fTTiTTinr* nTT& rtitph a a a a Tar* a a a a^ha^ta 

TCTCTCTTACATACAAATAACAAAAGGG CTGAAAAA 
GAAATTAAGGAAATAAAACCCTTCACAATAG CCATA 
AATAATATAAACTATCTTGGGATAACTCTAACCAGG 
CAAGCAAAAGACCTGTATGATCAAATCTTTGAAGAA 
GAAAATTGAAAAAGGTATCAGAGGAGGTAAAGATCT 
CCCATG 


249 


R 


- 


IM000376 


CATGGGCTCTGCTTAAGAAACCCCGGAG 


250 


C 




IM000377 


CATG CTT TT AGGC CTTTT CACG AT CTTANNGGGG AC 
CGNG AGAGNTNG CTGCTGGATGAT CTCTG AG AG AGC 
TTATCGTCCTCAAACTGCTGATATTCAAGCTGTTTC 
GCAGCTGCAGCAGCAAAGTCCCGGTCTTTGTCACCG 
ATCTGTGAACAGCAACAATGAGCACCTTTCATAACA 
GACAGGAAATGGATGCT 


251 


A 


mDall 


IM000378 


GGCGTACCTGTGTATATGCATGCATG 


252 


D 





172 



WO 03/008583 
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IM000379 


GTGCTAGGCTCACTCAAGATAAAATTTGC 
TATTTCAGCTCCCTGGATAATAAAATCTA 
TCCTCTCACAGCTGTGACTCTCACAGGGG 
TGCAGGCAGGACGACATCAAGAGAGTGAT 

TCTTCCGGGTCAAAGACTAGATCTAGACT 
GGTGGGGCTGTTGATTCACTATGAATGTG 
CCTGACACCATCCCACACTTAGCATCATA 
GACACTTGGGGGACTGGTGATACACTATG 
ATGCCTGACACCATCCCACACTTAACATC 
ATG 


253 


D 


- 


IM000380 


CTATCCCGAGGGTGAGGGCAGTTCTATGC 
CAAGGTTCTCATCACAGAGATACAGAGGA 

GGAGATCCTGGAGCCCTGGAGGTGGGTAG 
CAAGAACAAAGGAAGTACTTCACCTGATA 
AAAACAGTTCCCAGAGAAACACATATACG 
CTTCATATACAGGAGTGCGAGTGTGTGTG 
TGCGCGCAGAGAGGCAGAGGCCTGGAAGT 
CAAAAGTTCAGGGCCAGTTTGTGTGCATG 


254 


K 


FgWFgf4 


IM000381 


GGGGTTG ACT AG AAGAAGG AGG CG ATTAGGGTGT AT 
CATATGAGAGAAGAATAAATAAAGGAAAAAATAAAT 
TTACAAGGATTAAAAAGTAATTACATACATACATAC 
ATACATACATCCATACATACATACATACAAGTTAAA 
CTGTTATGGTAGCATG 


255 


D 




IM000382 


AGGATGATATTTTCTAGTTCCATCCATTTGCCTAAG 
AATTTCTTGAATTCATTGCTTITAATAGCTGAGTAG 
TACTCCATTTTGTAAGTATACCATAT TGTCTGTATC 
CATTCCTCTGTTGAAGGACATCTGGGTTCTTTCCAG 
CTTCTGGCTATTATAAATGAAGTTGCTATGAACATA 
GTGAAGCATG 


256 


R 


- ] 


IM000383 


CATGCCTGCAGGTCACAGCCTTGCGCGCCTCCAGTG 
CCCAGCGTTCAAAGTGACACAGACTCTGTCAGGATG 

TCTAACCAAGACAGAAAGCTGCCGTCCTGTCCTTCG 
TGTCTGTC CCCATACCCCATAT CGGGTAG CTTTTCT 
TTCAGCATTGTCCAGACACCATCATATGCCTACATC 
G CACAAGTT CT CTG AGG C CAG AT AAT TGG CAG CACT 
C C TGTTGTGTG C CGAG AGTG CAG AAAAGGG CTAT C C 
CGAAAAGGTGTGAT CTGGAAAG AAGG AAAAAAC 


257 


D 




IM000384 


ATCTTTTGG CCAGAG CAAGCAGGGACTGAGTGAGCA 
GAGGTGACAGGAGCGAGCAAGGCTGACAAAGTCTTC 
CATATTCCTACTAGGATGACCCATTAAGCCCCATTT 
AAAGCATTCCATTGCTTTCCAAATACAAAGTCCCAA 
AATC CACATT CTTT CAAAT AAAAG CATG 


258 


C 
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WO 03/008583 
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IM000385 


TTAACATATGGTTTTTAAAAAT C CAT AATG AG CATA 
TGAT AGAGAAGT CATCAG AG CT CTTCAG CT C CACAT 
CATCTGTCCCCAGAAGTATTACTACTCCTAACTTGC 
TGAGCCAAGGCACAGATATTCTTTGTGTAAGCATCT 
CTTCTTTATCCTGTGTTGCCACGCAGGAGCACGCAC 
ACTGCTTCCTGTCTGAGGTTGTTCCATATCAGCATG 


259 


D 


- 


IM000386 


CATG C CAGGG CTTGAATT AACACAAGTG C C C CAG AT 


260 


D 




IM000387 


CCTGTCTGTATATG CACATG 


261 


D 




IMUUUooO 


CATGGAAAATGAGAAACATCCACTTGACGACTTGAA 
GAATGACGAAATCACTGGAAATCGTGAAAAATGAGA 
AATG CACACTGTAGG AC CTGG AATATGG CGAGAAAA 
CTGAAAATCACGGAAAATGAGAAATACACACTTTAG 
TACG TG AAAT ATGG CGAGGAAAACTGAAAAAGGTGG 




R 




IM000389 


CATGAAGG TAAATTATG ACCAT CAGGG TT CAGACCT 
CAGCT CGACCGGAGACCAGCCTGCAANTCCCCACAG 
CCCTCC CTAAAGTGGGTTAAAAGACAGAAAAGAATT 
AAATATCTGA 


263 


R 




IMG00390 


CATG CA C TAGCAAGATTTTG CTGAAAGG AC CCAGAT 


264 


R 




IM000391 


GACACATACACACACATG 


265 


D 


- 


IM000392 


GTAAATGTATTAGGTTCAGAACTGGCACTGCTCACT 

rnK PI If f »^<«7i ^TV / fFWlV^iIVYlf I 1^/"^^*"PTV TV 7V TV /*»»p TV f% T\ TV /^*/^T\ J\ TV 

TATGTTCACAGTTGTTTGGGTAAAACTAGAACCAAA 
CACAAAAG CAAAAG AG C CAAG CAG CAG AG CAGGG AG 
CAAGGGGCTTGGGGAAAACACTCACCTCTGTTGTGT 
CTTCTTCTAGCTGTCAGGGCATTGAGTGGCAAGGAG 
TGGAAAGGAACTTTGGGCATTCCGAGTCAGGAAAAG 
TGTACCAAAATAACACTATGGAGGTTAGCAAGTGTT 
C TAGAGGG CAGAATAAATACATG 


266 


D 




IM000393 


GTTTAGGTCATTGGTGGTACACTCTC CAAGGACAGT 
ATAAATTGATTTTTTTCTGTATCCTTCTTTGTTCTT 

ggc cataagg cacttgg agtg cat t aatatgt actt 
attattactatgtcttttcttgtctttggcttaaaa 
' gaaacagggtcaagtgaccatg 


267 


C 




IM000394 


agttttctttaaaaaaataaagtaggaatgaaactg 
gaacaaaaatgcaataaattttaaaccatcaccgct 
aaaacatg 


268 


D 




IM000395 


catcattttcagttttcttgccatattccac 


269 


R 




IM000396 


GAGAGGAGCCTGGGGAAATGAAGGTCCAG 
CAACAGGCCCAAAGTGGGATCCAGCTTAA 
GGGGAGGCCCCAAGGCCTGACACTATTAC 
TGAGGCTATGGAGCACTCATAAAAATGGA 
CCCAGCATG 


270 


R 




IM000397 


CATGGCAGCCTTGGAGTATCAGGCTGCTG 
TTCCCAATGTGGGATGCAGAGGGCACTGC 
CAGCCTGGTTATCACGCACCACTGTCACA 
CAGGGAAGCGCCCCCTTCCC 


271 


D 
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IM000398 


GGAGTTCTTCTCTTCAATAACAGAGTAAATTCTCCC 
TCAGCAGTTCTCCCAGGAAACCCATAACCTAGCCAT 
G 


272 


D 




1M000399 


C CTTAGATGTTTG T CTAAT CGACAAAATACTTTATA 
TGTGAAAAGGAAAGCATG 


273 


D 


- 


IM000400 


AAT AAT CAG ATTT C CAGAGCT C CCAGGAA 
CTAAACCAACAACCAACGAATACACATG 


274 


R 




IM000401 


AT C CAGT AAT CATTCAT CTT ATTG TTT CCACACAGG 
AAAACCTGTAATAGATGGTTCATCAGCTTTATTTAT 
AAC TTT CTAT CT TGAAAG CAACTGG AATG C C CTT CA 
GTAGGTAAG CAGATACACTAGG CTCACCT CAACTAT 
AGG CACAATGAAAGG AATGAAATG T CAA CT CACGAA 
AGGTAAGTACACATG 


275 


D 




IM000402 


CCTCGCCATATTTCACGTCCTAAAGTGTGTATTACT 
CATTTTCCGTGATTTTCAGTTTTCTCG CCATATTC C 
AGG T C CTTCAG TGTTCATTT CTCATT TTT CAAG TT T 
TTTAGTGATTTCGTCGTTTTTCAAGTCGTCAAGTGG 
ATGTTTCTCATTTTCCATG 


276 


R 


- 


IM000403 


CATGCAAGAACAGGACAAATGTCTGTGAAGAAAATG 
AGTGAGCGTGAACAGGAGGTCAAGGATCCGGTCCCA 
GGCAGCTCTCAGTCTGGGCAAGCATTTCTAAACTTT 
GCCTTCCTTCCTGTTGGGGGTGAAGGTCTG 


277 


K 


Fgf3/Fgf4 


IM000404 


AATAGGAGTAGATGAGAATGAAGATTTTTCAATTTA 
AAGGACCAG CAAATAGCTT CAGCAAAATTATAGAAG 
AAAACTTCCCATACCTAAAGAAAGATGCCCATG 


278 


R 




IM000405 


CATGCAGCCCCATTAGTGATTGATCCTGTTCCATAT 
AA 


279 


D 




IM000406 


CATGGGCTCTCTGCTGATAATGCTGAGGC 
TGTTTGTGCTGTAGTCTGCGCTTTTTGCC 
CCCTCTCAGAAAAACTGTATGTCATAGGA 
GTTGCTGGCTATTGGGTACATAAGCAAAG 
CCACCCTATTGTGCCAGTGCCTTAGACAG 
TGAGACAAGAAAGGCCCCTGGTTAGAAAT 
CTTATCAGGACTGGGAATGTAACTCAGTT 
GATAAGAGTGCTTGCTTAGCGTGCACACA 
GCCCTGGGTTCAACCGCCTAGTACTACAG 
AAACTGAGTGTGGCTTCACACACCTGTAA 
TCCCAGCACTTGGAGAGATAGATGCAGGA 
GGATTAGAAGTTCAAGGTTATCTTTAGTC 
ACATAGTATTGGTAGCCAGCCAGCCTGGA 
ATACTTGAGATACTTACAGGAAGGAAGGA 
AGGAAGGAAAGAAGGAGGGAGAGAGGACA 
GGAGGAAGGAGATAGATATACACAGAAAG 
AGACAGAGAAACAGAGATTCAGGAGACAC 
AAAGACATACGGAGACACAGTGAGA 


280 


R 
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IM000407 


CATGTGGTTGCTGGGGATTGAACTCAGGACCTCTGG 
AAG AG CAGT CAATG CT CTTAACCG CTGAG CCAT CT C 
T C CAG CT C C CTTTT AG ACTT CT TAG TAG CAGCATT A 
ATTCTTGCTTGGTTTCAGTTCTGACAACCACAGCAG 
TCAGGAGTTTGAGTAAGAGG 


281 


R 




IM000408 


CCTCATAATGTTTGTTTGAGCATTTTTTT 
AAAACCTAACTTGTCTTTTGCTTATCTAT 
TGTGGTTTCTTAGTGTGTGTGTGTGTGTG 
TGTGTATGCGCGCGTGTGCTCTGGTCTTC 
GTGCACATG 


282 


D 




IM000409 


AT TTGTGACATCTTAGGAG CT T AGGTTGGT CTT CGA 
GACACAGGG CTGTCCCTGTAAAG CAGGTTCCATCAG 
TG A CT CCAGGGTTTTAG CAG TT CAG TGGCGT AGTTT 
TCAGACTGCTTAAGATTTCTCAAGGGCTAGGCGTGG 
GG CAG AGAC CCTGCAGACCCTGGCTAGAACAGANG C 
CCTGGGAGACAGTTGAGGGTGCTCAACTGTGGAGGA 
CATG 


283 


K 


FgWFgf4 


IM000410 


CATGTATGCACAAC CAAAACTTATAAATATGAG AAT 
TCACTTATAGTCCTAGTCCTTTAATACAGAATTTAG 
CAT T C CGAT ATAAAACAACAG ATT AAAC C CCAACAG 
TTAGAATAGAGCAG 


284 


D 


- 


IM000411 


AATAGG AGT AG ATGAG AATGAAGATTTT CAAC TTAA 
AGGGC CAG CAAATAT CTTCAACAAAAT AATAGAAG A 
AAACTTCCCCAACCTAAAGAAAGAGATGCCCATG 


285 


R 


- 


IM000412 


CATGCACACCCTACTCCTGGGTGATCGTACCAGCTC 
CAGCCTCTGTTCTGCACGCTGTGCCTTCAACCTGGC 
AACCTCC 


286 


K 


Wnt1 


IM000413 


CATGAAAACCTGTCTCAGAAAACAAAAACACGTTGA 
GAGCCAG CATAGAAGCCATAGGAGGTAATG TGTGTG 
TGTCTGTATATATGACAAGAGCAGACCTGTGCTGAA 
CCAGTTAACTACTTTTG 


287 


D 




IM000414 


CATGCTACTAACCAGTTGAGGCAGTACCAGTTGTTG 
AAGATGCTGTCTTTTATCCAATGGATGGTTTTAGCT 
CCTTTGTCAAAGATCAGGTGATCATAGGGTGTGAGT 
TTATTTCTGGGTCTTCAGTTATATTCCATTGATCTA 
CTGG CCTGTAATTGTACCAATAC 


288 


R 


- 


IM000415 


GGTTAGGAATTCTGGACAGTTGGTACTTGGTTTGAA 
TATAGTAGGTGACAAGCTGTGCCTTGAGTGGGGTGG 
CAAGCAGGGTTCTCTGCAGCAGGATGCAGTGTACAT 
G 


289 


C 




IM000416 


CATG AAAATGTT AAGTCC TG ACAG ACAGGG TG C CAT 
CTG C CAAG AAT TTG AGT AAT CT AGAAACAG AAAT 


290 


D 




IM000417 


CATGGGGTTTTGTGGATCTG 


291 


D 
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IM000418 


ACATAACTGCTGTTGGCTTCTTTGTGTACATTTTAA 
ACOTTC CTC TG AAAG AGTG AC CAATG CTTTT AACTG 
CTGAGTTATCTCACCCX3ACrTACTTTCTCTCTCTCT 
CTCTCTTTTCCTTCTTCCTAAAATTAATTGTGTGTG 
TATGTGTGTGTGTGTGTATGATTCAGAAACCTTTTA 
TGTGGTGGTAGAAGACCATCTGCAGGATTCATG 


292 


D 


- 


IM000419 


CATGGTCCCACAAGCCTAGAATGATTCGT 
GGAT ] 


293 


D 




IM000420 


GGGGTCCAGGAGAGAAACTTGAGTCATG 


294 


D 




IM000421 


GGAAAGAGATACTCAAGACCAACTTTACCACCTTTC 
ATTTAGCCAGGACTGCTCTATTCTTCCTATTACTG C 
TAAGAAACAAGATTCCTTGTTTCTTGAGATAAGAAA 
CGGG CATAACATCCTATCTGGTGCCATACTCACCAG 
ACCATG 


295 


D 


- 


IM000422 


GTCCTTCCCAAAGAATAGTGTTAACTGAGCTCTTTG 
GGTGGCAATAAATGAATTGCTCTGGTGGGACAGGCA 
GTGCACATATGGGGAGGGGGAGACACATG 


296 


D 




IM000423 


CATGTTCTTACTTCTTGTTG 


297 


D 


- 


IM000424 


GGGTATATGAATTATATATATATGTGTGTATATATG 
TATACAGGCATG 


298 


D 




IM000425 


CATGCGCCCTAAGACTCATCTCCACGAATGACGTGA 
CGACCTAATTGCATTCCTTCTAACCCACTGATTAGG 
CAAACCACCCTCCAAAGGGCTCGCTGAGTTCCTCTT 
CGGGAAG AGG TG TGTTG AG TACGCTGG AATGG ATAT 
TCGAGGGCTGAGG 


299 


R 


- 


IM000426 


CATCTCTCGAGCCCTTGCCCAGCCTTTTTTCTTAAA 
ATTGTATTTTTAAAATTTATTTTCTGTACACAGGTG 
TGTGAGTGTGAACATG 


300 


D 






CATGTGGACCTGGGGGCTAAGTCAGGGTGAAGCTTC 
CACAGCTAAGTGGCTGGAGGCTGCCCTAAAAGCTCA 
GGAGGCACCGCAAGCAAGCCTTGAAAAACCTTACCC 
ACCAGCTTGACCTTAGACTTCTGGCCTTCAGGCTGT 
G ACAAT ACATTC C TG CTGTTT AAAG AAC CAT ATGGT 








IM000427 


GGTGTTTTTTGTTTGCGGGGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGTGTGTTGCAGTGCTAGAG 
ATAAGATCTGA 


301 


K 


Fgf3/Fgf4 


IM000428 


GTCTAAAGTTTTCAAATGATGGATAAGTTGTTAAAC 
CTCCTTTAAGATCTCAAGCACAAAAAGAAAGACATC 
AAATACGAATAGTAGAAAGGAAAGGAGATTTGGAAC 
TAGAGG C C C CAAG AG T CAT AAAG AGAAGAATTT AAA 
CAACTGTACCCACAAATTCATTAG CATAGATCAAGT 
AGTCCATTTCTTCATG 


302 


C 




IM000429 


CATGTATGTTCTCGATGCCTTGGCCTG 


. 303 


D 
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IM000430 


AAAGACATTAACTCTTGAGAACCAAGGGGTAGGACA 
GTATAGACTGAATTTTGCCTCCCCTCTTCATAAGTT 
GTC^CTGCTAACCTCATTTCAGAACTTAAGCATATA 
ACCTTCATG 


304 j 


D 




IM000431 


CATGGAGAACTAGCAAGAGCAGGATGGCGTTTCTCT 
AGAATG CCGTATAG 


305 


D 




IM000432 


CATGGTGACTTTCCATCTTTAGAACCATAATCANGT 
TTAAT 


306 


D 


- 


IM000433 


CATGCTTATATCCCTCAAAAATTTTACAGTTAAACT 
GAAAATG CTTACTTACTTTTTTTT CTTACTTATATC 
TAGTAT CGATAAGAACTGTCCCAAAGGACAC 


307 


0 




I M 0004 34 


CTGGGTCTTAGTCCTCTGAGGTCCCTAGCACATCAG 
AGGTTCATCAGTTCCAAGAGATGACACAGCCGCAGT 
CATG 


308 


K 


FgWFgf4 


IM000435 


CATGG AG AATG CACAG T CAAAACG CTTGCAT C CT 


309 


D 




IM000436 


CACCCCCTCCCGCCTTACATCAATCCTGG 
GTGCACAATGGGACTGTGGATGACTGATG 
TCTGCGCAAACAACTTGCGGGGAAGTCTA 
GCTGACAAACGCTCATG 


310 


K 


Fgf3/Fgf4 


IM000437 


ATGTATC CAATGGCAAAG CACGGGGGAGGCTTCATC 
TTGAAGAGAAGAGTGCTCTTGGTAGGCTATCCTTTT 
TTTGAG ACAACT AG AAAT AGGAG CATT T CAA CAAT C 
TGGACATATGTCCTCCCACAAGAACTTGTTGAGAAT 
GGGTCTGAATTAACTGGAAATAAAAGTGAACACATT 
CTCCTATACACATG 


311 


D 


- 


IM000438 


TCACTCCATTTTAGTTCAAATGCTACAAC 
TCCTTTGAGCACCACTGTCATTTCAAGAC 
CTTATTCTGTGAATACCATG 


312 


C 


- 


IM000439 


CATGCTTAGCCCAGGGAATGACACTATTCGAGGTGT 
GG C CTTATTGG AG CAGG TG TGG C C T TGTTGGAAG AA 
GTGTGTCACTCACTGTTGGGGTGGGATTTGAGAGCT 
TCCTCCTAG CTGCTTGAGGATGCCGGT CTT 


313 


R 




IM000440 


CATGAGCTGGGTGAACGACAGCAAAGGTTTGTTTCT 
CTTTTAAGGAAGACAATGGTGTGAAATTGGTTGATC 
CTTTGGGGGAAATGTTGGCCCCTT 


314 


B 


Mm.2024 
5 


■ ft 1 A AA AAA 

IM000441 


CATGATCTCACTGTGAGGGCTGGCTACCTTGGAGCT 
CACTGT ACTG AAATATT CTGG CCG ATTG C C T CTTCG 
CTGGGTTTATGGGCACACACAGTACTTGTCTATGAG 
TCTTTGTTAGGCTGAG CCTAGTGGTGCAGGCCTGT C 
ATCTCCCCTACTTTACTTTAGGCTCTGAGG CAGG AG 

uAi 


315 


D 




IM000442 


TCTGGTAACTTGGGGGTCTGATAAAACAG 
TTGGGGGATTTCTTTTCTTTTCGCGTCTG 
AAGCCAATGTTATTACAGGTGTGTGCTTG 
TCTCTCCCACACCCTGCCCCTGTTGCCTA 
ACACACGCGGCACACACATG 


316 


D 





178 
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IM000443 


CATG ACTCTT C CT C CAGAGTTAGAGGTGG AG C CAGG 
ACAAACTCTAAAGAAAAGAAACCCCAATCAAAAAGG 
G AAG CTGGT AT CATC CAACC TTTAAATT AC TC CACA 
TCCCTCCAGAG 


317 


D 




IM000444 


CATGTCTGTCCCAAAAGGAAGTTCCTTCCTCTGTCC 
TCCACATCTGACCAGCACCATCATTCAATCTGCAAC 
CCAAACCAGACATTTACATCATCTATGCCTCCT'XTC 
CTCCTTGTCTCCCCTCAACC^GCACCCAGCAAGCTT 
T CAGG TAT CCCCTTAGTGTTGT CAGG ATCTCTCCAG 
TTCTCCAGACCCCAATTCTGTTCTCACTCTACACTG 
CTAGC 


318 


• 

D 




IM000445 


AAAGCTAACTT CT CATCACCTACCTAATAGCCTGAG 
AGCCCTGTGTAGAAAAATTAAGGAGTTTAGTTCCTT 
CATG 


319 


C 




IM000446 


CATGCAGACAAAGTAAATAAGAAAACAAATTAAATG 
TAGG CTGGACGG AT AGATGG T 


320 


D 




IM000447 


CT CAG CT CCT AGG CAACACTTGTAGAC CCACAG C CC 
CTTC^C^C^CACACACACACACACACACACACACAC 
ACACACGGCTGGGGATCCAACCCATCTCGTCCTTAC 
ACGTGCTCTACCATCACACCACACATTTCCAGCACT 
TTTATCTGAAGTGTTTCCTTTTATTTGTGCATG 


321 


K 


Wnt1 


IM000448 


CATAACCACTATAACCAGCCTGCTTACTTGGCTTTG 
TTT CG AGGG CTTTTGTTTTAG AG CTCTTT CTT TTTA 
CCCTTCTCCGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTCTGTCTGTCTGTCTGTCTGTCTG 
TCTTAGTGTTTGTACATG 


322 


C 




IM000449 


CATGTGGTCCACGGTTTTACT TTACTAGGGAGCAAC 
CTGTAC CACAGGG AGAG AGG C CT AAGG ACAGGAAAG 
GAGCTGACCCAGAACTGAAAAGGCACACACCATTCT 
GCCAGCACTTCCC 


323 


C 




IM000450 


CATGTCCTACAGTGGACATTTCTAAATTTCCCTTCT 
TTTTCAGTTTTCCTCGCCATATTTCACGTCCTAAAG 
TGTGTATCTCTCATTTTCCGTTATTTTCAGGTATCT 
CGCCATATTCCAGTTCCTACAGTGTGCATTTCTCAT 
TCTTCACGTTTTTCAGTGATTTCGTCATTTATCAAG 
TCGTCAAGTGAATTTTTTTCATTTTCTCTGATTTTC 
AGTTTTCTCGCC 


324 


R 




IM000451 


CATGTTGCCTCAAGACAGATCTCCACTTTAAAGACA 
T A CCT AAAGG C CTGGAAG CT TAG T CAATT AAG C TTT 
CCTGCCCAGACACTCCTCCCTGAAAAAGGTATTTAA 
CCTCAGGCCCACCCTGAGAAGTGGGGTATGATTTTA 
CTCATCCACTTTC 


325 


R 


- 


IM000452 


CATGGTTTCTATTACTGTGTTGAAGCACCL TGACCA 
AAGCCAATTGGGGGACGAAAGGGTTTATTTGGCTTA 
AACTT C CAAATCAG TG TTT AT CATTAAAGGAAGT CA 
GGGTAG 


326 


R 




iM 000453 


GCAAGTGTCAGACGGCTCTCAGGGAGATACACATAG 


327 


L/ 
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IM000454 


CATGT AC CTATGTGTGTG T AACATTTG C CT ATTTT C 
ACACAGTTAAGAAAGCATCGTTATGAAAATCATTAC 
AACTTT C CAGATAAACAGAT CCACT CAG C CACAG AT 


328 


D 




IM000455 


GCCCTTCTCTCTGAACTTTTCAGTTCCrGGATAAAG 
TCAGTGTTCCACCTCTATACCTGACTAGTTTTCCTA 
AATT CTGAG T CAAGCATATTT CATG 


329 


D 


- 


IM000456 


GACCTCGTGGGCGGGCCTGAGGAGACAGTGCAGATG 
AGG TGTCAG TAAGGAGG ATG CAAG CAAG AAAG ATG C 
AGG AGATG ATGGAG AAG CTG AAG AAGG CACTGAAGA 
AGG CA CAGGGAAGAAG AG T G CATG 


330 


D 




IM000457 


CTTGCCGTTGAGAGCGTC CAG ATCCCCTGACTTGAG 
TGGGTCCACCTTGTTTGGTTTGGTTCGCAGTGTCGG 
CTG TGGAGCCCCAGGCCTTG CATG 


331 


C 




IM000458 


TTCTTATCCACTGAGCCACACTGCTAATACTGTGAT 
GTCTTTTTTAAGACTCACCATG 


332 


D 


— 


IM000459 


GGGTTCAACACATTTTTGG AG ATTGAT CAAAATTAA 
AACATG 


333 


D 




IM000460 


CATGAAGGAGAGTCTGAGGCTACATCCACCAGGCTC 
TATGATCTCCCTCTGCTGCATCCAGGACATTCTCCT 
T CTGGATGAAGATGATG CTGG CG CTGG CG CTGG CG C 
TGACGCTGATGCTGCTCGCTTCTGCGTCCT 


334 


C 


- 


IM000461 


CCTTGTCCTCAAATTACAAAACTCCCTAG 
GGTCTTTTCTCTGGGCTACAAAATTCTGC 
AAATGG AC T CAGG AGG AAT CAATGTGG AA 
ATTTCACTTTGCCTTCCCAATCAGCAAAA 
TAATGTTTGCCAAAATCGTTAGATTTCTT 
TCCCCTAAGTAGGCTACTGCCGACTTGAA 
AGCAGTGGTTCCAGAACCCGAGCCCAGGG 
GCTGCCACTTCCTATGCATG 


335 


B 


AI426908 


IM000462. 


CCCTTGTCCTCAAATTACAAACTTCCTTA 
GGGT TTTTTTTTTGGCTNCAAAATTTTNC 
AAAGGGCTTCAGGAGGAATAATGGTGGGA 
AATTTACTTTTGCTTTCCAATCAACAAAA 
AAATGGTTGGCCAAATCGGTAGAATTCTT 
TCCCTAAATAAGCTACTGCCGACTTGAAA 
GCAGTGGGTTCAGAACCCGACCCAAGGGC 
TGCCCTTTCTATGCATG 


336 


D 




IM000463 


CATGTATCTTAAGAACAGAGCCAGTGCTCTCCCTCT 
CCCACTTGAT 


337 


D 




IM000464 


CATG CAGAN TAAAG T ACAT ATAT G TAAAAAAT AAAA 
ATAAATCTTT 


338 


D 





180 



WO 03/008583 



PCT/US01/51291 



IM000465 


GTGCTCTCCCTTGCCTCTCCTCTCCTGAG 
TTTCTCTGTAGGTGTAAGGGCTGGAGGTG 
GGCCCAAGAACCAGAGATCAGAGGAGGGA 
ACTTCCGGAGCAGAGGCCCTGGGAGCAGT 
GTTAAGCAGGCTTTGGCCAGGTCTGGAGG 
TGTCCAGGCAGGGAGGTGGAGCTGGAAGA 
GACCAATTAGTCAAACGGCTGCAATTGGC 
CATTTGGAAGCAATTAACAGGGTCTCCAT 
TACCATATTATGCCCCTCCACCCCCTCCA 
CACTCTACTAGGCTCTGCTCTGTATGGAA 
GGGGGAAGGTGGAGGCTCANCTCAAGCCA 
GGGAGACTACAATGGAGGCCCAGTGCTCG 

TGTGAGGAGGGGAGGGCAGGGCAGCATCT 
GAAGCAACCTGTCATTCACAGCCTGANAG 
ANGGTGGGAACAANGGCTTNCAAAGCCAA 
GAANGCANGTGGNTAGAAATGCANGAAAA 
CCTCTCTGGTAAGAAAGGCTGAANGAAGC 
AGCTAGGGTTGTAAAACAAGANCAT 


339 


K 


FgM=gf4 


IM000466 


ACAACTGCAGGGAATCAAGGAAGAGCCTTTTCCTGA 
ACTGTCCTGGATGCCCCAGTCCAACAGCAACTCCCA 
CTTGCCCTGGCTTGGTTTGCTCCACTGTCCTGAAGG 
CACAGTGTGATATCCCAGACCTCCAGCGAGACAGCC 
CAACCTGCAAGCCCTGATGGGAGGGGTGGCCTGAGA 
CAACAGTACCTACATG 


340 


B 


AI550057 


IM000467 


CATGGACTCCAGGGTCAGGGTG TAAGAAAAAGGTGG 
AGCCTGCTAGGTGTGGTGACACACACCTTTAACCCC 
AGAACTCAGAAAGCTGAGGCAGGTGACTAGCCAGGA 
GTT CAAGGT CATCT AGTTCAT CAGAT CT AT AG AGTG 
AAACAGCCAGGCTACATTTGAGATC 


341 


K 


Fgf3fi=gf4 


IM000468 


GCTCAACACTTAAAAGCG CCTG CAGAGGGGTGGGGG 
TTTAATTCCCAGCACACACATAGTGGCTCAGGGAAT 
CTGAAGCCCTCTTCTGGCCACTGCGTGAACTGCATG 


342 


D 




IM000469 


rTnnr aar rrATArnAA a^Tia a apap APTfT A AG A 
AAGAG AACAGG CTG C CTGGG AG AGGGAGG TG C CAGG 
GGCTTAGACAGGAAGGTAGTTTT CAAAAACTGAAAA 
CTT AAG C T AT CTG AATG AATG AT ACAAAATAAAAG A 
AGACACAAGAATTTCCAGTCAC CTGAG ATATCTCAC 
ACTCCTGTTCTTTCAACCTTCTAGCTGAAAGGAGAA 
AGAGCCATG 


343 


D 




IM000470 


CATGG AAGGAGTTACAGAG ACAATG TT TG GAG C TGA 
GACGAAAGGATGGACCATCTAGAGACTGCCATATCC 
AGGGATCCATCTTATAATCAGCCTCCAAACCCTGAC 
ACCATTGCATACACCAGCAAGATTTTGCTGAAAGGA 
CCCTGATATAGCTGTCTCTTGTGAGGCTATGCTGGG 
GCCTAGCAAACACAGT 


344 


R 
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IM00Q471 


CATG CTTAG AT TGACCG CAATATG TG TGG T ACTCTT 
CAGACTTTTAAAGATTTGCTGAATATCCTATTCCCC 
TTAAATTGTGATCACCCTAGCTAGATCTAATCTTAG 
ATCT CG AAAG TTCTACAATTTG CCTCAATTTG ATTA 
CTGTTTTCCTCCTTGAAGAC 


345 


D 




IM000472 


CTTG CCTTGGGAAGTGAGGGGTTCTAATGAAGGTTG 
CAAG C CTGTC CAC C CAGGG C C CTG CTAAAG AAGG AA 
TGGTCCCCAGCCTGTTTTGTCCCCTCTGTGGCTTCT 
T AGT T CTGGACAC TG AG C CAGT CTGGG CAGCAGG CA 
ATTCACACTG TGAATTT CTGTGG AAAGCATTTTGGG 
GGTTCTGAAAG CC CTG TACATTC TG TG TTAAGGACA 
GAGGGCCTCCTGCATG 


346 


K 


Fgf3frgf4 


IM000473 


CATGGGGGCTATGTCCTAGGGTAGACACCCCCTTTA 
TrrpT r acctcctt c cctgtcttagcagtggtgtcc 
CCCACTGTGACTCTACTGCATCTGGGAGCTGTCTCC 
CGGGGGACTTC CTC CTG CTGG AGTG AGT AGGT GG CT 
AGGGCGAAGCCTGTGTAAGAGGCAGGAGGTGTTTTG 
CACAACTCCAAAGGGTGCAGATCCTGCTGGCTCCAG 
CTTCCCAGGGCCAGACCCCCAAATACCCTTCACCCA 
GC 


347 


K 


FgWFgf4 


IM000474 


GTGTATGTTCTCTGGTGAAAGTGTTAACCAG CTCAC 
TCCGTGAAGAGCACG CTG CTTTCAGATCAGTGTTCA 
GAGTCTTGAATAATTGGTTTTTAGAATCATAAAATT 
G CAGT C CTT TACAAAGGACTGGAAG TGACT CAT G 


348 


D 




IM000475 


CATGTGAATTCTCTATTTGCAATGTGCTTGGTTCAT 
ACTTCCATACT CTACC CAGAGCCTGTTAGAAAAATC 
ACTCTTCCCCACCCTATTCTTCACCAGTCAATATGT 
ATCTAGTATTCTAAACTTCCTCCCTCCTAAGGCAGT 
GGGGAAG 


349 


D 


- 


IM000476 


CATGTGTACTCTCACCATCAGAATTATGAGCAACCC 
ACAATTT CTT CACATTTATAACTGACCCAGTCTGAG 
GTATTGTGCCTTTAGCAACAGAAACTGAACTCAAAA 
CAATCGG CACAC 


350 


C 


- 


IM000477 


CCATATCAGACCAACCTTCCCACACAACAGTAGGCC 
AC CAGGTGGGGGCAAAGT C CTGGGTAAGGTTCTTGG 
CA CTGTAATTTTG AATC C CAAT AATAATG ACTGTG T 
TATTTG CTCATG 


351 


D 




IM000478 


TAAAACCTTTAGGGAGCTGATAAAAATCTATCAAAA 
CAACACTCTGTCTCTCGTATCCAGCCATCCATG 


352 


C 


— 


IM000479 


TCTGCCCAGCCTTTG CTT CCTCC CTGG TAACAGGAT 
G CT AATTAGAATT CATG 


353 


B 


AA11778 
4 | 


IM000480 


C7VTGTAAAAAAAAACTTCATTAACAACTA 
CAACAAAGCAGAGACCTTGGCCCTTGGAT 
TGGGGCCCCTCTGAGAGCTATAGG CTGGG 
ATACTGG 


354 


D 




HVIUUUtU 1 


GTGCX5TG ATAACCAGGCTGG CAGTGCCCT CTGCATC 
CCACATTGGGAACAGCAGCCTGATACTCCAAGGCTG 
CCATG 


ICC 
KtsSsJ 


r\ 
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IM000482 


ATGTCAACATTGAGTCCAGTAAGGACATCGTATATG 
CTGG T CATT ATT ATAGC T C TAAGGG TT CAT A CATGA 
GACAGACCACCCCCTTACCCCCTCCCCCGTCTGGGC 
T AAAAG CAGACACACTGGG TTGGTG AGAGAG CAG CA 
G 


356 


K 


Wnt1 


IM000483 


CATGAGACAGACCACCCCCTTACCCCCTCCCCCGTC 
TGGG CT AAAAG CAGACACACTGGG TT GGTGAG AGAG 
CAG CAG 


357 


K i 


Wnt1 


IM000484 


CATGAGAAAAATTTGTCTCTAATTCTCTTTGTTGAA 
TTTTTGTGTGG TTTTG AT AT CAGG TG ATTG TGGCCT 
CATACAATGAATGTGG 


358 


R 




IM000485 


C CAG TGAAG T AAACC CAG CAGG ACC CTTT ACAAAG C 
CAGGACATG 


359 


D 




IM000486 


Tpnnnnn a. a AfSTT attttt ataccttctcg rTfTGG 

1 vVA^uuunnnO l Xxxx x X X inin^i iu\.\.u^iv.iu\J 

AT T AAGGGAGGG TAGG AAAGGATTGG ATGAAG CT AG 
AGACAGAGTGGCAGGAAGGTGGTAGACCTGAAATTG 
TCAGACAACCACTTATCGTTGGGAAGGGTATAAGGT 
GACCACAGCACTAGCAGACTGTTCTGGACGTAGTAA 
GGAGTTCCTGCAGGGGAGGAGTGGGTCAGCCTTTGA 
ATCCCATATGGTGGTTCACAAGTCAGCCTACATG 


340 


D 




IM000487 


CATGTGTTTTTAG CAACTGTGCTCATTTTCTG CTG C 
TGCTAGGAATAAAATCAAATCTAGT ANAAT TGCTTT 
AATACAAAGTTATTGTCATCCATCT CTGAAGATCTG 
AAGTATTGCTGGGGGdTCTCCAACTCACCCACC 


341 


• D 




IM000488 


CAAGGGCCTCTCCTCCCACTGATGGTCGACCAGGCC 
ATCCTCTGCTACATATGCAGCTAGAGACACAGCTCT 
GGGGGGGGGGTACTGGTTAGTTCATATTGTTGTCCC 
T C CTAT AGGGTTG CAG AC CACTTTAGG TCC CTGGG T 
ACTTTCT CTAGCTCCTT CATTAGGGG CCCTGTGTTC 
CATCCAATAGATGACTGTGAGCTTCTTATAAGCATA 
AACTTTCACTTAC CACATG 


342 


R 


- 


IM000489 


CATGGTGTTAGCCTCCAGGCAGGAAGCATACCAGAG 
GAGAACTCCACAGGGAAGCCTTTGTTTTCTGCTGTT 
AAAAACAAAGTATGATGGGGCTTAGAAGAGG CTTT A 
AGAGGTC CT CTGGAGAAAAGAATC TATTTT CCATT 


343 


D 


- 


IM000490 


CATGAGAGGTTTTT AAGT C CTG AAAGAC CAT CAT AC 
CT AG AGT CT ATACAACAAAT AAACTTGGAATACAG T 
GAAG CT AG T AAAAATAACTT C CTGAGCTTATGG 


344 


D 




IM000491 


CACAG T CAGG AAG CAG AAAG ATGAACG T TGACTCTC 
AGCTCTCCTTCTCCCTTTAGTTCTATGGAGGTCTCC 
AGCCCATG 


345 


K 


Fgf3A=gf4 


IM000492 


CATGATAAAAG TCTTGG AAAGAT CAAG AATT CAAGG 
C C CAT AAATAAACATAGT ACAAG CAATATACAG CAA 
ACACAGTAGCCAACATCAAACTAAATAGAGAGAAAC 
TTG AAACAAT CCCACTAAAATCAGGGACTAGACAAA 
GTTGCCCACTCTCTCTTTAACTGTTCAATAGAGTAC 
TCAAAATCCTAGC 


346 


R 
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IM000493 


CATGGTAGCTTTCTAGTGAGGTCTCTTCC 


347 


D 




IM000494 


AGTACCCTTAG CCAATAAACCATCCCTCTAGTCCCT 
GTTTGTTTTGTTTTTTTTTTAAAGACAGGGTCTCAC 
CATG 


348 


K 


FgWFgf4 


IM000495 


CATGAGCTAGGCCATCTGCAAGCTGGTCTCGTCTTG 
ACCAGG AGT ACACAGAAG CCTGG CTCAGGACTTGG T 
AAC 


349 


D 




IM000496 


GTTGTTTATGCAGATCTCTCAGCGTTAGCATTCTAT 
GGGATTCTTTGGAAAGACCTTTTCAGTTATCTTCCA 
TTTCTG AGGCTGTTTCTAGG CAACGGAGTGGTACCT 
TCCTTTAATCTTCCCCTGACCTTTTCTGCCTATGAA 
G ATG TTGACTAGTGAG CCCGTGGGGATGTGTATTAT 
CTGTTACATTTATTTATGGCTTGGTAGCGACTCCTT 
GGTTGTTGTTCAGCTTTTCATG 


350 


D 




IM000497 


CATGCCTCCCTCAGCCTCCTCCCACCCCT 
TCCTGTCCTGCCTCCTCATCACTGTGTAA 
ATAATTTGCACCGAAATGTGGCCGCAGAG 
CCACGCGTTCGGTTATGTAAATAAAACTA 
TTTATTGTGCTGGGTTC 


351 


K 


Wnt1 


IM00Q498 


TCTAAGTCCAGTCTTTCACACACACTGACTTTGGTC 
ATCTGTAATCACAACATG 


352 


D 


- 


IM000499 


CATGCACACAAACTGGCCCTGAACTTTTGACTTCCA 
GGCCTCTGCCTCTCTGCGCGCACACACACACTCGCA 
CTCCTGTATATGAAGCGTATATGTGTTTCTCTGGGA 
ACTGTTTTTATCAGGTGAAG 


353 


K 


FgWFgf4 


IM000500 


GGGCTGAAGGAAAATGTTGTGTCATCTTT 
TGTGGCATG 


354 


D 


■**■ 


IM000501 


CATGTACCACTTTTGCTAATCCCCTAACC 
GCCCCTTGGTAAGCATCTAAAGTGATATA 
TCTCTTGGTCTACTGAAGTTCTGCCCTGT 
CTCCATCGGGGATTCTCGGGAGGCTAAAA 
TTATAGACTATTTGTGAAAG 


355 


D 




IM000502 


CATGTC CTTATGAT ATGGAAAAA 


356 


D 


- 


IM000503 


CATGTGCCAAGAGC CATTACAGGCTCAGA 
CTAACATCTGCCTGTAAACAACGGTTGCT 
AAGTTT CCAGGG AAGCGTAAG 


357 


D 




IM000504 


CCAGATG AC CT TGAACTCAGAG AT C T C CTTG CCTT A 
GCCTCCTGGGATTCATAG CCGCTATG CCTCAAGATC 
TCCATG 


358 


R 




IM000505 


CATGTAGTTTGCAAACAAGACATCCCTGGTATATCC 
AGAACCTGAGCTATGC 


359 


D 




IM000506 


GGATATAGTGTCAAACAGTCTGATGTATTCATAGGT 
TTGTATCCATAGTTATCAAATCTCTCATG 


360 


D 
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IM000507 


CATGTACCACACACAGACTTGGTAATAAGTTAGATG 
ATAATTACAAAAG CAACAAATAAAACCAACAAAACA 
AAACAAAGCTTGGTAATA 


361 


D 


— 


IM000508 


GTTAGGAGCACGAACTGCTCTTTCAGAGGACCTGGG 
TTTAATTCCCAACACTCACATG 


362 


R 


- 


IM000509 


CATGG TCAATGATAAACATT C CAAAACA C CAAAACC 
ATCCTCTCTGTACAGGCTATGATGATT CAACTGCTG 
CCCTTCCTCATTTCTTGTTCCCAACTCCTACTGAAT 
ATTTCCTGCAT 


363 


D 


- 


IM000510 


CATGATAGAAGACCACGTCTGGGATGGGGTAAGGGT 
TTCTCAG AGTACCTTGCCC TGGGGCCACATCCTAAA 
TCTACAACAAAGCTGACC CTA 


364 


D 


— 


IM000511 


CAAGTTTTTGTAAGGGAGCTAAGAAAGG CATTGTTG 
GTTAGGTTGGAAAGAGGGGGCAGGACCTGGCTCTCG 
CTTCAGCCCACTCCCCTCTGCCCCCCAGCCTCAAAC 
ACTTTTACCCTAGCATAG CAGAAACATG 


365 


D 


— 


IM000512 


CATGAACTCAGTGGG CAGATGAAGAGTTTTTGTGTG 
AACTGGGGCTTTGCCCTTATCATCCTGTGTGTTCTC 
CTGGTGAC CCTCAAG CTTGG CTGCAATGATCCCCAC 
TTACAGAT 


366 


K 


Fgf3/Fgf4 


IM000513 


GTTTATTACTCCAATGATTCGCACAG C CGGGTTGCA 
AGTCTAAGGCAGGCTGTCTGCCTTCCTGGAGGTACT 
TACCCCACCTCCCCCTCTGGGGGAGCTCCACTTGGC 
CATG 


367 


R 




IM000514 


CATGATTTTCAGTTTTCTTGCCATATTCCACGTTCT 
ACAGTAGACATTTCTAAATTTTCCAACTTTTTCAGT 
TTTCCTCG CCATATTTCACGTCCTAAAGTGTG AATT 
TCTCATTTTCCGTGATTTTCAG TTTTCTCGCCATAT 
TCCAGGTC 


368 


R 




IM000515 


GTAACCACT CATTTAC CTGCCC CAATGATGTCTGGG 
CCAAGGCACrTTTAAATTCATATCTACTGTGACTAT 
AGGTGCCCATG 


369 


D 


— 


IM000516 


CATGACACTGCTCACTGTTGCTCTCTAACCTTGGTC 
CAG 


37Q 


D 




IM000517 


GNGCTTGGCAGAGTAGAGAAACTCTTTGGGAAACTT 
GGTTCAGATCCAGACATG 


371 


C 


- 


IM000518 


CACCTCTGCCTCAGTTTCCCTGATTATCA 
ACAAGTGCTCATG 


372 


D 




IM000519 


CATGTAACTCAAGAAAGTCTAGTAGGCGTAGTGGTA 
AATGCCTTTGATCCCAGCACTTGGGAGGTAGAGGCA 

ATAT AG TG AGTT C CAGGC CAAC CTT CACATTG AAAT 
T CATCTCAAAACAATAAAAATAG AGGAAGATATAGT 
CAGGCAC 


373 


R 






GAAGACATTCATTTTTTTCTTGGGAGGGGATAGAAT 

C CAAGG CT C CAAAG CAG AG T T CATC 


374 


n 
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IM000521 


GACCACGCTGGCCTCGAACTCAGAAATCTGCCTGCC 
TCTGCCTCCCAAGTGCTGGGATTAAAGGCTGTGCCA 
CCACTGTGCTTACTGATCTCTTTGATGTCCCAGTTA 
TAGCTCTTGGGTTCCCCACCCATTTGTAGGGGGACC 
CAGGACAC CTCAGAG CT C TC C CAAGT CTAAAAAGGG 
CAGGGTTCCTGGCTCCCTTAATGCCTTATCAAGCAC 
AACAGAACTCAGGGG CAGAAAATG TT C CCAGGAAGA 
ACTT AG CTGTGGGGAG AG TCATG 


375 


R 


- 


IM000522 


CATTT TT CTTTAT AGCTGAG TGTT ATT C CACTG CAA 
AAATTTGAATATTCCACTATTCTGTTGATGAATGTC 
TAGGCTGGTCACGTTCTCTTGCCTTTGTGAATGGAG 
CAGCAATAAACATAAGTGGGCATG 


376 


D 




IM000523 


CTCCATTGGGCCGAGTGAAG CTGTGGTTCAGAGAAA 
CTCTATGGACAAGCTTGACTTCCAGAACATTGACCT 
GGTCTCTGAGATCAACAAGCGTAGGAAAGCCATG 


377 


0 


- 


IM000524 


CATGGG AAAG TAAT C CGTGGCT AACACAAAGGGGAA 
ATAAAGTAATATT 


378 


D 




IM000525 


CATG T AGG ACC CTGAATGC CAG CAATG AA CAATAC C 
AGCTTGGTTTTCCGACTCTTGCTTTCTCCTCCCTCC 
ACTACTAACTAGCCTCACCGTTGCATCTTGTGACTC 
AGAGG T CTTGTTT CCAGGGCTTCCTTC CTTCCAGTG 
TTCTTCTAATGCATCTAAAGTGAAGGGGTGG 


379 


D 




IM000526 


rATGCAAAGCCTCTGCAGGGC CGACAG CAAGGAAGG 
CCCTTCTAGATCTCCAGCACTCTGTCAAAAGCCATC 
ACTCGGCAGGCAGGCAACCACAATGTAGGGAAGACC 
TGT AAAG C CTTCAGAG AGG AACAG CTGGCAG CC C CT 
GGGTCACT CAGAGTGGC CAACAG CTACTCTTGTGGA 
GACAG CAGGAGG AGG C C T AG ACT ATAG AAGG ATGGA 
GGAC 


380 


D 




IM000527 


CATGCACACAAAC TGGCCCTG AACTTTTGACTT CCA 
GGCCTCTGCCTCTCTGCGCTCACACACACACTCGCA 
CTCCTGTATATGAAGCGTATATGTGTT TCTCTGGG 


381 


K 


FgWFgf4 


IM000528 


CATGAAACATTATTTNTTTTGGAAGTCTG 
CAGGTAAACTTAAATAGGTTAA 


382 


R 




IM000529 


AG CAAG AACAAAGGAAG TAC TT CAC CTGAT AAAAA C 
AGTTCC CAGAGAAACACATATACGCTTCATATACAG 
GAGTGCGAGTGTGTGTGTGAG CGCAGAGAGGCAGAG 
GCCTGGAAGTCAAAAGTTCAGGGCCAGTTTGTGTGC 
ATG 


383 


K 


FgWFgf4 


IM000530 


G ATTTTTATTTTC CTTAGCATCCTGATTGG AGATGC 
CTGGGTGCACATG 


384 


K 


Fgf3rf=gf4 


IM000S31 


CATGTAGAGACTGCCATATCCAGGGATCCACCCCAT 
AATCAG CATCCAAACACTGACACCATTG CATACACT 
AGCAAG ATTTTATTG AAAGG ACG CAGATG T AG 


385 


R 
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IM000532 


GACCTGTACCCTACCCTCTGATGGAGGCCATCTATT 

CAGACTGTGGGCTCTGGAAAGCTAGCAGGTGACCCC 
GGGGGATGTTCTGAGCAGTGCCTTACTGAAGTTTAT 
CCAGGCCCTAGGGTCCCCTCAACTGCTCACACAGCC 
TAGGGTGGGTCTCTTGAGGAGTCACTTGTCACTTCT 
GTTGCTTCCCAAGAGACCCAGGGAAAAAAGGAAGGA 
AGGCCATG 


386 


D 




IM000533 


ATCTCACTCGTAAAATGAACAAAGGGACTGCAGAGA 
TGG CT CTGAG CTTTTAAG ACCATAG CCTG CTTTTC C 
AGAG AG C C CAGG CTT CATTTC CCAG C CCACAT ATGG 
CAGTTCACAACCATCTACAACTCTAGTTCCTGGGGA 
TCTCACACTTTTGTCTTCTGTGGG CACTGCG CAAAT 
GTGCACAGAAATACACGCAAGGAAAACACCCATG 


387 


K 


FgWFgf4 


IM000534 


AAGAAACACTCTTAGCTGGGCCTGGAAGTGCACATG 


388 


D 




IM000535 


CT AAAG CAGATT ATTATACTTATT CTACTG AC CATA 
ATG CAAC CACTATT ATATAAACAG AACAT ACTATAA 
AGTGAATAACATTAGGATACAAAATGTATAAAAGGG 
G AGAG AGG AT AAC CATTGTGAAG T ATGTTTAAATAA 
AATGTTTGGGATTTGAGGAAATTAATAAATTAGTTA 
CCCTTTTTG CTTTGGGGAAAGAAAGG CAGCATG 


389 


D 




IM000536 


CAG CCC CAAACC CATCAGC CTGAG ACTG ATG CACAG 
GAGGCAGGCCAGTTAGTTATTCTCTGGGCCCCTCTA 
TTTTGCCTTCTGTAGGTTAATCCCACCGCTCCCAGT 
G CTGGAAAGTGCAAG CATTGTGGGAAGTT AAAAACG 
TGCCACCATG 


390 


D 




IM000537 


CATGGACAATGCACCCCTCAAGCAGTGTCTTCCATA 
CAGACAAG CATATTTATTTTCTATACAGACAG CAAC 
TTTGCTGAGGTGTAAGG 


391 


K 


Fgf3A=gf4 


IM000538 


G GATG AAGAAG CC CAAGG T ATTAGGT CAGT CTTG CT 
CTGACTTCTCACAGTAAAAATACAACTCCCAGGGAC 
T AAAATG ACACAGAACAGCTTAG C CT CTGGACATTG 
CTTTTGGATTGCAAAGTGATAAGTGAAAAAGTAATA 
AGTCTATCTACATTGGAAAACATTTGGTAACTTCAT 
TTAAACACACTTCCCCATG 


392 


D 




IM000539 


CATGTCCTACATTGGACATTTCTAAATTT 
TCCATCTTTTTCAGTTTTCCTCACCATAT 
TTCACGTCCTAAAGTGTGTATTTCTCACG 
TGTATTCGTTGGTTGTTGGTTTAGTTCCT 
GGGAGCTCTGGAAATCTGATTATT 


393 


R 




IM000540 


TGGAAAATGAGAAACATCCACTTGACGACTTGAAAA 
ATGACG AAAT CACT AAAAAACGTG AAAAATG AG AAA 
TGCACACTGAGGGACCTGGAATATGGCGAGAAAACT 
GAAAATCACGGAAAATGAGAAATACACACTTTAGGA 
CGTGTAATATGTCGAGGAAAACTGAAAAGGGTGGAG 
AATTTAGAAATGTCCACTGTAGGACGTGGAATATGG 
CAAG AAAAC TG AAAAT CATG 


394 


R 
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IM000541 


TGACATACAGAAAGAACACAAATACCTGTAGCTGCT 
GTGACAGGAC CAACCATTCT AAAT ATCAAAG CAG CT 
GTTGACACCTAAGGACrreGTCTGACTGCTAGATCTA 
GGAGTTTCAACTTGCAAAAG CTGGCTTGATGCTCAT 
G 


395 


C 




IM000542 


TTATATATATATATCGTTTTCTCTTACTCCTGAATC 
AGTGACATG | 


396 


D 




IM000643 


CATGTCAGCCCTCAGCTTTACACAGGTGTCAAAAAA 
AAAAAAAAACACTGACTGAGATCTTCCGTCTGCCAT 
TAG CTGTTATTGTGT ACATTAAGT AG AAT C CACTG C 
TTAACCCAGGCTACTGGGCTCACCCCAGTATTCAAG 
GAGGTGCCACAGGACTCAAAGGATACAGAAGTTACA 
TATTAAAACCCAATCTCGTAGAGGATTCAGAGGAAC 
T AAG TTTGG TAGGGG CACAG ATTG T AG T ACCATTAA 
GCCCCTCTGTTCCTCGTGGAGAACCACTACTGTCCA 
GCAAGGCGGGAAGGACCCAAATCAAGCAAATGAGAC 
TTGTTCTGG 


397 


D 


- 


IM000544 


CATGATANATCCCTTTTTGTGAGCATTCC 
ATAGCCTCAGTAATAGTGTCTGACCTTGG 
GACCACGCTGTATCCCACTNTGGGACCTT 
CTTTTCNTCAGGCTACTCTCCATTTCCAT 
TNCTGTAATTCTTTCAACAGAAACATTTA 
TGGGTCANAGGTGTGACTGTGGGAGGACA 
ACCCCATCCCTCACTTGATGTCCTGTCTT 
CCTGCTGGAGGTGGGCTTTATAAGTTCCC 
TNCCCCTACTGNCCAGCATTTCATCAAAG 
ATCCCTCCCTAGGAATCCTGGGAACCTCT 
C 


398 


D 


- 


IM000545 


G ATAAG CTTATCTTGAACTTGAATGTATATGGAGAA 
GCAGAAACCTTGAAACAGCCCACAGAAACTGAAGAA 
GGATGAAGGTGGAACT CT CAGCTGGAATATT CATG 


399 


D 




IM000546 


CATGTT CCCAG CTGGGCAAGGCCT CGGGTTCCTCGG 
TG AAGAGTGTGG AC CAG C CG ATGAG C C CT C CG ACG T 
GTGGATGAAACGGCTGGCTTTTGTTTAGTTTTGTTT 
TAACCTCCCCAACGAGACTTTGATCAGCTCCACCTC 
G AAAATGTTCGCGAAAG ATG CGG AGAG C CTGAGGG A 
CTGCGGGGCAGCAACGGGCTCCGGCCTAGCCCGGCC 
CGCCGGCCCCCAGA 


400 


B 


AI413288 


IM000547 


ACCAAGTGTTAATAATGTACTGATGGCTTCTGCCTG 
TGG CAGTACACTTGTCCTCTACACATG 


401 


C 




IM000548 


C CTTACTG CAGAGATGACT CGG C CAACGG CTT CG AG 
CCCCTCACCACTTCCrCAGGTTlWTTTTGTTAGTT 
TTTTCTCACAGCAATGGGAAGCATAATCAATACAAC 
TTC C CAGAATG CGACCTG TGACAAGG CCAATG AG CA 
GACTCAAGGCTGGG CACATAAAAG CACCAAAAAAAA 
AAACTCCCTTGCAGTTATTGTTCATG 


402 


D 
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IM000549 


GACTGAGCCTGCCTGGGGCCGTAGGGAAG 
GGGGGGTTGGACCCTCTGGTATTTGCAGT 
TACCACTGACAGGGTTTTTCCGAGATGCC 
AGTGTCAGGGTGTTCGGTGCTGACCCCCC 
AGGGACCGTGCAGCCCCGATGGCTGTCTC 
GGTCCTCTCANCTTTTCCGCCACCCCTGG 
GATATTTCAGGACTCANTCC C CGCAACAG 
CTCTGACTGAGGTCAGCTCTGTGACCAGG 
GNCCCTGTCC CCGGTGTGNNGTGTATTTG 
CATG 


403 


K 


Wnt1 


IM000550 


CATGTAGAAGGCAGAGGACAACCTTCAGGGATTATT 
TCTGCCCTTTCAC 


404 


C 




IM000551 


GTTCCTCCATTCTGCTGCTTCTCCCTGATACATTGA 
GTTACAGCAGCCCACGCGTACACACTCTCGCACATG 


405 


K 


Wnt1 


IM000552 


CATG CCAC CAACAAAT AAGT AAG T AAAAAAG AAGG A 
AGGAAGGAAGGAAGGAAAGAAAGAAAACATTTTAAA 
TCTGTAAT 


406 


D 




IM000553 


CGGAGCTTAGG T C T ATCATTT AAAGAT ACAACCAAA 
TAGGCAGAATCATTTCCTGAGGAGCCCATTTTCTTT 
ATCT CAGGTCCTG CAGATTTCTCCCTGGTATTATCA 
GGGAGGAG CAG CAG CTG AG CT AT C CTAT CT CCTTTA 
CTAATAGAAAAAACGCCTTTAGGGCTTGAGCACAGG 
ACCTGTATTTCAGGGGAATG TTGACAATCCATAACT 
CCAGGGTGGACTACTAAGCCCTGCAAGGTGAGTGAA 
CCCCGGCCGAGAATAAGGCCCATG 


407 


R 




IM000554 


CATGGC CTG AG AG TTGG AAAG AG T ATTGT AAG CAGG 
GG T TG TT C CAGAAAG TTT AG AAT ATACAG ACACT AT 
ACTCTATCCAGACTTCTTGGCACAGGGAGTTCAAAT 
GTAGACTCTGAGCCCCGTCCTGGGGCAGCTTCTTCC 
ACCTGCTTTGGGTAGAAG CAGGCAG ACT CTGGGTAG 
ACTCTGATTCCAAGGCTAAGTAACCCCTGAACCCAG 
AACAGTGTTTTC 


408 


D 


— 


IM000555 


C CAGAT AT CAT A CTGAG TT CGTAGG TGG TTTTAATT 
AAT CACGGG C C C CTGG CATG 


409 


D 


- 


IM000556 


TTGGTGATCCAAACCCAAAGAGACAAATGCTGAATG 
TTCACTCTCATTTTCTGTTCTTAGCTCCAAATCTTC 
AGATATGAGTAAGCAACACATAAATTATGAAGGGAC 
CATACTGGG ATG T AGGGGG C TT G CATG 


410 


D 




IM000557 


CATGAG CACTGCTCTAGGGACACCTCCCA 
TCCCTTCCTAGCACCCCAAATGCCCCTTC 
CCATCTCTCCTTCCAGAAGTTGGA 


411 


K 


Wnt-3 


IM000558 


ATATAGCTGTCTCCTGAGGGCCTATGCCAGTGCCTG 
GCAAATACAGAAGTGGATGCTCACAATCATCCATTG 
GACAGAG CACAGAG T C CC CAATGAAGGAGCT AGAGA 
AAGTACCCAAGGAGCTGAAGGGGTCTGAAGCCCCAT 
AGG AGGAACAT CAAT ATG AA CTAAC CAG TG C CC C CA 
GAGTTCCTTAGAACTAAACCACCAATCAAAGAAAAC 
ACATG 


412 


R 
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IM000559 


CATGAT AAGGTT AGAGTTTTGTGAG C CTCCTTAACC 
TTGCTCAGCAAGCGTTGGGCTCTTGGCAGCCGAGCT 
G C CATCTTT CTCATC C C CGATAGAGC CAGC CGCCCT 
TGT CGTG T CTTG AAT AAGTTAGAGGAGG CATT ATAG 
AG CGG AC CT AAACATTTG C CTTGGAG C CTG AGGGAT 
GGGGATTGG CTGAATGTGAAT 


413 


D 




IM000560 


CAGAACTGTGCTCTTTAGGAAGCCAGACGCTATGCC 
TTAGGCCCTGTTCCCTCCAGACCTTGCTCTGTGCTA 
CAG TGT AAAAG CGAAGAT CATG 


414 


D 




IM000561 


GAGAATTAGAAAAGAGATAACAAAGG CGAGAAAGAG 
AGG CGTG TGAGAG CATG 


415 


D 




IM000562 


GTTTCCAGATTGTCCTAGTAGCTGGGCTGCAGGAAC 
AGCCAGCATG 


416 


C 


- I 


IM000563 


GGGGGTGGGGGTGGT AAG AG AAG ATTAATTAGC CT A 

UUilnlnlnnuu 111 1 \j\3f\i. X Lrvil LI 1 LnnL 1 l_ v — t\ 

C CCCTTAAAGAATAAATAAACAAGTAGATAGATTAT 
AGA CAG ACAG CT AGATGG ATAG ACAG AT AGCTACAT 
AGATACATAGATAGATGATAGATAATAGACAGACAG 
ACAGATAAATGATAGATAGATGATAGGAAGTCCCAG 
TT AACAAATGGAAAT AAAAAGACAAAAG T CC C CTT T 
GTCCATG 


417 


D 




IM000564 


G TATATGG AATATGG CAAGAAAACTG AAAAT CATG 


418 


R 




IM000565 


CATGGTAAAGGTCAGGAGTACACCTGTGCTTCTGTG 
TT CTTCT G TG TTGGCTGACAG CTGGG CAGAAGTGAG 
TTCAGGAGGNCAACCCATACGATGAGACAAGCCGGG 
GCAAAGTGGGATATGTGG ACCGCAG CACATCAGAAG 
GGTGTGCCCGACATAAAC 


419 


B 


AA11135 
4 


IM000566 


CATGAAGTATATTATTAGAGGGGAACTAGTCTTACT 
GCTGAGCAG CGTGTTGTCTTCTACAGAGGATGTTTG 
T GTT CTGG AATTTAAAAT TACTTAAAG T AATAG TG T 
CAATGAAACG TTGTCCGGTGACTTG CTTCTTTTAAA 
TGATCACTGTTAGACAGGGA 


420 


R 




IM000567 


AATAATCAGATTTCCAGAGCTCCCAGGAA 
CT AAAC CAAC AAC C AACG AAT ACACATG 


421 


R 


- 


IM000568 


CATGATTTGATAGGGTTATTTGGTTCTCTGGAATCT 
AACTTCTTGAGTTCTTTGTGTATATTGGATATTAG C 
CCTCT 


422 


R 




IM000569 


G CAAAT AG TC CTTTG T AC CGAACTT C CACACACT AA 
TGTAGTGAATTATTTAAAATTTATTC CTTAATCTTT 
TTTTAAAGTCCAGACTCTATCCCCCTCCTTGTCCAC 
CCTCTGATTGTTCCACATCCCATACCTCCTTGCCTC 
ATG 


423 


R 
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IM000570 


TTCCATCTCTTGTATTCTGTTGCTGATGCTCACATC 
TATGTTTCCAGATTTCTTTCCTAGTGTTTCTATCTC 
CACTGTTCCCTCACTTTGGGTTTTCrrTTATTGTGTC 
CACTTT C CTTTTTAGGT CTTGGATGGTTTT ATTGAA 
TTCCATCACCTGTTTGGTTGTGTTTTCCTGCAATTC 
TTTAAGGGATTTTTGTGTTTCCTCTTTAATGTCTTC 
TACCTGTTTGGTTATGTTTTCCTGTAATTCTTTAAG 
GGATTTTTGTGTTTCCTCTTTAATGTCTTCTACTTG 
TTTAG CAGTG TTCT CCTG CATTT CT TT AAG TG AGTT 
ATTTAAGTCCITCTTGATGTCCTCTACCATCATCAT 
G 


424 


C 




IM000571 


pafp ZlflTTTT r*T & fTTTTTT AT A A A ATT AT AT A A AC! 
TCATTT AG T AGAAC CT AG CTTTATTTAATTTTACCA 
ATTAATATAAX5GCCACTGATATTATTGACTTTTGTC 
ACTACAAAATACAGCAATGAAATAATCTTTCTTCTA 
GGCTCCTTCCTCATCAAACTAGTTCTTCAGCTCACA 
TTAATACTTTTTT CAAGTTGTAAGGGACCTCAGGGA 
CAGGGGGC 


425 


D 


— 


IM000572 


CATGAG CTTATAGTTT CAG TAAG AG AGCATAGAT AG 
AATATAGGTGCCTGTGCGCTGGCTCTTTTGGTTGTA 
TTTAAATCCTTTATCTCTGAGAAGTCGGAACTGTTG 
GCAACAGACAATATGGTAGCC 


426 


D 




IM000573 


CTGACACAGGTATGCCCAGTCCATAGTGTGCAGAGC 
ACAGATGGCCAAGGATAACTAGGAATGAGACCTACT 
TAACCCAAACTCCAAACATTATGAAACTTTAAAAAA 
ATGACTTCAGTTGAACTTTGCAGGTAACCACATCAT 
G 


427 


D 




IM000574 


ATTGTGTCCTTTTAACATTCTTGCTTTAGTAGAACA 
TCCTCTGACCCGTATCTGATTCAGTGAAAAATT CCT 
TCACGAGTCTGCCTTAGCAAAACATCCTTTCAC CTG 
TGTCTGCTTCAGGAAAACACCCCTTCACATG 


428 


R 




IM000575 


CATGTTGGTAACAGATACAACAAGCAGACTTAAACT 
AATAAGAAAACAG CT ATGATTAATATGTTTATAA CT 
TAGCTGAAGAGAATGTATGGAGCTTTGAAGTTAATC 
TTTTCATATACACAGGAATGCCTTCAAAAAGCATTG 
CAGCAGATTTCAAAGGATTAAACTCAT 


429 


D 


- 


IM000576 


CATGTGGCGAACCAGCATCACITTTGCTCTTTCCTT 
ACTAACCCAGGACATCCATCATTATTTTAATAG CAT 
CCACCC TAGTAGATATAAGGTGATACCTTATTGTGA 
TTTCATTTGCCTTTCTCTGAAGATCACTAACAATCA 
AAATCTGGTTCATTTTATTTATGAATTCTCATTTGT 
CTTTTG CT AAATAT ATGTT CACAATT CTTTT CAATT 
TAAAAG CAAATTG TTTTGTTAATAATGAG C TAACTT 
TTCATACATTGAAG 


430 


D 




IM000577 


TTGCTGTGGGCCTAATTCAAGGCTGATAG 
ATCACCACAGAAGGACACTGTTTTCCTCC 
GGGCAGCAGGAAGTACAGGGTAGGGACTC 
TAGAATCACTGCCCTAGGGCATG 


431 


B 


AI663969 
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IM000578 


G TACTTGAAGTTTTAG CT AG AG CAAAAAGA CAATGG 
AAGG AGATCAAGGG AATACAAAG TGGGAAAGAAGTC 
AGAGTATCATTATGTCCAGGTGATATGATAGTATAC 
ATAAATGACCCTATAGATTACACCTAAGACCTCTAC 
AGTGG ATAAATA CTAAAAT ATTTACTACACAGAAAT 
CACCCCATG 


432 


R 


- 


IM000579 


CATG CAAGGT ATG AACT C ACT AAT AAGGG 
GATA 


433 


D 




IM000580 


CATGGTTCACACTCCATAATATCTTGTTCTCACTAA 
TT C CTC T AAT CC CAT AAT AT ACA C CAAT AATTT AAC 
AAGGGAATTTCTACATTGATTTGTAATAAGGGAGAT 
ACTGTGTGAACTTACCCAACAAAAGTCTCCAATAGA 
AGTGTGG ATAC CACAGG AAG T C TTGTG ACAACCATT 
AAAATTTGGGTCTGATAAGAAGATAACCCTTTAAAT 
ATATAGATTTATGTAAAG 


434 


D 




IM000581 


CATGGGCTGGGGAAAGGCAGAGAGAAGAACATCTGG 
ATTGTTCCTAACTTTGCCTTTAAAATGAGACTTCAA 
T AATACTTAG ACGTACCAG CTT CT CACAG T CAG TTA 
AAATGTGACACACACACCTCT CAG CAGACTGAATGG 
GTGAG 


435 


D 


- 


IM000582 


AGAGATGGTTGGGATTTAAGTTACCAGGG 
TAGGGTCACCACAATCAACCCTTGATGCC 
TTTATAGGAAGAAACATG 


436 


D 




IM000583 


CATGGAAGTCTAAAAGACATTAGGTTCTGGATGGAA 
GAAGAGAAAATTAT C TTTAAG TTTTAG AAAAGGGAT 
G ATAAAACAAG T CT T AAAT CTTCT CAATTTTG C CAT 
AATT CATTTGAATTAATATTGGTAAATG CTTTG TGT 
GGTCCCATAAAGTTCAATGTGTTATATCACTAAGTA 
GTTATTTGTAAAATTATAAATAG CCTCTAT 


437 


C 




IM000584 


CTTGT GAATTGTTT AACTGTTTTG AAAAAG T AGATG 
TTTT CTCTATTTAT T TTTGGG ACAATT ATCAG AAT T 
TGAAACAAACTCTGTATCrCTTATTTACTTTCTGCT 
TAACCCCCATG 


438 


D 


- 


IM000585 


CATGGTTGCTATATTCATTAACACAAATCATTTAAA 
AT C CTTAATG TAAAATGGG CACAT TTT CAAAATTAA 
AAT AT ATGAAAAC CAATAAAG AT AG AAAATTTAGG A 
AAAAAAATAATCCAAGCAAGATGTTAACATCCAACC 

m * n rmv rrwT»iv ^"*TV TV i~*f** TV TV TV TV Tl '^TV ^ 'A O 

A CAG CAG CATATTAG CAGCAGGACAAAAATAAGG AC 
AACAACCAAGAAAGGGATTGTGGTTAATGTATGCCT 
CATTGGAAGGGATAATAGGATGTAAAAGTGTGACAA 
TAAAGAGAAAAAAATCT CTTTTAAAATGTAAGTTAA 
AAT AAT AAAAATAATTT AAAAATTGG TG T T CT CAGG 
GCTGGATAATATTACTAACAAAACCAGGGAATTATT 
AATAAAAAATCTCTTATCAGTTAT 


439 


D 




1M000586 


AACAAGTTTTAAATGGGGCATAGTGGATCACATTTG 
TGATCC CAG CACTTGGAAGG TAG AAAT AGGTAAATT 
AAGAGTTCAAGGTCATTTCTCAGTTATGTAGTTGTA 
CATTTCTAG CGATGTAGTTGAGTTCAAGG CCATG 


440 


D 
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IM000587 


GTCCTCCAATGTGCATTTCTCATTTTTCACGTTTTT 
CAGGGTTTCTCGCCATATTCCATG 


441 


R 




IMD00588 


AATTG CATTGAAT CTGTGGATTT CTATTAACAAG AT 
GGCCATTTTTTT CCTATGTTAATCGTACTGATCCAT 
CAGGATGGCAGTCTTTCCATCTTCTGATATCGGCCT 

G 


442 


R 


- 


IM000589 


GGCTAGGTACTCCTAAACCTTCCTCTGCTATCCTAG 
GCCCAATAGAAAAAAAGTGGCCCATG 


443 


D 




IM000590 


AATAATACTTTCACTGTACTTTAAAATATTATCTCC 
TATCTCACTCTAATACTTCTGTGAAAGAAGCAATAT 
CGT CTCTTTGTAG ATAAAAATGG CTG AG AAGGG CAC 
CTT CAAG ACACT AAGTGACT AACTCAG ACT CAG AAG 
TT CAG AG AC CATG 


444 I 


D 




IM000591 


CATG CT CTACTATGTT CACAG CAG T CTTATT T ATAA 
CTTCCAGATACTGGAAGCAACTCAGATGTTCCTCAA 
TGTAAGAATGGATACAGAAAAAATATGG TACATTTA 
CACAATGGGGTACAACTCAGCTATTAAGAACAATGA 
C 


445 


R 


- 


IM000592 


AAAACCCAAGAACAATTAAGCTGTAGTTC 
C CAAGTGT AATTATATTATGGTTGTTT CT 
GCTTGCTTTATATCCCTATATACAATTTA 
TGATTCAAGTATTAGTGGGAATAGACTAA 
TGGCATG 


446 


C 


- 


IM000593 


CATGCCAAG CCTTCTGGTATCAC CCTAAAGGC 


447 


C 




IM000594 


CATGCTCTTCT CTGCTGTTCTTACTG AATTTTTAAT 
AAGAACAATT C CACACAG CTCGAAAG CACTGCT CAA 
TTAAG AG ATATT C CTACCAGG CAT CTTTGG AAT C CT 
GCAAGCACCrCTTCTCTGTTTCCTGATGACCCTCAA 
TTTGGTTGTGT CCAGAGGTTGGTGGGGAGGAGGGGA 
GGGGAAACG AAG CTTATTTTTTTTT AATTG CAAGTT 
CAATTTTACAATGTTCTCGAT 


448 


D 




IM000595 


CATGCTAGGCAAATGCTCCACTGAATGAATTACATT 
TCCAATCCTTTAGATG CATTTTAAAG AGAAAAGATT 
GAGTACTGAAGTTTTGAATAGAATACAGGAATAAGG 
GACTAAACATATATATAGC CTTATATAGAGAAATAT 
TAAGTAAGTAGTAACTTTGCTTGTGTGTGTGTGTGt 
GTTGCACAC 


449 


D 


— 


IM000596 


CATGC CATTAGTCTATTCC CACTAATACTTG AAT CA 
TAAATTGT AT AT AGGGAT ATAAAG CAAG CAG AAACA 
ACCATAATATAATTACACTTGGGAACTACAGCTTAA 
TTGTTCTTGGGTTTT 


450 


C 




IM000597 


CATG CACAGCTGG TGAG TG AGTTGT CTTC TGG TACA 
AAAATCTCCTCACAGGCACATTTACAAGTGCCTATA 
TCTTTGCTAG CTTCAAGAACACAAAGAAGGG ACACA 
CAAAAGCTCTTCTGAGTCTCCTTCTCCTGCTGTTAT 
TTTG 


451 


D 
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IM000598 


ATCGTCAAAGTTAGCAAAATTATAAATGTGAAAGTC 
ATG 


452 


D 


- 




CATGAATTATGTTTGTTTTATTTCTTTTG 
TACAT CATTCAATGCAGTAATCTAAAGTT 
TGGGGTCTTGGTCTTATATCTTGGAACTT 
CAGTGACTTATTGGTTCTAACG 


453 


D 




IM000600 


AGAGACAG T CACAAAAGGGG CC CATTCTTGTT AAGA 
ATGGGCCAGTGGAGAAGTTCGGGTTAGTGGAGTAGC 
CTGCCTCAGTTTCCTCCTGTCTTCTGTAGTTAAATG 
TGTTAATGGTTAACATG 


454 


K 


Fgf3fi=gf4 


IM000601 


CATG TAG CATT TATCTTAG CCAG CAC 


455 


D 




IM000602 


CATGTACAGACTATGAACAGGAAATGTTTTTGCAAA 
TAACTCTGTG CATTAGAATTTTCTT CAGAAATATAA 
CCATTTTGACAGTTGTAGGTTACACTTTTAAAATTA 
CAAAAT CAAT AAAATTGAT CT ACAAAC CG AGG C CT A 
CAAAACCCTTGCTGGATATTGAAGACGGCATAATAT 
TAAAG 


456 


D 




IM000603 


AATT CCCAC CAC CCACAGGGTGGCTC CAT AAC CATC 
TGTAACTCCAGTCTCAGGGACTCCAAGGCCCTCTTT 
TGGCTTGCAAGGGCTTGCACACACACAGCGCACACA 
TG 


457 


K 


Fgf3/Fgf4 


IM000604 


CATGGTGAATGATTGTTTTGATGTGTTCTTGGATTT 
GGTTTCGAGAATTTTATTGACTATTTTGGCATTAAT 
ACTCATAAGGG AAATTGG T CTG AAG T T CTT TC C TTG 
TTGAGTCTTTATGAGGGTATCAATATAATTGTGGAT 
TCATAGAGCAAGTTAG ATTGTGTT C CTTCTGTTTAT 
ATTTTGTGGAATATTTTGAAGAGTATTGGTATTAGA 
TGTTCCTTGAAGGTATGATAGAATTCTGAACTAAAC 
CCATATGGTTCTGGATTTTTTTTGGTTGGAAGACCA 
ATGACTGCTTCTAT TTCTTTAGGTGTTATGGGACTG 
TATAGATGGTTTA rCTI UAAv-LJVjA ill ML ill Vjvj i 
ATTTGTTATCTGTTTAGAAAATTG C C CATTTCATCC 
ATATTTCC CAGTTG TGTTGAGTATAGG CTTTTGTAG 
TAGGATATAATGATTTTTGAATTTCCT CAGTATGTT 
TTCITATATCTCCCTTTCCATTTCTGATTTTGTTAA 
TG TGGATACTATCT CTGTGTCCTCTGTTTAGTCTGG 
CTAAGGGTTTTTCTATCTTGTTGATTTTCTG 


458 


R 


- 


IM000605 


CATGGGTTAACAG TGGGCCCTAAACTTGAACTAGAA 
AACTTAAAG ATG C T CAT AGGG AAG AAG AAAAG AG CA 
G AAAG CTT AG CT T CT AGACAG GGG TAAGG CTT AG AG 
CTCAATAAAAAAGGAACCCC 


459 


K 


Wnt1 


IM00Q606 


CATGGCCTGTCTCAGTTTACTT CACAGCTGAACAAG 
AGG CAG AG AGTG ACAGG TAG 


460 


K 


Wnt1 
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IMQ00607 


CATGCTCGCCAGTCCCAGAACCTGGAAGG 
CTGAGGCAGGAGGATTAAAAAGCCTTGGG 

ATCCAGCACTGGGGAGTTAAGAAGCAAGT 
GAGTCACATCTGTGAGTCTGAGGCTATCT 
TGGTCTACGTAACCAGCTCTAGTATAGCC 
AGCCTGGGATACATAGTAACCAGTTCTAG 
TATAGCCAGCCTpGGATACACAGTAACCA 
GTTCTAGT ATAGCCAGC CTGGGATACACC 


461 


D 




IM000608 


CATATGCGT ATT CACATTTG TGTGGGAACG T CCTTG 
G AGAAAG CAGGAG CAGGAG TTACAG ACAGTTAT AAG 
CTGCCTCACCTGGGTGCTGGGAAACACCTCAGGTCC 
TCTGGAAGAGCAGTAAGTCCCCTTAACCAATGAACC 
ATCTATCCGTCCAGCCTACATTTAATTTGTTTTCTT 
ATTTACTTTGTCTGCATG 


462 


' R 




IM000609 


CACACACACACACACACX3GCTGGGG AT C CAACC CAT 
CTCGTCCTTACACGTGCTCTACCATCACGCCACACA 
TTTCCAGCACNTTTATCTGAAGTGTTTCCTTTTATT 
TGTGCATG 


463 


K 


Wnt1 


IM000610 


CATGCCTGGTGCCTGCAGAGGTCAGAAAGTGTTGGA 
TG C CCTGG AATTAG AG TAACACATAGTT ATAAG ATG 
CTGCGTGGGTGCTGOOA1 1 lvjlL.Ulv.ik7 
CAAG AG CAG C CAGTG CT CTT AACCAC CGAG C CAT C C 
CTCCAGCCCCTGATTACTCACTCTTCACGGCCTCAA 
TCTTGTAAGGAATATTGAGGCTGCCAAGTGACGCAA 
GAGCACCTAGGAAGGCAGCCACATCGGTGGCACTCT 
GG AAG CACTG CG AGG ATG AC TG CACACATTG C CGG T 
TGTC 


464 


K 


NotcM 


IM000611 


CATG CTGGCCIATTTATTTTGATTTAAGTTATACTCT 
AGACCTTTGTAAATATTAGCCATTG CATATTACAGA 
AATTTCTTAGCAGAGATAGTCTCTCACTCTTAGTGA 
TGAGCAAGCTGGAGCTCAGCATTATTCTCCCAGCTA 
AGATACAGAATTACAGACGTTTATGACGGACACATC 
TTGGATGTAGTTACTTAGTCCAC 


465 


D 


-- 


IM000612 


CCCCCCCCGCCCCTGCCAGACCGCAGCCCCAAGCAC 
AG CATG 


466 


i D 




IM000613 


CATGCCTCCCTCAGCCTCCTCCACCCCTTCCTGTCC 
TGCCTCCTCATCACTGTGTAAATAATTTGCACCGAA 
ATGTGGCCGCAGAGCCACGCGTTCGGTTATGTAAAT 
AAAACTATTTATTGTGCTGGGTT CCAG CCTGGGTTG 
CAGAGACCACCCT 


467 


K 


Fgf3/Fgf4 


IM000614 


CATG AATTCAATGGTGTG CTTGCTATAAATG CAAAT 
AAACCATATATATCATATTACACTCAATTTTAAATA 
TTTTTCCTAATATTAATAAAGGTGATGGGGAACTT 


468 


D 
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IM000615 


CATGTCTACTTTATTGCATATTAGGATGT 
CAGGTCCTGCTCGTTTCCTGGGACCATTT 
GCCTGGAAGACATTTTTCCATTCTTTTAC 
TCTGAGATAGTTCCTGTCTTTGTTGTTGA 
GGTGTGTTTCTTGTATTCAGCAAAATGCT 
GGATCTTGTTTGCGAATCCAGTCTGTTAG 
CTTATGTCTTTTTACAGGTGAATTGAGTC 
CATTAATATTGAGAGATATTAAAGAGAAA 
TGACTTTTGGTTCCTGATATATTTGTTTT 
TCTAGTTAGTTTTGTGTGCTTGGGACTCT 
CTCCCTTTGACTGTGTTGTGAGATGCTTA 
ATATCTTGTCCTATCTTTGGTGCAGGTGT 

LI i LL 1 lulbl 1 1111 \_/\ 1 1 

TTTCTCTGTAGTGTTATGTTAGAAGACAT 
ATACTGCTTGAATTTAGTTTTGCCTGGAA 
TATTTTGTTTTCTCCATCTATGTTGATTG 
AGAGTTTTTCTGGGTAAAATAGCCTANCC 
TGGCATTTGTGTTCTCTTAAAAGTCTGTA 
TGACCTCTGACTANGCTTTTCTGGCC 


469 


D 


- 


IM000616 


CATGGTGAATGATTGTTTTGATGTGTTCT 
TGGATTTGGTTTCGAGAATTTTATTGACT 
ATTTTGGCATTAATACTCATAAGGGAAAT 
TGGTCTGAAGTTCTTTCCTTGTTGAGTCT 
TTATGAGGGTATCAATATAATTGTGGATT 
CATAGAGCAAGTTGGATTGTGTTCCTTCT 
GTTTATATTTTGTGGAATATTTTGAAGAG 
TATTGGTATTAGATTTTCTTTGAAGGTAT 
GATAGAATTCTGAACTAAACCCATATGGT 
TCTGGATTTTTTTTGGTTGGAAGACCAAT 
GACTGCTTCTATTTCTTTAGGTGTTATGG 
GACTGTATAGATGGTTTATCTGAACCAGA 

111 ArtL 111 VjVj iAl 1 1 V7 1 IHiLlul 1 1 -ttVJ 

AAAATTGCCCATTTCATCCATATTTCCCA 
GTTGTGTTGAGTATAGGCTTTTGTAGTAG 
GATATAATGATTTTTTGAATTTCCTCAGT 
ATGTTTTCTTATATCTCCCTTTCCATTTC 
TGATTTTGTTAATGTGGATACTATCTCCG 
TGTCCCC 


470 


R 




IM000617 


GGAATGCAATGCCTCTGG CATCTACAGG CATAAACA 
TACTTGTGGCTTACACTCAAACTGACACACCAACAC 
ATATGTGCACGCGCACACACACACACACCAAATTAA 
AAATAAAATAACCCTTTTTAAAAAAATATAGAATCT 
AT AGATAATTG CTTT AC TG CACT CACAAACATTTT A 
GGATC 


471 


D 




IM000618 


ACACTAACACAAAGAAGGGGATC 


472 


D 
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Table 3 

MOUSE NOMENCLATURE 
ICSGNM Fscnl 
5 Celera mCG23208 

HUMAN NOMENCLATURE 
HGNC SNL 
Celera hCG15970 

10 

MOUSE SEQUENCE - GENOMIC 

C CTCTCAAACTCTTGGT AATTCTCCTG CTTCGG CACGCTGGGGACTGGGACT ACACCTGTGTGCCACCATGG CTGG CTATTTCTCT 
CTCTTGAACACTGGAAGAGTGCTC^GAGTrTACn^ 

AGCTCAG^GAGGAGTTGCCTGAGGGCCAGGCACTTCGAAGGACAGATGGGAC 
1 5 GTGGAGCCCTGACTGTCTCTGAGAGGGTAAGCTGGGGTGGCTGCCAGGCGGGCAAGG 
CTTCTGGCAGAGATATCTTGGCATGAGCCTGGCTCTGCCCATGACA 
TGGCCACGATTGCCTTGTTTCACAGAATTCIACCTTGGA 

AGGGGAGCTGGG CT CAAGGG CAGGACT C CTGGG CCAGC CCAGGTTTTTCCCT CM AAAC CACAG CATG AATACTGG CATGATGGAG 
CACACCTGTAATCAGCAGAAATTGAGACAGGAAG^^ 
2 0 CACACTCCCCCrCCCCCCGCCCCTGGAAATGCTGGAGGGTCTGGGGAAGATGGCT 
GGGACCCAAGTTTGATTCCCAGCATCTATAACTTGGTGGCTCACAATTCCAGT^ 
AGGCACCTGCATTCTVTTCTCACGTGCATGCGCACACACATGCXSCGCGCGCACAC^ 
TACAAATAAAATAAATCTTGAAAACAACAGTGACAACAACACATGCTC 

TGGTGGTGCTCCTGACATGGTGATGCTCCTGACATGGTGGTGCTCCTGACATGATGGTAAGCCTGAC^ 

2 5 GGTGGTGCTCCTGACATGGTGATGCTCCTGACATGGTGGTGCTCCTG 

GTGGTGCTCCTGACATGGTGGTGCTCCCTGCACTGACAGTTCTGAGGAGGTGGAGATAG 

AGATAGACCCTGTCTCAGAATAGGCTGGAGGCTGCCTCTCCCCTTTTG 

(JVTATGTGTATGCATATTTGTAAGTATATGTATATAAATAAAC^ 

ACT CAGAGACCTG CCTGTTTCTGCCTTCCCAGTGCTGGGATTAAAAGCGTGTGCCATGGGCTGGTGAG ATGGCTCAGT 

3 0 G T ACCTG ACTG CTCTT CCAAAGGT C CAGTGTTCAAGTC C C AGC AAC CACATGGTGG CTCACAACCAT C C G TAACAAG ATCTGAC G C 

CCTCTTCTGGTGTGTCTGAAGACAGCTACAGTGTACTTAAATAAATAAATCTTAAAAA 
CCACCACTGCCCGGCTAGTTAGCTTTATCTTCCGTGGATGTTTTGTCTGCAGTATGTCTGAGTC 
TGGAGTTATAGACAGTTTGTGAGCAGCCATGTAAGTGCTGGGAATTGAATCCGGGITCTCT 
CTGCGCC^TCTCTCCAGCTGTCCTTATTTATTTTTAAGGGTCTCTTGTAG 

3 5 TTGAGCTCCAAGTGCTGGGTGACAGGCCCGCATCTCTCCTGGTTTGTGATCTGCTAGGGATG 

C^TAAGCATTCCCCTCACCCACCCTCATCCCCACCCAGTCCTGGACAGGCrrCTGGCTGTGTA 
CCTGCCTCAGCCTCCTGAGTGCTGAGGNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNCTGTGGTGGCTCCCGA^ 
TCACAGGTTCAAGGCCAGCTTGGTTTACTTGAGACTCTGTCTCAA 

4 0 TTTTCCAGCTAT CGAGGGAACCTGAGTTTCATCCTGGAACTCATACIAAAAAG CTGGCTGTGGTCAGGTGGCGCATGCCTTTAATCC 

GATCACTCAGGAGACAGAGGCAGGCAGATTCTT^ 

ACAG AG AAAC CCT AT CTTGG AAAACAAAACAAACAAAAAAAACT AGGTGTGG CT AGCCTGT AAC C C CAGTGCTGGGT AAACAGGGG 
CAGGTGAAT ACATCT CCGGG ATTCGC CAGCCTGTC CT ATCTATGAGCT CAAATC AATG CAAGC 
AGGTGACAACOSAGGAAAGACACCCAAGGTTGATGCACGGGCAAGTC 
4 5 CCCAAACCATATTTTGCTGTGGTGGTGGTTGTTAAATTTTATTTATTCTGTGTG 
GTGTAGCATATACATGCCGGGTACTTTCGGAGGTCAGAAGAGGAC^^ 
GGCTTCTGGGAATCTAACTTGGATCCTCTAAAGGAC^ 
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TGATGGGCAAGTGGTGCCAGTGGTACCAGGCTGCCCTGAGGGGCCC^ 

GGC CT C CGGTT CACG AGG CTG CC CCTTT ATTG AGGG CTTG AAC CACAGAAAGCTGTGTGGT CTG CAGGG AAG CT AGTTCTG AG CTG 



ATCTATCTATCTATCrATCTATCTATCTTTGTGTCTATGTATCTAATAA 
55 TGATCTATGTATCTATCAACTGATGTGTATGTATGTATGGATGTACCTACATGCGTACTTGAGGCAGGACTTA 
TAGCCITACCACATTTCATAGCAGAGGATGGTCTTGAACTTCAGGTCCTCCTGT 
GTGCCACCACGCCTGGCTTAGGAGGTTCTGTGGTTAGAATCCAGGATTTCACT 

ATG CTTTCTTTTACATTCTT ACTTATTT ACT ACAAATGGGT CTGATGGTGAGTGCTCTTCCCT 
GCTGGCrGACTTATTTGCTCT TlT ' lU ' lU ' l ' TT 'r rT T T T' l"r ' l I TCTAGACAG^KSTTTCTCTGTGTAGCCCrGGCTGTCCTGGAACTCA 
6 0 CTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCTGCCTGCCT 

G CCACACGC CCAG CTCHT ATTTGCTC CTTTGAATT AAGGTCTCATGT ACACAAGG CTGAC CG TT CATGCACAAT ACAGCTAAGG CT 
TGCTTTGAATTTCTCAG TTlVlTr CTTCTACC 

G TTTGTTTG TTTGTTTTTTGGTTTTG TTTTG CT CAAC ACAG AG TTT CTCTGTGG AACC CTGG CT AT CCT GG AACT CT G T A 
TACCAGGCTGTCCTTGAACTCAGAGIACCCTCTGCCTCCCTAGTGCT 

6 5 GCTTCTCATTTGGTGGCTGGAAATTTGCCTGCCTCTGCCTCTCTCCAGCTGCCTA^ 

TGAGGTGCCGCTAGGACAAGATCTCATG CAACCGTG AACAGCCCTACTGAGCCATCCTCCTGGG ACAGCCTCTCAGG CCAGGCCTT 
CTGTGTCTTCTGCTATAGTAGC3AGTACAAAGTTGACACAACCTTC 
GTTTCCAGGAAAGCCAAGAGX3TGAGAGACTGAAAAGTTGGAGGGATAAGGGGGTC 
GTTGCGCATCCCTGACATCTCAGCAGGAGGATCAGGAGTTCAAGG^ 

AG^GAGTCACTACTG^TTCCTTTGGAAGTCCCTGTTCTGCATGGAGAT (IATGGAGATTCTGGGGACACTCCAGGAGT CCTCCTTCA 
GGGAACTAAGTTGGGGAGCAGGGCAGCCTATCCTGTGGCCT 

GCG ACTCAGGC CTG ACCTCCGCTAACrGTTCTGTTTTGTGCTG CACCCTCATCTCAGCAGGG ATCAGGTGGCACTTGGGTCCCCAG 
GGGATCCAGGGACACAGTGTCCATGTCCCCATCrTTCCCC^CAGGGACCTGGGGCA 

7 5 AGGTGTAGCTCATCCCTCCGAGGACTCAGGGGACACTTGGGGAAGGTG CTGGAGCTCTGTGCGG CTCTTT AT 
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TAAGTGTGCCTACAACAGGGGCAGAGACACAGCAGCGACMTGATAAC^ 
AGCCAACTGCAGAATTCCAGTCCATAACGGCACCTCCTT^ 

CCATCTAG AGCAAATGCTTAT ATCCCCCATGTTTGCTCAGAGCAGAG CATGGAGGGACCTTCCTGTCACCAGAGGCAGAGG CATTC 
TGTATACGTGCATACACGCCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTGGTACAGATCT 
5 CATTGAGGTTGGTGATTCAAAGGCTCACTGATGGGCAAGTCGGGTTCGAGTGCGCT 
XGCTGTTTTGGAAAGGAATGAGGGAGCTACTGTTTCTAAGGAAGAGAAGAAT^ 
TAAGCACTTACAGCCGAAGCCCAAGACCTGGAACCCAAGGGGAAAAACAGGTGTGGC 
TGAGTTCAAGTCCAGCCAGGATTACATAGCAAAACTTTCCCTCAAGAAAAAA 
GCTCTTCCGAAGGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCA 
1 0 T GTGTCTGGAGACAGCTACAGTGTACTTACATATAATAAATAAATAAATCCTAAAAAAAAAAAAAA 
AGGTAGAGGAGGAGGAAGAGGAGGAGGAGACGGCAAGACAAAACTAACTACATAGCTG^ 
GCACTTGGAAGGCTGAAAGAGGAAATCAGGAGTTCCTATGCAGCT^ 
AGCAAAACATGAGGTGGTAACTCAGCTGGAGAGTGCTTCTCT 

TCCCCCAGGGAGGGGCTGGGGGCGTGGCTCAGTGGTAGAGCCCCTGCCTAGAATCCCCCATTGACGGGCT 
1 5 ACAGACTGCTCACCTAAGATGTTCAAGGCCATGACTTTCATTGAAAAC^ 

GTATCCATGTTTGCTTGTCTCCATTTAAAGAACACCTGGAAGATCTCTAGCTGCTGG 

CCTTTGTATTTGAACTGTGTGCCTGTGGGTGTATTATCTGTTCCCAAGGACT 

TGCTGAGATGAACCTCAATTCTCCACAGATGATCTCACCCAGAGCCAGATGGAAGTCC 

CTGGACCCAAACTCATCTTAAGGGAAAACCCAAAGAGCCTTTCTGTG 
20 CACGTGTGTTTGCTTGTGTGTGTCTGTGTGTACTATGTGCATGTGTGTGAGCATGTGTGTATATGTA 

GTGTGTGTATACATGTGAGCATGTATGTATAGGCCAGA^ 

GTCTGTGTGTGAGCGGCTGTGGGCACTGTACACCACAGCACATGTGTGGAGCT 

TCTGTGTGGGTCTAAGACCTGGATCCAGCGCTTTCGCTTGCTTGAGACATCTCG 

TCACTATGTAGGTCCAGTTATCITGGAATTTGCTCTGT 

2 5 AGTACTGGGATTAAAGGTGTGCACCACTACCACCCAGT 

AGAGACTGTTGGATCTCTGGGAACTGGAGTTAGAGGTGGTTGTGAGCCCCCACGGGTGCT 
GAGCTCACTCACTGCTGAGCCCCCAC^rTAGATTTGAGTCCCT 
AGGGTGGGCTTTTCGATGTGCACCACAGGCCTGGCTTTATAGGTGAGGTC^ 
CTCCCCAGACCTTTTTTCTTTTGCATTTACX3TGTGTGTGTGTGTGTGTGTGTGT 

3 0 GGGG C AGTCTGGGAAGGGT CAGTGG ACAACTTAAGGG AGTTGGTTTTGCCTTCCATCATG ATGG 

TCAG CCTGGGAGCGTGCCTTT ACCCTCCGAGCT CCCAGCTTTGGGTTTTTGAGGOUVGGTTTCATATAGCTTAGGCTGGCCTG^ CA 

CACCCAGGCATCCCCCAT AGCTTAAGCCTAAAGCTCTCGGATGG CCAGCACGCC CATGCCCTGACGTGAAGTTCAGTGACTAATCA 
ATACAGAGCGTTCACGGAGCCCTGACACATAGCAGCATTGA 

3 5 TCTGCCCACGATGCGATGCTCTGGGTTAAACAGGTTTGAACAGAGTACCGGATGGCAGGGAGGGCACCCTCAGT 

CTGTTACTGCTTCCAGTCTCCTGACCTGCTCCCC^ 

TGGAATTACAAGTGTGACTCACTGTACCCAGGAATATGTGTCTTTATTACAGAGGAAGGA 
GCAAAGTCCCACAGCAGGAGAGGGTGGCTGGCTATCGGC^ 

GTCC CAGCAAGGATTCTAGGGACTTGTCAGG ACTGACCTTTG ATG ATGGG AAAGCCATGCCATCTCC 

4 0 CTGGAGGGGTTTGCCTGGGACAGAGATGTGTTTGGACACAGATCACC 

GCTGGGAAGGGCCAGGGGCCCAGATCTTGTGGGCAGC 

TGTGGGAATGTCCCTCTGGTCrGGAACCCTGAGGGGCTGCAG 
CTTTTCTGATGTCTTCATAAACAGAGACCACTATTCCTGCCTTACCTACCCT 

4 5 ATCACCACA^TAGGGACTTTTGTGGGCCACTGACTTGGGCC^ 

ATGCTGACCCGGGAATAGAAATCTCGGTTTCTGCTCAGGAG^ 

GTAGGGCTCTGCATGCGCCrCTCTAGAAATTAACCTCCTTTCTACCCCAGACCCTT 

TCTAAGGCTACTTCCTGGTACACCCTGGACCCTC^ 

GTTTGGGGGTAGGGGTGTTTCAGGATGGCCTCAGAGAGCAGGAGAAAGCT 

5 0 G G AACAGTT AAATGGACT AG CAAAC ACG AACGGG CCAGAGGCAG AAGG ACTG CT ATGAGTTCG AG TC CAGC CTG AG C CACACAG AT 

ACTTCAAGCCTAGTCTGAGTTACAGAGTGAGACCCGGTCTTAAACCCCAGAAAAACACTC 

CGCTGCAGAGCCCCCTCCCCGGCAGGCCCGXMTAGGGGCGTGGCCACGGTGACGT CATCCTCCTATAAAAC CCTGGGCG CCGCCGG 
GCTGGCTTTGTGGAGAACTGCAGCGGGCTAAGCCGTGrrTG^ 

GCCGCCCTCCGCCACCATGACCGCCAACGGCACX^CAGAGGCTGTGCAGATTCAGTTCGGGCT 

5 5 TGACAGCCGAGGCGTTCGGGTT CAAGGTGAACGCATCCGCTAGTAGCTTGAAAAAGAAGCAGATCT CTGGAG CAACCTCCC 

G CGCGAGGTGCCCG ACGG CX3 ACTGCCGCTTTCTCGTCGTGGCGCACGACGACGGC^ CTCACCGGC 
GCTACTTTGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGAGCGTGTCGCCGGCCGAGA^^ 
CACCCGCAGGTTAACATCTACAGCGTTACCCGCAAGCGCT^ 
60 CGACGTGCCTTGGGGCGTCGACTCGCTCATCACCTTGGCCT 

TGCGCCACG ACGGGCGCCTTGTGGCACGGCCGGAGCCCGCCACGGG CTTCACG CTGGAGTTCCG CTC CGGCAAGGTGGCCTTTCGC 
GACTGCGAAGGTCGCTACCTGGCTCCGTCCGGGCCCAGCGGCACCCTCAAGGCTGGCAAGGCCACCAAGG 
CTTCGCCCTGGAACAGAGCTGCGCTCAGGTGGTGCTGCAGGCGGCCA^ 
CATGTCCCCTCCCTTGGGTCCTGACCCAGTGCACAAAAGC^ 

6 5 AGGCTCACAGGCCTCTGGGCTCCGCCATGGGGAGGTGGGCTTCTATTTTGAAA 

GGAACTCGCrATGTAGACTGGGCTGGCCCCGAACTOSCAGAAATCTTCCTTCCTGTCT 

CCCCTCTGTTCAGGACGTAGGCTTCTTCTGCT^ 

TGTCCACCACCACCCCCCCACCCCCCCACCCCCC 

CTTCCTTCCCAGTCTGAGCCATCCCCATTCATGAGCCGATTCTTTAGGATAGGCT 

7 0 TACACCCCCCCTTGTTCATTTCCTCGTGCGTCCTCTCCCGTTTCGCGGGCCTTT^ 

CTCCCCGCCTTAAATGGCGAGAT AG AGGAGGTTGGAGGGGCrTTTG AT CCCGGCGGGGTGTGCTGGGCGGGGTCGGCCCTG CGACT 
CTTGGTGGTGGTGGGGTTGTCCGTAGGGGCTTCCCCT 

TTTGTGAGCAGGGGGGCGGGTCGCCACGGCTGGK3CCCTCCCTCGTCCCCTTTGTCCT 
TCCGGCGa^GCTGGGGGAG<^ATC0GCrrTAGGCGa3TCCCTCTGCG^ 
7 5 CAGGGTTCATTGAGAAGGCGCGGAGTGAAGC 
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GCGCCAGGCGCCCTCCCTGCCCTCCTATGCACGCCTTGCCOT 
TGAACCGGTCCCCAAGGCCTCTCTGGCTGGGAACAACCGTTTGCAGCT 
GCC^TTTCCGACCGGCA.TGGGCATGGTACAAGTAGGCGCCTCCTTTCTGGCGTACCC 
CTGGTAGGGGCTTGTGAGATCTCGCTTCCITGGTTGGGA 
5 GGGAGGAAGACCCCCCTCCCCCAATGCAGAGGGCAAAGCCTTGGGTGGGGCCTGCT^ 

G CG AC C ATTT G CTTACTG TT ACACAGTAAGT CAGCACCATGCTG AACTGGGTGT ACTTGATG CCAGAATGTTCT CTCATT ATACAC 
ACACACACACACACACACACACACACACACACAC^ 

GTTCCCACCTGTTTAGCAAACATTGAGATACGAGCTGCGGTAAACGTCTCTTGGGAATGTGACT 
GGGTACAGGATCGGGGTAGTAGATTATAAAGATCTGGGGTCCCTTCTGAGGACCCAGGAGCTGCCCCAGG^ 
1 0 AGCATGCACCAAGTTTATGGACTGGTTTTGCAGGCAGGGGCCTCAGCAAAG^ 
GCATAGCTAGGAGGAAGGCTGGGATTGGAGGGAATCCTGCAGATGACGG^ 

AGAGCTGTGGCCGAGGCTTGTGCC CTGGGGAAGGTTAGGCAAGG CACAG^TGACTTGGG CCCTTGGGGCCTGAAAGGAGCT AGAGG 
GTGTGTTTTGCGAGCCTGGACAGGTACCCTTTTCTGAGGGTGCCATGATTGTGC^ 
GAATTGAAGGGGTGAGAATCTGACGGGAACTTGGGGTCACACACAGTC^GAC 
1 5 ATTCCCGAGTCAGCCAACTCCTCCCCCGGCTTTTTGAGGGTGAGCTGTTGGGTG 
GCGTGGCAGGGAGGCGGGGCTGGGCTTCGGGGAAACACGTGTTGAACT 
CrGGGGTTGTGAATCTTCCATGQAGATTTGCTGCTTCCGG^ 

GCCAGGATCTTCCIGGGAGGTGGAGATCCTATGAGAGTAGGATGGGGGGTAGTGGGTAGGGT^ 

2 0 TTTCAAAGAATTTTGTCCCCATCATTTCTAGGGCGGCTGGCTAAGAATGAGTGACA 
TGACGGCTTCCCTCTTGGGTCGAGGAATGGCTTAGTTTTGAGCACTGTACGACTT^ 

AAGCCGCCTTCTGGGTTGTG AGGATTGCTT ATTTCGAACAACAACG CTAGGG AGTCTGGCAAACAGCTAAG CCCAGGAAGGGGGTT 

GCGGGTTAGAGGACCCTCTGAGTGACCGCAGAGACGCCTAGGAAGCTGGAAGACAGCG^ 

AGTGACTCAGGTTTTCTGTGCTGCTGCTGCTGAAAAAAAAAAAAATCCCACAG 

2 5 TGATTCCAAGGCCAAGTGGGGCCTTTCCCCCTGGCTGAGCCTTAGAGGGCCCATTCTGGAAGGGGGCT 

AGGTGGGAGGGCCCACCATCTCCTTGGCAACAGGCCCTCACGTCCTTTCACGGTCCC^ 
AAGGGAGGGAAACTGAGGCAGAGAGCGGAGAAGACATTCCCCGAAGGGTGCCTGGAGTCCACCT 
ATTAAAAAGTGGTTGCACAGGCCCCTTTGTTCCAGTACCTGGAGATGAGACCCACATGGTGAGGCCCri'TTTG 
CGTCTCATTTTTCAGAGGTACTGGGAGGTTGAAGGTTGGTGTATGTCCACAGCACT 

3 0 CTTGGGCCAGGGCCTCTCCTTTGTGTGAGGGAAGCTAGCATGCTG 

TTTTGTCTTGTAGCCCATCTGGCTTCCAACTCTCCATCCT 
CIAGGCTTCTGGCCTCCACTTGATGGTTGCCGGAGTGGCTGGGAGTGGCTTTTCT 
AGGGTTTATCTGTGTAGTCCGGCTGTTCTGGAACTATGTNNNNNNNNNl^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN 
35 NNNNNNNNNNNNNNNNNNNNmi^^ 

NNNNNNNNNNTWNNNNNNNNNIWNNNN^^ 

NNNNNNNmiNNKNNNNNNNNNNNNNN^ 
NNNNNNNNNNN^n^NNNNN^n^NNlWNNNN^ 
40 NNNNIWNNNNNNNNNNNNNNNNI^^ 

TTGK5GACTGACTGGTCCCAGTTCTGGCTTTCCCTGACTGTTCCATT 
GCTCGCTAGCATTGGGTGTGGCTCTTTTCTGCC 

AGGCTGAGGGGACAGTGGGTGACACCCTGTATATGTGTAGCCAATTGTTCTCTTAATATTTAGCC^ 
CAGGAGAGCCATTTCTGCCTGAGAAGGTTGAAC^GACTACAGCCCAG 

4 5 CTCCTACTCTCCTCTGTGCTTCAGTGGTAGATCCTGTGGAGGCACTGGGCCAGGGTCCTTGGGGACCC^ 

GCTGATCCATATGTCTAGAAGGGGAGX3CATCTGTCCATCCCC^ 

GGCCAAG^CCAGCCACCTCTGAGCTCCATCTTCTGCTTTGTGAGGTAGAGGTAAGAGCCTTTG 

TCGGTTGGGTCAGGGTGGCATTGGCTGGCATACTCTTGGGTGTAGTGTCAACTAA^ 

GCATCTACCATCCCATACCACGTTCTGCTGCCTAAGGACGCCTGAAGACAACTGT 

5 0 AGGACATTGTTAGAATCCCAGATCCTTGAAACAATGAGTTTATCTTGTTAGGACC 

TTGGGAGCGGCACCTTATTGGGGCTCAGTGAGGTCATGTGACATGGAAACCATGTA 

CATG CAACAAACCTGT AATTC CAACTCCTG AGG CAGAGGCAGG TGG ATCTCTGATTTCAAAGGT AGCTC CAGAT ACACTGTG AG AC 
CTTCTCAAAACAAGACAAGCAGACTGGAGAGATA^ 

CCAGCAACCAGTTGGTGGTCACAACCATCTGTAATGGGATTCGATACCCTCT 

5 5 CATACGTAAAATAAATAAATCTTTAAAAAAAGAAAAGAAAAATGAAAAG^ 

TGTTTGTTTTGGCTTTTTGGTOTCTTTTCTTCTTTC 
TCTCTCTCTCTCTCTCTTTCTCTATCTTTTCTCTTCT^^ 
TATAGGTCAGGCTAGCCTTGAACTCAGAGATCIAGCCrGC^ 
TCTGGACCACnTGTTCTTGTATTCACCTAGTTCTCACAG 

6 0 CACATGTGCGCATATTCATACACACAGGCAAGACGTTCACACACGAT^ 

AATAAATGAGAAGCGAGAGX5GAAAGGAGGGAGAGCCACTGTTTGGACTGGACCACCACTGGCAGGGA 
AGCATTCAG<MTTTGTGGCTGGTGGGGGCAGGGATAAAGGAGGAGTTTTCTAA^ 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTTACTAGGGGTAGACTC^ 
ATATTTTCGACCCCCTCTTTATTTTATTTTTATTTTATTTTATATATAGCTCTGA 
6 5 GACCTCAAACTCACCCIATACTCAAATGAGATTCACCTGC 

TTCTCTTTAAAAAAAAAAAAAAGCCTATTGCTTTTGTTTTTA 
TGGAATCAGTTCTCCTGCCCTATTCTACCTAGGGATrc 

TCTCGCTGGCCTGTAGCCCrrTTGTGTGTCTGTCTGTCrrGTCrGTCTGTCTGCATGGAGTGGA 

GCGAGTTATGATCTTCCTGACCCTGGTATCCIAAGTGCGGAGACTGCAGGCTCACTCTG 
70 GACTCTGCATGTCAAGTGAGTTATAGCCACGGCCCTATTTTATTTTATTATTTTTGAGATCGGGCCCCA 

GGTCGGCCTCCTGAGTGGTGGGATGACAGGCGTGGGTCGATGTGCCT^ 

ACAGGAAGCGAGTAGCGAGCCCCCTTTTACCAGC^CCCCGCCATGTGACATTCCGC^ 

TGGTTTTGGTTGTATTTAAAGCAAATTCCCCACACTGGTG 

TTCCTTACaTAaCG^CAGCGTTGCCTAACTGGAACCGATCTTTAAGACCCAAT 
75 AACATCTTTGTGGGTGGTTCGTTTGAAGTTTGATGGCTTCCTTCCTCCCCT 
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TTT AAAG AAAC CCG CCTGC CACTG AACTAAAT CC C CAG AT1TT1T1TTTTTT AAGTTTTTTTT CG AG ACAGGGTTTCT CTGTGT AG 
CCCTGGCTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCAC 
TAATGTCTTATGTATCCCAGACTGGTCTTGAACTTGTGGTCATGCCTGGAGTAT 
TATGGATGTCAGAGATTCATATCATATGTGCCGTCTTTGGTCATCCTCCATAC 
5 TAGCCTGGGCTGTGCTAACACACGCTITTCATTCCAGCATTAAAGCA 

ACACAGGGAAATTCTGTCTCAAAAAAAAAAAAAAACAAAAAGAAAGAAAGA 
GGCTGTCCTAGAACTTATTATGTAGACCACGCTGGTCTTGGACTCACAAC^ 

GTCTTGGCTTCCCTGTGACAGGATTATCGGCTCATGCCACCG<X3CCTGGCTCTTACA 
1 0 TATGGTCAAGCCATCCTTCAATGCTCCCrCCCCTCCTGTTGAGACAQ 

AGTTAGTCTCCCGTTTGTTTCCrGAGTGGTGAGATCGTTCCTGGCTAGCCT^ 
CTTAGGATTATCACrGAGGAGTTTAAGCCACACTTCAGG^ 

GCCTGACCGGCTCTGGTCTCrCTCTGCAGGAATGGACCTGTCAGCCAATCAGGATG 

AGATCGACCGCGACACAAGAAAGTGTGCCTTTCGCACCCACACGGGCAAGTACTGGACACTGACGGCGACC 
1 5 ACTGCETCCACCAAGTGAGTGACACCCTACACCCCIT 

CCCCACGTTGCCAGCCCCTCrTCrCrTCCCCAGGAACGCCAGCrc 

GAGCCTCCAACGGCAAGTTTGTGACCGCCAAGAAAAATGGCCAGCTGGCCG 

CX3TACCCTAAGCCTGTTTCCCTAGTACCCCGTGGTCA 

GCTGGTGTACCCTCAGGCCAGCAGCCTGTGACCCTTAGCTTCTTGGTATCCCTCTC 
CGTGTTCCGGGGGGAACACGGGTTCATTGGCTGC 

AGTTGGAATTCAATGACGGCGCCTACAACATCAAAGGTGGGTTCACTGGGTGA 

GAACGGGTGTCCTATGACCGCCTGGGGTAGCCCCTCCCCCCTTGTCTTAGAACTTTCCTGGCCCAACCTGGGCA 
GGGAAGCAGCTAGGGAGAGTGGTCGTGGCTCCAATCTGGGAATCGGAGACAGGAGAATTATL"1'T 

2 5 ACAGAACCCAGGGCTTTGAGCTTATTAGGCAAGCATTCTACTGCTGA^ 

AAAGAAAAGGGAAAAAGAAGGTGGAGCCTGGTGGTGGTGGTGGCGGCGGCGGCGGCACACGCCTTTAGTCCC^ 
AGAGGCAGGTGGGTCTCTGAGTTCTTGCCAGCCTGCTCTACAGAATAC^ 

T CAAAAACAAACAAAAAAAGGGGGTGGGGAGAG AAGAT AGAAG AT AAGGAT ATT CCTTCTT AG CTAG CTGTGGG AC CAAAACCAGT 
GAGAGGACACAGCAATCCAGAGGTGTCACTCAGAATC^GAGCCCTGTGCTGTGTGTG^ 

3 0 TTAC7VGTGCTGAGGACTGGACTCAGAGCTAAGATCCAGCCCCTCC 

CCCAGCCCCTCACTGGGGGATTCTAGGCAGGTGTTCTACCACTGAGCCACGCC 
CTCTACCACTGAGCCACACCCCCAGCCCCTCACTCGGGGATTCTAGGCAGGG^ 
TACTTGTTTGTTOSTTTGTCrTATTGTGGCCCTAGCT 

AAATGTTGGATTTAAAGGCATGTGCCACCATGCCCCACCCTAGCCTAGCCCTCTTATC 
3 5 GATTTCCCAGGATGGCCTTGAATCTAGGTAATCCTCCTGCCTCAGTTTCTTGAGTATT^ 
GCTGAGCCCACTCAGTTCTGGAAGCrrGACAGCGGGAGGGAGTGTTAATCCTTT^ 
CCCTCCTCAGACTCCACGGGCAAGTACTGGACGGTGGGTAGTC 
CCTTGAGTTCTGTGACTACAATAAGGTGGCTCTCAAGGTGGGCGGCCGCT 

GCGCGGAGACTATCGACCCGGCCTCACTCTGGGAGTACTAGGGCCACCTGCCCTCTGCAGGCCGCTCT 
40 TCCTTACTCATCGGGTGGC CCTGCAGCAGGTGGCAAACCCCTTGCCTTT CAAACTGGAAACCCAAGAGAAAACGGTGCCCTTGCTG 

TCACCCTCTGTGGACCCCITT^CCCTAACTCACTGCTCCCCATGGG 
CTCTGTCCCCTTCTTTCCATCGGGAACTCTGGCACCTT^ 
CCTAGGAGAAGCTACAGGGCCTAGGGACTTACCCTGAGCTTGTAACTGGAA 
ACCCCACCCCTGCTCTGGCCCCAGCCTCTGGAGGCCAGC 

CTCCGTGTCGCTCCTGCACATTCTGGAAGGGGCGGGCCT 

AATGACCAAATCAGTATTTTT TTTTTT TTT CTT^ 

GTCGTGAGTGGTCTTGGGGGGAGATGCTTGGCTCCTGTCCCTGCCT 

AGCTCTGGCTCTGTGATTGGTGCTCCACGTCTTCCCAGACACCTCGGGGCTCCTGGGCGGGAGAAAGCCGGATGTG 
5 0 GGAGCCCTGGAGTAAACCTCAGGGGGCCCTTTCCCAATCACCCCCTTC 
CTCGCTCCTTTATTTTTTTGTAACTGTCATTTCTATAACTCT 

TCAAGTCTGCGGCCCGTGTGTGTCTCAGGGGAAGTGGATGCTGGAGTGGGCAGAGGGCCGGCTGGGA 

TTGG CAGCC CG CAG ACT CTTGTTTCGGT CTCGGACCCT AGCGGCTT C AG CAT ATG CCTT AG T AACTCTGG ACTGAGAAATGG CTG A 
CACGAGGTTCCCGAGCCTTACAAGCTGGGCAGGCT^ 

5 5 CCCCAGCACCGTGTGAAAAGTTGGGCACAACCGTGTACACCCACCCCAGGG 

GCCTGCTGGOSGCCCAACCrrTGCCCAATCrGTGATTCCCATTG 
TAGGCCTGCGGCCCCCCACACCCGGAAATAAAAACAAGCCAC^CTTGTCATCCCAG 
AGATTTAAGACTGCCCIAGGTTGAAGAGTAAGTTCAAGGGCAGCCTGGACATCT^ 
TTGTTTTGTTCTTTTTTTCGAGACAGGGTTTCTCTGTGTAGCCCTGGCTG 

6 0 ACTCAGAAATCCGCCTGCCTTTGCCTCCCGAGTGCTAGGATTAAAGGTGTGCGC 

GGCTGTCGAGTGTGGTTCACGGGTAGCCTCGCTCAGACAGGCTGC^ 
GGTTTCCATTGCTGTGAACAGACACCATGACCTGGCAGCTCTITCTG 
CAGGAAGCAAAGCACAGGCATGGTGCAGGAGGGGCTCAAGTCT^ 

TCCAACAAGGCCACACCCACTCCATCAAGGCCACACCTCCT CTTATTCAAAC CACCAATCAC 

6 5 AACTGCITTTCTAGAGGACCCAGGTTCAATTCCCIAGCATCCACATGA 

CCCTTACACAGGCACACTTACAGGAAAAGCACCAATCAGTGCACATAAA^ 
CCTTCACCCTTAACAGGCAAGCrGAGGTCCTXXKSAGTGTGTTAGA 

AGGGAAAAGACAACCTTGTTTGCTGGCTC^GAAGAGTGCCAGTGGGGTACCCACCCACAAGGCT 

7 0 GGCAGTTACAAAGCCGCATTGATTGGCTGCTGATGACGAAATGAAGA^ 

TGGGGAGATTGACAGGGGCAGATGGCCACAAAGGGTCCAGTGACT^ 
CAATGACATGGAGAGACAGATGAATCCTTCCCAGTTGAATTCTC^GGCT 
CGTTACAACCACCAAGCGCAGCCAGGCTACGTGATGACTGGGGGAAGTCCCAGACG 
CC^CGGAGGAGGCTGAGAGTCTGTTGGAGGCAGCCGAGGATCC^GAGGGAAGAGGCACAGGT 
7 5 TGGGACTCCAAGGGAGGACGGATGTAGGGTGATGGGAAGAAAGGAC 
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AGGGGACGCTGGCCCTCCGGAGGGAAAGCTGTCRAAGAAAGGCCTGGCTAATC^ 
AGGCATCAGACACCTTGGGGCAAGGAGGGCACCCATGAGCTAGTACA^ 
GACACAGGTGGGTGAGCAGCACCCACGGGCAGCTGAAGCTGAGTATCCTC^ 
GAAGGATCCCTATGAATATGGCTTTTTTTGTTTGTTTGTTTGGTTGGTTGGGT^ 
5 TAAAGACATACACCACCATAACCCAGCTTTGTAAATCTGGTT^ 
AAATTGAGGCAAAGTCTCAAAGAAGCTGACAGCACAGGCT 
TGAGATGGCTCAGCGGTTAAGAGCATTGACTGCTCTITCC^ 
ACCATCTATAATGAGATCTGAGACCCTCTTCTGGGGTATCTGAAGACAG 
TAAAAAAAGGAAAAATTATTTATTATTTTATTATTAAATTAATTGTTGTATTTA 
1 0 TTTATTTTGTGTGCATGAGTGTTTTGCCTGCATGTGTGTGCACTACCTGTG 

GGTCCGGGAACTGGGGTTGTATGGTTGTTAGCCGCCGCGTTCATGCTGGTTCTCTGCGAGGCC^ 
TGTCAGCTCCTCrrACTTTCTTTTAGACAGGGTCTCCTGTAGCTCAGGCT 

TAAATCATCTT ACATTTCTGCCTCACTTCCACCTCCTGAGTGCTGGG CTTG CAGGCATGTG CCACTGGGATGGGTTTTGTTTCCTG 

CTGG CTTCT CACCCCGTAGCTCCGTGTGTTGGGT CAGGG CCAG C CCTGTGCC CACTGAG CCATTAACCCT ACTG AGCC CACTGT AG § 
15 GGAAAAACCATAGAGGAGCTTCTATGACCCTCATGGGTCCCT ' 

AGCTGTTGACTTTAGCCTTGGAATTTCCCAACCAAGTCTACACTGAGCA 

CTCTTTTGGAGTGATCTTAGTCCCCACAGGGCCCCCATCTTCTCCA^ 

CTGACTTGGTTAAAATGTTGAATGCAAGGAGCGGAGGAAACGGCCCAGCT 

TTCCCAACAGXSATGGGGCTCACTGGCrGCCCTG^ 
2 0 ATGACCCCTGGTGACAGACACACTTGCATATGCATCTGTCTGCACACA 

GC CTG ACTGT AATCCCAGT ACTTGG GAGACAGGAAGAG AGAATCAC CTTCAGTTTG AG G CCAACCT CGTCTACAT ATC CAGCAC CT 

TAGGAGACTCTGTTTCCAACTCTTCAGGGTGGCTCA.CATCTCTAATCTCAAC 

GGAAGGTTTGCTTGGGTTCAGTGCCAGCCTCCAGAGGCAGAGGCAAGCAGATTTCTGAGTTCGAGGCCAGC < 
TCCAGGACAGCC^GGCAGCCAGGGCTATACAGAGCAACCCTGTCTCAAACAACAACAATAACAACAACAACA 

2 5 GAGTGTGCGTGTGCGTGTGCGTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGITTACTATTTACAGTAAACGAG 

AGGCAGCCAAAGTAGGCGAXXSAACTCCTTGTCCCCTGCTGGGCTCTCTGGGTTGAGATCm 

TT CT CTG TCC CT CAGGGTG CT ATCAGCAG TGGGAACAG ATGGGGTCGGGAGAG AG ACTGGGGG AGGGG AGAAT CAGGGTGGAGG AG 
GGCTGTGATTTAGGTTTACCATGAAACCGGAGTCATCACCGGTGGGAGAGGCTGGTCAGAGTTCCT 
TAGGACCCAGGCCTGATCCGGTGGGGAGGGGAATGATTGGAAAATGCTGCCT^ 
30 ' TCAG CCATCAATGACAACCTTTCTCTTCTCACTGAAAGCAATTAAAG AT CTGTAAG ATGAGTCAGTGTT ACCCATTGT AAGCTGCT 

ACACACACACACGCATGCACACAAAAATA^ 

CAGTACTCGGG AGGCAGAAG CAAGTAGAT CT CTG TG AGTTT G AG ACCAG CCTGG TCT ACAG AG TG AGTTT CAAGC CAG CCAT AG AT 
AGCTACATAGCCAGATCTTGTGTTAGCCTCCTCCTTCTACCAACAGAAATTAAGATC^ 

3 5 ACAACTCAGTGGCTATGAACATTTGCTGTTCTTGCAGAGAAACC^ 

CTAGCTCAAATTCCAGAGGACCTGGCTTCTGAGAGCACGGTACACATGGGGTGTTCTCTCT 
ACACACACACACACACACACACACACACIACGCAG 

AGGGGATT CAACG ATGT CATG AG TC C ACAGAAAT C CT ATC CTC CCCTTTTT CGGGGTGTT ACACGGGCGGATC AAACC CAGGGNNN 

NNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN*n^ 

NNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NTWNNNNNNNNNNNNNNNNNNNNNlWro 

NNNNNNNNNNNNNNNNNNNNNNNNN1W 

NNNIWNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNWTONNNNNl^ 

NNNNITONNNNNNNNNNNNNN^^ 
55 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNIWNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNIWNNNNN^^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNMWNNNNNl^ 

NIOJNNNNNNNNNNIWNNNNNOTWNNN^ 

NNOTWNNNNNNNITONNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNM^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

CTTGAGTTCAATCCCAGCAACCACATGGTGG CTCACAACCATCCGTAATGAGATCTGACGCCCT CTTCTGGTGTGTCCGAAGACAG 
CTACAGTGTACTTACATATAATAAATAAATAAATCTTAAAAAAAAAAGAAAGAAAGAAAGAA^ 

aVGCTTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGA 

AAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGAAGGA 

AGGTCCAATAGGTTGGGGGCATGGCTCAGTGTTAGAGGGCrrTGCCTAGCA^ 

GGGATGGTAGGTAGGTAGGTAGGTAGGTAGGTAGGTATCTAGATAGATAGATGATAAATAGGTAGATGGATGGTAGATATATAGAT 
7 5 AATAGATGTGGGATAGATAGATGATAGAC^TATACATAGATGATAGATTATAGATAGATAGATAGATAGATAGATAGATAGATAGA 
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TAGATAGATAGAO^GACAAATGGTAGGTATATAGATGATAGAAAGATGAGGGATAAATAGATGGTAGATAAGTAGATGATAGTAGG 

TAGATGAGAGGGAGAGAGAGAGAGAGCGAGAGAGAGCGAGAAAGGATATAATAAGGTCATTAGGCTAATTTG 

TGGGGTTCTTATAAGACAAGTGATCAAGCAGGCCAGATATGGGGGGAACTGATC^ 

GCAGGAGGAAGGGGAGGTCGAGGGTATCCGCAGATACnTAGT 
5 TCCG C ACT AACAACAAAAATTCTGAAAGGCAAACCT CCCTT AACTCAT AAAG CATG AATTGGGG CG T ATGCGGGCCTT CAGAGCCA 

TGTGGTGTGTTTGGGAACAGACGACAGGAAACACTGTTGAGAGGACTCA 

ACAGGCGTTGGGGCTTGCTGGGCCGCTCACAGGAAGCAGCGCA 

GAGTGGGCCCTGACACACCCGGCCAAAGCAAGCACTGGCCTCTGGACAGAAAGC^ 

TCACTGTACTTGAAGATGTXyiGCCCAAAGAAGTGGGGGGTC 
1 0 ACTTAGATATATTTTTAAGAAGAGCAGTCGGTACT 

AG AG CTTAGAGGCACAGGCACTTGTCACCAAG CAGCAACATGAGTCAG CCCTG AAAAT CCACAGGCAGAAGG AGAG AG CTGACT 

CCTGTCCGTTGACCTTCATCTGAATACAGTGGTGTGTGGTGGTGTGTGTGTC 

TGCCATCTGTGTCAAGAGTGTGAGGAGGACCCAAGAGAGTTCCT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

N 

20 MOUSE SEQUENCE - mRNA 

GAGCTAAGCCIGTGTTGAACAAAGGAGGTCGGGCACAGCTATCC^ 
CAACGGCIACGGCAGAGGCTGTGCAGATTGAGTT^ 
AGGTGAACGCATCCGCTAGTAGCTTGAAAAAGAAGCAGATCT^ 
TGTCTGCGCACGCACCTGGGTCGCTACCTGGCCGCCGACAAGGACGGCAA 

2 5 CCGCTTTCrCGTCGTGGCGGACGACGACGGCCGCTGGTC^ 

GCCTGTCCTGCTTCGCGCAGAGCGTGTCGCCX3GCCGAGAAGTGGA^ 
GTTACCCGCAAGCGCTACGCGCATCTGAGCGCGCGGCCGGCCGACGAGATCGC^ 
GCTCATCACCTTGGCCTTCCAGGACCAACGCTACAGT^ 
CAO^CGGAGCCCX3CCACGGGCTTCACGCTGGAGTTCCGCTC 

3 0 CCGTCCGGGCCCAGCGG CACCCTCAAGGCTGGCAAGGC CACCAAGGTGGGCAAAGATG AG CTTCT^ 

TGAGGTGGTGCTGCAGGCGGCCAACGAGGGGAACGTGTCC^CGCGC^ 
ATCAGGAGACCITCCAGCTGGAGATCGACCGCGA<^CA^ 
GCGACCGGAGGTGTGCAATCCACTGCGTCCACCAAGAACGCCAGCTGCTACT 
GAGAGCCTCCAACGGCAAGTTTGTGACCGCCAAGAAAAA 

3 5 TCCTCATGAAGCTGATTAACCGCCCCATCATTGCGTTCC^ 

GATGCCAACCGTTCCAGTTACGATGTCTTCCAGTTGGAATTCAATGACGGCGCCT 
GACGGTGGGTAGTGATTCCTCGGTCkCCAGCAGCAGC^ 

CTCTCAAGGTGGGCGGCCGCTACCTGAAGGGGGACCACGCTGGGGTCCTGAAGGCCTGCGC 
TGGGAGTACTAGGGCCACCTGCCTCTGCAGCCK5CTCT 

4 0 GAAACCCCTTGCCTTTCAAACTGGAAACCCAAGAGA 

TGCTCCCCATGGGTCGGTGGCTGCAGACTGTCCCCAGGAGGACT 

TGAGCITGTAACTGGAAGACCCCGTCCCTATCCCCGCTCCCG^ 
CCAGCCTTTTGGCGGGACTGAAGCCGGGCATGGCCAACCTTGCCCACAAGTGTT^ 

4 5 CATCCTGCAGTGTTTGGGCCTGGCTCTTTGACTCAAA^ 

GGCCTCTCACCCACCTCATTCCnTTTCCCCCTG<3CCT 

GGAAATGTTACTGTTGAAAGGCCCTAGGCAAGCCTGCCCTG^ 

GTCCCTGCCTCCCCAGCGGTTCCCTCCCTCCCTCCT^ 

CACCTCGGGGCTCCTGGGCGGAGAAAGCCGATGTGCCCCrCCCTGGGAGCCCTGAGTAA^ 

5 0 CCTCCACTOACCCCTCAACACGATGCATCTGACTCTGGGTGT^ 

TGAAGACCCATGATAGTAAGCTTTGAACTGGAAAATAAAGTAAAATCAAGTCTG 

MOUSE SEQUENCE - CODING 

ATGACCX5CCAACGGCACGGCAGAGGCTGTGCAGATTCAGTTCGGGCTCAT 

5 5 CGGGTTCAAGGTGAACGCATCCGCTAGTAGCTTGAAAAAGAAGC^ 

CGGCCGTGTGTCTCCGCACGCACCTGGGTCGCTACCTGGCCGCC^ 

CX^GGACCGCCTGTCCTGCTTCGCGCAGAGCGTC 
TCTAQVGCGTTACCCGCAAGCGCTACGCGCATCTGAGCGC^ 
60 GTCGACTCGCTC^TCACCTTGGCCTTCCAGGACCAACGCTACAGTG 

CCTTGTGGCACGGCCGGAGCCCGCCACGGGCTTCACGCTGGAGTTCCGCTC 

ACCTGGCTCCGTCCGGGCCCAGCGGCACCCTCAAGGCTGGCAAGGCCACCAAGGTG^ 

AGCTGCGCTGAGGTGGTGCTGGAGGCGGCCAACGAGGGGAACGTGTCC^ 

AGAGACCGATCAGG AGACCTTCC71GCTGG AGATCG ACCGCGACACAAGAAAGTGTGCCTTT CGCAC CCACACGGGCAAGTACTGGA 

6 5 CACTGACGGCGACCGGAGGTGTGCAATCCACTGCGTCCACCAAGAACGCCAGCTXSCTAC^ 

ATCACTCTGAGAGCCTCCAACGGCAAGTTTGTGACCGCCAAGAAAAATGGCCACGTGGCCGCCTCGGTG^ 

GGAACTCITCCTCATGAAGCTGATTAACCGCCCCATCATTGCGTTCCGGGGGGAACACGGGT^ 

GCACTCTGGATGCCAACCGTTCCAGTTACGATGTCTTC 

AAGTACTGGACGGTGGGTAGTGATTCCTCX^GTCACCAGCAGCAGCGACACCCCT 

7 0 TAAGGTGGCTCTCAAGGTGGGCGGCCGCTACCTGAAGGGGGACCACGCTGGGGTCCTGAAGGCCTGCGCGGA 

CCTCACTCTGGGAGTACTAG 

HUMAN SEQUENCE - GENOMIC 
TTGCATTTGAAGCTACCATGACTGCAAGAGGGAGTTCC 
7 5 CACTGCCCCGACAGTGCCCATGTATGAATTAGAAAAAGTGGGCCAGGCGT 
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CTGAGTGGGCAGATCATGAGGTCAGGAGTTCGAAACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

AGCTGGGCTTGGTGG CACACATCTGTAGTCCCAGCTACTCAGGAGGCrG CTGGGAGGTGG AGGTT 

GCAGTGAGCCGAGGTCATGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTTTC 

AAAG AAAG AAAG AAAAAG AAAAAAAG TGGG CCAGGTGTGGTGG CTCAT AC CTG T AAT CC CAGCACTTTGGGAGGCTG AGGTGGG AG 
5 GATTGCTTACAGCCAGGAGTTTGAGACCAGGTTGGACAAC^ 

CATGGTGTTG CATTCCTTTAGTC CCAG CT ACTT AGGAGGTTGAGGTGGAAGGATCACTTG A C CCTCAT 

ATTGTCTTATGCCCAArrrCTGCCTCCAAAGAAAGAAAAAGTAAAAACTAAAAG^ 
CGCCACACCCTGGGCCTCGTAGTTAAAGATCGACCCCTGATCTAATCGGTGATGTTATCT 
GCACTGTGAAAATCCCTATCTGGTTTTGTTCTGATCTAATTACC 
1 0 CACGACCCTCTCACGTGCACCCCCTTAGAGTTGTGAGCCCTTAAAAGGGACAGGA^ 
CAGGAGTCTTGCTGATGCCCCTGGCCAAATAAACCCCTTCCTTCTTTAACTCGGT 
CAGAGTCTAGGAGGCAGAGGTTGCAGTAAGCCAAATTCACGCCACTC 

AAG AAAAGAGG C CAGGCGCAGTGG CTCACG C C CAG CTAATTTTTG TACTTTT AGTAGAGACG GGGTTTCACCATGTTGG CCAGGCT 

GATCTCAAACTCCTGTCCTCAGGTGATCCGCC^CCTTGGCCTCCCAAAG 
X 5 AGAAGAAAAGGACGAATCAGGACAGGAGACAATTAGAAGCTCTGTTCATTCG^ 

AGTGATTGGTTATATGTTGTAGACAGACCCACAGGGTGG 

CCATGTAGCAAGTAGCTTCAGGACGTAATTATTTAGCTCAATAGAAAG 

CGGAGTITCACTCTTGTTGCCCAAGCTGGAGTGCAATGGCGCGATCTC^ 

CTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCGTCTGCC^ 
2 0 riTCACTATGTTAGCCAGGATGGTCTCGAACTCCTGACCTTGTCATCCACCGGCCT 

TGAGCCACCACGCCCGGCCTTTTAACTGCTGTTACTTTTTTTTTTT^ 

ACACCAGCATTCTATACATCTTGCTGGTGAACTCACATAATGGTTAGTT 

TGACTGCTGTTACATTTTAAATGCCTTTCTGGGCCTGATAATTTAAGGG 

2 5 TTGTTTG TTT CTCACT ATTG CTCAAAAGCCACATG TGT CAAGG ACAG CT CTTGTT AACGGG CAG AGAAG CTT ATAGAACCTTTC CA 

GGGCTGGCGCGGTTGTTCATGTCTGGGGGACAAGAGCGA 

GGCACAGTGGCTCACACCTGTAAGTCCCAGCACTTTGGGAGATTGAGGTGGGAG 

TAGGCCACATGGCAAGAACCCAGTTTCTACAAAAACATTT^ 

CTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCTGGGAGGCGGAG^ 

3 0 CTGGGCAACAGAGGGAGACTCTGTCTCAAAAAAAATCAAATAAAGAAAAAA^ 

GGCGCTGGGCCAGATGCCTGCCCC^GCCCTGGGACACCCAAACCTGGTGAAAGTGCCAAG 

GGGGCCGGGAGCTGGGATTACAAGOSTGGAGGCAGGCGC^ 

GCAGGGCTTTGGCAGGCAGCTTTGTTGGGCAGGCAACA 

GAG AAG AG AAG GGAGGGTGGCAGGAGAGATGGG CAGAGGC CCCCCAGG AG CC CAGAG C CTTCAGGGCTTTGGG C C CTCTTGGGG C A 

3 5 CTGGGG AG CCACGGGAGAAGTGTGGGGTGGAGAGGGGCGC C CTGG AT TT GAC CAC CTTCAGGAAC CT ACCTTGGCTGCCCGGGTGG 

GTGGGATGGAGGTGATGAAGGTGGAGGTCGTAGCATTGTC^CTGAGAGCGATGCTTATTCTGA 
AATCGGTCAGAACAGTGCTCGGGCTGGGCXSTGGTGGCTCACGCCTGGAATCCCAGCACTTT^ 
TGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCAAAACCCCGTCTCTACTAAAAACA 
GCATGCCTGTAATCCAGCTACTCCAGAGGCTGAGGCCTCCCTGGGAGGCGGAGTGCA 

4 0 AGCCTGGGTAACAGAGTGAGACTCTGTCTCAAAAC^UVACAAAAAAGAACAA 

GGAGAAGAACAGGTGCCTGGGTCCTCCCGGCTGACGGTGACGTCACTATGAGCCAC 

AGGTGGTT CAGGG AGG AG TG AGTGGCAGATGGTG CAGG CT CAGGGAGGAGTG AG rGGCAGGTGGGGCCTG AGGC CAGAGG CAGGAG 
GACCGTTCAAGACTGAGGTGTCTTGGGCACTGGTGCCGGG^ 

AGGATGAGGTGGCGGCCAAGCAGGGAGGCCAAGGCCTGGCAGCAGCCGGCAGCCCCTCTCCT 

4 5 GGGCTCTGCCCAGGGCGCTAATGTCCTCTTAATTAGCCAGGC^ 

CTTGGGTTTGGAGCAGGACCAGCATCGGCCACCGGGTGAGTAGGGGGGAT^ 

TGAGTCCCCACTGAAGCCAATCCAGGCCCCGACCTCTGGTCATC^ 

ACGTTTTCAGGGCCAGATTCCAC<3TCCCCCACCTATTTTCCrrTTGCT 

CCCAGTGTAGGCATCACCTC CTCCTGAAGGCCCTCCTGGACTGCACAGTTAACTCTGGAGCCTCCCCTG CACAGGGCAGCTCATGT 

5 0 ATGGATCCACAGGTGAGTTCCTGTCTTTGAAGGGTTACAGTTTAGG^ 

CCCTCTTTGCAAATCCCATGGGAACTATGAGCCTGGGTCCCCCAA^ 
CTGTATCCAC^CCTCAGTTTCCCTGCTTGTAACCAGAAGTC 
CrTGTAATCCCAGCACTTTCAGAGGCTGAGGCAGGAGGTTCZACTTGTGATC^ 
ATCCCTGCCTCTACCAAAAAATTAAAAACAATTAGCCAGGCCTG 

5 5 GTGGGAGGATCACTTGAGTCCAGGAGTTTGTGG CTGGGGTGAGCTCTGATTGTACCACTGCACTC CAGC CTGGGTAAG AGAATGAG 

ACCTTGTTTTATTTTATTTTATTTTTATTTATTTTGAGA 

ACGC CTGG CT AATTTTTTGTGTTTTT AGT AG AG ACAGGG TTTCACCAT CTGGACTTGAACTC CTG AC CTT AGGTG A 

TCCATGCCCCCTTGGCCTCCTAAATTGCTGGGATTATAG^ 

6 0 CAGGTCCAGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGAGGGCAGATCACCT 

CCAGCCTGACCAACATG CAGAAAC CCCGTCTCTACTAAAAATACAAAATTAG CAGGGCGTGGTGGCGCACACCTGTAATCCCAGCT 
ACTTGGGAGGCTG AGGCAGG AG AATCACTTGG AC CTGGGAAGCAG AG CTTGTGGTGAG CCGAG ATTGCAC CATT C CACTCCAGCCT 
GGGCAAAAAGAGTGAAACTCCATCTCAAATAATAATAA 

GAGGCAGGACAGTGGGAGTGACAGCCTGGTCCTGTCCTCCCGGGCTCTGGCTGACCATGCrGTAGTG 
65 GCIACAGCTGTCCTGTGGCCCTGTGGTCAGCCGGGAACCCAGGCCT 

GGGTCCCATGCAGGGCGCTXKjCCGTACCZAGGCCTCCTCGCGGGTGCCACAGAGCCAGCGGCTC 

CAGG CT CT CGG C CACG AGGCCG CTCATTT ACAGAGGG CTGAG CCG CTG ACAT CTG T G CGTTTGT AACATGGACAG CATTTCTGGTC 

CCATACAGACGCCTGGGCCAGCCACATTTATCTCCCCTTGAGAGAAGGGAAACGA 

GTCACACAACTTTAATGATGACCCTGGGCAGGAAGCTCCC^ 

7 0 CTTGTCATCCCAGCCCTTTGGGAGGCA^ 

ACCCATCTCTAAAAAAAAAAAAAATAGGCCGGTCGCGGTGACT 
TCACGAGGTCAGGAGATTGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCT 
TGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCCG^ 
GACGAAACCCTGTCTCrACTAAAAATACAAAAATTAGCCGGGCGTGGTGACACGCGC 
7 5 GCAGGAGAATCGCTTGAACCATCGAGGTGGAGGTTGCGGTGAGC 
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ACTCCGTCTCAAAAAAAAAAAAAAAAATTAGCCGGGCGTGGTX5GCGGGCX3CCTC 

GAATGGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACT 

TCTCAAAAAAAAAAAAAAAAATTTAGCCGGGCATGGTGGTGCATGTCT^ 

GCTTAAGCCCAGGAGTTTGAGGCGGGAGTGAGCTGTGATGGCACCACTGCACT 
5 CCCTATCCCCCGCAAAAAAAAAAAAACAACTCAGGGCTTTCCCIA 

GGTATATTTTGCCCCCAGGCCCCAAATCCTGGTCTTTGGACT^ 

TGAGGAGGGTGTGAGGCTGCAACGTTGCGGTGAGAAGGGGGTCCC 

GAGGGAGGTTGTGCAGCCAGAAGGAGCAGCCCGCAGCCTTTGGTTGTTGACTCCTG 

CAGGGTCCGAGTCCTCACTCAGGGAACTGAGTCCAGGGTGAGCTGAGCAGCCCTTCCT 
1 0 GCTG CATGGTGTCCAGTGGCCAGCCTGGGAGGAGCTCAGGACTGG CCT CCCT CGTACCAAGGTGG CTCAGAT 

GGCACCTGGGTTCCCGAAGGGCCCAGGAACACAGCGTCCATGTCCCCATC 

GGATCATGTGACTTGTTCAAGGGCAGCTTGTCCTGCCCCGGACACAGAGGT^ 

AGGGCATGGTGCCGAGCAGGAATGATTTTCGAAAATGCTACTCTGTGGTCGGGCACGGTGGC^ 

GGGAGGCCCAGGCGGGCAGATCATGAGGTCAGGAGTTCAAGACCAGCCTGGCCA^ 
1 5 AAAAAACT ACCCGGGCGTGG TGGTGG ATG C CTGT AATCCCAGCT ACTCAGG AGG CTG AGGCAGG AG AATCGCTTG AACC CTGG AGG 

TGAAGGTTGCAATGAGCCGAGATTGCACCACTGCACTCCGACCTGGGCGACAGAGAAAGACT 

ACCCCGAGATTCTCTTTTCCCCCGTCCGGAGCTCTATGGCCATCTGGAGCTCGTTCTGTCCACAAGGACAC^ 

CTGTGGACCAGGGCTCACTCACTCACATCTGTACCCAATCT^ 

ACAGTGGCAGCCCCATCACCAGAGGCATTCAAGCCAAGGCATTTTCT 
2 0 ATCTATCTATCTATCTAGATGAGATCTTGCTATGTTGCCCAGGTTGAACrCCTGGCCTCA 

AAGTGTTGGGATTTCAGGTGTGAACCACTGTACCTGGCTGGAGTGCAGTGGC^ 

GCTCAAGGAATCCTCTTTCCTCAGCCTCCTGAGTAGCTGGGACCA^ 

CATAGTATATGGCTCAGGGCAGAGCAAAGAAGCACAAAAACATGTTGGCTTGCGG^ 

AGCCCAAAATTAACAGCCACAACGTCAGTGTCTGTGGGAGGGGTTGCCAGGGGCGTGCG 

2 5 AAACCCATTTGAAGCAGGATGGCAAG AAGGTGACACCATCTTCCCCCG CGCACTGAAAGCCCCTGG CTGGGATTGCCTGGGGCAGA 

CACAGGCTCGGACCAGCCCCAGCAATCCCAGTTTATCAGCGA^ 
TTGTGGGCAGCCTGTGGTCATGCTGGCATTCCAGGGGCCTTTTGGCATGTGGG^ 

GCGCAGAAAAGGGAAGTGTCCCTAGAGGGGGTGGGTGAGGGCGTGGGAGGTGGTGTCTGCAGGGAATGTCCCCTTTGG 
GATGGAGGGTTGGGATTCTGAGGATGGGGGGGGGGGCTGTAGCCAGCACCATGTCCCTCCTGTCT 

CCGTGTCAAAAAACGAGAACACCAATTCCTGTCCTGCCCACTCACCACCAGGTGGGACCTGAAC 
ATGTCGGCCACTGACTCAACCACTCTCCCGGAGACCTATTTGGGCCACCCGAGGCGGGTGCCTG^ 
GGTCTTCAGGGCAGCGGCTGTGGGGCTGAAGCCTCAAGGAACC 
ACCTAGTTGGGCGTGTTGGGGTCTGTGGGTCCCAGGTCCTGGCCT 

3 5 CTCGTCTGCCTCTGGGCTTCTGTAAGGCTATGTGCTCCAAGGCCACCTCCT 

CCGATCTGCACCGTGGTCTCTGAAGTCTCCATCGTGACCTCAAACCTCGCTCGTCCTTGCT 

TCCGAGGTGGGGGACATCCGGGGGGGTTAGGTGGCTGGCGCGGGGAGCCGGGGTTGTGAGGGGTGATGTCCTCAGGC 

GCGGGGTGCGGCGAGGACACCGGTGGGGTGAGAGCACCGGCGGGGCAGCAGCGGGGGCCGCAGCGCCGGGTC 

CCCTCCCGCGCGGAGCCAGGGGCGGGACAGGGGGGCGTGGCCTGGTGGCGCTGA03TCA 

CGCCACCC^CCTCCCGGGGCCGCGCAGCGGCCTCTCGTCTACTGCCACC^ 
AGTTCGGCCTCATCAACTGCGGCAACAAGTACCTGACGGCCGAGGCGTT^ 

AAGAAGCAG ATCTGGACGCTGGAG CAGCCCCCTGACGAGGCGGGC AG CGCGGCCGTGTG CCTGCG CAGCCACCTGGG CCG CTACCT 
GGCGGCGGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGGTCCCGACTGCCGTTTCCTCATCG 

45 GTCGCTGGTCGCTGCAGTCCGAGGCGCACCGGCGCTACTTCGGCGGCAC CG AGGACCGCCTGTCCTGCTTCG CG CAGACGGTGTCC 

CCCGCCG AGAAGTGGAGCGTGCACATCG CCATG CACCCTCAGGTCAACATCTACAGCGTCACCCGTAAGCGCTACGCG CACCTGAG 
CGCGCGGCCGGCCGACGAGATCGCCX3TGGACCGCGACGTGCCCTGGGGCGTCGACTCGCTCATCACCCTCGCCTTCCA 
GCTACAGCGTGCAGACCGCCGACCACCGCTTCCTGCGCCA03ACGGGCGCCTGGTGGCGCGCCCCG 
CTGGAGTTCCGCTCCGGCAAGGTGGCCTTCCGCGACTGCGAGGGCCGTTACCTGGCGCCGTCGGGGCCCAGCGG 

5 0 GGGCAAGGCGA.CCAAGGTGGGCAAGGACGAGCTCTTTGCTCTGGAGCAGAG 

GGAAOSTGTCCACGCGCCAGGGTGAGTGGGGACGCTGCCCCCGCCrrCTCCTGGTCCGTGCACAA^ 
CCAGCCTCCCGCCCTTTCTCGCTCGCGGCGCCGCTGCGGTCCGGAGCACTGCCCATTGCGCCCCCGCTAGGC 

TAGGCCCCGCGGAGTCGCAACCGTACGTGCACCCTCCTAACC^ 

5 5 TCGGATCAGCTGTCCCAGCTCTTGCGGAGTGGGAGCCCCTCACCCATTTC 

GGGGGTCCGAGGCAAGGGTCGCCACCCGCAAGGGCGCGCCTCCACCCCCACCGGCAGCCTTTCGCGGGCGAGATGG 

CCAGGCCTTTGATCCCGGCGGGGCGCGCCrCCACCTCCCCGTCTGCCCGGCCTTCCTCCACCTC^ 

GCCTCCGCTGGTGGGXK5GGGGCGCGGGGTGTCAGCCCTTCCCCCCAGCCCCTCCTCCCGCGTCTG 

CTTTGTGAGCAGGGGGG CGGGT CGCCTCG ACTGGGTCCTC CCTCGCCCCCTTTGTCCTGCTCCATCTCTG CAGATGGGAAAACCAG 

6 0 ATGCGGCGGGGCGGGGGAGGGGATCGGCTTTTGCGGTTCACCCCTGCAGAGGAGCCCCC05CG 

GTCCCGAGGTTCCCCAGGAGGCGGTCTCCCTCCTCGCGCCGCGGCCCGGGA^ 
CGCCCTTCGCCCGCCGCGCGCGTCTCCAGGCCGGTGCGCTGAGCTCCGCT^^ 
GAGACCCCAGGCCAGCTCCCTCGGGAAGCCCCTACCCTCTGGTGAACCCATCCCCTAGGGCGCTC 
CrCGATCCGTCCCCTCCGGGGCCTCCTGCAGGCACGGCTATTTGCG^ 

6 5 TCCCCGAGGGGCTTTCTCTTCCTTTTGCCGTCCTCTTCCACTCCTGGCGGAGAAG 

GGAGGGAGGGGCGTGGGGGGATCITTTTCCCTGGATAGGAAGTCGGACCCCAGGCCTTGGAGAT 

TGTCCTCAT CCCTCCTCTTGGAAGGAGCCCTCCTGACGGCCCCCCTCTTGG CTGTGGGCTCTGCAGGAGGGGGAAAGACCCCCCAT 

GGCAGTGGGGGTGAGGGTGGAGGCCTGGGTGGGGTAATGGCCATTTTGTGCCT 

GCTGCTTGTCATGGGTCACACAGCAAGTGACCACAAGGAAGCAGCATGCAGAAGTGGGTGCA 

7 0 TGTTTCCCACAGCGCATGGCGGGCAGAGCTCCT CACCTG ATCAG CAGGCATTG AG CC CT ACTAAGGGGACCTGCTG AGCCATGAGT 

GAACCAGTCGGGTTCACGTTCTCTTGGGGACATTACGAAGTGTCAAGGAAAAGGCAGCGGGT 
TTTCTGAGCAGGACATTTGAGCTGGCCTGAAGGATGAATAGAAGGTGGCAG^ 
GCTGTGGGTGTCTCCTGTGCAAAGGTCTTATGGCAGGAAGGTACAGAGAAAGCCCA 
GGCCA.CGGGACCTTGGGTGAGAGCTTAGTGCAGGGCTATGGT 
7 5 CGCAGAAGTAGGTGGCCTTGGCCCTGGGGGCCAGGAAGTTG 
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CCTCTCCTGAGAAGCTTGCTGGGGGCGTCTGGT 

TGGTGGTGATGGCTCAGCTTTGGAGTTGAAAGGGATGTGGGAC^ 

TGGCCTCAGATCTTGGGGTCCTTATCTCGTTTCTATGTCAGCCJUl 

CGTGGGGGGCGTGTCTGCAGAGGCCATTTTGGACGGGCCTGCGTGGCAAGGGGGAGGCGTCTGC^ 
5 CGCCAGCCAGGAAGGGGAGGATCCGGACTTAGCTGGCAGGGGGGATGTGAATCATCTCTG 
GGGCAGGGTGCTCCCCTTTGAAAAACGCTGCGCCCrrGCTCTTCAGCCT 
TTGGGCTGGCCCAGAGCTGCCCTGGGCAGGGGCTTTCAGCC 

TGTCTrCTCrCACATCTCTCACATCTGGGAGTCCTCTCCCGCCA^ 
1 0 GTCCTCCCTGCTGGCTTGGGACATGCCCTCTCCCAGCCACTCTGAGGAGGGCTGACTGAGGAA 
AGGATGTGTAAGAGTTCTCCCCACGAGGCGCAGGGCATTCTGAGCCCA 

G GC CAAGT AGTTGCAGGAATCGACTGG AAACCGGGGTGGT AAGG TG AAGC CAT AGAACATT CCAGGCCC C CT CC CCT AAATGAG AT 
GGAAGGAGTGCCTGTTTTGAAC^GCCAGGGGCATCrGGGGACC^ 

CAACCAGGGGGTGCCCCTCTGAGCCACCACAGACAGAATGGTTCAGAAGGCCAGGCACACT 
1 5 TTGGGAGGCCAAGGAGGGAAGATGGCTTCGGCCAAGGAGGGA^ 

GCAAAACCTCTGTCTACAAAAAATACAAAAAATTAGCCAGGCATGGTGATGCACGCCTGTCGTCCCMCTA 

TGGGAGGATGACTTGAGCCXX5GGAGTTTGAGGCTGCAGTGAGCCGAGAT 

CCCTCTCTTAAAAAAATTTTTTTTGGCCTGGTGCGG 

ACCTGGAGGTCGGGAGTTTGAGACCAGTCTGACCAACATGGAGAAACCCCATCTC^ 
2 0 TGGCGCATGCCTGTAATCCCAGCTACTCTGGAGGCTGAG^ 

G AT CG CACCATTGCACTCCAG CCTGGGCAACAAG AGTG AAACT C CAT CT CAAAAAACAAAACAAAAGAAAAAAAACAACCAAAACA 
AACAAACAAAAAACTGTAATTAAAATAATTAAAAGAAAAAAGTTAAAAAAAAAAAAAAGAGAGGATGG 
GGAAGGCCACTGTGAGGGGAGGACAGGCAGGGCTAGACCTCTCTGGAAGGAGGAGGGAGAGGTACC 
CTCTTAATCTCCTGGCTCCCGGGGGCITCTGTGGGCCTGTGACTCAGGATTTCTGTGCCT 

2 5 GAAAACACTGTTCCCIATAGTAACACAGCTCTAATTAGAGACT 

ACTGGGGGGACGACACCCCTTTCCCCTATCCCTCCCTGCTGGGGTGT^ 
CAACATCCTGGCGGAGCGGAGAGGCTGGGCCCTTTTGACAGGTGGGGAAACT 
TGCCTGGCCCCCCGCGGTGGGTGCCTC^CTC^CTGGCCTAAACCITTGC^TCAAA 
GGGGACTGAGTCGTGGTACAGATGGCGGCAGTTGTGTGGCCCTGGGCCCAGGAATGGCTGGGAAGCT 

3 0 GGGGATGAGGTTAGTGTAAATTTCATGGAGTTGCCAGGAGGATTAAGGCCGACGTGCA 

GTTAAGCCCTGCGCAGCCATTGGAAGTCTCATTGAGAAGGAGCCTCCCGGCIT 
CTGCTTATTTGCAAGGCTGTGTGAGGACGAGCCCrGTAAACCCT 
CCTTCTCCITTTGCCCCTCTTTCTGGGGAACCTGGAGCCCCTGATGCCCTCT 
CTCCCTCCATCCTTTCTGCTGCTGGGGTGGGCCTGCAGGAGGCTGTCCTCrTA 

3 5 TTCAGCGGAGCCCCAGCGCATTGGACIAGGTGGTGCTCTGTGGCCGGCACAGGCCAGGG 

GACATGCAGGCCCTTCTGAGC^WIAATGAGGGGGCGGTGAGGGCAGCrGTGCGGGTC 
CGGCTTCrGTGCAGTGGGGGTGGGGCGGGCCAGACCTTTGCAGGGCCTCTTGG 
CCTCGTGTTCCCTGGGGTGTGGCCTTAGGATGGTCCCGGCACCCTC^ 
GGCTGGGGTGTGGTGGCTGAGCCAGCCCACCCAGTGCGGTGGATGCTCGTCGGAGGGCAG 

4 0 CCAGGGAACCTGGGGGGTGGAGCCCAAGGGGTTCCAAGAAGGGTCCGGGGCCA 

GGCGTGGCAAGAGGGCCACCCACCTCCGGTCCAGAGAGCCAGCCAGACCCTTACT1TCT 
TGCCACCACCGGCCTTAGTCTCCCTCTTTGTGAAGTAGGATGAAGATCCCCACCrrcTGGAGC^ 
CAGGATGGAACAGGATGGGGAGGCCGGGCGTGGTGGCTTACCCCTGTAATCC^ 
TGAGTCCTGGAGTTTGGGCAACATAGCXJAGACCACCCCCATCTCTA^ 

4 5 CTGTGGTCCC^CCTACTGGGGAGACTGAGGGAAGAGGATCCTTTGA^ 

CTTGCACTCCAGCCTGGGCAACAGAGCAAGACCCTATCTCTAAAAAAAAA^ 
ACCACCCACC^GCCTCCCTCrcCrGTGGGTGCTGCCTGCCACCCACCXSCT 
ACTCTCCAGCTCTGAGCCAGGCACCCATACAGAGCCCGGC^ 
ATGTGAATTTGTGCTGCTGGAGCCAAACATCCCACCTCCAGGTTTTAGCT 

5 0 ATAATGAATAGCAATTGTGGGCTGGGCGCAGTGGCTCATACCT 

AGGTCAGGAGTTTG AGACCGGCCTGGCCAACAT AATGAAACTCCGT CTCTACT AAAAATACAAAAAAATTAG CCGGGCATAGTGGC 
GGGTGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAGAGGTTTC^ 

AAACCCGAAAGCCACCTTAAGTTTCCCGAAGATAGAGATCCCTGGGGTCCTACT 

5 5 GGAGTGCAGTGGCGGATCTCAGCTCA.CTGCAGCCTCCGCCTCCTGGGTT 

GATTACAGGCGCCCGGCACCACCCCCAGCTAATTTTTTTTTT^ 
GCGCGATCTTGGCTCACTGCIAAGCTCCACCTCCCGGGTTCAC^ 

GCCCGCCACCGTGCCCGG CTAATTT T TT TTTTT TGTATTATTAATAG AGACGGGTTTCACCGTGTTAGGATGATCTCGATCTCCTG 
ACCrCGTGATCCGCCTCCCrrCGGCCTCCCAAAGTGCrGGGATTACAGGCATGAG 

6 0 AGTAGAGATGAGGriTCACCATGTTGGCCAGGCrrGGTCTCAAACTCCTGACCTCATGATCCGC 

GGGATTACAGGTGTGGGCCACTGTGCCCGGCCAGTCCCATTCTACTTTTTAGGGACATG 

ACAGT ATG CAAAT CCAGAGACAG AGG ACAT CTGTG CATGTTT CTTT CTTT (nT'CCTTTTTTTriTTTTl ri l l I T TG AGATGGA 

TTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCAATCTCAGCTCACTGCAACCT 

GCCTC^GACTTCCAAGTAGCTGGGATTACAGGCATGCCCCACCATGCCCAGCT 

6 5 CCATGTTGGCAAGGCTGGTCTTGAACTCATGACCTCAGGTGATCCACCCG 

GCCGCCGTGGCAACCCX3CATGCTTCTTATGTATAAGAAAAAAACAGCTAGAAAGTTGT 
TGTGTCTGTAACCTCAGCACTTTGGGAGGCTCAGTTAGGAGGATCATGTC 
CAGACCCTGTCTGTACCIAAATAGAAAAAAAAATTAACTTGATGTGGTGGTGTGTGCCT 
TGGGAGGATCACTTGAGCCCAGGAAGTCGAGGCTACAGTGAGCT 

7 0 CCCTGAATCAAAAAGAAAGAGATGTATTTGCC^ACACAGGGGTGTTCTGCA 

TTCCTTTTTTTTGTTTTAGAGACAGGGTTTAAAAAAAAAlT'ri 1 I'l l i 1 1 ITTTTGAGGCGGAATTTCATTCTTGTTGCCCAGGCT 

GGAGTGATCrCGGCrCACTGCCATCTCCGCCTCCCAGGTTGAAGTGATTCrCCTC 

AGAGACGGGGTGACTCCATGTTGGTCAGGCTGGTCTCAAACTC^ 

GGATTACAGGTGTGAGCCACTGTGCCTGGCCCCTTTrTT Tr TTTTT T AAGTAGGGTCTTGCTAAGTTGTCCAGG CTGGTCTTGAAC 
7 5 TCCTGGTCTCAAATGATCCTCCTGACTTGGCCTCCCAAAAGGCTGGGATTGCAGGA 
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AAGTTTTCTAAAGGAAAGGAACATGTACCATCGTC 

CACGGCCCCCCAAGATG CGCTGTTCCGCTGTGCTG CAGCCCCATTTGTCCTTTT 

CCCTGACATCAGGGCATTTCATCCCAAAATACTTCCATCTGTGTTC 

ATCAACAGTAATCAATGCTTAACATCTAATTCTGGGCCACGTTAAGACCCTCCT 
5 TTTCTTGATTCTTTTTGTTTTTTTTGAGACAGGG 

AGCCTCIAGCCTCCTGCCTCAGCXjATCCTCCTGCCTCAGCCTCCCAAGTAGCT 

TTAAAAAAAGTAATTTGTAGAGACGGGGGTGGGGTGGGGGTCTTGCTATGTTGCCCAGG 

ATCCTCCTGCCTTGGCCTCCCAAAGTGTTGCGATCACAGGCATGAGCCCCTGTCCCT^ 

GTCATGTGGGAACAGATGTAGCTTGCCTTGGCCTCTGACCGGCCCTGCCTGCGTT 
1 0 CCACTGTGGGGACT CGGCCGCCCAC CCCACCCCGTGGTGTT ACCTTGCGTGTGT AGTTCTGTGAGCTCAGGGCT ATGGTCTGCCAG 

AACTAGGGGGCGTGGGGCCCCAGTACGAGCCCAAGGCCTCCTCTCTGCAGGTATGGACC^ 

ACCAGGAGACCTTCCAGCTGGAGATCGACCGCGACACCAAAAAGTGTGCCTTCCGTACCCAC^ 

GCCACCGGGGGCGTGCAGTCCACCGCCTCCAGCAAGTGAGTGCCTCGCTCCCACCTGTCACCGCCCCCACCAC 

CCCCGCCTGACCCTGTCCCGCCATCCCCCAGGAATGCCAGCrGCTACTTTGACA 
1 5 GCGTCCAATGGCAAGTTTGTGACCTCCAAGAAGAATGGGCAGCTGGCCGCCTCGGTGGA 

TCCCTGGAGCCGTCCTGGAGTCCTGGAGGGTCTGGCCATGCCGT 

ATCCCCCCGGACTGGCCCCGCACTGTCCTACCCTGGGGCTGACTGCTGTGTGACCCCAG 
CAGCCTCCACCCCACTCCCTGCCAGGAGGCTCACTGACTCCCCTCTTTCTGGGACAGGGGACT 
ATCAACCGCCCCATCATCGTGTTCCGCGGGGAGCATGGCTTCATCGGCTGCCGCAAGGTCACGGGC^^ 
2 0 CAGCTATGACGTCTTCCAGCTGGAGTTCAACGATGGCGCCTACAAC^TCAAAGGCAGGTT 

AACCCCTCGGTCGGGGCTGGGGTCAGTGCTGCGGGGAGCGCCCTCTGCATCCACACTGGACCCT 
GGCCCTAAAGGGACAGGTGTCTGATGGCCACCAGGGGCTCTGGGATGCAAGCAG 
ACTTACCTTGCCCACCTGACAGAGAGGTGTGTGGAGGGGAGAGCAGGGA 
GGGAGGGGAGAGCCGGGAAGGAGAGGAGAGCAGGGGTGGGGAGGTTCTGGAAAG 

2 5 GACTGG AGCC CCTTCCCAGGAGGACCCCCAACAATCCAG AGGTG CCTGTTAGGATTCAGAACATGGTTTTTTTGTTTGTTTTTTTG 

AGACTCACTCCCTCACCCTGGCTGGACTGCAGTGGCGTTATCTCGGCT 

TGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGCACCACCACGCCTGGCTAATTTTTATATTTTT 

TATTTTGAGACCCAGTCTGGAGTGCAGTGGCGTTATCTCGGCTCTCTGCAACCTCT 

CAGCCTCCCGAGTAGCTGGGACTATGTGTGGGAGCCACCATGCCTGGCTAATTTTTTTGTATTT^ 

3 0 TGTTGTCCAGGCTGGTCTTGAATTCGTGGCCTCAAGTGATCCGCC 

ACCACACCCGGCTAATTGTTTTGTATTTTTAGTAGAGACG^ 

TGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGGCCGAGCAGAA 

TCATGTGTCCATCATGGACAGGAGGACGTGCGGGCCATAGGGACCCTGGCTCATTCCGGAGCCGGGACTGGAGGGT 

CCTTGGGAACACCCGTGCCCACCCTCCGCTGCCCAGGGTAGGGGTGGGGAGCCAGGCTTTGGGCCCCACT 

3 5 CCAGACTCCACAGGCAAATACTGGACGGTGGGCAGTGACTCCGCGGTCACCAGCAGC 

GTTCTGCGACTATAACAAGGTGGCCATCAAGGTGGGCGGGCGCTACCTGAAGGGCGACCACGCAGGCGTCCTGAAGGCCT 

AAACCGTGGACCCCGCCTCGCTCTGGGAGTACTAGGGCCGGCCCGT^ 

TCCCTGCTAACCCCTTCTCCGCCAGGTGGGCTCCAGGGCGGGAGGCAAGCCCCCTTC 

GGTGCCCCCACCTGTCGCCCCTATGGACTCCCCACTCTCCCCTCCGCCCGGGTTCCCTACTCCCCTCGGGTCAGCGGCTGCGGGCT 

4 0 GGCCCTGGGAGGGATTTCAGATGCCCCTGCCCTCTTGTCTGCCACGG 

CTGAGCCTTATTTCTCTGG AAGCGGCTAAGGGACGGTTGGGGGCTGGGA CTC 
TCCCAGCCACCTCCTCCCAGCCCCCCAGGAGAGCTGGGCACATGTCCCAAGCCTGTGAG^ 
AACCCCTGCTTGGGAAGGGAAGCTGTCGGGTGGGCTAGGACTGACCCTTGTGGTGTTTTTTTGGGT 
CTCCCACGTGGGAGAGGCTCAGCCTGGCTCCCTTCCCTGGAGCGGCAGGGCGTO 

AAAATGACCAAATCAGTATT1TTTTTAATGAAATATTATTGCTGGAGGCGTCCCAGGC 

GCGGGGGGCGTCTCAGCACCCTCCCCAGGGGGTGCATCTCAGCCCCCTCTTTCCGTCCTTCCCGTCCAGCCCCAGCCCTGG^ 

GGCTTGCCGACACCTGGGCCAGAGCCCCTGCTGTGATTGGTGCTCCCTGGGCCTCCCGGGTGGATGAAGCCAGGCGTCGCCCCCTCC 

GGGAGCCCTGGGGTGAGCCGCCGGGGCCCCCCTGCTGCCAGCCTCCCCCGTCCCCAACATGCATCTCACTCTGGGTGTCTTGG 

5 0 TTT ATTTTTTGTAAGTGT CATTTGT ATAACT CT AAACG C CCATG AT AGT AGCTT CAAACTGG AAAT AGCG AAAT AAAAT AACTCAG 

AGGCGCCCTGGCTCCCCAGCTCAGTGCTGGGAGTGTGCAGTGGGAGGGAGGCCGTGGCT 

CCAGCACACCTCCTTAAGCGGGGGATGGAGCGCTGGGAGGGGGTGGACTGTGGCCC^ 

CTGTGGTGAGAAGTGGCTGTGGCTCCTCGTAGGCTAaSTCCAC 

5 5 GTTGAGAAGGCGCAGCCCAGGGAACACTCAAACCAGGAGACCCCACATTATCCTCTAGTTGACATGTGCCCTTCGACTAGGGGACT 

TGGTGGTAGGGGCAGCTCCGCCCCCATACTGCGAGGATCCGGGCCTTCCACTO 
CTGGGAGATGCTGCCCrGGGCCCCTCAGGGGGCGGCCGTGGTAGGATGGACCGGGCC 
ATCGGAACCAGCAGCCCTGGGGATATCTGGCATATCAGAGGCTGAGGACTGTT^ 
CTGGGTTCTGAGCCTCCCTTTGCATCCTAGGCTGCAGGGACAGGGGCTGGGCAGAGGCTC 

6 0 CCCAGGTGCCCCTAAGGATAGCAGCTGCGAGAAGACTGAGGGGAGGAGCAGTAAGGCGCAGCCTGGAGCGGAGGGACCTGGCGTCC 

AGTCATTGTGCGCTGGGAAGGCGGTGATGGCGACGGAATTTGGGA 

GAGCCCCCTGTTCCCCCGCATCCTCCCCGCCGTCGGCAGCAGAGCAAGCGCTGATTGGCTGCTGATGACGCAACT 

GCTGTGATAGGTGGTCCCTCGGGGGCGTGGGGCGCAAGTCTTGAGATTGGCAGGGGCAGGTGGCTGCTGCA 

CACCTCAGCGAGGTTCCGGCGGCCACCCCTCTCCCCTCCCCCATGAAAGCCAGGCTGGAAGCC^ 

6 5 GGACAGATTAATCCTCCGGAACCAAACTCTTTGAGACCTGGGGAGGAGGGTGGGAGGCACTGGGAAGGG 

GCTCATTTCCACCCATGGAGCTGACCTAGAATGACCCTAAGGTCCCCGGAGCCACCAGCTGATTCCGA^ 
AATTTGGCGCCTACCACGTGGCCGAGATTCCCGAGGCTGCAGCCAATGAGCAGCATAGT^ 

CACGAGATCTCTCGGGAGGAATCGGTAGGAGCTGCCCAACTGAAAGGGACGGGGGAATAACCCGGGGCGAGGGACCT 
CTGCCCTCTGGGGGGAAGCCAGGCCAGGGGCCGGGGGCATCTCTGGGGTCCCAGGTG 

7 0 TTCTCCAG ACTCGCGTGGAGGCCACGAGCGTCTTCTGGAGGAG CGGGCACTGCGCGGACCGG CAGACTCTGGGGCGCTCGTGCTCC 

CCATCTCCTGACCTTTTCCTACCTTCAGTTCCCTCCTGTCAG^ 

TGCAGTTTCCGCTCCTGCCTGAGTCACCTTCGCTTTCTCAGCGATTCTTCATTCACAA 

ACTTGTGAACTTGCACGCACACACACATATGGACCTGTGCAAAGAGCCCCATGCACAGGCAGGTAC 

ACAGACACGCACAGATGCAGGCAGACCAACTCCCAGGAATCTGTCTCCAGCCTACAG 
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AGCCTGCCATGGGGTTGGGAGTGAGAGTCCCACCTCCCAC^ 

AT CCTTGTTGGGATTCACTGTG ATCCC CAAAGAT CACGACTT AAGGG AAAACATTCAATTAAACTTGTTAGAGCA^ 

GACTGGGCTGGGAGCAGTGGCTAACGCCTATATAGTTCAGCTACrCAGGAG^ 

GAGGCTGCAGTGAGCTATGATGGOVCCACCCACTGCACTCCAGCCrrGGGTGA 

AAAAAAAAAAAAGGAAAAAGAGGCCGGGCACAGTGGCTCACACCTGTAATCT 

GGGTGCCTGTAATCTCAGCTACTTGGGAGGCT^ 

CACCACTGCACTCC^CCTGGGCGACAGAGTGAGACTCTGTCTCCAAATAAT 



GGTCCCAGCCCTGGGTGCCCCCACTGCTGTCTGCTGTGACCTCCTCAGGAGGCCT 
1 0 CTGTGGAGCTGTCACCAGCCCCAGGAGGACTCAGGCAAGTCCCTCTGGTGGTTO 
GACACCCCCCCTCCAGACCTTCTGCCCCAGCTCAGCCTCI^CCAATTCATTA 
CC(^CTGGGCCCC^TTTCCTCCAGGGCCACAGTGGCCTCT 

TGAGACGGACTCTCACTCTGTCG C ' rr r r ri T l'TTTTT 1 1 T'f T TTTTGAGACGGAGTCTCACTCTGTCG^CCJU^CTGGAGCGCAGT 

GGCX5TGATCTCGGCTCACTGCAATCTCCACCTCCCACGTTCAAGCGATTCTCCT 
1 5 TACGTGCCACCACACCTAACTAATTTTTGTATTTTTTAGTAGAG^ 

AGGCATGAGCCACCATGCCCGGCCTTGACTTCACTCTTTTAAAA 

CTGAGGCAGGAGAACTGCTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAACA 

AATTAGTCAGGCATGGTGGTGCGTGTCTGTCGTCCCAGCTGCTCAGGAGGCTGAGGTGGGA^ 

GACTGCAGTGAGCTGTGATTCCACCACTGCAGTCCAGCCTGGGC^ 
2 0 AAAAGTCTAAAACTTTTCTTTCAAACGACGGCTTT^ 

TCGGCTCCCTTGGGACCCCGGCTTCCCTGAAGACCCCTGAGCAATGGTTGGCTGGGTCCCT 

ACAAGGTTGTCTGGATGGGGGAGGGGCTCTGAGACCCT^ 

AGGCCHGGAGGGGTGGGGCGGGGCAGGGCATTTGGCGTGAAACCCAAGC 

CCTCTGTGATGTGCAAATAATTAACACAGAAGCCrTCACT 

2 5 GAGGAGGTGAGCAAGGGGGGAACCCACCTTCCAGGGAACCCCCAGATAGTGCCAGGCACCAGGC^ 

AAT ATT ATTTGAAG AAACT AAG CT AAT AAAT AAAT ACGGTAT AAAAT CAAAGGG T AC CCACAGAT AT AAACAAAAATC CACAG CTA 
ACCCGGCCCCAGCCCTCTTTCCTTTGGAGGT AACTGTGTTAATGGTr riTrACACCCCTCTGGAATTTTTTTTTTTTTTTTTT 1 1 T 
TTTTTTTTTTTTTGGGAAGGAGTCTGGCTCTC 

GGTTAAAACGATTCTCCTGCCTCAACCGCCTGAGTAGCTGGGACTACAGG^ 

3 0 TTTTTTTTTTTTTGAGCTGGAGTCTTC 

CCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCTGAGTGGGTGGGACTACAGGCTCCAG 

TTT ATTTTTT AAT AGAGATGGGG TTTCACTG TGTT CGCCAGGATGG TCT CGATCT CCCTTTTTTTTTTTTTTTTTTT AGACGG AGT 

CTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCATGATCGTGACTCACTG 

CTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCACCAC 

3 5 CCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTTGTGATCCTCCTGCCTCGGCCT 

CACCGCGCCCGGCCTCTTGTTlTlTTrTIT 

ATTATTCAGCCAGGCGTGGTGGCTCACGCCTGTAATCC CAAC ACTTTGGGAGGCTGAGGCGGGTGGAT CACTTG AGGCCAGGAGTT 

TGAGACCAGCCTGGCTCACATGGTGAAACCCTGACTCTACTAAAAATACA^ 

GCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCCCTTGAACTGG 

4 0 CCAGCGTGGGTGACAGACTGAAACTGTGTCTCAAAAAAAAA 

GTACAATGGGGTGATCTAGGCTCACTGCAACCTCAAACTCCTGGGCTCAGGTGAT 
CTACAGGTGCATGT<^GAACACCTGGCTAGTTTTTAACTTTTTTGTGTGT 
ATCACCITCTGGGCTCAGTGATCCTCCTGCTTCAGCCTCCCA 
• GGATACCTT AATGTTT ATTTCTCACATTTCTT CTAG AACAAACATTGC CAG CGGATCCCAGACT CTG AACTGTGGCTGGGG ACT G C 

4 5 AGCATGGAGGTCITTGCAGGCACTCTGGGAAACATGCCACAA 

GGAGCCGAGGGACTGTGCATGCCTCAGAGCCAGATGCAGCAGAACTGGT 
GCCCCCCACCAGCTCCGACAGTCCrCGTCCTTCTCCTCCCCATCAGAAGGGAAG 
CGGTTCTGAGGTTGGGCAGGCTGGGGAGCCCCAGACATGGCTTC 
CC<;ATCCTCTGAGGGTGTGGCrrCAAAGCCCTGCTTGGGCCTAAGG 

5 0 TTGGGCACCAGTCCTAGGGTACCTGGGCTGGGGGTCTTCAGCTCA 

ACAGCACTGGCCCTTTCTGACCTTACACCCCTGTGCATCCTCACT 
ATGGTGGCCAGGGCACCCCACCGGCAGCACACCTCAGAGGCCT 
TTTTTTXTTTriTTrTCCGAGACAGAGTCTT 
TCCTCCTCCTGGGTTTAAGCAATTCTCTTGCCTCAGCCTCCT 
5 5 TTGTAGTTTTAGTAGAGACGGGGTTTCACGATGTTGGCCAGGCT 
CCAAACTGCTGGGATTACAGGCXjTGAGCCACTGTGCGTGGCCCT 

GG AAAG AGAGAG AGAG AGATTG AG AGAG AGAGAG AG CAC CACATT CAG AT AGAGACAGGAGAGTTCTCAG CAAG AAG AGAAC T CAG 



CTGCACACTGCACACATGGGTCACAGCAGGGCTTGGCGTTTGGAGCAG^ 
6 0 TGTGCTGGCCTGTGGTTTGCTTTGGCTGCTACCATGTGGA 

TGTGATGGAGAGAGGAAGGCCACCACTGTGATGGAGAGAAGCCGGGGCCAGACCGATGAGGGCT 

TGGTCTCGGCCAAACCCGAGCTGGCAGCTGATGGCAGCCACCCCGGGGTCCT 

AAAGGGAAGCCTTGTGGAAAATAGCAACTCAGGGATGTTGTGAGT^ 

6 5 GGCCTCCTGAGTAGCTAGGACTATAGGTGCATACCACGATGCCTCGCTTGTT^ 

TGCCCAGGCTGGTCTGTAACTCCntSGGCTCAAGTGATCCTCCTG 

GTG CTGGC CTTGGGGTG TGG TTTT ATGTGGC CAT AGATG ACTC GG 
GGAT C CGACAGGGGGACAGG AGGG CAGACAGCTC CTGG AAGGC CTGGG AAAG AGAT AT CAGGGCTGGAC CCTGG AG AAGTG AGG CT 
TTGAGGGAGGGGAGGAGATGGGGCCTGGCGGGGTACGGGCGCTGCTCCTAGGTCTGAGGGC^ 
70 GGGAGGGCTCCCTCCCGGTGCTCGGTGCCCTGGCTCTCCT^ 

GGAAC^CAGGGCACTGTCCCTCTGCCTGTCCAGAAAAGCACGT^ 
CCCTGCTGCTATTCCCAGGCCTrCGGGCCTCACTGCCTGGGGTGGCCGTGCT 
CCTGCCCCGAACTGCCCCTGGGACCAGTTCCrrCAGCTCCCAGGACAGGGC 
CCCCATTGGTGCCCAGTCTCCTTCGACTCT 

7 5 TGGCAGCGTCCTCCAGGACTCCTGCCGTGGGGGAAGTtTT 
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CTCK3TTCTCAGGAACACCTGGAACTGGCCAGTCAG 
AGCAGCAGGCGGGGCTTCTCCTGGAAGCCTTC 
GCCACCTG<i»TGCCGGTTTCrCTGTGGCACCTACTAGG 
ACCCAGCTACACTCIAGGGCACGTGCACAGGT^ 
5 G 

HUMAN SEQUENCE - mRNA 

GCGGAGGGTGCGTGCGGGCCGCGGCAGCCGAACAAAGGAG*^ 

CAGCGGCCTCTCGTCTACTGCCACCATGACCGCCAAC 
1 0 ACAAGTACCTGACGGCCGAGGCGTTCGGGTTCAAGGTGAACGCGTCCGCCAGCAGCCTGAAG 

CAGCCCCCTGACGAGGGGGGCAGCGCGGCCGTGTGCCTGCGCAGCCACCTGGGCC 

GACCTGCGAGCGCGAGGTGCCCGGTCCCGACTGCCGTTTCCTCATCGTC^CG 

CGCACCGGCGCTACTTCGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGACGGTGTC 

ATCGCCATGCACCCTCAGGTCAACATCTACAGTGTCACCCGTAAGCGCrACGCGCACCT 
1 5 CGTGGACCGCGACGTGCCCTGGGGCGTCGACTCGCTCATCACCCTCGCC^ 

ACCGCITCCTGCGCCACGACGGGCGCCTGG 

GCCTTCCGCGACKTOSAGGGCCGTTACCTGGCGCCGTCGGGGC^ 

GGACGAGCTCTTTGCTCTGGAGCAGAGCTGCGCCCAGGTCGTG CTGCAGGCGGCCAACX5AGAGGAACGTGTCCACG CGCCAGGGTA 
TGGACCTGTCTGCCAATCAGGACGAGGAGACCGACCAGGAGACCTTCCAGCTG^ 
2 0 CGTACCCACACGGGCAAGTACTGGACGCTGACGGCCACCGGGGGCGTGCAGTCC^CCGCCTC 
TGACATCGAGTGGCX5TGACCGGCGCATCACACTGAGGG 
CCTCXSGTGGAGACAGCAGGGGACTCAGAGCTCTTCCTCATGAAG 
TTCATCGGCTGCCGCAAGGTCACGGGCACCCTGGACGCCA^ 

CT ACAACAT CAAAGACTCCACAGG CAAAT ACTGGACGGTGGG CAGTG ACT C CG CGGTCACCAGCAG CGGCG ACACT CCTGTGGACT 

2 5 TCTTCTTCGAGTTCTGCGACTATAACAAGGTGGCCATCAAGGTGGGCGGGCGCT 

GCCTCGGCGGAAACCGTGGACCCCGCCTCGCTCTGGGAGTACTAGGGCCGGCCCGTCCTTCCCCGCCCCT 
CTGCCAACCCTCCCTGCTAACCCCTTCTCCGCCAGGTGGGCTCCAG 
CAGAGAAAACGGTGCCCCCACCTGTCGCCCCTATGGACTCCCCACT 
GCTGCGGCCTTGGCCCTGGGAGGGATTTCAGATGCCCCT 

3 0 TCAGACGGCTCTGAGCCITATTTCTCTGGAAGCGGCTAAGGGACGGTTGGGGGCT 

CTTTTGCCTCTCCCAGCCACCTCCTCCCAGCCCCC(^GGAGAGCTGGGCAC^ 

CTGTCCCCGAAACCCCTGCTTGGGAAGGGAAGCTGTCGGGAGGGCTAGGACTGACCCT^ 

AACAGCCCCTCTCCCACGTGGGAGAGGCTCAGCCTGGCTC^ 

3 5 CTGGAAGCAGAAAATGACCAAATCAGTATTTTTTTTAATGAAATATTATTGCTGGAG^ 

GAGTGATCTGGCGGGGGGCGTCT CAGCACCCT CCCCAGGGGGTGCATCTCAG CCCCCTCTTTCCGTCCTTC CCGTCCAGCCCCAGC 

CCTGGGCCTGGGCTGCCGACACCTGGGCCAGAGCCCCTGCTGTGATTGGTGCTCCCTGGGC 

CGCCCCCTCCGGGAGCCCTGGGGTGAGCCGCCGGGGCCCCCCTGCTC^ 

GTCTTGGTCTTTTATTTTTTGTAAGTGTCATTTGTATAACTCTAAACG 
40 AATAACTCAGTCTGC 

HUMAN SEQUENCE - CODING 

ATGACCGCCAACGGCACAGCCGAGGCGGTGCAGATCCAGTTCGGCCT 

CGGGTT CAAGGTGAACGCGTCCGCCAG CAG CCTG AAGAAGAAG CAGATCTGGACG CTGGAG CAG CC CCCTG ACG AG G CG GG CAGCG 
45 CGGCCGTGTGCCTGCGCAGCCACCTGGGCCGCrTACCTGGCGGCGGACAAGGACGGCAAC 
CCCGACTGCCGTTTCCTCATCGTGGCGCACGACGACGGTCGCTGGTCGCTGCA^ 
CGAGGACCGCCTGTCCTGCTTCGCGC^GACGGTGTCCCCCGCCGAGAAGTGGAGCG^ 

TCTACAGTGTCACCCGTAAGCGCTACGCGCACCTGAGCGCGCGGCCGGCCGACGAGATCGCCGTGGACCGCGACGTGCCCTGGGGC 

GTCGACTCGCTCATCACCCTCGCCTTCCAGGACCAGCGCTAC 
5 0 CCTGGTGGCGCGCCCCGAGCCGGCCACTGGCTACACGCTGGAGTTCCGCTCCGGCAAGGTGGCCTT 
ACCTGGCGCCGTCGGGGCCCAGCGGCACGCTCAAGGCGGGCAAGGCCACCAAGGTGGGC^ 
AGCTGCGCCCAGGTCGTGCTGCAGGCGGCCAACGAGAGGAACX5TGTCCACGCGCCAGGGTA 
GGAGACCGACCAGGAGACCTTCCAGCTGGAGATCGACCGCGACACCAAAAAG^ 
CGCTGACGGCCACCGGGGGCGTGCAGTCCACCGCCTCCAGCAAGAAT^ 

5 5 ATCACACTGAGGGCGTCCAATGGCAAGTTTGTGACCTCCAAGAAGAATGGG 

AGAGCTCrrCCTCATGAAGCTCATCAACCGCCCCATCATCGTGTTCCGCGG 

GCACCCTGGACGCCAACCGCTCCAGCTATGACGTCTTCCAGCTGGAGTTCAAC 

AAATACTGGACGGTGGX3CAGTGACTCCGCGGTCACCAGCAGCX5GCGACACTCCTGT 

CAAGGTGGCCATCAAGCTGXMCGGGCGCTACCTGAAGGGCGACCACG CAGGCGTCCTGAAGGCCTCGG CGG AAACCGTGGACCCCG 

6 0 CCTCGCTCTGGGAGTACTAG 
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Table 4 

MOUSE NOMENCLATURE 
5 ICSGNM Map3k6 

Celera mCG20807 

HUMAN NOMENCLATURE 
HGNC MAP3K6 
10 Celera hCGl9300 

MOUSE SEQUENCE - GENOMIC 

GGGCTGGACCTTCACAGAGAGGCAGAGCTACACAAGGCAATACCAATTCT 
GGCTCCAGTTGAGGTGGTGTGCCTCTGGGAGACAATGGGGAGAAGTCTTGCTT 
1 5 GCTGGCACCCCCCTCTAGTAGACCCTACAAGATGGGTCTACTATGAGCCCTA 
CCTGAGGGTACAGGGTCACCCCAGCTGCCCATGTAGGGTCCTACAAATGCTC^ 
GAAGCAGGGTGAGAGAGTTTGGTAATCTGGCCTTTTCATGTAAATTTATGTAAATAGGATATA^ 
GATATAAATACCACAGAACCTCAGCATCTCAGGAGGTCCTGATATCTAGAATCT 

2 0 GGTCCCCCTCCCCAGTTATTCAGGTGTCCCAAGAAGTCCTAGAGAGAAGAGATCTGTAGGTTCTCAAG^ 
CTGGCAGAGAGGGCTCTCGAGGTAGCCTGTAGAGGCrGGATGTGGCTGCTTGGG 
ATGCTTAGTTATTAAGGAGTGGTACTATITAAGAAGGATTAAGAGGTGT^ 
CTGAAGTTTCAAAAACCCAACTCTGTCTTCCCTCTGCATGCCTTTGTGTGTATCTTTCTG^ 
CTACTTCTCCAGCAGCATCCCTGCCTCTGTGACCTCGTGCTCCCCACTATGGTGATTGTGGACTAAACCTCTG 

2 5 TCCCCCAGTTAAATGATTCTCTAGAGGAGTTGCTrGCCTTCXSTGCCC^ 

AAGGCCAGCCTGGTCTACAGAGTGAGTTCTGGGACAGCCAGGGATATACAGAAAAGCCTGTCT 
AAAGGTAGTCATGGTCATTGTGTCCCCTCCCAGCAATAGAACACTGACTAAGA 

CGACGTGGGAACCACCTCTGCACCAGGCTCCAAGTGGGAGAGTGCCAGGATCCTGGGGTGTACCAGGTA 
TTTACTGCAGAGGACTGTGAGGCTCAGAAAGGAACAATTGCTTCT 

3 0 ACCAAATTCTGTTTCCTCCTCTTCCTCTTCTTCCTTCTCCTCCTCCT 

TTGAGCACTGGGCAGGACACAGGACATCTGCCTCTGGAATTCCAGGGCTACCTTGGCTCCCTA^ 

GTCT ACC CT CTG ACTTT ACTTTTTG CACAG ACTAGGGCTGG GCCCT CTCAG CT ACTT AGGG CTG T AAGTG C CCT CGG AGG AGCCAG 

CATTGGAAGGAGTAGTAAGAATTTGGAGGAGCGGGGGCTGGAGAGGTGGCCCAGTGGTTAAGAG 

CCTGAGTTOU\TTCCCAGCAACCAGATGCT 

3 5 GCTAGAGTGTGCTCATATACATAAAATAAATAAATCTTAAAAATAATAATTTGGAGAAGGGCTG 

GGGGCCTCCCTTAGGGTACTC^GACCTCAGAGTCAGTTGACACCTGGTCCTTTATCTCTGGGCAGGAC^TCAGCCA 
CACTAAGAGGCTAAGGAGGCGTCCACGGGAGTGAGGAACTTGGA^ 

CCTCTCCTCGTAGCTCTATAGTGCGCACCTCITCCCCAGGTCCCAGGATCTGAGAGGACTGCTCGGT^ 
CTGGGTGGAGGCTTTTGGGAAACAACTTGGGGGCCTTTCTCAATCCACCACCCCCCCCCCTT 

4 0 GTCTTTCCC^TTTTTTGGACACTCCAGGGGGTAAGAAAAA 

TAGGGGGAGCCAGAGTCCACTGAAGCCAGGCATACCTGACAGGACAGCTCCTACAGGGCCAGAAGTCTGAGAAGGC 
AGGTGTTATTTGGAGGAAAGTAAAGGCTCrGTGAACATTCTCTAGGT CTGTGAGTG CAGAGTACTTTGCGGGTAGAAATAAAGGCG 
AGCAGGAGGAGGTGGGAGAGAGGAAAGGCTAAGGAAAGTAAGCTGGGGGTCGTGGGAGAGCCAA^ 
AAGGCCACTTCATAGAGGACATTGCATTACTGACATGGAGGGCAAAGGGGAGCCA 

4 5 TCAGACTTAGTTTTAGAAAGATAAGTCAAGAGGCTGGAGAGATGGCT 

AGTTCAATTCCCAGCAACCCCACATGGGGCGACTCACAACCATCTGTAATGTGATCCAGTGC 
GCTACAGTGTACTCATATACATAAAAAAATAAGAGAAAGATAAGTGAGGCGTGGAAGTGCTTGCCTGT 
GCCGGGGGCAGAGGTATTGAAATTAGGGGCCTGGAGTGGGCCCAGGGAGGCCCTGTC^ 
TGACTTTATGGTTAAAAGCAAGCATTGACTGCGGAGCXSCCCCCACTGGATAGC^ 

5 0 CTGACGTGCTTGTCTAGAGTCCTCGAGCATCTGCGCACTGATGCACACACATTCTCTCTGTTAGACACATGA 

CGTGCATACACACACACACACACTCTACTCTCrCCCTCTCTCACCTATTTTTAATCTA^ 
ACCCTAGAAGAGACACTGGATGGGGCAAGGCAGAAGGTGGAGACAGAGGTTAGGAATGGA 
GAAGTGACATCCACAGTATTGGACATATGTCATCTGAAAGATACTGGAAGAGCAGCAAGCACCACCCCC^^ 
TAAGATTTCCCTGCATAGCGATGGAACTTGCTCTGTAGACCAGGCTGGCCT 

5 5 ATTGGGGAGCTTGCCAGA.CCACTGGGCTGTCCTAAGAGCAGAGCTCCAGAGCTGGAGGACGA 

GCCACGGGAAAAGGCTAGGAATCCAGCTGAGGCCTCTGGGGGCAA.CATCTATGCTCT 

TTCTTCTTCrTCTTCrTCT T CTT CTT CTTCTTCTrCTTCTT 

TTCCTTCTTTTCC T TCT TT TCCITTCTTCTCCTTCT^ 

TCTCCTCCTTCTCCTCCTTCTCCTTCTTCTCCTTCTTCTTCTCCTTCTTCTCCTT 
60 TTCTTCTCTCCTCCTCCTCCTTCCTCCTTCTGTAAGTATATATATATATATATATA 

TGAGCCTTCGTGTGGTTGATGGGAGTTGGACCTCTGCTTGCTCCAGTCAACCCTACTCACT 
CCCAGTCCCTGCTTGCTCCAjGTCCAAAGATTTATTTATTATAA 

ATCAGATCTCACTACGGGTGGTTGTAAGCCACCATGTGGTTGCTGGGATTTGAACTTGAACT 
TGCTCTTACCCACTGAGCCATTTCGCCAGTCCCTAATACCTGGGrrCTTAACCTGCAAATT 

6 5 ATATATATATATTCATTCCAATCTTCCCTGGAGCTTACGAGATTTITGTCATGAGTTGAGTG 

GGATTTATCATTCATTATTCATAGGCTAAACTCACTCAGAGGAGTCACATTTGA^ 

AGGGAAGGGTCCAGCCAGCCCTCACACCCATCCAGCCCCTCGCT 

CAGCAGGTGCAGGGGTGGATTTCTTCCACTCCTGACTGCTACJVGAGCAGTGGGGAG 

AAATTTACAGGAGCTTCCTCGGTAGGGTGAATTCCAAGTGAGAGGGCTGGGCCAGCCCAAACCCCTACCATCTCAG 

7 0 CTCCCAGGTATCTGGGAAGGTCCTGTAGAGTCTGAAGACTTCAGCAGTAGCTGCATCTTTGTTTATTATGCTA 

CAGGGAGGAGGAGCGCTGGAAGAGTACATCCTGGGTAAATTCT 

AACTTTAATTTTTTATTTTTTTAAATCTTATTTTTAAT 

GTGGAAAAGAAAGAAAACGGGArSAAATGGACCTATTTAGAAAGGTTTTT^ 
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GTTCAGTTCACAGGTTAGCAGGCATCGATGGCTTGATCCA 

CGGGTTAGCAACGGTGGTGACACGACCTAGTAGAGACAGCOIGGCCTCAGCCTCAGC^ 
GTITCTCAACTGTGCCrCTCTCAAGGAAAGCCACGATCAGCAAAGACr 

ATCTAG CTCTGTCTCCATCACTCCGTGGCGTCCTATTTAT ACCCTCCAAACATCACGTGT CCrCCATATGCCTTGTCTTGCCTCAG 
5 CACAGGC^TCCAATCAGCCCCAGTCCTTA 

AGTTCCGACAAAGGCAGTTCAACTACACAGTATAAGGCA 

TTTTCACATTCTTGCTTTAGCAGAACATCCTCTCTCCTGTGTCT 

TTTCACACCTGTGTCCACTTTAACAAACTGTTCCTTCACGTG 

TAAGTTTCCACTTCACCCGGGGATTGGGGGGGGGGCAGTGGCAGAGGAACAGGAAGG 



10 




AGTTTTGCCTTACACAAAATTGATGGGGGTGGTGC^ 

GCAGGGGTGGGGGAATG CAGTGCAGGGGTGGGGGAATGCAGTGCAGTGCTGGGGGGCATGCAGTGCAGTG CTGGGGGGCATGCAGT 
GCAGGGGNNNNNNNNNNNNNNNNNNKNNNN^^ 



15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNGGCTGG 

GGCTGTTATCTGCTGCTTTGGTGTGTGTGCACCAGTGTGGTCTGCTCCAGGAGAGGAACCG^ CT T T CAG AGAAG CAGTG 

AGCTCTCCCTGGCAATGTTGCTC&GGCrGCTTGCT 
2 0 GATAAGACTTGGGTCAAGGGGGGAGGGTATAGGGGTCTTTCGGGATAGCATTTGAA^ 
AAAAGACTTGCTGGGCGTGGTGGCGCACACCTTTAATCCCAGCACTCGGG 
CCTGGTCTTCAAAGTGAGTTCCAGGACAGCCAGGGATATACAGAGAAACCCCTGTCTTG 
AAAAGACTTGGGTCAAACTGATGGCTTGGAACAAAAAAGTCCCACTTCCACTGTGACTA 
ACAAAAATTGGCTCCTTGGTGCTGTTGGCTCrrCATGGGGCCTGATAACATTaVCCT 

2 5 TATCTGTAAGGGCTTCAGGCATCCTGTAGGGTGCCAAGGTGTCTTCCACTCCTTGCTGCAG 

AG ACTTTTTTTTTT AAG ATT AATTT ATT ATTACATGT AACT ACACTGT AG CTGTCTT CAG AC ACATC AG AAG AGGGTGT CAG AT CT 
CATTATGGGTGGTTGTGAGTCACCATGTGGTTGCTGGGA 

ATCCACCTCTCCAGCCCAGTGGAAAGACITCTACCACTGATGGTGCAGAGCC^ 

C CTGGTGCTGGGTTGTACAGATCCTGCAACTGAACCAATCTCAAGGGGCATTATG CTGCATCTGAGGCCACTGCCCACATGTATGG 

3 0 CATTGACTGGACCT CAGAAAAAGGCGTGGGT CACCCCTACTGTGGG ATTACACATAATGCTTTG CTAGAGCACTCAATTGTC^ 

TCGTTGT CACTTCTTACAG CTTC CAT CTCTTGGCCACCT CACTGC CAACTG C CTCTG CTT CCAGG GC CACTTTGT AACACTTTACA 
AATCTTTTCIT ACTCAGGAGAAAACCTAT AGAAAACGGGG CAGGGACACAAG CTTCAGTTCCATTCAAGGAAGAACTTCTAATCAG 
GAAGATGAAGGGTTGAACTTCTCAGGGCTTCAGACCCTA 

3 5 TGGTTGGTTGATTTTGTGAGACAGGGTTTCACTCAGCAGG CAG CCCTGGCTGTCCTAG AAACTCTGTAGACCAGGTTGGCCTTG AA 

CTC7VG AGATTCACATG CCTCTGCCTCCTGAGTG 

AGTGTGTTTTCTTTI rTTCTTTTTTTTTCATTTTrr ATTAGG CTCATTTACATTT CCAATGCTATACCAAAAGT CCCTG 

ATAGTGATGATC^GAACATTGTCTCCCTGATTGAAGAGGACACTGGGATTGGGCTTCT 

CCCCCAAATCTCCTGATATTACATCCTGAGCCTCTATGAGTGTAAAGGAGCTCGTGTCTGAT 

4 0 CTTTAAGGGACCTCITTAATAGGTCCCCCACCCrCTGCCAAAGTGTGCrC 

CCCTTITTGGCCCTGGCGTTCCCCrGTACrGGGGCATAT^ 
GGCCIATCTTTTGATACATATGCAGCTAGAGTCAAGAGCT 
GATCCCITTAGCTCCTTGGATTCTTTCTCTACCTCCTCCATTGGGAGCT 
CATTCrCCCTTTCTGTTGCTCGTCTATCCCATTGGTTCAAACCTAGGAAG^^ 

4 5 GAAACAGAGGCTTCCTACCGTTTGTTGAAAGGTCAATTTGTCCAAAGCCCCAACATA 

TGTGCATCTTTTGCCTGTAATTATATATGTGCACCAAGTATC 
GGCCTACAGTTAC!AGATGATGGTGAGCCACCATGTGGGTGCTG 

ACAGCTGTGCrGTCTCTCATGCCCAAGCCCTACCATAATAAACAAlTATGAAACATGGCCCT 
TTCTAAGTTCAGGGCCCTCTAAGGTTTATAJGCAACT 

5 0 AACCTTGTAACCCAAAAAGCGGAGACTTCTGGAGTTCCTTCAAAGGGTAGATA 

TGGTCAAGTTTCGGGTGGACCAATAAGGTCCACAAGGTTTCACTA 

CCTACCAGCAGGAACATTACATGTGCCGCCTACACAATGTTT^ 

GTCCCMCAGGGCGGGGCTGTCCTGTTTTCTATCCCTGGGCTGGGCAC 

TTGGCCCGAATAGGGGCACAGGCTGTT CCGGAAGTGGACAGTGTC^AGAGGTTCT AG CCGGAGAGGAGGGAGGGAGAGC CGCTGGGG 
55 AGCCTC^TTGCCCCAGGGCCGAGGGTCGAGCCTTTCCCAGTCGGGAAAGGGCTGGGCCCAG 
GACTGCCAGCGCTCCCTTGAGAAGTCTGGGTGTAGCACGCTGGCGGTTGACTTCCC^ 
TTACTGGGTACCCCAGATTCCCAGAACCCAGTCCTGGATTGC^ATCrGTAAGGACCGCGAAAGAT 

TCCCCGCGCCCGGGGCCCCX5CGCTTGAGCGCCCCCGCATGGCGGGGCCATGTCCTAGGGCCGGGGTTCTAGAACGG 
60 GCTGGCAGGACCCGCTGGCGGAGGCGCTGAGCCGGGGCCGCTCGTCTCCTGCGGTC^ 
CTCAGTGTGGTCTACGTGCTGACCCGGGAGCCGGGGCCCGGGGTGGA^ 
ATGCCTGCGCGAAGCCTGCGCT(IAGCTC(IAGGGCACACGGCCACCCCCGCAGCTGCGCAGCC 

GCGACACCGCTG CG CTGGACTCCTTCTACAACGCGGGTGAG CGACTTGGGGCGGGG CTTGAGTCTAGGCXK1AGCAAGGGGTGGAGC 
TTTAGAACCCTGACGGGGCGGGTGTGGGTACCAGK3TCTAGGGAATTTACGGTAAACTAAAGCCGAACTTC 

6 5 ACAGGGCATTTAGTGGATTATGGATGTGATGGGCTTGAGGCTGGATCTGGGGATTCAGGC 

GGTGAGCGTGGTGCCAAGCAGGTGGGGCGGAGCAGGCGAACTCTAAGGGCGGGGCTGGGGAGGGACGGGGCTC 
GCTGCGCTTCCCTGCGTAAGCGGGAGAACTCTTTTTCAGGATGATCTCTCGTC 
AGTTCGGTCTATGGCAAGGGGCCATCTCTGCAGGATTGGGGTTGATAAGTCCTAGCATACTACA^ 
GGGAAAGGAAGTACTGAGGCAACTCTTCGC^ACTGGATTATAGGAGAAAGGGATTGTATTT 

GATGCTCCACCTCATCCCT 

AGGCTGACCCGGTCAGACCACAGATAAAGTGGAATCATCTGCGTTGACT 

ATGGCTGGATACAGTAAGCTACACAAACACACATAAAAGCACGACGTT 

TGGCTTCAAGATTG^AGAGATGGTTCTGAGX?TTAAG 

7 5 GTGTGCATGCTCACAACCACCTGTAACTTTATCTCC^ 
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ACATATAAATAAAACAAGAGAGAGAGGAGAGAGAGAGACAGAGAAAGAGACT^AGAATGGGACTTC l"l"riTGCAAAGTTCCTGTCT 
CCTACATAACCTGGAAGTGCTCAGTGGGATGAAGGTCCCTGGGCATCTCCTGTAACACCACCA^ 
GGAAGCTCTCGAGAGCAGCACCGACTTTGGCATGAGGTGCAGGCAAGGCACTTGACGTGACA^ 
TGAGCTGGGCTTCCACACTCAGGCCTACACTTCrGTGTCCACAGATGTAGTGGTC 
5 CCTTGTTCTACCACCTCGGTGTGCGTGAGAGCTTCAGCATGACCAACAACGTGCT 

CAGGCCCTTCGTGTAAGTGGCCAGGGGGCTGGGGGCTGGGAAGACAGGTGCAGGGTTGGGGTC 

CTGATTGTCTGCCTCCTGCAGGAGGATGTTTT^ 

ACCCCTACTAACCTTCCTAGCTTCCTCC^^ 

GCTCTTTTCTTCTATTCCTATCTTCACCAGTACTAACCGCCAGTCCGTCTCT 
1 0 TT GCTGC CTTCTTCCCACCAGGATTGTGTTGG CAG CT ACACACTG ATTC CTT ATGTGG TGACAG CCACTGG CCGGGTCTT ATG TGG 

CGATGCAGGCCTCCTGAGGGGCATAGCCX3ATGGGCTAGTACAGGCTGGGGCGGGCACTGAAGCC 

GGCTTGTCCGTCTGTTGGAGGCTACACCCACAGACTCTTGGTAATGG 

GTAAAGTGTGCTGGGGCTGAGGGGCAGGCTCATCTCTCTCTCTCTCTCTCTCTCT 

TCTGGTCTCTGGGCATC^GCGGCTATTTCCGGGAGACCATTCGTCAGGATATCCGGCAGGCTC 
1 5 CTGAGCCAGGAGCTGGCTCGCCTGCAGCGGAGGCTGGA 

GTACCGCGATGTCCAGGTGTGTGGCGTTCGGAGAGGCCATGACATAGGATATAACTCCAGC^ 

CAGGAGTCTGAGGTCATCCTGTTTACAAGCAAGTCTAAGGCCAACCTGGGCTTCAAG^ 

CCATTGTATGACAGACAGATACAAGTGCAGCTCCTGGCTGAGCCTCACTAGTGGCTTTAAGTAA^ 

TATTTGTTTCATCATCATTATTATTTTTTAAACATGGTCTCACTGTGCATTCCTG^ 
2 0 TGGCCTACATATCCACCTGTTTCTGCCT^ 

TATTTTTTTAATTGTGTGTATATATGTGTGTCTGAGTTAGGGTTTGTACCTTGTGTC 

TGTTGGATCCTTTGGGACTGGAGTTAAAAGAGGTTGTGAACTTCCT 

GCAGCCTGCACTCTTAACTGCTGCTAGTTATCTC^ 

GGGCATTGGTGAGGTTAGAAACACCCAATGTGGGGTGCA 

2 5 GGGAGGTGTGCAAAGTCTTCAGGTTGTGGGTAGTAGAAATTTCCTTCAAACTGA 

CAGTAAGCAGAGGCAGGCAGATATCTGAGAGTTCAAGGGTA 

AAAACCTCATCTCAAAAAAACAAGCAGAAAAAATG ATTTCTG CTGG CAGTGATGGTGCGGGTCTTTAGTCCCAGCACTCGGGAGGC 
AGT AGCAGGTGGATCTGTG AGCTTGAGGCCAG CCTGGTCTACAG AG AGAGTTCC^GGGCAG CCAGGGCTACCGATCTTTCCTCG CT 
GTCAAGGGCTCATCGAGATGCTGCACGCTGTCTCCCACAGGACTACTCGGCCATCA 

3 0 CCACCTGTGATGTGGCTGAACAGCACAATGTCTGCTTTCACTACACATTTGCCCT 

CAGAGTCTTGGAAGTCTCTGTGACCTGCCAAGCAAGAGCACTGCAATCT 
CCTAATGTCACAAGGTTCCTGTTGGTCTGGGCTAAGGATGCATGGTGGGGACAGAGGACGTG 
TGTCCAAGGTCGCTACTCCTTAGAGTTGTTTGCAACTAGAAATCCCTTCCATCTGGAGTCACT 
GGGGTTAGTGTCATTTGAGGCTCATTAATAGTAAGCCCCCAGAAGGGTCCCTTCCGTTT 

3 5 GGGTCAGAGCCATCAGACAGGTCTTCAGTGTATGTTCCTCCGCCTTAGGAGGAACAGGCCT 

TCCTGCTGCCATTGGTGAAGTATGAAGGTCCCGTGGCTC^ 
ACTTCTGX3CTTCCAAAATGCCGGGCACCTGGAGCAGGCCT 
ACTGAAGGGTGGGGTTGCC^GCATCCATTCAGCCTGGCATC^ 

GCCCAGCCTGCACTCAGGCATCAATGCAG CTGTTCTCCT CATTGCAGCCGGGC AGCATTTCGAAGACTCTGAGGAGCTCCGGCTGA 

4 0 TTGGTGAGGCCTGCTGTTAGCCCACAGCACCCTGCACA 

CAAAGTGGGCTTCGAGACTGATCGGTGCCTGAGGTCTGAGGTCTCAGGAGCCTTTGAGTGCCT 
GTTGTCACACTTTATCCCACATCTATTTGTGCCCTAGCTCTCCCCT 

GATCTACATTCATATCCACACTCCTGTGCGTGAGGGCCACAGTGGGACCCACACTTGAGCCTTAC 
ATTGAGCCCACGCCACCCTCCCCCAGGCATGAA 

4 5 GGGATGTGGGATTCTACCTGGGAGCCCAGATTCTTGCCAACGACCCCATCC^GGTGGTGCTGGCTGCAG 

TGATGTCACTACGATAG&TAGACAGTGCCCGTGTTCCCCT 

TGGCACCAGCCGGCTCCTAGGCTGAGCTGAGGCCAAGGCTGTCTGTGCATTTAG 

TGTACCAGCATTTCAGACCCACACCAGAGCCCrrCAGGAGGACCCCCGCTTCGAGCTCACTTCT 

5 0 TGCCAGCCTTTCAAGATGGCTCCrCCrCAGGAGGAC 

AGCACATTATGACCGATCCCTTTCCCaSCTACrGCTTACAGGTACTGGTGCTGGAGAT 
GAGATTCAGGGAACAGACCCCATGAGCGCAGTGACCCTAAGCTTGCTGGAGCCAGAGACTCA 
ACCX1ATCCACCTTTCTTTCTCCTATCCTCTCTAGTCCCAGCG 
AGCTGGACCTTCCCAGTCACCTCCATCTGTGGGATCAG 

5 5 TCAGTAGCCTCTCTTCITCCCTAGCACCrCCAAGCTGGACCAGCGCTGCT 

CAGCTGTGCTTTCCCAGTGTGGAGCGCTGCCAGCGGTGCGCTCAACCTTTCTC 
ACCAGGTGCTCCTGGGGGATGGTTCCTGGCTGTTCCCCTTCCAGCTGTTTCCCCCT 
GGTGATGAATCCGGATTCCTCCGCGCC(^CTGAGGAGGCAGAAGGCGCAAGGGAAGTGCT 
GTTGGAGGAGAGCGAGGGGTGAAGGGTCCCTGCTAAGCGGGTGCTCTGGGTT^ 

6 0 AGCGGCTGGTGCTGGGCAGAGGCACATATGGGGTGGTGTATGCTGGCCGAGAC 

ATCCCAGAGAGAGATAG CAGGTGGGGTG CATTGGTCGATGAAGGTCTGGGGTCTCGGG AGGGTATATGCAAACAGGGTGTGTCT 
GTGTGTGTGTGTGTGTCTGTCTGTCTGTTGAGGAAACGTGAGAACT^ 
CTAGGGGTGGTGGCCCTTACAGTAGATAAGAGAG^TCTGTGGGCGGGGTGTTGGATCT 
TGGGAGGTTATCTCTGCTCAC^TCTGCCCATTGCTCTCTCCTCGACACTCCCAGATTCT 

6 5 TTCACAAACGACTTCGTCATAAGAATATTGTGCGTTATCT 

GTGCCTGGAGGTACCAGCCCTGCGTGGGAGGGGAGTGAAACCTGAGGT^ 
CCTGAGGGGAAATGAACTCTAGCGTGCGTGGGTATCACCT^ 

GGAATGGGTGTGTTTGGGCTTTGATCAGAGGGATGTAGGTAGAGTAAGCGGACATCAATTATA 
CAGCCTCTCCTCCTTGCTTAGGTCAGTGTGGGGACCCCTAAAGGACAAT^ 

ACTACCAAACTGAAGCTGACAGTAATGGTGGGAGA^ 
CGAAAAAGTCACGTCTGTGGAAGATCAGACACGAGCTCCT 

CTCGGGATGGGGGGGTCTGGTTTATAGCAGAGCCCAATCCCAGTGTCCTCTTCA^ 
AGTGGGCTGCTCAAGATTTCTQACTTTG^ 

7 5 GGGGTGGGGGTGGGGTGAGGGTGTCAAGCTCTTGGAGACTTCT^ 
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AGATTTATTTATTTATTTTATGTATATGAGTACACT^^ 

TTAGGACCTCTGCTCTCTCTAGTCGGCCCAGCTTGCTCAGTCCCTGACTGCTCTGGCCCAA^ 
TACIATTGTAGCTGTCTTCAGACACACCAGAAGAGGGTCTCAG 
TTGGACTCGGGACCTTCGGAAGAGCAGTCAGTGCTCTTACCCGAT^ 
5 ATGGAAGGAATCTGGCTGGAGACAGGTGACTGGATCCAGGAACCGCT 
AGACCTCAGCATCGAAAACGGATGCAAGGCCGCGTGTGTTG^ 

CGTCACGTACAAGCCAGAACCCTCCTGATGGTTGAACGCTATGCAGCCTAGAATGAA^ 
AGGTGTGCACATGGCCAGTGGATGATGGGGACTTGATGCCTACCTGGCATTGTTGTT 
TATGGCCCOSGAAATCATCGACCAGGGCCCGCGAG 
1 0 TGGCAACAGGTCGACCGCCCITCCATGAACTAGGGAGCCCTCAGGCTGCGATGTTT 
GAGGGTGGGCGGAGCCAGATGGGGCCAGTXK5GGTTATCTATCCCATCTGC 
ACCCGTGCCCGGTTCCCTGTCAGCTGAGGCCCAAGCC^ 

GCTATGGGGGTGGAGGGGGACAACCTAGTGAAGGGGAGAGGGGCTGATGAGGCGAAAAGACTAACA 
1 5 CCCCTTCCGGTCCTTCCAGTCCTTCAGCTGACTCAGCCACACAGTCTCAGACATTCCC^ 

CCGA.GCCCCCCAAAGCGCTGCCTTAGTTATGGGGACACCAGTCAGCTCCGGTGAGCGCCT 

CTTTTTTTGCCGTTGTATTAATTTCCTCCAGAAAAAAAAAGCCTCCCTGGGT 

TACrACCTCCTGCCCTCTCCCAGATGCCGGTGGCATTCACTCTGCCAACCCCCM 

AGGCATAAAAGAGCCAAGATTCAAACCCTGCAACATAGGTGGTTGGTTGCATGGTC 
2 0 GTG CCCAGTGTG CCCG AGG AGCCCGCTCCCGAGGAACCCGCGTCCCCAGAAGAGAGTTCAGGCCT 

CAAGCGCCGGGCCATG CTGGCTGCGGTGCTGGAACAGGAGGTGCC CACACTAGCAGAG AATCTCCTGGAACAGGAACAGGTGGGCG 

GCCCGCCCCAGCGCCCCCTGGTGGTCGAGCTGGGATAAGAATTTATAGTGCACCGGCXaAG 

ATCCACGGACGTCCCCTCTTTTTAGGACTCTCGACTCAGCAAGATTCATGTGGAACTTCTGCTTC 

ACACTCCTAACCGC CGGCAGCTGGC CCAGGAGCTGCGGACCCTGCAAGCTCAGCTGCX3GGCCCAGAGCCTGGGCCCTGCGCTTTTC 

2 5 AAGGGGCCGCTCTTCGCCTTTCCAGACGCGGTGAGGAAGCTATCT^ 

TGATGTTCCAGGATTCTATGATTTCCAGGAAGACGCATGTCCMTC^ 
CTCTTTTAGACTGCAGCCGACTTTCrAGGGCTTTGCMCTCAAGCCCCACCT 

CTTCCCTCC CCAGGTGAAGCAGATCCTCCGCAG ACGCCAGATCCXjCCCACACTGG ATGTTCGTGTTGG ACTCG CTGCTCAGCCGTG 
CAGTCCGGGCGGCGCTGGCGGTGCTGGACGCGGGTAGGTGGGGTTGGATCCC^CCGGCCGGCCGGGG 

3 0 GGCGTGGCGAACATAAAATTGGTACTTCGCGGCCCTCCCTTCTCC 

AGCTCAGTCTACTCAAATTGGTAGAGTCGGAGAAGAAAGCGGTCTTACCGAGGTCAGAAGAGTCGAGTAA^ 

AAACCTCAGG AGAGCCAGGCGCTGCAG AG CCAGCTCCCACCAG AGCAGGG ACCCCCATCGTTGATGGTGGA^ 

AG CCG AGACIX3 ACAGGTGAAGCCTTTTGG AATCACTCGGATAAC CAATCTGCAGGCCCTTCCATTCTACCACCC 

CTACCTTCCCACAGGCTTCGGGACCTTCTGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTGCAAC 

3 5 TGCAGAGACCAGGAAGTATGCCCCAGCTTCCGAGACCCCAGCTGAG 

ACACCTTTGCC CTCCCTCATCACCTGC CTTTTTG ACAGCCACTCT 
GTGTAGATCCAGCCAGTATCCAAACGGTGAGTAAGTGTGAGCTGGCT 

AGCTATTCAAACTTTGTCATTGCTGCAAAGATGCTCAATACTGAAAGAGCCCTTAGTATGCACCT 
GGAAACTCAGGCC71CAAGGGGGCCGCCTGCCTGAGAGTTCTGTTAGAAGCAAGAACCCGAGTTTCT 

4 0 TGATCTTTCTCTGTTTCTGTGGTTCTCGGTTCATAACCAGCT 

CTCAAGATGATCTAGTCTACACCAGAATCAGGTATGTCTTGGGCCTTCCAGGGGTC 

ACATCCCCTTCCCTATGGGCCTGGCCTGCCTTTGATTCCTCCCTGAAAACCATAGTGGTTCCT 

CTGGCrGAATTACATTGATCCCTGATGTCCCCAGGGGAGGGA 

GCCACATCGGTCACCCCAGTACCCCGGGACGCTGAGTGATGACATCAAAGGACACAGATACGGATGGATGAACA 

4 5 ACAGCTTTGGACACACCTGTCCTAGAACCCAGGAGCAACTACAGGGACCAGGCCCA^ 

ACCCAAAGGACACTACATACAATCATAGACTATTAAACAGAA 

GGGGAGTGAGGAGTGTCCAAGGAGAAGCCCCTTGAGGTCTCCATGTCTGAGTGACCCTTT^ 

GCCTTACTACCCAGACTTCATACCCAGTTAAGCAOTCAGTAGCTCTGTAAACA 

GGTGGGGGGTCCACAGAGGGGTTTGACAAGGCTATGCCCTGGGGAC 

5 0 CAGGGCCGCTCCAGGAGTGTCTGCCACAGCTGCITCTCCT 

TGGATGTGGGTGAGCCAAAGTtaGAAGCTACTGTTACCTrGGAGACCT 

TGTGCCCAAGCATCCCGAACCTCCCAGCTTGGCTCCCCTCAAGGGCCTGCTCTGCTCACCTOT 

CCAGCTGGCGACTGGCCAGGGCCCCATGGTCCCACAGGGAGAGCTCGGCACA^ 

TAAACCATGGTGTGGTTCAACACAGGGCTGAGACTTCTTC 

5 5 CACTGAGCTGCGACAGGGTAATAAAGGAGGGCTAGTCTTAGGCAATCACCCTCTGACACrCCCT 

GTCCTGTCCTTTCATATGGCCGTGTTCCCTGACCGTTCCCTGGCCCTGGCCTGAGATCAGGTCTCTAGCCCT 

TTCTT AAGG CTTCGT ACGTG AAGCT AACTGTT CA CAGG ATT CCAGAGAG AAGT CTCTG AAATGCAATT CTG CT ATTTTCTGGCCCA 

GTAACCTTCAGTATTCCCTG CC CTCCCCTTTAGTTT CCAAAGCCTCAACCACTCTGTAG AGTAGAATAATATT ACTOT 

GTAATAGAAAGGAAGCCAGGACGGGAGCCCCAACTCAGAGCAGTCTACTCTGTTTC^CTCCTGGAGGGTGC 

6 0 AGCACCCAGAAACACTTTTAGAGGTTCTGTAAAAACTTACTTTGTAAACCACA 

ATGTGTGTATGTATATATATGCATGCATGCTTAGGGTGACT^ 

CACCAACCATTGACTCACAAGTTCAGAG CCAAGCCCCATAGCTTGTCCCTCCAAGAGTTC CAAGGAGGACCCTTGTGGCCTTTGCA 

TCATAGTGCGTGCATAGTAGGTGCACAGCCTCTGCAGACTGCAGCCTATGCTGGCTAATG 

GTTCTCAACCTCCCTAAGGCTGTGGTCCTTTAAGACCCCCA^ 

6 5 CTGTTATGACTTGTACTGTAAATATCTGATGTTTGCCACCCACAGGTTGAGAACCACTGAGCTAG < 

AAGCTAAGCAAATGCCTCCATAAGATGGGTCATGCCGTGGCCAAGCCTGTAGAGCATTT^ 
GCAAGCCTTGGAGAGCAACCCATGAGGAGCCAGGCAGTCAGCAGCCCTCCTCTGTGGCCT 
CTGCCCTGTTTGAGTTGCTGCCCTGGCTTCCTTCAGTGATGGACTACAGC^ 
TCTGGTCGTGGTGTTTCATCAAAATAATAGAAACCCTAACCACATA 

7 0 AGGTCTCACTATGTAGCTTGGGCTGACCTGGAACTAGCAATGTAGACCAGGCT 

TCCCCAGTGCTGGGATTAAAAATGCACACCACCACCAGGCAGTGGTGACGTATGCCT^ 
GGTGGATTTCTGAGTTCCAGGCCAGCCTGGTCCACAGAGTGAGTTCC^GG^ 

ACAAACAAAAAAAT AAAACCCAAAAACATG CCAC CGG AC CT AG C CT AAG CTTTT ATTTTGAG ACAGGGAT CTTGTGT AT CCCATGT 
AGCCTTAATGACCTTGAACTAATCCTCCroCCTCCAATCCTAGT^ 
7 5 TAAGCCAGCAACTCCACCAACGGATACCCAGCCCCAGCCCCC^CTCCAACCCCCTGCCT 
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TTGGCTGGGGGGACAGGTGGATCAGAGGTCAAGAGCAGAAGGGACTAAC^ 
AGGGATCCTGGTCGCAGTGGTACCAGGCTCTGAGCTTCCTTTACCCAGAACT 
GGGGACACTCCAGTTCAGACTACGACCTCACAGAGTGGGTGGGTCTCCCACCT 
CAGTCACTTACCCTCTGAGCCGGCAGGGACATACTT^ 
5 GAACCTGAAAAGGCGGGGAGTGGAGAGAAACAGGATTATGTGTGAG 

GCAACGTTCCCCAGAAGAGGCGCCCTGGCTGGCAAGCCTGGCCTAAT^ 

CCGTTGGGTGTTTGGTCCAGTGGGCCTGTGGGAA^ 

GCCTCAGAGTCCCAGTTCCACGTGTCTAGGGGCACT^ 

GGACAGGCTCAGCACGCGGCCTGGGAGATCAGCCTGCTGGACAGAGTGCTGGGGACAGGAGG 
1 0 ATCGCTGTGGCTCACCTGTTAGCTGGAGATGGTCTCCCAGGTTC 

AGCACAAGTTGTGGGGCITCAGATTGCAAGACCACTAGCTGTCTTGTC^ 

GGCAG CCCCTG CCACT CACCCCACTGAAAATGGT AC C CTGTTT ATTTGCAGGGATG ACAGAAC C CTTGATG CCT CACCCGCAG ACT 
CTCGTTGAAG ATCGGGTTCAGATT C CGTTT CTTCACTGACGTCTT ACG CTTGCT CTG CTT AT C CGGG AGG AGGT AG CTTTT CACGT 
AGCTGGGCAAGGGATGTGAAGGCTGGAGGTAAGGAAGAGCCC7ITC 
1 5 CATAACACAGCGTGCCCATGTGGATGTTAATTAACAGGC 

GAAGAGAAAACTGAGGCCGAGGCAGTTAAGTGGTAGGCTGGGTGAT 
GTAGGCACTCACGGATCGGLAGCXK3CGTCGCCG<^ 

AGGCTCGTAGTGCAGCGAGAAGAGCACGGAGCCTCGCACCTGCACCGTC^ 
TCAGCTGTGGGCGGGGCXKX3TGGATGATCAGGGTCCC 
2 0 AGCCCCTCCCTAGCTCTTCACCGTAGAGGAATTGAGGCTGGACAC^ 

GCCCCGGGGCCTCTTCTCCATTCTCCAGGATCTTGGAGGTCGCCTTGGACTGGAGAGGGGAG 
TCCCTGGACCTTGCAGGGGAGTTGGGGGTAAGGGGTCATTCACCGACGGTTCCCACCCCCAGGGCAGGTACA 
TCCGCTCCCGGTCTTGAGAACAGCCAGTGGCTACTCCCGCCTACCTGGGCTGGAGTTGGCT 
AGGATCCAGCTCTGGGTCCX3CCTCTGCC!ACCTGCACACCCCCTTCGCCAG 

2 5 TCTCCTATGTGTCCCTAAGCAAACCTGGAGGGTTCACCTGTCAGCTTCAGGCCCCAGGGGTG 

AGTTGGGGT CCCAGGAAGAAGAGGAAGGTGGACATGATGGTTGTGGGGG CAGTCATT AAAAATGGCCTCTG ATCCCCTCAACCTCT 
TAGGTTCCCCAGCTTACCCTCCAGCTCACATTCCTGGTCC^ 

GTGAGGACAAATCAGGCCCCTGAAGAAAGCAGAGGGTAGAGGTCACGGTGGGGGTTAATGAGAAAGGGATCAA» 
GTCAGAGGTCAAAGACTCCAGCCTGCTCACCTTGCTTGGTCTGAGT^ 

GTGGGGGCCCCAACATCTGAGTCCCTGGCCT 

ATGACCTTGGACTCCTGTTTCTACCTCCTGAGTGCCAGGATCTGAGGTGTTCACCAGAATGCCGGG 

CAAACCCAGAGCGTCATGCGTGTTAAGCCAGCACTCTATCAACTGAGCTACATCCTTTGTGTTTGTT^ 

GGGGTTCCCTGTGTAGCCCTGGCCTTCTGAAACTCTAGAGTTGTTAAAAATTGGAGGCTGGCCTCAA^ 

3 5 TCTCTGCCTCCCAAGTGCTGGGATTAAAGGGGCAACCTGCCCCAC 

TTTATTTAGACTTATTTGTTTTATGTAAGTACAGACACACCAAAAGAGAGCATCTG 

GTGGTTGCTGGGAATTGNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNIWNNNNNNNN^ 

NhWNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNKNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNiWNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 
5 0 TCCAGGTTGAGCAGCATAGTAATCACTGTTCATTAAGTGCTTATT^ 

TGTGTGCGTGGTATGCATGCCTGTATGCTCCCATGTATGTGGTCATACGCATGTGGAGGTCAGAGTATGAACTTGCTCATCTTCAG 

CTCTCTCCATCCTATATACTGAGGCAAGGTCTCTCAC^ 

CAGCAATGCCCTAGCACCATTGCCGGAGTGCTGGGACAGTGAGTGGG^ 

GAACTCCATTTCTCAAGCCTATGCAGCAAGTGTTT^ 

5 5 GGGGAGGTTCAAGGTCAAAGTAACTGCCCAAGGTTACACAGCTGTTCCCAGACACTGCTTGTCACrCT^ 

TGCTCCTCTGGGGAGACGAAATATCTTCGTCTTGGAAGTCAAAAGTACTCAAGGATTAGGGGACATGCT 
TATATATATCTGGGAGCTGCCAGGAACCGATCAGGGTGACCTAT 
GGAGGGTCCAGAAGGGTAGTACTAAGGACAGGCAGTATCTGTTCACCAA^ 
CCCAAAGACATGGAGGCTAGGGAGGGACCTGTCACAGGTTTGAGAGAAGTGAAGAGCT 

6 0 AAAGGTAGCACCAAGAACCACCCAGACCCAACCACCAGCATCCAAGGACCCCGAATCCCTTCCTTCTATCT 

TCACCTGACCCGCCCTTCCTCCAGCTGGCGCAGGTGGGCA 
GGAAGCTGAGGTCC^GCAACCCCTCAACTTCAGATCCT^ 

GATGGGTGGCCTCTCTGGGGCATCAGCTGGGGTTGTGCCCCAGCTGGGCATATGTAGCTTC 
GTATCTCTCAAATGCACAGCACACATCCTGAAGCCAGCTGCCGGCTGTCTCCCAAGCATAT^ 
65 GGAAGGCCAGGTCAAGGTGAGGTGGGGCGTAAGTGATGCACTGCAAGAGATGCAC^ 
AGTATTGCAAGACAGTACACGCTGATACACATACCCACTGTTCTACAC^ 

CACATCCC CATG AAGGAACTGTGCGCACACAT ACATCATGAAGGG ACTGTG CACACATACCCAGAGTCCTATGCCCAAG AGCTGTG 
CAGGACAGTAATTCTTAAACCTGCACACAGTATGCC 

GATCTCTCTGGATTTCCTC^CACACTGTCTAACCTGCTGCATACTACTAGAAACT 
70 TTTGCACACAACATAACCTGAGTCTTAATGAAATGCAAAAATATACCTGCATAAACATG 
CTCATGTTTATCCCCTACCTAGCCTGCCTCTGCCCTTGCCT 




AGCTGGCAAAACAACTCATGCCCAGGTACCACAGCCACTCCGGA 
7 5 GGGACCTGCACTCAGTAGTGGCCCTGGT CTGGGCCCGCTGGGCAGTCTCTCTXriTTCCCAGCCC CTCCT ATGCCAGGCCCCCAGCT 
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GGGGTTGAAAAGGGAGGAGGGCAGCTCAGGGGCAGGTCCTGGCTTGTG^ 
ATCTAAGCACAGCAAGCAAGCGAGCCTCCTCTCCTACCCCACTTTCCCT 

C CCGATG CCTGCTCACTGACAGTGACT CTTT CT CTGT ACTTGAGGTG CTGGT AT CAAG CT AG CTTGGG ACVnTCT ATGGCGGGGT 
GTATGAGCATCGCCGTGTCCTGTTGTCCTGCTGCATTAGTAGACATGTCTGC^ 
5 G CTGTGTTGCCTTATGATACTACAGAGTCTTACAGGAGTGTCGCTG CTGTGGGCAGTG ACTTGCTGGG AG CCAGTGCCGTGTGGGT 

ATGTGC^TTTCTTCCTGTGTGGTCAGCCAGCCCAGGCTGGCT^ 
GGGAGTGAAGGTGGAAGACATTTGGCAAAATTGCCTGGCTTTG 
CAGGATCTCTCTACCTCAAGTTGTATCTCTTTCAG 

GGGACCCTTTAGTGATAGAAAGTCTGTATGTGTATCTGGGCAGGCCAGAGTCAT 
1 0 AAACTCCTGAGGTACCAGAATGTGGTCAGGACACCAGGTGGCAGTGCCTGTCCCTrAGAC 
CAGTAGGGACGACAACTTCAAAGGCGACTCCAGCTCTCCAGGAAGACAGTTCAGAG 
TCGCCOUSCAGCTGCTGCCCCAGTCCCGAGGAGGTGGAGGAGGAGGAG^ 

MOUSE SEQUENCE - mRNA 
1 5 TCTGGGTGTAGCACGCTGGCGGTTGACTCCCAGGACTCTTCTGTGACCTGTGA^ 

ACCCAGTCCTGGATTGCAGATCTGTAAGGACCGCGAAAGATTGAGCAAAGAAGTCTTT 

TCCAGCCCATCTCCGGATCCCGACTGGATCCACACCTTCAGCCTCGTGCCCTGTCCCGGGT^ 

GAGCGCCCCGCATGGCGGGGCCATGTCCTAGGGCCGGGGTTCTAGAACGGGCGGGCAGCT 

CTCAGCCGGGGCCGCTCGTCTCCTGCGGTCACX3GGTCGGGGCTGCGCGCGG^ 
2 0 GAGCCGGGGCCCGGGGTGGAGCCCGGGTCGGGAACCGAGGCAGAGCCGCTGCCG 

CCAGGCACACGGCCACCCCCGCACGTGCGCAGCCT ACCCTTTGCGACGTGGCGCTAGG CG ACACCG CGCTGG ACTCCTTCTACAAC 

GCGGATGTAGTGGTGCTGGAGGTGAGC^GCTCCCTGGCACAGCCTTCCTTGTTCT 

GACCAACAACGTGCTCCTCTGTTCCCAGGCAGAGCTCCCTGATCTGCAGG 

GTGTTGGCAGCTACACACTGATTCCTTATGTGGTGACAGCCACTGGCCGGGTCT 

2 5 GCCGATGGGCTAGTACAGGCTGGGGCGGGCACTGAAGCCCTCCTCACTCCCCTGGTGGGCCGGCTTCT 

AC CCACAGACTCTTGCGGCTATTTCCGGG AG ACCATTCGTCAGGATATCCGGCAGGCTCGGG AG CGATTCAGTGGGCAG CAG CTG A 

GGCAGGAGCTGGCTCX5CCTGCAGCGGAGGCTGGACAGCGTGGAGCTGCT 

CGCGATGTCOVSGACTACTCGGCCATCATTGAGCTGGT^ 

TGTCTG CTTTCACT ACACATTTGCCCTCAACAGGAGG AACAGGCCTG CTGTCCTGCTGCCATTGG 

AATGCCGGGCACCTGGAGCAGGCCTATCACTGGTACCGCAAGGCGTTTGATGTGGAGCCCAG 
TGTTCTCCrCATTGCAGCCGGGCAGCATTTCGAAGACTCT 
GCAAAGGCTGTGTAGAGAAGATGCAGTACTACTGGGATGTGGGATTCT 
GTGGTGCTGGCTGCAGAGCAACTGTACAAGCTTAATGCCCCCATTTGGTACTTG^ 

3 5 GCATTTCAGACCCACACCAGAGCCCTCAGGAGGACCCCC^ 

CTTTCAAGATGGCTCCTCCTCAGGAGGACCAGTGCCTGGTACTGGTGCTG 
ATTCAGGGAACAGACCCCATGAGCGCAGTGACCCTAAGCTTGCTGGAGCCAGAGACTCAGGAGG 
AGTGACCTCCATCTGTGGGATCAGCACCTCCAAGCTGGACCAGCGCT 
AGCTGTGCTTTCCCAGTGTGGAGCGCTGCCAGCGGTTCT 

4 0 ACTGAGGAGGCAGAAGGCGCAAGGGAGGTGCTGGAGTTTGATTATGAATACTCGGAGACCGGT^ 

CACATATGGGGTGGTGTATGCCGGCCGAGAC^GGCACAC 
TCTCTCAGCCTCTGCATGAAGAGATCGCTCTTCACAAACX3ACTTCX3T 
GGCGGCTACCTCAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGC^ 
CAATGAGAGCACTATTAGCrrTCTATACACXSTCAGATCCTACAAGGACT 

4 5 TCAAGGGAGACAATGTTTTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCT 

ACACCGTGCACGGAGACTTTCACAGGGACTCTGCAGTATATGGC^ 

AGCTGACATCTGGTCrCTGGGCTGCACTGTAATCGAGATGGCAACAGGTC^ 

CXATGTTTCAGGTGGGCATGTACAAGGTACATCCACCCGTGCCCGGTTCCCTG 

TTTG AG C CAGATCCCCGCCTCCGAG CCAGTGCCCAAG AGCTGCTGGGAGACCCCTTCCTTCAGC CAGGG AAG AGG 

5 0 GGGCTCTCCTCGGCATACTCCCCGGCCCTCAGGCGCC^ 

CATTCCCAAGGCCCCAGGCACCCTCT<^CACCC^CCCAGC^ 

GTGCCCGAGGAGCCCGCTGCCGAGGAACCCGCGTCCCCAGAAGAGAGTTCAGGCCTGAGTCTGCTGCACC^ 
GCGCATGCTGGCTGCGGTGCTGGAACAGGAGGTGCCCACACTAGCAGA 
AGATTCATGTGGAACTTCTGCTTCGTGGCCTTGGGGCACAAATCCACACT 
5 5 CTGCAAGCTCAGCrTGCGGGCCCAGAGCCTGGGCCCTGCGCTTTTGAAGGGGCCGCTCTTCGCCT 
CCTCCGCAGACGCCAGATCCGCCCACACTGGATGTTCGTGTTGG^ 



CAGGCGCTGCAGAGCCAGCTCCCACCAGAGCAGGGACCCCCATCGTTGATGGTGGAGTTGGGCCTT 
GCTTCGGGACCTTCTGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTC 
6 0 AGTATGCCCCAGCrTTCCGAGACCCCAGCCACTCTCCCAAAGGACCAGAACCTGGTGCGGTGGCTACAGGA^ 
GCCACTATCCAAACGCTCCTGAGTCATAGCTTCACCCTTCAAAC^ 

AATCJVGGGGAGGGATGGTATGCCGGATTTGGAGAGCCATCTTGGCACAGCGAGCAGGAGCCACAT 
ACGCTGAGTGATGACATCAAAGGACACAGATACGGATGGATGAACAGAGA 

CCAGGAGCAACTACAGGGACCAGGCCCAACCCCAATGGGTGGAACTAACTAGGCAGGC^CCCAAAG^ 
6 5 ACTATTAAACAGAACACTTGCTACC 

MOUSE SEQUENCE - CODING 

GCGGGGCCATGTCCTAGGGCCGGGGTTCTAGAACGGGCGGGCAGCTGCT 



70 GGTGGAGCCCGGGTCGGGAACCGAGGCAGAGCCGCTGCCGTTGCGATGCCT 

ACCCCCGCACGTGCG CAGCCTACCCTTTG CGACGTGGCGCT AGG CG ACACCGCGCTGGACTCCTTCTACAACGCGGATGTAGTGGT 

GCTGGAGGTGAGCAGCTCCCTGGCACAGCCTTCCTTGTTCT^ 

TCCTCTGTTCCCAGGCAGAGCTCCCTGATCrGCAGGCC^ 

ACACTGATTCCTTATGTGGTGACAGCCACTGGCCGGGTCTTATGTGGCGATGCA 
7 5 ACAGGCTG^GGG CGGGCACTGAAGCCCTCCTCACTCCCCTGGTGGGCCGGCTrc CACAGACTCTT 
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GCGGCTATTTCCGGGAGACCATTCGTCAGGATATCCGGCAGGCTCGGGAGC^ 

CGCCTGCAGCGGAGGCTGGACAGCGTGGAGCTGCTGAGCCCTGACATTGTCA 

CTACTCGGCCATCATTGAGCTGGTGGAGACGCTGCAGCTTTTGCCCACCTC 

ACACATTTGCCCTCAACAGGAGGAACAGGCCrrGGGGACCGGGAGAAAGCCCTQ 
5 CCCGTGGCTCCXK^TCTGTACTGCATGTGTGGTCGTATCTACAAGGACAT G 'I'X'C T T CACTTCTGGCTTCCAAAATGCCGGGCACCT 

GGAGCAGGCCTATCACTGGTACCGCAAGGCGTTTGATGTGGAGCCC^ 

CAGCCGGGCAGCATTTCGAAGACTCTGAGGAGCTCCGGCTGATTGGC^ 

GAGAAGATGC^TACTACTGGGATGTGGGATTCTACCTGG^ 

AGAGCAACTGTACAAGCTTAATGCCCCCATTTGGTACTTGGTCTCCGTG 
1 0 CACCAGAGCCCTCAGGAGGACCCCCXXTTTCXjAGCT 

CCTCCTCAGGAGGACCAGTGCCTGGTACTGGTGCTGGAGATAAACAAGGTCCT 

CCCCATGAGCGCAGTGACCCTAAGCrTGCTGGAGCCAGAGACTCAG^ 

GTGGGATCAGCACCTCCAAGCTGGACCAGCGCTGCTGCTITCCTGTACGCACTT 

AGTGTGGAGCGCTGCCAGCGGTTCTGTGGCCTAATTCAGGTCTTGGTGATGAATCCAGA 
1 5 AGGCGCAAGGGAGGTGCTGGAGTTTGATTATGAATACTCGGAGACCGGTGAGCGGCn'GGTGCT 

TGT ATG CCGG C CG AG ACAGGCACACG AGGGT ACGAAT CGCCAT CAAGG AGATC CCAGAGAGAGATAGCAG ATT CT CT CAG C CTCTG 

CATGAAGAGATCGCTCTTCACAAACGACrrCXSTCATAAGAATATTGTGCGTTATCT 

GATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTTA 

TTAGCTTCrATACACGTCAGATCCTACAAGGACTCAG 
2 0 GTTTTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTTG^ 

GACTTTCACAGGG ACT CTGCAGT AT ATGGC CC CGGAAATCATCGACCAGGG CCCG C G AGGAT ATGGGAAGGCAGCTG ACAT CTGGT 

CTCTGGGCTGCACTGTAATCGAGATGGCAACAGGTCGACCGCCCITCCATCAA 

GGCATGTACAAGGTACATCCACCCGTGCCCGGTTCCCTGTCAGCTGAGGCCCAAGCCTTCCT 

CCGCCTCCGAGCCAGTGCCCAAGAGCTGCTGGGAGACCC 

2 5 ATACTCCCCGGCCCTCAGGCGCCCCTTCCGGTCCTTCCAGTCCTTCAGCTGACTCAGCCAC^ 

CAGGCACCCTCTCAGCACCCACCCAGCCCCCCAAAGCGCTGCCTTAGTTATGGGGACACCA 
CGCTGCCGAGGAACCCGCGTCCCGAGAAGAGAGTTCAGGCCTGAG 
CGGTGCTGGAACAGGAGGTGCCCACACTAGCAGAGAATCTCCTGGAAGAGGAAC^ 
CITCTGCTTCGTGGCCTTGGGGCACAAATCCACACTCCTAACCGCCGGCAGCTGG 

3 0 GCGGGCCCAGAGCCTGGGCCCTGCGCTTTTGAAGGG 

AGATCCGCCCACACTGGAIX5TTCGTGTTGGACTCGCTGCTCAGCCGTGCAGTCCGGG 

GAGAAGAAAGCGGTCTTACCGAGGTCAGAAGAGTCGAGTAAAGAGGAGTCCCAGCAGAAACCT 

CCAGCTCCCACCAGAGCAGGGACCCCCATCGTTC^ 

TGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTGCAACAGGCCCTGCATCGGGTGCATGC^ 
35 TCCGAGACCCCAGCCACTCTCCCAAAGGACC^GAACCTGGTGCGGTGGCTACAGGAACTGAGTGTAGA 
GCTCCTGAGTCATAGCTTCACCCTTCAAACCCTGCTCACCrrGTGCCACT 

TGGTATGCCGGATTTGGAGAGCCATCTTGGCACAGCGAGCAGG AG CC AC ATCGGT CACCCCAGT ACCCCGGGACGCTGAGTGA 



HUMAN SEQUENCE - GENOMIC 
4 0 ACCACCAGG CAAGATGGGCCCCAAG<^TGAGCCAGGTGAGTAAGTGGGG CTGGAGGCCTCCCCGCTTTCTTTGCCTCTGTCCCTGG 

ACTCTAGGGTGGGATCTCTGCTTGGCH^CTGTGOT^ 
TAGGAGAAGGGCCTGGCTACCCTATGATGGCCCTCCTATTGG^^ 

GGGTACCCTATGAAGGCCCTCCTAC^AGGGTTTGGAAGGCrGGGAGGAGTGGAGAGGATTAGGAGAAG 
GGCCCTCCTACAGGGTATGGAAGGCTGGGAGGAGTGGGAAGGATTCGGAAAAGGGCCT 

4 5 CTGGGGTGCTGAGACATCAGAGCCAACACrCATCCCTT 

GGCAGCCCTTGCAGGAGCCAGGAACn^GGGATCACATGGTGTTGGGTGTTGG 
GCOTCCTCAGAACCCTTCTTCAGGGACCTGAAGGAGCCAA 
GTCAGGGAATAACCTGGGCACCACCCCAGTCCATCTCCCCTTTCCCAGGCCCC^ 
ACCTTGAGCGGCTGGTACCATCTGTGCCTACCTGAGGGCAGGGCCCT 

5 0 GCTGGTGAGTGTCTGAGAAATGGGCTAACAGGCCAGGCCCACTG^ 

AGAGCAAGGCATTTCCTAGAGG AATT(^CTGACAAATGTTTAGTGAG CCCC CAATGGGACCTGAAG AAGTATGAGGG CTTCACAGA 
GGAGGTG CCATTTG AAAAGAC CTTAAGTTG AGTGCAGT AG CTCACGC CTGT AATCCCAGCTACTTGGG AGGCTGAGG CAAGAGAAT 
TGCCTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGTTCTTGCCATTGCACTCCAGCCT 
CAAAAAAAAAAAAAAAAAAAAAAAAAATAGGGTAGGGAGGGGCATTCCA 

AGGACCTCTGTAAAGGCCAGTTCATGGAAGGCACTAAATGATCAAm 

GACGTGGTGGCTCACGCCTATGATCCCAGCACTTTGGGAGGCCAAGGCG 

GGCCAACATGGTCAAACCCCATCTTTACTAAAAATATAAAAATTAGCCAGX3 

GAGCCTGAGCTGGGAGAATTGCTTGAACCCCGGAGGTGGTGGTTGCAGTGAGCCAAGATTGAGTCACT 

6 0 CGGAGTGAGACTCTGTCTTAAAAAAAAAAAAAAAAAAGAAGAGTGTGCAGGGCAAT^^ 

AGG^GCCAAGTAGAACTTCTGTTTTAAAAAGATGATTCGGGCTACACCATGAAAGATC 
GGACATGGTTTAGGAAGACIAGCAAGGAGCCAGGCAGTAAAGACAGTGGA^ 
CACCTTTTGGGAGGCCGAGGTGGGTGGATTGCCTGAGCTC^ 
CTAAAAATACAAAAAATTAGCTGGGCGTGGTGGCAGGTGCCr^ 

6 5 NNNNNNNNNNNNNNNNNNN AGC CG AG AT CACAC CACTGT ACTC CAGCCTGG T AACAGG^GG TG AG ACTCT ATCT CAAAAAAAT AAA 

TAAATAAAATAAGACAGTGGAGCIAATGGAAATAGAAGTACTTAGGTTGACAACTCTATAGGACCT 

GGGAGGTGGAAGACATCAGGGTAAGGCCAGGGGTCCAGCTTGGGCTGTGGGAGTGGC 

TGTTTGTTTGTTTGTTTTGAGACGGAGT 

ACCTCCCAGGTTCAAGCAGTTCTTGTGCCTCAGCCTCCTGAGTAGCT^ 

7 0 GTATTTTTAGTAGAGACGGGATTTCACCATGTTGGCCA^ 

CCAAAGTGCTGGAATTACAGGCATGAGCCZACCAAGCGTGGCCAAATATCT 
TGACACCCAGCAGCGCTGGCCATCTCCTGTGGGTGGGGACAGTGTTTTGTCAA 
TGAAGTTGAGTGTCAGTAAAAGTTAACAAGGGAATGTAGAGTTC^TAGAAATATTCAAAG 
AACAGGCCCCAATCCAAAGAGAGAGTGGGT^^ 
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GAGCAGGTTTTXXX3AACCAAGAGTCTGAATTCT 

GGGTCCTGGAGTCTGGGGGTTACTGTCTGCCATCCTGAATCCCTGCCCCACCCCCAAGAACT 

CCTTCTTTCCAGGTAACTGGGAGCTGCGGGTAGAGCTGGAAGACm 

CTCCTCGGTGAGGTAGACCACTACCAGCTGGCACTGGGCAAG 
5 GAAGGAGGCTGGATCCTTTCCCAGTG<rrGCCACTGCCCCCATGAATGA 

CGCATCTGAGAAATGGCAATCATGAAACAGGATGGTGCAACAGTGAAGAA 

CCATCTCTGCCATTTAGTACTTATGTGACCriTG 

GGGAGGCCGGGCACGGTGGCTCATGCCTGTAACCCTAGCACTC^ 

ACCCAACCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAAT^ 
1 0 CAGCT ACTCGGGAGGCTGAGG CAGG AG AATCACTTAAATCTGGGAGG CAG AGGTTG CAG CG AGCCAAGAGAG TGT CACTG CACTCT 

AGCCTGGGCGACAGAG CGAGACT CCAT CTCAG AAAAAAAAAAAAGG CGGGGGGAG CG CAGGG AGG CAAGGAGGAAGTTTTGGGTGC 

CGCCTGAGGCTCTGTTGAGTGTTAGTGTTCTCrrGCTTTGGAAGAGGAACCACQ 

GGAGAGATTTAAAAGAGGGAAGTGGTTATGAGGGAGTGCCTCAG 

GCTTTTCCTAGCAGAGTTGCTCAGACTGGATGGGAGCTGAGAGAAGACAAAGGAGCT 
1 5 AAGACAGGGGGAAAAGCTGGGAGCTTTGGAGCCAGACAGTCCCGAGTTCAAGTCCTGCCGCTG 

CGCAAACAACCTGGCCCCTTGGTGTTGCTCGTGGGGCCTGATAAAATTCAACTCr^ 

CTCCAAAGGTGCCAGGCATGGTGCTGGGCACCAAGGTGCTTTCCATCCCTTCCCTGCT 

AGGC CCTTT ACCAC CTATGACG CTGAC C ACG ATT CAAG CAACAG CAACTGTGCAGTG ATTGT CCACXK5TG CCTGG TGGTATG CATC 

CTGTTACCGATCAAATCTCAATGGTCGCTATGCAGTGTCT^ 
2 0 GTGTGGGCCACCCCTACCGCAGGGTTCGGATGATGCTTCGATAGG^CACT 

CCGGATCGTC!AGCCACCTTGCCTTTGCCAACCACCTCTGCTTGC 

CTCTTCCAGTTACrTGGGAAAAAACCCTAGAGGGCAGAGCAGGGA 

AGGAAGATGGAAAGGACAGCCTCTAAAGTAGTGAGGCTTCTGTCTCTGG^ 

AGGACTG^IAGGAGATTCAAGCATCCATTGACAATTTCCT 
2 5 ACAATTGCACAGCTTTTTTGTTTGTTTGTTTGTTTGTTTGTT^ 

GCACCATCCTGGCTCACTGCAAGCTCTGCCTCCCAGGTTCACGCCATTCT 

ACCCGTTACCACGCCCGGCGAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCAT 

CerCATGATCTGCCCTCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAG 

TTTTTGAAATGGAGTCTTGCTCTGTCGCCCTGCCCAGGCTGGAGT^ 
30 NNNNNNNNNNNNNNNNNNNNNNJTONNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
35 NNNHNNNNNNNNNNNNNNNNNNNNNNNNNNIWNN^ 

NNNNNNIWNIWNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNTOINNNNNNNNNNNNro^ 

NNNNNNNNNNNNNNNNNNNNNNNNIWNNNN^ 

NNNNNNIWNNNNNNNNNN^^ 

NNNNN1WNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
45 NNNNNNNNNNMNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNm^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNOT^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TAGGO^CACCGCGGCTCTGGATGCCTTCTACAACGCGGGTGAGAGCGCTGGGGGCGG^ 
TGGGAGGGGACCAAGCCGGGCTCGAAGGCTCAAGGGAGGGGCCGAGAGGGGTACCGGGTCT 
6 0 GAAGGGCTTGGG ACAGGG C C C GTGAAAAAAG TGGG ATTTAGGGG AT CTGGGG CG CGG C CGGG CTCGGG ACG AG ACAGGGAT C C GGG 
CGCGGCTCTAGGGCTCGAGGTTGGGGCTTAGGCACCCTTCAGTCGGAGCGCT 
TGGAACTCTGGGGCGGGGCGGGGAGCGGCGCGGGGCTGTGGTGTCTCAGGGGTGGG^ 
ATCTCACGCGGATCTCTTGAGGACAATTACTGGGAGAGGAGCTTGGACrTTAG 

T AGCATACTATTTC CAAGGATCGGG CTAGGAGAGGCAAAAAG AG AAGCG AAGGGGCCCG ACGGGACTGGATACTGGGTGAGGG AGC 

6 5 TTAGGTTAAGTGGCCTCAGCGGCCCAGAAAGAATGGGCGGGGAAG 

GTCCAGTAGCTGTGCTTCCTGGATGTCCAIGAAATCGTAGCCTAAAATGAAGGC^ 
ATCGACAGACACAAATCGCATACATGTTCACTCATGCAGGCCCTAGC^ 

G CTATACATG TCACAGAT AT ACACATACAT ACATTT AGG CAC AGCGTTGT CT ATG CACACTGATGGT AG C CCCTTTCAAG AG CCTG 

7 0 TG CTGAAGAC CAAAC^Ka AGGGG ACTTCT ATGTTTGCG CAACTCC CACCT CCTAAATGGC CCGTAACAGACTCAACTGGTC CAC AG C 

CTCCCTTCCCCCATGCCAGGTGCTCACAGGCTCTGAACCGTGGCCTGAGGGTCCCAACCCAG^ 
ACTACAGAGATCAGTTCTGTGAAAGTGCCAGGGGAACCCTAAAAAGGGCCGACTGAAGCC 
TCCCA.CAGCAAGGCAGGTCAAGCCAGGATGGACCCCGGCTCCTTCCCCATCAAGCT 
TGCCTGCAGATGTGGTGGTGCTGGAGGTGAGCAGCTCGCTGGTACAGCCCTCCCTGTTCT 
7 5 AGCATGACCAACAATGTGCTCCTCTGCTCCCAGGCCGACCTCCCTGACCT 
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GGCAGGCACAGGGCTGGGGTAAGGCCTACACAGAACACCAGTATCTGACTGTCTGCCTCCTGCAGG^ 
TCGGTGAGCAGAATCCACCTCTCACTCAAAAGTGCCCTTTGA 
CAGTCACCTAACCCCCCACTGACATCCCAAGTATTCCTCrGTTTCCTGTTGCCTC 
CCAATTCCCACTGATGCTGACCTTTGATGCTCTCACTCCCCrTGTCTAGCCCT 
5 CCTCCCACCAGGATTGCGTTGGCAGCTACACACTGA^ 

CTTCTGCGGGGCCTGGCTGATGGGCTGGTACAGGCTGGAGTGGGGACCGAGGCC 

CCIG CTGGAGGCCACACCCACAGACT CTTGGT AATG AAGG CCC CCAGGG AGGGTGCT C CAGGAGGGGGCTGGTG AGGGT ACGG TGG 

GCGGGACATCCGGCAGGCGCGGGAGCGGTTCAGTGGGCCACAGCTC 
1 0 TGGAGCTGCTGAGCCCCGACATCATCATGAACTTGCTGCTCTCCTACCGCGATGTGC^ 

GCAGCKKSCATTTGAGGTC^GGCAAACCCAAGTTCAGTTCCTGGCT 

TGTCTCTGAGTGTTGGCCTCCTTATTTGTTAGAGAGGATGAGCATGTCGCTGCCT 

GCATGGGGCATACAGTAGGTTCTCCCTGTGAGGTGTGTCCGCTGGGCCrGAGGGA^ 

TCAGGAGTGGGACAGGACCCTGGCTCAAGKTTTCCTTGGXK5CCTC 
1 5 TACTCGG CCATCATTGAGCTGGTGGAGACGCTGCAGG CCTTGCCCACCTGTGATGTGGCCG AGCAGCATAATGTCTGCTTCCACTA 

CACTTTTGCCCTCAACCGGTGAGTGGCAGAGCAAGGGTTAGGTCAT^ 

ACTGCTGCCTGGGTCGGGGTGGGGGAGGTGGTGCTGCCTGTGGTATGGTCACCG 
CATCTAGGAATCACTTCCATCTGAGGTCACTAGAGAGGTCAGTGCT 
2 0 CCCTGGAAGGGTCCCTGCTGGTTCAGTGTCTAGAGGTCCCTG 

TTCCCTAGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 
CGATCTGTACTGCATGTGTGGCCGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTC 

ATCACTGGTAAAGAGCACAGTGAGTGGCCGGTGGGCTAAGCTGTGAGATGAGGGGTGGTG CTGTGGGCATCAGTCCAG CCCTGTGC 
CCCCTACCCCCACCCCAGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACT 

2 5 CTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTAATAGGTGAGCACAGATCCT 

TTCCGAGGGGCCTGGGAAGAGGGCAGGGAGAGAGCACACTGGGCAAGAGCTAG 
TACCCACACCTGGGTCCTCATACCTGTTCTGGGCCCTCAGGAGCCCrCAAACC 

ATTACATCCTG AATTCCACATATACCTGTGCCTGAG CTCCTGTT CCCACCCGAGATCTTG CCAT ACT CAATGC CAT ACT CAT AT CT 
ATGTTCATGTCCACACTCGGGCCCACAGAAATGGTGGGCCTCACTCA^ 

GAG^CCAGATCCTCGCCAATGACCCCACCCAGGTGGTGCIG 
GCCCCCTCITGCATTCITGGCCTGACCTGGGCTGTGBlGCT 
ACCTGGCTGCTTTTAGACCAGTGCCCGTGGCACCCCATGCCAAC^ 
TGGTTGGTGCCTGGGCTGAACrGAGGATGTGGCTTTGCATGC^ 

3 5 CTTCAGGCCCACGCCAGAGCCCCCTGGAGGGC CACCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAG 

TCAAGACAGCCTGTGCCCIAC^GCGACCAGTGCITGGT^^ 
TGACCCCACrCCCCAACTGCCTGCAGGTGCTGGTCCTG^ 

GACCCAGTAAGCACAGTGACCCTGAGCCTGCTGGAGCCTGAGACCCAGGTGAAGTCGCTGCTGAGG 
GGACCTTTTCTGTGGAGCCTTTCTCCCATCCTCAGACCCCTCTCCTGTC^^ 

4 0 GACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCC^TATGCGGAGT 

AGGTGAGGGTCGCrGCCACCCAGCCTGACGCTTACTGGGTT CTTCATCCCTTCCCAGCGCCTCAAAG CGCG ACG AG CGCT GCTGCT 
TCCTCTATGCACTCCCCCCGGCTCAGGACGTCCAGCTGTGCTTCCCCAGCG^ 
CGGGGTACTCTGAGGGCC^GGATGCCCGGGGGGGAGTGGGGGAAAGAAAGGGAC 
ACQVAGATGCCCCTCCCK3CGGGTCTGACCTCCCCCIACCTTCAGGTTCT 

4 5 ACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAGATGTTGG^ 

GCGGGCCCTGCTGAACTCCCMCTCACCCOT^ 

AAGGGCACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCAC^GAGGGTGCG 

CAGGTGCGGTGCGTGGCAGCGCGGGTGGGGCTGAGGGCTCXSCGAGGGCC^ 

CGGGGCCTAGTGGGCGGGCTCXSTAGGACCTXaAGAGTTCGAGCCCTGTGGA^ 

5 0 CGGGATTGAGGTCTGGAGGGTCCTGTGGGCAGGGTAGTAAGGGTGGGGAAGGTCGGAGAG 

GGCGGGGCCAGGCTGGCGTGXSGATGGGCGGGGCTTCTCTGCCC^ 
CTCAGCCCCTGCATGAAGAGATCGCTCTTCACAGACGCCTGCGCCACAAGAA 
GGCTACCTTAAGATCTTCATGGAGGAAGTGCCTGGAGGTACCTGCCCTGTGTGGGATGGG 
ACCGAGGCAGGCM3GGGTAGAGAGCCCTAGGGGGAAATGCACCCTAGTGCG 

5 5 GCGCTGGGTGATGGAGTTGGGCTACAACATGGGTGTGTCTGGGCC^ 

CAGTTTCTACACCCGCCAGATCCTGCAGGGACTTGGCTACnTGCA 

GGG CTGGCC CGCCCTCIATCTGCTGAGGGTGGCGGGGAAGGAAACATGGTCCCAG AC^ 

ACCTGTTGCAAAGGGTAGGG AATTCATCAGGGAGG CTC CTTAGAGAAGGAGGGGAAAGGTTCAGCTCTGAAGCCAAACCAATTGTC 

6 0 AGCTCCCAGAGGGGCATATAAGTGGGATGGCAAGGGGAGAGTGTTGACAGAGGC^ 

CCCAGAAAGAGGGTCTAGTTCTGCTGGCAGGGGCCCAGACCAGTGTCCACTAATGGCTG 
ACACCTTCAGTGGGCTGCTCAAGATTrCTGACTrCGGCACCrr 

AGGCATGGAGTTAGAGTCCTAATGGGGAATGAAACCCAGGGTGAGTGATTGGGCCAGG^ 

6 5 CTTGGAGAAGTGATGAAATTGACTTTAGCIACTTGAGGA 

AGTTCAGGGGGACGAGGCAATCCAGGATGTGTGCTGGAGCACTACTGT^ 
AATGACGGAAGCCTTGGC^GGCGATGGAGCCCAGGGTGGTGATGGAACTCAGGATAGA 
TATATTTTGGATGATAGGTTCAGTATGGGTGCAGTCX^ 
CTAGATGGGCArTGGTGCTCCTGTTTCTGTCCT^ 

7 0 ATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGCACTGTCATTGAGATGGCCACAGGTCGCC^ 

CCACAGGCTGCCATGTTTCAGGTGAGACCTCTTTGGGC^ 

GAATGTCACCCTTGTTCACCCTCCCCCCACftGGT^ CTGTCGG CCGAGGCC 

CAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCCCAGAC^ 

TGGGAAAAGGAGCCGCAGCCCCAGCTCCCCACC^ 
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ACTCCTTCAGCCAACTCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCT 
CTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGTGAGACCAAGGCTCCTGGGAT 
CCTCTCCTACAAAAGCCCTCCCTGGGGACCTGGGCACCTTGACCACTTGTG^ 
GGCAACTCTAACTTGGACTTGTTGTGTCCTCTCAGTGCGGGGTTTATCTATGTTC^ 
5 GCCAGGATTCAAACCCGGCAACAGAGGTGACTGGCCGGTTGCTGCCCGTCCTCAATCAGCAGCTCGCC 
GAGGAGCCTGCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTC 

GCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAGAATCTGCACCAGGAG 
TCTAGCGCCCCCTGGTGGTCGCAAAGGGTCAGGACACCTGTTGATGTCCCCGCCCCCTACCT 

1 0 GCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAGGAGCTGCGGGCGCTGCAAGGACGGC^ 
TGCGCTTCTGCACAGACCGCTGTTTGCCTTCCCGGATGCGGTGAGGGCGCCTTCTGGTAGTTAGAG 

CGTCTGCCCAGTCCTAGCTTCCCAGGCCTTCCCTGGACCTCAAGCCCTGCCCACCNNNNN^ 
CTGCCTCTAGTTGAGGCTCAGCTGTCrrCAGCGGCGCCTCCCTCCCCCTG 
1 5 CACACTGGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGC 

GCCACGGGCCTCCTGCCAGCGAGGGTGTGGTGCTTGCTGGGCGCGGTGAACATCGAA 

GGCAGGCAGGCGCCGGTGGGGGCGTG ACTAAATGGACATCTCCCATACCGGCACCG CCTGGGG CCTGCCCAGAGCCGGGCACCCTA 
CCCCCCGCACTTTGGGGTCTGGGGTTCCCAAACGGGCGTTGTAGCCTTGCCTGA 
GAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAATGAAGGGGACTCCCAGCAGAGCCCAGGCC^ 
2 0 GCCCGAGCAGGGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGTAAAGC 
TAGCCTCTGCCGGGCTTCAACCCACTCACCACTCACCGCTCCATTCTACTCTTCrrcCGCC 
GGAAGGAACGGGAGTACCAGGCCCITGGTGCAGCGGGCTCTACAGCGGCTGAATGAGGAAGC 
CCTCCAAGTGAGTGGGCCTTGGCTGGAGAACATTATTGGTGGGTTCAAGGCAG 
TGACTTGGGCTCATCACCTGAGCGGGCCTTGGCGGGAGAACATTATTGGTGGGTTTO 

2 5 GCTCATCACCTGACTTGGGCTCATCAC CTG AGTGGGCCTTGG CTGGAGAACACTATTGGTGGGTTCG AGGCAGCTTCTAG CAGG AG 

GCACTCTTGGGCTCATCACCTGACTGCTTCTTGACAGCTGCTCTTT 
TCTGGATTCAGGCACCATCCAAATGGTGAGTCGGGGAGTGCTTGGCT 

TCTTCCTGCTCTTCTACTGTGCCCTCTGATGCAGAGAATCTCACTACCAGAAGAACACTTAGTATTCTCGTC 
AGAAAACTGAGGCCCAGAAGGGTGTGATCTGCCTCATTATTTTGGCAACAGAGCAGAGCT 

3 0 TGT AATTTCTTATTTCCCTCTTTCTGCGGGTCTTTGACCACATCCAGCTGTTGAAC CATAGCTTCACC CTCCACACTCTGCTCACC 

TATGCCACTCGAGATGACCTCATCTACACCCGCATCAGGTACATCC^ 

GGCCAACTGCAGGCCTGCCTAGGTTCCTTCCTGGAAACCAAAGGGGCACCTGGGTCCCCTAGGGACAGAG 
AGTGTGTGCAAGATAGAGTCCAGGGCCACCTTCTGGCTGAGCCAC^ 

CTGGAGGGCCAT CTTGG CACAGCGAGCAGGATCCACACCAGTCACCTCTGGACCCTGAG AG CTGAATGAGGGCATCAT AGGCCAGA 

3 5 CAGGCCCCAAGGATGGATGAATGGAGAGGACAAAGGCAGCTTCTGACACACCAGCCCCAGGACCTGGGG 

GCGAGTGGGGCCCAGGACTGGTTCCAGTGAGAGAAACCAACCACAGGCACCCAAGCACTACCAGACA^ 
CTTTTGAACCTTTGTCCTGGCAATTTTGGAGGTGTATGGTGGGAGGGGTATCCT 
TTTTTTTGTTGGCCGGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG^ 
TCAAGACTATCCTCGCTAACATGCTGAAACCCTGTCTCTACTAAAAAT 

4 0 GTCCCAGCTACTCGGGAGGCTGAGGCAGG AGAATGGTGTGAACCTGCGAGG CGGAGCTTGCAGTG AG CCGAGATTG CGCCACTGCA 

CTCCAGCCTGGGCAACAGAGCXiAGACTTTGTCTCAAAA 

GGCTCATGCCTGTAATCCCCGCACITTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAGGAGTTCGA 

CAGGAGAATTCCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCGAGATCG 

4 5 AAAAAAAAAAAAAAAAACCTTTTTTAGAGACAAGGTCTTACTCTGTCATCCAGGCTGGAGTGCAGTG 

AGCCTCAAGCTTCTGGATTCIAAGCGATCCTTCCTCCTCAGCCTCCTGAGTAGCTGGG 

T AATTTTT AAACTTTTTTTTTTGT AGAAAT AGCAT CT CG CTTTGTGGCAT AG CTCACTG CAGC CTTG AACTCCTGGG CACAAGCAA 

TCTGCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCATGTGACACCACCACCTGCAGCTAAT^ 

GGGGTCTCATTATGTTGCTCAGACTGGTCTTGAACTCTTGGGCTAAAGCAATCCTTCT 

5 0 CAGGCGTGAACCACTGTGCCTGGTTTCTACTGAAGTCTTT 

GCTCTTTCCTACAATCACCTTTCTTCCTCCCTGCCCCAGCAGACACCAGCCTTTTATTGCT 

GCCAAGGGCAGGC CCTG AG ATGGGGGT CCAGAGAGAGAGG CTTGGTGGGGCTACGTCCTGG<K^CCAGGTTGGTTCTGAGGGGTAG 
AAGGCCATCCACCCACTCGCACGGCTGCTCCAGGAGGGCTTGCCACAGCTGC 

GCAGCCCATAGCTGCTGCCTGGATGTGGGTGGGCAGGGGTTGAGGGCATGATCACACTGGACACCTTGGGG 

5 5 CTGGTCCCAGATTATGCACTGTCCCCCGTTACCACCCTCAGTTCTCAGTAGATATCAAAGCTCAGGAAAGGACAGAGAGGTA 

GGGCCACACACTGAGCCCCCATCTCCCAGGGCCTGTCCCAGGG CCTGTCTCAGGGTGCTGTC CCACTT ACCGGTGCCCAGGCTG AG 

GCGTGTGCCCCCCAGCTGGCGGTTGGCCAGGGCCCCATGGTCCCAGAGGGAGAGCTCGGCACAAGCCTGGCGCAGGT 

CAAAGCCATCGTACACCATGGTGTGATTGAACACAGGGCTGAGGCTGCGTCGCACAA 

TCATCAGGCAGCACGAAGCTGTGCGGGAGGTGGCCTGGCATCAGGTGACCCCCCTCACACTCCCTAGAGTCATCCAGTCACCACGT 

6 0 CCCACTCATTCTCCCTCACATATGTCCATGTCCCCTGGCTCCT^ 

CAGCCTCCTCTCAGGTCTCCTGGCTTTAAAAGTTACAGCTCCTGGCC^ 
AGGCCGAGGTGGGTGGATCACCIGAGGTCAGAAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAAATTAGCTGGGCATGGTGGCATGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAA 
GCGX3CTGCAGTGAGCTGAAATCACACCACTGCACTCCAGCCTAG 

6 5 GGGCGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGAGATTGAGACC^ 

GTTAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTAGCTGGGCGTGGTAGCGGGCGCCTGTAGTC 
GGAGGCTGAGGCAGGAGAATGGTGTGAACCCAGGAGGCGGAGCTTGCAGTGCACCAAGATGGGGC 
ACAGAGCGAGANNNNNNNNNNNNNNNNNNNNAATAATAATGTTACA^ 
GCAGTTCTGCTATTCTCGCTGGCTCAGTGACTTTTATTATCTCCCTGTT^ 

7 0 TTTTTTTTATTTTTTGAGATCGAGTCTCGCTCTGTTTTC 

GGGTTCGAGTG ATTCTC CCG C CTCAGTCTCTCAAGT AG CTGGG ATT ACAGGCAC CCAC C ACCACG CCCGGAT AATTTCTTGT ACTT 
TTGATAGAGATGGGGTTTCACCATGTTGGTCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACC 
TGCTGGGATTACAGGCATAAGCCACCGTGCCTGGCCTGTTTCCAAGGTTTTCAAAG 
TCCTCAGAAGTAC n -GTACCTCTGTACTG^ 
7 5 TCTCTAACAGTGGGGGCAG CIACCTGGAGGCACCCCT AGAAGTCCCCTAGAAGCTTACACTG AACTATAAGTTC CCAGCCAAGCCCC 
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ACAGCCTCX3CCTCCCAGGTTCTCTATGGAAGGACCCTTC 

GCTGAAATGGAGCTCCTCAAAGCTGTCCTCAACTTCCCTG^ 

TGAAAGCCCAGGCTGCGTGATCAAAGGTTGATTAGTAAG 

CCGGAGGGCCAGC^CTCCTCACCATTGTAC13TAAGTCTCCAGGGATCCTGCCCG 
5 AGAAGTGCAGCTCCCCCCrrCGGGGGC^TCCTGCGCCTGCGGAGA 
CAAACCCGGGGACTGGAGTCTCCCGCTTTTCAGGGCGTGTCCA^ 
ACGTACTTGAGGGACAGGGCGAGTAACCCGCGCCTCGGAAGGTCGTC^ 
AGCAGGGGCTAATGGAAAGGGGCCTCAGGAGGGTGGGGTCTGCAGA^ 

GGGG CAG AACCT AGGGGTC CT AG CGCC CTCX^GGAAI3GGGC CX3TGGCCGGCCGGGCTTCGCCTGGGG CCATGAGG TT AAGAGGTTG 

CCCACGCGGCCTGGCrcCCTCACCCGGGGCTGCAGGGGGA 

AACTTOSCCCAGAAAGATGTTGCGACCCAGGCTTTCGCC^^ 

GGACGGAGTACTGAGGACAGGAGGTGAGCrrTAGTGGA^ 

TCCTACCGCGTTCCTACCATGATGGAATTCTGGGGAGCCCCTAAGGGCATTAATGGGGACCTGCACTATC^ 
1 5 GGCTGACAACACTGGATAACGAGTTAATGGGCCCCCTCOSCCGATC 

CCCGCAGGAAGGGGGCGTCCAGTTCATTAACGGGGAACCGCATCGTGGTCAC^ 

TCAGATTCCGTTTCTTCACCGCCGTCTTGCGCTTGCTCTGCTTATCCGGGAGGAGGTAGCT^ 

CCCGAGGTGAGGAGGGGCCCTCCCAGAGCCTTCTCCCCTrGCCTGTGGGCTCCCCCAGCCA 

TGCTAAGCGCTTTACAGGGATTGTCTCGCTTAATCGCACGCCTCCCAATAACGCTGT^ 
20 GGCCGAGACCCTGTGAATGCTGGGCTGGGCTGTGACAGCCGCGCCCCGCTTGGCCGGCGGGGCACTCAC^ 

CCGGGCGGCGGCCAGGCCCTGGCACTGGATCACGTGCACGCGCAGCTCGGCGGCGCCCGGCTCGTAGTGCAGCGCGAAGTC 

AGCCGCGGACCTGCACCGCCTCCGCGTCGCCTGACAGGCTCATCTG^ 

GCCGCTCG CAGGGCTCACGTCGGGG CCG CGGGCAGCGCGGAGG CTTCAGACCCG<K5TCAT AGGTGGCAGGCTGGGGGGCTGCAGCG 

2 5 CCTCACCGTGGAGGAGTTAAGGCTGGACACOSAGGAGCTGCTGCrGAGCATGCGG 

CCCCATTCTCCAGGATCTGGGACGCGGCCTTGGTCTGCAGAG 

AGCGCCCTGGGTCGCGGGCGGCTC3CAGCCCTGGGCAGGAGCGCTCCCCX3CCCGGAGTCCGCTCCGG 
GCCACTCCCGCCTACCTGGGCTTGCTGGGGCCGCGGCTCCTGCTCTCCCCCCGACGCGGGCTCCAGCT 

CACAGACCTGTTGGTCTC CTTGG CCAG G ATCTGGGG TGGGG CACG AGATGCT AAGCGAGCC CTCTCCTG C CCTCCCCT AAG CAAAT 

3 0 TTGTAGGAACTGGTTTACAACCCC!AGCGAGCTGCAGATTCAGGCCCCGGTGCTGCCCT 

GGGAGTGGGGGTGACGAGGGTGAACACAGGTGGTTGGGGCCCCTTAGAGCTC 
CAGGCGCTCCCACTCACCTTCCTGGGCCTGGGACGCCTCCTCAGGATCTGAAGCCTT^ 
GTCCCTGAAGGAAGAAAGGGGTAGAGGTCACAGTACGGGGGCTGGT 
GGGCTCCAGCCAGCTACCCCAGTTTCAGGTTTAGGGCAGCCCTCCGGGCAGGGCT 

3 5 GCTTCACAGCCCACCCCGCAGGGCTTGTGCC^CCTGACTCCCTACTCCCATTGGTGGGGTC^ 

GCCAGGCCAGGCCAGTCCCATTCCTTTCTCACTCCTTTTTTTATTTTTATT^ 
CTTTTTTTTAATNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
hnWNNNNNNNNNNNNNNNNNNNNNNNN^ 

4 0 CACCATCATGCCCAGCTAATTTTGTATTTTTAGTAGAGATGG<K3TTTCT 

GGTGATCTCGGCTTCCCAAAGTGCTGGGATTATAGGCGTGAGCTC 

GGGGGG CATTTTGCTATGTTGCCCAGGCTGGTCTTGAACTCCTCGGCTCAAGCGATCCTCCC^ 

CCACTCACTGTTCACCACCGTTCACCACTGTTTACCACCATTCACCACCGTTCCCCAC 

ACCTCCCCTCCAGGGACACTGTACTCACATGTTGGCT 

TGGTGAGCCTAGGAGTAGGGAGAGGACGCTCTCAGAGGGAGGCACCCTGGTGACTGCTGGGCACC^TCT 

CTCCTCACCTGGGCTCTGGCCCCTCTTCCTTCTCTTTCACAGCAGCCT 

CGCAGGTGAGAAGTTCTGTGAGCCCCCATGCTCAGGAAGCCTGGCTGTGACTCGGGAGA 

5 0 TGCCCACGGAGGGGCATTGCCCACTGAAAGACACACAGTACTGGGGGCAGCCAGGACT 

CACATACAG CCAG CGCC CTGGACGGCACATGGCTTGCCCTGCCCCGAGGATCT 

AGACG CTCGG ACAAGGTCAG AGCCG AAGTGGGCATTGTGGTG C CGCTGGGAGCGTG CTTCCTGGAACCAGTCCCCTGTCAGGATCT 
TCAGCTGCCCAGGGTCTGCCACTGAGGCCCGGAGCTTGCTGGAGGAAGG^ 

T C CCG CCAC CT ACATTG TT AACAGG CTTGGG AC CAGGGG CCAG GG AC CCAACAG CAAAT CAGACTTG AT CTGTG CTG TGGAGG AG C 

5 5 TCXX5GCTGCTGGGGAGACCAAGTCCAGAAGAGACAAGTACACTGTAGTGAGGTGGATGATG 

TGCCTGTGAGAATCCAGGAGGGCTTCCCAGAGGAGGTGACATTTC 

GAGTCTC CCTG CT ATG C CCAGG CTGGAGTGCTGTGG CG CAATCT CAG CTCACTG CAACCTTCAC CTT CCAGGCTCAAGTG ATC CT C 

CCACCTCAGCCTCCTGTGTAGCTGGGACTACAGGCGTGTGCCACCATGCCCAGCrAATT 

GCTTTTCCATGTTGCCCAGGCTGGTCTCTAACTCCrGGACTCA 

6 0 GTGCCCAGCCTTGAATTTCTTCTTAAAAGACAAATTGGAACTCTCCAGGTAGAGAAGGATA^ 

TATTTTTTTTTGAGACAGC^TCTTGCTCTGTTGC^^ 

GG TTCAAGCAATCCTCTCACCTCAGCTGCCTAAATAGCTGGGACT 

GTAGGAATGGGGTCTCACTGTGTTGTCCAGGCTGGTCITGAA 

TAGGATTACAGGCGTTt3CCACTATCTCTGGCCAATTCCT(ZTTAAATCA^ 

6 5 AG CATGGTGCTCAAGGAGGTCAGGTGAGTTG CCAAGG CT A 

GCTCTTCTACAACTGTGCTCTTCCAGGAAGAAGTACCAGCATAGGCATAGAAGTG 
AGTGGCnK3AG<K3GTCACrTTGTGTCTGTGTGT(X3GGTCG 
AGGGAGK3CCCAGTG<1ACCATGCCTAGGAGCTTGCCCTTC 
TCTC^CCAGGC^CC^TGGCTCGCGTCTGTAATCCCAGCACTTTGGGAGGCTC 

7 0 ACCAGCTTGGCCAACATGGCGAAAACCCATCTCTACTAAAAATACAAAAATTAGCTGG 

CTATTTGGGAGGCTGTGGCAGGAGAATCGCTTGAACCCGGAGGGGTGGAGGTTGCAGTG 

CCTGGGCGAAAAAATGAAACTCTGTCTCAAGAAAAAAAAAAAAAAGCAGACA^ 

AAGAGGCAAGCCTGGAGGCAGGGAGGCTGGCGAGAAAGCT<X5 

CCAG AGG>jGACTGAAGGG CAAJXACC AAG AACCACGC^ CCACTC 
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GAGGACGCCAGCAATGGCCTCCTGCTCCTC CTCTGTCAGGAAG CTGAGGTCCAACAGTC CTTCAGTCTCAGGCTTTGGCCCATG CG 

CCATGGGGAGGGAGGGCAGAGCCCAAAGCCCCTCTTGCGATC 

GG CACACGGAG CTT C CTGGGGG CAG ACCGAGGGTCAG 

5 HUMAN SEQUENCE - mRNA 

GAATTCGAGGATCCGGCTACCATGGGGCTGGAGTGGGGACCGAGGCCCTGCT 

G AGGC CACACC CACAG ACTCTTGTGGCT ATTTCCGGG AGAC CATTCGG CGGG AC ATCCGGCAGG CG CGGGAG CGGTTCAGTGGGCC 
ACAGCTGCGGC^GGAGCrTGGCTCGCCTGCAGCGGAGACTGGACA^ 
TCTCCTACCGCGATGTGCAGGACTACTCGGCC^TCATTGAGCTGG 
1 0 CAGCATAATGTCTGCTTCCACTAC^CrTTTGCCCrCAACTO 

GCCGCTGGTACAGCTTGJW3GGCTCTGTGGCGCCCGATCTCTACTGCATGTGTGG 
GTTTC(^GGATGCTGGGCACCGGGAGCAGGCCTATCACrG{jrATC^ 

AATG CAG CTGTGCrrCCrCATTGCTGC CGGG CAGCACTTTGAGGATTCCAAA CATGAAGCTGGGCTGCCT 
GCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTATTACTGGGATGTGGGTTTCTAC 
1 5 CCACCCAGGTGGTGCTGGCIt5CAGAGCAGCreTATAAGCTCAATGCC 

CTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA.CCACGCCGTGCCCA 

CTGCCAACCATTCAAGACAGCCTGTGCCCAGGGCGACCAGTGCTTGG^ 

AGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTGGA^ 

2 0 GGACGTCCAGCTGTGCTTCCCCAGCXSTAGGGCACTGCCAGTGGTTCTGC^ 
CGGCGCCOGCGGAGGAGGCGGAGGGCGCGGGGGAGATGTTGG^ 

GGCAAGGGCACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATC^ 

CAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCrTCACAGACXSCC^^ 

CTAGCCAGGGCGGCTACCTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCT 

2 5 CTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTGCAC^ 

CAGGGACATAAAAGGGGACAATGTGCTGATCAACACCTTCAGTGGGCTGCTC^ 
CAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAG 
GGGAAAGCAGCTGACATCTGGTCACTGGGCTGCACTGTCATTGAG^ 
ACAGGCTGCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCT 

3 0 TCCGAACH'TTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCCCAGACACTGCrGGGGGACCCCTTCCT 

CGCAGCCCOlGCTCCCCACGACATGCTCCACGGCCCrCAGATGCCCCTTCTGCCAGTC^ 
GTCTCAGACATTCCCGTGCCCrrCAGGCACCCTCT 




3 5 GGGGGCCCGTCTGGGCAGAAACCATGTGGAAGAGCrGCTGCGCTGCCTCGGGGCAC^CAT 

CCCAGGAGCTGCGGGCGCTGCAAG^ACGGCTGAGGGCCCAGGGCCITGGGCCTGCG 
GATGCGGTGAAGCAGATCCTCCGGAAGCGCCAGATCCGTC 

GGCAGCCCTGGGTGTGCTAGGACC^GAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGA^ 
AGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGC^^ 

4 0 GCAGAGACTGATCGGCTGCGCGAAATCCTGGCGGGGAAGGAACGGGAGTACC^GGCCCTGG^ 

TGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCITTCAACGGACCAGGGCCT 

TGAATGTGGATTCAGGCACCATCCAAATGCTGTTGAAC CATAG CTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGAC 

CTCATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCT^ 

CTCTGGACCCTGAGAGCTGAATGAGGGCATCATAGGCCAGACAGGCCCAAGGATGGA 

4 5 CACACCAGCCCCAGGACCTGGGGCGACTGGAGGAAGCCAGGCGAGTGGGGCC 

G CACCCAAG CACT ACCAGACAAAG CG T ATT AAACAGAACACTTTTGAAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTaSGCCATCATTGAGCT 

5 0 TGATGTGGCCGAGCAGCATAATGTCTGCTTCCACTACACTTTTGCCCTCAACC^ 

TGTCTGTGCTGCTGCCX3CTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCT 
TTCTTCAGCTCGGGTTTCXAGGATGCTGGGCACCGGGAGCAGGC 
TCACTCAGGCATCAATCCAGCTGTGCTCCTGATTGCTGCCGG^ 
AGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCTGra 

C^A^OTTCCTGCTCrACCAGCACTTCAGGCCCACGCCAG 
TCTTtSCTACAGTCCTGCCAACCATTCAAGACAGCCTGTGCCCAGGGCGACCAGTG 
CTGCTGCCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCT 
TCCCTCCAGCTGGACCTTCCCAGTCX3CCTCCATATGCGGAGTCAGCGCCT 
60 TCCCCCCGGCTCAGGACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCT^ 

AACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAGATGTTGGAGTTTGATTATGAGTACACGG 
GCGGCTGGTGCTGGGCAAGGGCACGTATGGGGTGGTGTACGCGGGCCGCGATCGCC^ 
TCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCACAGACGCCTC 
TATCTGGGCTCAGCTAGCCIAGGGCGGCTACCTTAAGATCT 

6 5 GGTGTGG GG ACC CCTGAAGG ACAACG AG AG CACCAT CAGTTT CT ACACC CG C CAGAT CCTGCAGGGACTTGGCT ACTTG CACGACA 

AC CACAT CGTG CACAGGG ACAT AAAAGGGG ACAAT GTGCTGATCAACAC CTTCAGTGGGCTG CT 

T C CAAG C GGCTGGCAG G CAT CACACCTTGCACTG AG ACCTTCACAGG AACTCTG CAGT AT ATGG CCCCAGAAATCATTG ACCAGGG 

CCCACGCGGGTATGGGAAAGCAGCTCACATCTGGTCACTGG^ 

AGCTCGGGAGCCCACAGGCTGCCTATGTTTCAGGTGGGTATGTACAAGGTCCATCC^ 

7 0 cAAGCCmCTCCTCCGAACTTTTGAGCO^CCCCCGCCTCC 

TGGGAAAAGGAGCCGCAGCCCCAGCTCCCCACGACATGCTCCACGGCCCTCAGATGCCCCrTCT 

ACTCAACCACCCAGTCTCAGACATTCCCGTGCCCTC!AGGCACCCTCTCAGCA 

GGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCTGCGGCC^ 

C^ACCAGGAGAGCAAGCGTCGGGCCATGCTGGCCGCAGTATTX3GAGCAGGAGCTGCCAGC^ 
7 5 AGAAGCAAGAGCAGGGGGCCCGTCTXXXJCAGAAACCATGTGG 
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CGCCGGCAGCTCGCCCAGGAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAG^ 
GTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCC^ 
GCCGTGCTGTGCGGGCAGCCCTGGGTGTGCTAGGACCGGAGGTC 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTT^ 

GAGCCTCTTGAGGG GAGAG ACTGATCGG CTG CG CGAAAT CCTGG CGGGG AAGG AACGGG AGT AC CAGG CC CTGGTGCAG CGGG CTC 

TACAGCXX3CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCA 

TGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATGCTGTTGT^ 

CACTCGAGATGACCTCATCTACACCCGCATCAGGGGAGGGAT 

CCACACCAGTCACCTCTGGACCCTGA 
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Table 5 

MOUSE NOMENCLATURE 
ICSGNM Fosb 
Celera mCG4848 

HUMAN NOMENCLATURE 
HGNC FOSB 
Celera hCG20725 



10 MOUSE SEQUENCE - GENOMIC 

CTCACTCCGCCAGTCTGAAGCATCCGGGCTCHCTCTGTG^ 
TCCTCCCAAGGCCCCCCATGCTGCTGCTTCATCTGACCAGGGCT 

GCATGAGTGGAGTGAGTTCTGGGGACAGTGCCCAGTTGTGCGATAGAC7U3CAGATTCCG 
GTTGTATTTGGTTTGTGTTTGTGAGTATGTCTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAG 
1 5 GACAGACAGAGACAGAGACAGAGACAAAGAGAGTTGTGTGAGGACCACTGTGC^ 

GTGTGTTG TTGT ATTTGGTTTGTGTTTGTGAGTGTAT CTGTGTG TGTG AG AGAGAG ACAGAGAG ACAGAGAG AGAG AGAG AGAG AG 

AGAGACAGACAGACAGACAGACAGACAGAGAGACAGACAGAGAGAGACAG^ 

CTCCCAGGTGATCTGG<1AAGCTTGTGAATGTGAAAGTGCCTGTC 

TAACTATGTGTGCTCAGACAATCCACTCAAAGATGCTCTGGGGCTGCAGCTGATAGAGCA^ 
2 0 GGTGCAAACACC CCAGAACT ACAATCCCAGAACTCTCT CTG CG CACTG AACT 

GACGGAGTCTAACATTGTCCGGATTGCTTCAAACTTTATA 

CTAGGATTGCAGGCATGTACCCTCATGCCCAGTTGAGACAGTATCAGTGATTGAACC^ 
GTACCATGC^GCTACAAGTCTAGTTCCTTTTCTTC^ 

TACTCCCCATCCTCCTGCCTCAGCCTCCCAAGTGCTGGGATGATAGGCTTATGGCACGGCCATGATAGCACCT 

2 5 TTGCCTGTCCTTTGCAGTGTACTATGCCTTGTGAGGGACAGGCTCTGAG 

TGGGAGTTCTGTCCCCATTTCITTGCCAGGCTGGAAGACACT 

GACTCCCTCTGCTCTAATGGGAGGGTCTCAGTGCCCCGTATCTGCTGTTTCTCAAATG 

GACACAAAACATATCAAACACCAAGATAGC!AAAGGCTAGAATGAGACCCTGTCT 

TTGCACCCACTCACAAGACTGTTGGCCATAAACAAGTTTGTATC 

3 0 GCACACCTGGATTCTCATACTTGGAGAGAAGAGCCAGGAAGATTAGGGACGGTTCCAGTCA^ 

AAAAACAAAACAACACAACACACACACACZACACACAC^ 

AT(TTGAATGGTAAAGTGTAGGGATTAAGAGTAGAGACTAGAGTCCCTCCATGGGACCGAGATGTAGAAA^ 
TGCTAAAGCAGGCCAGCCTAAGGGCAGACTCCTGAAAGGACTTGCTCCAAATGATTC 

ATTGTGGGAAAAACTCTCGGGGAATG ACAAAAGCCGGTTTG CAGGTTGGTG AG ATTAGAAAGTAATGG CCACAAT ATAGGG CACAA 

3 5 TATACAGGCTGGTCTTGAACTTGTAATCCTCCTGCCTCAACCTCCCAACT 

TGTCTTGTTTCTTTCTGTGCCCCAGGGCTTCAGTGAATAGGGATGGCTGCCGATTA^ 
TCAGGGCTTTAATCCCAGCCTTCAGGAGGCAGAGGCAC^ 

AGGACAGTCAGAGCTACACAGAGAAACCTTGGCTCAAAAAACAACAAAATATCT 
AGATAGACAGATGGACATGTACGATCCACAGACATGTACGATCTTTGGGATGG 

4 0 CAAGACCTGCTTCTGGGAACCCTTTCTTTCCTTTGTATCCT 

GAAACCCTAGCCAC^TACCTGACTTCTGACATCACACAACCAGGGCAACCCTGGTG^ 

AAATCAGT CACCTTGGGCATGGG CACGTGG C CGCT AG TGAC CCC CTGAG ACACC CC CGAGTGG CTGTCTGGCTGTGTG GG CATGCA 
AACCAC^TCAAGGTCAGTAAAGGCTCTCCAGCCTGAGTCAGGGACAGG^ 

4 5 GGACAGGAAGAGAAAGTTGAGGGAGAAAAGGCAGAGGGTAGGCCCCAGTCAGGGAGAGG 

TGTGG CTTGCGG CGAGT AG CCTGGTGACCTGTGG CTGGG AT GTGG AATG CG CACAG AATGGCG AAGTTG CA 

AGCGAGGCTCAGAGGACAATGGAGTCAGCAGACAATGGGCTCTTACACr^ 

GAGCCCTACCCTCCAGCCACCCC^AGCTACAGCTTGCCACACACACACAAG^ 

TAAGTGACATCCGTAATTCT CATTGTGAAGATTG CAACT CT CGGT ATTTT GATGAT CT C ATTT GAT ATGAAT AACTTAGACATTTG 
50 CAGTGCCAGGCACTGTTGTAGACTATTTAGCTGTGCTAGGGATTC 

GAGCTTG CTCCCAGACCCACGTGCCCATGGTTTGTCCCCAGACCC CTGTTTT AT ATGTAATTCTG AGACAAG^ 
CCCAGAGACCCTTGAGCTCACTCTATTACC CAGCCTTGAACTCACTG ATCCTTCTG CAGGGATTGTAGGCATGACTAGCTAGACAC 
AGCTAAGTATCTGTTAGAGCGTGTATGTGTACCTTGGG^CAGACATCTGTCCAACT 
CAAATGGCTGAGGTGCCTATTTAACATATCAAGAGGACCTG^ 

5 5 TGGCTG^CATACCCCACCCGCTATGCTGAACTTCTCCAGCCCAGGGGCTGGGCT 

CTCCCCTCTCTTTGTACCTTAAGGCTGC^CCTGGCTCCCATGGCT 

TGTTCCCTTCCTTTCTCTTTCTTAATTTCCTTTTCTTTTCATT 
TAAAAATATACACAGCTAGCCAGGCATGGTGATCCAC^ 

6 0 TTCGAGACCAGCCTAGTCTACAGATTGAGTTCCAGGCCTGTCn'GGCCTATACAGTGAGA 

TACCCCCTrOUVGTCAGAAGACCCTGAGACACAAGCCCATCCATCGCCGG^ 

AGTCAGGTCTGAC ACTCACACACAAGGTCCCTGCCACCAGTTGAT CCAGCAG CT ACATTCGAAGGATCAGAGAGGTCCAG ATAACT 
GTGCTGCCCTGTCAGACCTTATGAATGAAGACCTGAATGGAGGCT 

AGGAGG ATTGG AGGGAGTTAGCTGGAGGCCAG CCTATGCTATAGAGTAAGACTC CTTCAAACAAGAGAAATGAG AACTGAA 

6 5 TGCTCAGTGGGTAGAGGCGCTTGCCTG CAAGTCT AAGGAACTGAGTT CAATTCCTGACATTTGCATGATGGAAG AAAAGAAACTCC 

TTCAAGGTCACACAC^CACACACACAC^CACTCAAAGAAGT 

TGGTGATACAAACCTGTAATTCCAGCACTGGGGGCrGGAGCCAAGAC^ 
AACATAGCCTGAGCTACCTGACACTGGTTGCTAGTGTGAGCTAT 

7 0 ATTCATGACGACAGCACCTACAGATTGCCTCTCCTGTGCT 

GACGACATGCTTGTTGCAAGGCGCTGGCTCIATCTGAGACCAAAGCGACT 
CAGTGGGAAGGGCATACAGAAACCTCAGGAACTGAGGCACAGCAG 
TACATCATACCATCTTGTCCCTCTGAATCTCCATGGGCACCCCCTCT^ 
AGCCAATTCAACCCTGGCCTAGGGGGTGGAGGCAGGGGGTCTATCAGC^ 
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GGGATGGCTCTCTTGTCAAACCATCAGAACACCCAC 

GAGACTTGTGGGAACTGAGGAATGTAGTGGCAAAAGTTTGGCCAAG 

TGACATTTCAGGAAAGAAAAGCAC&GAATCAAGCATGGTAG^ 

NNNNNIWNNNNNNNNNNNNNNICJNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNKNNNNNNNNOTWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



NNNNNNNNNNNNNNNNNNNlWNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNTOJNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNIWNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 



NNNNNNNNNNNNNNNNNNNNNNN^ 



NNNNNNNNNNNNNNNKNNNNNNNNNNNNNNN^ 

TATGGCTAAGAGTCTCCCCACTCCTAAGATGGAAAGATGCCAAGCTGCAAAGGAGATCCTG^ 
TCAGAACCTAGCCAAGCTGCCTGCAAGAGGTTTAGACCAACTAAGACATCTGGAGA 
AGGCTCTGCAGGCTGCTCCAGGCTCCCAGCTTTGTAAGCTGTCACT 
GACTTCTCCCGTAAGTCACCCCTCACCCATGCACACTCCTGTAAGTAACTCTAATAG 

TTGGCCTGGGGTCTCCCAGGCTTTTTGC^GTTGCTGGTGGCATCCTTCTTAG 

G CAG CAG C CGTTGGCAG AGGGCTCC CCT AGCATG CAGGAGG C C C CTGG ATTACAT C CC CAGT AAT AAAC CAGGCCTGGCGACAAT A 
TAGAGGTGAGAGCAAGATGTTCATTGTCATCCITGGGTACATAG 

AAGCAGGACATAAGATCAGATATGCCTGGCATGGTGGAAGGTGCCTTTAACTCCAACACT 
CAGAATGAATTCCAGGCCAAAGGTGCTAGATAATGAG 

GAGGAGGGGATGTTCACAATAGGGGGAGGGGATGTTC^CAATACGTGTATATATATTCT 
GACTTCACACATGCrAGACATTCATTCCACTTTAAGCTACATTCCCG 

TTATTATTATTATTATTATTATTATTATTACTACATATGAGTACTGGTTGTTTTCTGTCTTrc 
AATTTTTAGGAGTCATTTTTCTTCTTCCACCGTGAGTCCCAGGGATGGGACTC^ 
CTGTGAACCATCTTGCTGG<^CTGATACCTGCTGATTGCTAAAC^ 
TATTTTTCGAGACAGGGTTTCTCrTGTGTAGCCCTGGCT 

TCTCCCTGCTTCTGCTGGGATTAAAGATGTGTGCTGCTGCCGCAGCTGCCACCAC 

T ATTATTGT ATACACACTTT CTTTTCTGTCT CCTG AGCTAAGTGGG AAGG AG AGGCTTCCAACACCATC AAGCCTGAAG AACTCAC 

CGGGTGTGGGTTCCTACCTGCrrCCAATTCTGCAGGTCCCCCAGATTCAGGGACCCGGAAAAGCCACCT 

AGTACTGAAGTTTTAGATCAGGGAGGGAGCACATTTTCCAAGCTCACCGAG 

GACTGCATCCTTACCCATCCCTAC^CCTGGGGATATCACAGACTCTCCA^ 

TATACCCCTCTCGCCAGAATCTGCACTGGGGACCTTGCTCTTTTTCACCAGCA 

CCAGGGGTATAAGCAGACCTGGGATCTGGAGTTGCACCTTCTCC^ 

GTCAATCTCAGCCTCCGGCACAGCGTGGAACTGCGCAGGCGCGGGCGGGTTCCGC^ 

GGGATTCCCTCTGACGTAATTGCTAGGATACCAAACAAACACTGGGCCG 

GGAACTCCGGGGAGGGCGGGG^CGGGATCCCCTCCCGCGAAGCCCCTCAGAACGCAGCCTTGGGGACCCCC 

CACTATGGGCAGGTGG CAGGTGCACGAGGCCCTGG AGGGCGCCACAAGGAG CTCAGTCGGCGGGAAGGGAG AGTTTGGG AGGTTTG 

TGCATGGAGTTGCGGGTGACGCAAGCGCGGGGGGCGGGTCCCGGAGGCATAAATTCAGCCC^ 

ACTGGAGCGGCTACGCCGGGGACACGCGGAGCCGGGGCTTGCTGGACTTGACTACTGGTGACT 

GCCGTGAAAAATTTATATATATATATTTATATATATTTTTAAACTGGAGAGAAAGTTTTGTGGGTTT 

CTAGATCGTTCTTCCGATTCGTCCGGAGGATAGACTTTCTTTCTGTGGGCTTCTGGATCGCGCCCCTCTG 



CCCGCGTCGCCCCCCGTCAAACCGACAGAGCCTGGACTTTCAGGAGGTACAGCGGCG^ 
GGAACTTGCATCGAAACTTXSGGCAGTTCTCCGAACCGGAGACTAAGCTTCCCCGAGC^ 
TCCGGACTCGCATCrCATTCCACTCGGCCATAGCCTT^ 

GAAATGTTTCAAGCTTTTCCCGGAGACTACGACTCCGGCTCCCGGTGTAGCTCATCACCCTC 

GGTGGACTCCTTCGGCAGTCCACC CACCGCCGCCG CCTCCCAGGT AAGTTCTAG ATCGTAAAG ATTCT ACTTTAGTGGTTGGGGGG 
GGGTG TTT CTTTCAG GCT AAAGTG T AG ATTG AG CATCCT CTCAAAT AGAG ACG CG CCAAAACCCAGG ATCTGGG ATTG CAAT AGTT 
CGGTTTG GACTTTGGTTTTTTGTTGTTT GCAAACTGC TG ATG TTTG CAAAAGG TT ATTTG CG CGGAG CG CGGG AA 

AGGAATGCAGCTGGG CAAACGTTGGCGATGCCCGGTGC AAAGT ATATACCCGGTGGTT AGCAG AAG CTG AGAACTTTTAGCCGAAA 
GCCGGCTCCCTAAGCCGAAGCTAGGCAAGTAG^GAAGAAAAAGAAACAAAAAATTCCAGA 
TCCCTCTTCCTTCAAAACGCGGACTGCAAGTCCGCAGTCACCCrCCACCCAGCAAGAGTTAGGGCCT 

CT CCGCCTCCTGCGCGGAGACGT AACGGGGG ACCCGTGCGTAAAGGCTGACGCGCTGG AATCCT CCGTCTGACGCGGGGCACGCAC 
AGCGCGCAGCGCCCCCTCCGCCCGCCCCGCCCCTGACGTCCCGGGCACGTTCT 

GGGGGCAGCGTGG CTTTGTG ATTGGCTGTCG CGGCGCGAGCTTTAGCCAATCAGCGTTCCCTTCCTATTTGT AGAGCGT AGCTCCC 

TCACCCCATGGGTCTGCGAGGACCTTGTGTGGACCTGGTCCTGTTGTCATAAGCTAGAG^ 

AGGGGGAACrrGAAGGCCTCATCCCTTCTCCAGGCACACATATACGTGCrCCCTG 

AAACACTAGATGAGCTAGCCTACGGAGAGG^GCCAGGTGGTCTCTAAA 

ATGGACGCTTCTTGGGTTCGGGGACGCAGATCCTATGTCACCCCATCCCCTGCAAGAC^ 

CT CTG C CT ATCAG TT CACTG AAACCTGT CAG TCTCACTGGGGAAGAGACAGACACTCGG AAGGG ATG CT CTCAACTCTT AG^ 
TCCCCCAACAGCXSTTGGAACTGGGATCTCCGCCCCTGCGGGAGCC^ 

3TGGGGTGTTTTGGCTGTATGTGTGTGTGANNNNNNN 
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NNNNNNNNNNNNNCTGACTTGAAG<X^ 

TGGAACTCAATCTGT AGACT AGGCTGGTCT CGAACTCACAG AG AT C CACCTG TGTCTG TGGCTCCAG TCC CGGG ATT AAAGG TTTG 
CATCACCAGGTTCTGTCCAGCCTCCGTCTCCCACCCACCCCCCCACACCTAAGAGTGACCAA 
TGGAACCGTGCAACCTTTCCCCGAGGAAGAAGGAGGAGGTAGAAG^ 
5 TAGTGGAAGGGTGGGTGTTGTGGCTTITTGCCTGTGACACAC^ 

TGTGTTATGTGTGTTGGGCGTGTGTGTGTCGGTGGCTTTGTTTGTC 
CGCCGGTCTCGGGGAAATGCCCGGCTCCTTCGTGCCAACGGTCACCGCAATCACAACC^^ 
CCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGGCTCCC^^ 
ACCAGCTACTCAACCCCAGGCCTGAGTGCCTACAGCACTGGCGGGGCAAGCGGAAGTGG 
1 0 TGGACCTGTGTCTGCCCGTCCAGCCAGAGCCAGGCCTAGAAGACCCCGAGAAGAGACAGTA^ 

GGAGGAGCCTAGCTAGGGATGTGGGCTCAGTTTGTACAGTGCTTGCTGCCATGCATGAAGATCCC^ 

TGGTTATGCAGACCTGTAACCCCAGCTCrCAGAAGGTGGAGGCAGGAGGAGCA 

TCCAGCCTGCACTGCAAGAGATCATTATTTTCA 

TTTTATCACTTAATAGTTGGAGGTTCCTCTGAGGCCTCAAGTCTGAAGGAAei rrACCA 
1 5 TATTTGGGGTTCAGGAGGAAGGAGTlTrcrrAGGGCTGATAGAGGTACCCCCAGA 

CCCAGAAGAAGAAGAAAAGCGAAGGGTTCGCAGAGAGCGGAACAAGCTG^ 

CAGATCGACTTGRGGCGGTAAGGAGGAGTCTGGGGGTGTCTTGAGGCCGTGCTGGGAGC^ 

CACTGTGCCTGTGTCCTAAACGAGGAAACCCCCTCTTAGGGAACAGGGGTCAGTATAGGCT 

CAGACCCATGCCCACTTACTTTCGACTGTTCCCCACTTTCCCT 
2 0 AAGGAGACAAGCTGGAGGAGGTAATTCTCTCACCTl'CTTrCCTTCACTAAATAATAA 

TTTCCTGAGCTGGGGATCTACCTGTGTAGTTCAGCCCTCCTCCCCCAACTTGATAGCCTCAAGTT 

AT CAT C CTGACTGG CT CTGG CTGG AAACTATTTTGTGCT AAG T CAATT C CTT ACCTG CT ACTC CAG CT ATCT ACAGTT CTG C CGAA 

CTTGAGCTGGTGGCGCCCACCAAGCCCACTTCTTTCTCTCTTCTACCT 

GCCCCTTGAAACCAGGGTGTGTCTCTGATTTCCCGTCGGGAGGCTGAAGGA^ 

2 5 AGCATTACCTACrGAOTCAACAAACTGTAGTGITrrrCTTTTTTCCTCTCAAAAAAT 

ATGTTTGAGTGAGTGCTGGTGCACCACAGCACACATACGAGGTCAGAGGGAAATTTTCATAGTT^ 
GGTGCTTGCTGGCAATCTCCTTCACTCAGTGAGCTACAATGCCCCCTTCTGCCCTTTA^ 
CCCTTTCCTCGGCCTCTCAAAGTTGAGATTACAAATGTTCACCATCACACCAGGCTTGG 
CCTGCCTAGCTTCTTCCCAACCATCTCTTAGTCTGATGGGGAAACCGAGGCACGAGTA^ 

3 0 GGACGGTCCCCTGAGTTGGGAGGGAGCTGTCGAGCCCCCTGGATCAGCA 

TACTCTGTGTGGTCGCTGACCAGCAATTCTCCTTTCTCTGTCTC 

CTTGAAGAGGAAAAGGCAGAGCTGGAGTCGGAGATCGCCGAGCTGCAAAAAGAGAAGGAACGCCTGGAGTTTGTCCTGGTC 
CAAACCGGGCTGCAAGATCCCCTACGAAGAGGGGCCGGGGCCAGGCCCGCTGGCCGAGGTGAG^^ 

CTAAGGAAGACGGCTTCGGCTGGCTGCTGCCGCCCCCTCCACCACCGCCCCTGCCCTTCCAGAGCAGCCGAGACGCACCCCCCAAC 

3 5 CTGACGGCTTCTCTCTTTACACACAGTGAAGTTCAAGTCCT 

TGTCCTCACCTGCCCGGAGGTCTCCGCGTT'CGCCGGroCCCAACGCACCAGCGGCAGCGAGCAGCCGTCCGA 
CCTCCCTTCITGCTCTGTAAACTCTTTAGACAAACAAAAC^ 

GGAGGAAGCAGTCCGGGGGTGTGTGTGTGGACCCTTTGACTCTTCTGTCTGACCAC 
GGACCTCCTTTGTGTTTTGTGCTCTGTCTCTGGTTTTCTGTG 
40 GGGCGGGGATGGACACCCCTCCTGCATATCTTTGTCCn'GTTACTTCAACCCAACTTC^ 
GGTGGGGTGCAACGCCCACCTTTGGCGTCTTGCXjrc 

CGAGCTGGCATGCACCTCCAGAGAGACCCAACGAGGAAATGACAGCACCGTCCTGTCCTTCTTTTCCC 
TCAAGGGTGCAGGGTGACCAAGAT AGCTC1G ri l iGCTCCCTCGGGCCTTAGCTG A 
TCCTCCGGGACGAATTGAGCCCCCGACTGAGGGAAGTCGATGCCCCCTTT^^ 

4 5 ATGTGAACCCCTATCTGATTGCTCACTCTTGCCCTCCTGGGAAAACT 

CCCTCCCAACTCAGGCCCGCTCCCACCCCTGTCCAGTATTATGCTACGTCCCTCTCACCCT 
CGTCCTCGTTGGGCCTTACTGGTTTTGGGCAGGAGGGGGCGCT^ 

GTCGGATTGCTGGGTGCGCCGGATGGGATTGACCCCCAGCCCTCCAAAACTTTCCCTGGGCCTCCCCT^ 

CCTCC CCTTGACAGGGAGTT AGACT CG AAAGGATGACCACG AC G CATC C CXXH'GG CCTTCTTGCTCAGGCC CCAG ACTTTTT CT CT 

5 0 TTAAGTCCTraSCCTTCCCCAGCCTAGGACGCCAACTTCTCCCCACCCTGGGAG 

TTCAGAGAAGTTTTCAGGGCTGAGGCTTTGGCTCCCCT 
AGGGGGCTGAGTTCCCACTATCCCACTCCATCCAATTCCTTC^ 
GTGAGAATCCCACGAGTCAGATTTCTATTTTTTAATATTGGGGAGATGGGCCCT 
CATACCCTGTCCTGGGCCCTAGGT^CCAAACCTAATCCCAAACCCCACC^ 
55 CTTATATCTATTATGTATAAATAAATATATTATAT^^ 

TGTTTTCAAGTGTGCTGTGGAGTTCAAAATCGCTTCTGGGGATTTGAGTCAGACTT^ 
TGTTGTCTCGGCTCCTCTGGCTGTTGGAGACAGTCCCGGCCTCTCCCTTTATCCTITTCT 
AACATGTCTCCACTCTGAATGACTCTGATCTCXGGTCT^ 
GTAAGTGTGACTGGGTGGTAGATTTTTTACAATCTATATCX3TTGAGAATTCT 

6 0 ACTG CCGAC CACAATTCATTG ACTC CAT AGCCCTCACCCAGG CTG TA 

CTGACCCCGGGGGTGCTGGGGCAGTCTAGCACTGGGCAGCTCCCCTCCCCCCCTTGGTT 
AAAAAACTGTATTCTTCAGGTCAAAGTGTCrGTTrTCCCrGGACATCT 
GCTAGGGAGGTCTTGCTGGCACTTAGTGGGACGCCTAATGAATCAGAACCTACAACGGGACT 
TTTTCCCATGTTCCCAGGCTCGGCCACCTACTTGAAAAAATAAG 

6 5 AGACTTCATCATCGGAAAAGAATTTATTCACCTTGGGTGTGC^ 

CACTTGTAAATCCTAGCATTTGAGAGGTC^AGGCAAGGG 

AAAGATCTGCTATGGGGAGAGGGCTTGGTACACCAGGGGAGCCAGAAGTTCGTGGTGAGGG 
AGTTAGCCTCAGGGAGATTCTACAGGC^TGATGCAGAGTTCAGACGCTCCrTTGA 

CAG AGAAGGTT AGCG TT AGGTGG T CTCTTCT AGCCCATC CCAGGCTGAG G AGGACX3 CTG AGGGTTT CAAG AAGG AT CGAG AATGG A 

7 0 AAGCAGAGGAGAAAAAGGATCCAAGAGGGA.TGGAGGAGGCAGAACACATTTCT 

GCTGCAGGAGGAGATGCTCACCAGTCGGGTGGTCTAGGGGGTTCTTG^ 
TCTCGCTCTTCrGTCAGCTTGTCTTCGRTTAAGTGTGTGTCTC^ 
CTCGAGTTCATTAAAAACACAATTGCCTGGTGCCCIGCT 
TCACAACTCTATCCCCATTACTGCTCCAACGCAGAGACAG^ 
7 5 GCGTGATTAAATACATAAAAGAGCAGATGACTGGATC 



224 



WO 03/008583 PCT/US01/51291 



ATTT AAATCCAT CTTTGAGATGT ATTTGGTCGGAAATT C CTGGGAGGAAAAAAAGTGT AAG T ATG AAG AG AGAAT AAATG AG AAT A 

GGGGTGGCTTCAGAGAGGTTAACTGCGCGCTGGTCGCTTTTGTACAAGAATGTGAATTC 

CGCCCGAAAGGTGGAATTGAACC^CTCTGTCGCTAAACAGCt'ACAGGTTTGAAGCCTC 

CG AAAGG AGCTATTTTCGG TT AGTT AT AT AAAGGCCAGAT ACT ACT ACTTTTTTCACTT ATGGT CATT ATTTGTGGT AT ACAGTAG 
5 ATAATTAATTTCAATGGTTTCGAACIATTTTTTT^ 

CTACTAACTAAAAAGTAAGTAGCTTCATTTGCAT^ 

AAGCAGAATTTTTTGCAAAGTGAAAAGTCAGTTTTATT^ 

CGGCCCATTCGTAGCAGAATTCGAGATTTTCTC 

TG CTTT AT AGAACTC CAAGTATGT AGGG AATCT ACTGAGTCCCT TTGACAACCAACT CCCCTTG CGTTT AGACG 

1 0 CTAAAAACCATCCCTTTTTATATCTATGTGATTAGCCCAGGGAAACTAAGGCT 
AGCCCAACTCCCTAC^GAAATGAAACCrACAGTTGTGGTTTTAATATGCTT 
CCTTATTAGAGGAGGTGTACITTGITAGAGGAAGTGTGTC^ 
GTGATCCTGGCTGTCTGCAGAACGCGGTCTCCrrGGCTGCCITCGGATGA 

CTGC CTGCTTAATG CTTTG CTT (SIT I' C CATGACGAT AATGAACTGTG CCTCTG AAACTGT AAG TCAACC CC^ 
1 5 TGTTTTCITTTATAAGAGTTGCATATATATATATATATGTCAC^ 

GATTGCCTGAACTTTGAAGCAATCTTCCTGCCT 

TGATGACTTATAAGAAGACAGAAAATCAGAGTTCCTTTACCTA^ 

GCCCTACCCCACCCTCCTGGAACTGCnTTGAGGAATGCTGCAGGCrCT 

GTTGCCTTCCCTCCCCATGTCCATGTGGCCCAAAGCCTCTCAT^ 
2 0 AC C C CGGTTT AAAT ATT AG AAAAGGGTCACTTTCT C CCCTG C CACAGACAAC CAAAC CACCAT ATGCTTG TCACTTACT ACCTGAC 

TATGAAGGTGAATAGATGTCTTCACAACCTTTCTCTGAGCCTC 

AGAGCTGTGGTTCTCCTCATTCCTAGTGCTGGGACCCTTTAATACATT^ 

ATTTTCATTGATACTTCATAACTGTAATTTTTTCTATTGTT 

ACCCTGTGGAAGAGTCATTCCACCCCCAAAGGGGTCCCCACCACAAGTTAAG 
2 5 GAATTACACTTTC^GTCGACTTTTGGGCTGGCCTCT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNMNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNN^^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNhJNNNNNNNNNNNNlJNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNTOINNNNNNNNNNNNNNNTINNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNCTTGGATTAOVAACATGTGCCACCAAGTCTGGCTGCCCATTATC 

GCCTTGTATATACTAGGCAAGTACCCATCCACTGAACTACAGCCCTAGCATGGACCTCACAGGG 

CACT ATCACAGAGT ATAAT AAT AAAACCG T AGCATG CITTTGAG ATTGAGCATTT CTG CTGGG TCACACTTGG AAAAC CTTCTT AT 
6 0 TGTTATCATGAGTATGTGTGCATACATGTGGAGGTGAGAGGATGACGTCATAGAGACAGCTCTGTCT 

ATT ATTTT ATGTTTTGTCTG CGTG T ATG TTTGT ATACCAAAG TGCATGCAGTG CCTGT AG AGACCAAAAGGGGGCATC CCTG AGAC 
TGGAGTTAGAAGTGGCIGTGAACGACTGACTCTGTGGGTGCT 

ACCACCG AATCAACTTCTCCAGCCCTGCATTG CTTTAATCTTTTGGTCG ATTC CC CCAATCCAACTCAT ACCG CCAAGTCTGAGTA 
GCAAGTG CCTTTATCTGGGAG ACTCACTGATCTT ATTTTTTTT^ 

6 5 GTATGATTATTTAGATGGCCTCGAACTCACAGAGATCCACCTCCCT 

ATGCATATATAGGGCTCTGCAAACACAGCCTGA^ 

CCAGCTTGAAGGTAGKSTGCITTCACCAGCCCCAAGAATTTATTTTT^ I AATTAAATGATTAAATGACACATGT 

TCATTGTGGGTAGTGCAGGTATGCGATGGAGCAC^TGTAGAG 

AGACCACTG CTTCCTTTACCAGATCATCATTCTTG CATCCCCCAATCATTTCTTTTTGTTGATGGTTTTGTTTGTTTGT^ 
70 T^GGGGGGTTTTTAAAGTTTTTTTCGAGACAGNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

7 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTAAAGGC^ 
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GTTTCGAAACGGAGTCTCATGTAGCCTGAGGTGGCCTCAAACATC 

ACCTGTTTTATGCACTGCTAAGAGATTAAACCTGGGCCITCGTGCTAG^ 

TTTTTAGTTTTTCAGACAGGGCrTTGCGACATAGCCCTA^ 

CT CAGATTGTTG ATGCTACACC(^CATATCTCAGACCTGAGGGTTGGCAG CTTGGGCCATGCTCTG ATTTGGGAGTGACAT ATTAA 
5 CAGTAGTTT AAT ACGAATTTGAATC CACCTT ACTTTTTTGG TG CTTTTACGTG A 
GAAATGTTTTACCTCCCTTCCTTGAGCCATGAGCGAGCCCCT 

CCTGCCCCACTCTCCGAGTGCTGAGTGTCTACAGAGCTCCTGACTCTGCCTGGGACGTGATGTC^ 

GGAGGCAGTCACACCACCCATTTTTGGCTTCCATGTCGGCAATGGAGAGCTTGCCATACTC^ 

GGACGGTAAACAAAGCAACCACTCCTACAGATGCAGAGAGAGAGATGG 
10 CTCCGTTG AGGGCITTTCCACT CCCGACCAAATTCCCAAAAT AACGACTCTGAGACTCAATATTTCATGAACAAATG CTTGGGCTA 

GCTTGGGCTTGTTCCCTGACAAGCCCCTATCTCATTTAACCCGTTTATTCT 

CAGTTTCATGAGACCCTCTCATCAGCACTAGGX3CAAACCCTGATTCT 

TCCCTAAGCTAATAGCCCAACAGCTTTTGATTGiACAGGTGATGCTTC 

NNNNNNNNNNNNNNNNinWNNNNNNNNN^ 

15 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNIOWNNN^ 
NNNNNNNNNNN ^^ 

GAATCAAGAATTATCAAGGATAAAGGGAGAGGTCAGGATTTACA 

GGCAATGCX3TGAGCTGAGGTCAAGGGTCACAATGGACGGT CCGTCGGCGGGG^ CATTCTCACTGGAAAGGG 
2 0 TTGGTGCCGTCGCCCCGGTGCACGGCCTGCAGCACTTTGCGGTAAACCCTGAGAG^ 
GTGCGCAGCCACGGACACGATGCTAAAGTGCAGGAGGCAGAGGAGAGAG^ 

TGTCCTTCCAGTACAACAGGTCTG CCACTGTGGAAAAG AAG ACAATGTTGGGACCCGACCTGACCAAG CTCACTCCCT 
GTTTTTTAGAGATAAGGTCTCACTATGAAGCTAAGG CTGACTTCAAACTATG AAACCCTGG CTGGTCTTG AACTCCTGATTATTCT 
AATTCATCCTTTCTTTTTCTTAGACTGAGTTAAGTTTGACCGGCTTCAAACT 

2 5 TTCAAGGAGGTGCTAAGGATCAAATTTAGGGCTTCATGCATGCTAGGTAAAACCCAACT 

MOUSE SEQUENCE - tnRNA 

ATAAATTCTTATTTTGACACTCACCAAAATAGTCACCTGGAAAACCCX3CT 
AATCACTGAGAACTAGAGAGAAATACTATCXICAAACTGTAATAG^ 

3 0 TG CACAGTG CAATTGCT ACATGGC AAACT AGTG TAG CAT AG AAGT CAAAG CAAAAACAAACCAAAGAAAGGAG CCACAAG AG T AAA 

ACTGTTCAACAGTTAATAGTTCAAACTAAGCCATTGAATCT 

TG ATTT AACTTTTACAGAACACAAG CCAAGTTTAAAATCAGCAGT AG AGAT ATT AAAATGAAAAGGTTTG CTAAT AG AGTAACATT 

AAATACCCTGAAGGAAAAAAAACCTAAATATCAAAATAACTGATTAAAATTCA 

AATCATGCAGTGTTTTATTTAAGAAAACATAAAACAAAACTATTAA 

3 5 ATGCTAAAATTAGACTTCAGGGGAATTTTGAAGTCTTCAATTTTGAAACC^ 

GTGCACGCAACAGTGACACGCCTTTAGAGAGCATTACTGTGTATC 
TGCTCAGTCATGAACITAATACAGAGAGAGCACGCCTAGGCAGCAAGCACAGCT^ 

ATCAATCCGTGTACTTGGTGTATCTGAAGCG CACGCTG CACCGCGGCACTGCCCGGCGGGTTTCTGGG CGGGG AGCGATCCCCGCG 
TCGCCCCCCGTGAAACCGACAGAGCCTGGACTTTCAGGAGGTACAGCGGCGGTCT 
40 TGCATCGAAACTTCX5GCAGTTCTCCGAACCGGAGACTAAGCTTCCCCGAGC 

CTCGCATCTCATTCCACTCGGCCATAGCCTTGGCTTCCCGGCGACCTCAGCGTGGT 
TTTCAAGCTTTTCCCGGAGACTACGACTCCGGCrrCCCGGTGTAGCrrCATCACCCT 
CTCCTTCGGCAGTCCACCCACCGCCGCCGCCTCCCAC^ 
CCGCAATCACAACCAGCCAGGATCTTO^TGGCTCGTGCAACC 

4 5 GCCTCCCAGCCTCCAGCTGTTGACCCTTATGACATGCCAGGAACCAGCTA 

GGCAAGCGGAAGTGGTGGGCCTTCAACCAGCACAACCACCAGTGGACCTGTGTCT^ 
CCCGAGAAGAGACACTTACCCCAGAAGAAGAAGAAAAGCGAAC^TTCGCAGAGAGCGG 
AACCGTCGGAGGGAGCTGACAGATCGACTTCAGGCGGAAACTG 
CGAGCTGCAAAAAGAGAAGGAACGCCTGGAGTTTGTCCT^ 

5 0 GGCCAGGCCCGCTGGCCGAGGTGAGAGATTTGCCAGGGTCAACAT 

CCACCACCCCCCCTGCCCTTCCAGAGCAGCCGAGACGCACCCCCCAACCT 
CCTCGGCGACCCCTTCCCCGTTGTTAGCCCTTCGTACACTTCCTCGTTTGTCCTCACC^ 
CCCAACGCACC^GCGGCAGCGAGC^GCCGTCCGACCCGCT^ 
ACAAACAAACCCGCAAGGAACAAGGAGGAGGAAGATGAGGAGGAGAGGGGAG^ 

5 5 ACTCTTCTGTCTGACCACCTGCCGCCTCTGCCATCGGACATGACGGAAGGACCT 

TGTGCCCCGGCGAGACCGGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGGCGGGGATGGACACCCCT 

GTTACrTCAACCCAACrTCTGGGGATAGATGGCTGG CTGGGTGCIGTAGX3GTGGGGTGCAACG CCCACCTTTGG CX3TCTTGCGTG AG 

GCTGGAGGGGAAAGGGTGCTGAGTGTGGGGTGCAGGGTGGGTTGAGGTCGAGCTGGCATG 

ATGACAGCACCGTCCTGTCCTTCTTTTCCCCCACCCACCCATCCACCCTC^ 

6 0 CCCTCGGGCCTTAGCTGATTAACTTAACATTTCCAAGAGGTTACAACCTCCTC 

GATGCCCCCTTTGGGAGTCTGCTAACCCCACTTCCCGCTGATTCCAAAATGTGAACCCCTATCrr 
GGGAAAACTGGCTCAC^TTGGATTTTTTTCCrrCGTCTGCTACRGAGCCCCCT 
TTATGCTATGTCCCTCTCACCCTC^CCCCCACCCCAC^CGCCCTTGGCCG 
GCGCTGCGACGCCCATCTTGCTGGAGCGCTTTATACTGTGAATGAGTG^ 

6 5 GCCCTCCAAAACTTTCCCTGGGCCTCCCCTTCTTCCACrTGCrTCCT 

ACGACGCATCCCGGTGGCCTTCTTGCTCAGGCCCCAGACTTTTTCT 
CTCCCCACCCTGGGAGCCCCGCATCCTCTCACAGAGGTCGAGGCAATTTTCAGAGAAG 
ATCCTCGATATTTGAATCCCCAAATATTTTTGGACTAGCATACTT^ 
CTTCAGTCCCAAAGACGAGTTCTGTCCCTTCCCrCCAGCTTTCACCTCGTGA 

7 0 TGGGGAGATGGGCCCTACCGCCCCTCCCCCGTGCTGCATC 

CAAACCCCACCCCCAGCTATTTATCCCTTTCCTGGTTCCCAAAAAGCACTTATATCTAT^ 
AGTGTGCGTGTGTGTGCGTGTGCGTGCGTGCX3TGCGTCCGTC 
TCTGGGGATTTGAGTCAGACTTrCTGGCTGTCCCTTTT^ 
CCCGGCCTCTCCCTTTATCCTT^CT 
7 5 GTIH'GTCTGTTAATTCTGGAITTGTCGGGGACATGCAATTTTACTTCT 
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TATATCGTTGAGAATTC 
MOUSE SEQUENCE - CODING 

ATGTTTCAAGCTTTTCCCGC'AGACTACGACTCCGGCTCCCGGTGTAGCT 

TCACCGCAATCACAACCAGCCAGGATCTTCAGTGGCT 
CTGGCCTCCCAGCCTCCAGCTGTTGACCCTTATGACATGCCAGGAACC^ 
CGGGGCAAGCGGAAGTGGTGGGCCTTCAACCAGCACAACCACCAGTGGACCT 
GACCCCGAGAAGAGACACTTACCCCAGAAGAAGAAGAAAAGCC^GGGTTOSCAGAGAGCG 
1 0 AGGAACCGTCGGAGGGAGCTGACAGATCGACTTCAGGCGGAAACT 
CGCCGAGCTGCAAAAAGAGAAGGAACGCCTGGAGTTTGTCCTGCT 
QK3GGCCAGGCCCGCTGGCC<*AGGTGAGAGATTTGCCAGGGTCAACAT 

CCrrCCACCACCCCCCCTGCCCTTCC^GAGCAGCCGAGACGCACCCCCCAACCTGACGGCrTCTCT 
AGTCCTCGGCGACCCCTTCCCCGTTGTTAGCCCTTCGTACACTTCCrrCGTTTGTCCrC^ 
15 QCGCCCAACGCACCAGCGGCAGCGAGCAGCCGTCCGACCCGCTGAACTrc 

HUMAN SEQUENCE - GENOMIC 

CACTAAGCACTCTCACCACCCAATGCCTGGAGTGGTTGTAGT 

TGGGTGAGAGACAGCCCCCACAGTGAGTGGCACCCCTGGCCAG^GGCCTGGGACAAG^ 
20 TCTGTGTTTGTACCTGGGTGTGTCTGTGTCTGCCCTACTCrGTGCATACTCATATATGTC 
AGTGTGTGTGGAAGAGGCTGCATGGCAGTGGAAGCTAGGTCTC 

TTGGATGCCACTCCAAGCAAGTCAGTGGGTCTTTTTGTTTTTTC CCCAGGCTGGAGTG 
CAGTGGCACGATCrTGGCTCACTGCAACCTCTACCTC 

CAGGTGCCCACCAATACACTCAGCTAATTTGTTGTATTTTTAGTAAAGATGGG^ 

2 5 CCTGACCTCATGATTCTCCTGCCTC^CCTCCCAAAGTGCrrGGGATTACAGG 

TTTAGGAGCTGTTTCGTACGGTGTGACTGTGAGTGAACCTrC^ 
CGGGCAO«nx3GTTCAGGCCTGTAGCCTTAGCACTra 
CTGGGCAACATAGTGAGACTTrcTATCTATAAAAAATATC 
CTTGGGGGGCTGAG^CAGGAGGATCCCTTGAGCCTGGGAGGTGGAGGCrG 
30 GGCGACAGAGGGAGACCCTGTCTCAAAAAAAAAGAAAAGAAAAGAAAAAAAGAA^ 

ACTGTGTTTGGGGTGTGAACACCTATGTGATCTGTTTC 

AGTCATCTTGTCGCCCX^CGTGGTCGCTCATGCCT 

GGTCCAGACTGCAGTGAGCCGAGATTGCGCCACTGCACTCCAGCC^ 

3 5 AAAAATAAACAAATAGTCATCAGGTGCCTGCACAGACAAAGGTGAAAGGTGTCTGCCTG^ 

TGGACATCTCAGCCTGTCTACGTGTGTATCTGTCTCTGTCCTCACGGCAAAAGAGAGGTTGG 
GTAATCCCAG^CTTTGGGAGGCTGAGGCGGGCGGATCACCTGAGGTGAGGAGTTCAA 
C^TCTCTTCTAAAAATACAAAAAAATTAACCGGGCGTGGTGGCACACGCTTGTAATCCC^ 
AATTGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCAAGATTGCGCCACTGGACTCCAGC 

4 0 CTCIAAAAAAAAAAAAAAAAAAAAAAAGAGGCTCAGATATC 

AATGGTGAAGATCCATTTACTCAACAAACATTTATTTATTTATTTATTTATTG 
TGCAGTGGCGCAATCTCGGCTCACTGAAGCCTCTGCCTCCTGGGTTTAAGAGAGTCT 
ACAGGTGTGTGCCACTGCGCCCAGCTAATTTTTGTATATTTAGTAGAGATGGGGTTTC!ACCA 
CCTGGCCTGAAGTGATCTGCTCACCTCAGTCTCCCAATATGCTGGGATTACAGGCA 

4 5 AT ATTAT ATT AT ATTG TTTTT ATTTT AAATTTTTTTGT AGTG ACAGGGT CTT ACT ATGTTG ACTG GGCTGG CCTCAAACTGGC CT C 

AAGTGATTCTCCTGCCTCAGCXrrCCCAAAGTGCTCGGATTACAGGCATG^ 

ACCTATTGGGTACCACGTGCTATGCTGGGGACAAGGCAGTGACCGGGATGGTTTTGGCCGAGCT 

GAGGGAGACAAACCTATCCCCAGACAATGATGAGCCCAGATTGGCAGAGTTGGAGGGAGGGA 

AG CCTGG AG AT CAGGGAGGG CTTCCTGG AGG AG AGGTT ATGGGAGTTAAG ACTTGG AGGA^ CCACAGG AAG C G AA 

5 0 GGGAAGAGTGACCCAGGCAGAGGGAAAAGCACCTGCAAAGGCCTGAAGTCCAGGGAGAG 

AGAGGAGAGACGAGACTAGGGCACCCTCGGAGCAGGTCGTTCAGGGCCTCAGGAGC 
GGTACTGGGAGGGTCAAGCTAACAGTGTCCAATCTGGGTTACTCTGCrTCCCrrGAAACTCCAGGGA 
AGGGATTGCTCAGTGTGGAGCGGCTCAGCCACCCAGTCACACTCACTATTCAATGCTG 
CAGCCTGGATTTGCCCTCCCAGAGGTCACAGTCACAACAGTCAA^ 

5 5 TCCTCTGGAGCCTCAGAAACCCACAGACCTTGTTCCrrCCCAGGGGGT 

TTGGAAATGCAAAAGATGGCTGGGCACAGTGGCTCACACCTGTAATCCC^^ 
GTCAGGAGTTCATTTGAGGTTGGGAGTTCAAGACCAGCTTGGCCAACATAG 

AGCCGGGCGTGGT AG CACACGCCTGTAAT CCCAGCTACTTGGGAGGCTGAGGCAGG AG AATCTITTGAACCCG AGAGGTGAAGGCT 
GCAGTGAGCCAAGATGGCACCACTGCACTCCAGCCTGGGTGACAGAGTGAGTCTTA 

6 0 GTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCGGATCACCT 

AATAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTTAGCTC 
GGCAGGAAAATTGCTTGAACCCAGGGGTGCAGAGGTTGCAGTGAG 
GAAACTCCGTCTCAAAAAACAAAATACAAAAATTAGCC^GGCATGGTGGCACACACCT 
a\GGAGAATCACTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCTGAGATCGTGCCA 

6 5 ATT CCGTCTCAAAAT AAAAT AAAAT AAT AAAAAT AAAAAGATGACCACAGCAG ATCT CTGG CT CT CCAGG CAATTCGTGATCACC 

ACGAATTCAGAAGCGCTGCTCTCAGGCITAACCCCCCTGTACCT 

TAGAGGACTGACGTGGAGTGAGGCTTAACTGAGATAATGAGTCTGAAAGTC 

CTCCTCTTCXTCTTTTCTAr rmTr ATT^ 

TGGCAGAGTC^TAGCTCACTCCAACCTCAATCTCTCAGGGCTCAAGTCATGCCTO 

7 0 GGTGTGTGCCACCATGCCCAGCTAATTTTTAAATTTCTTGTAGAGGTTGGGGGTGGGTCT 

ACTCCTGGCCTCrGGCGATCCTCTTACCTAGGCCTCCCAAAGTGCTGGGATTA 
CCTCTGTATTATTAAATTTCAGAGCTGGAAAGAAATTTCGAAGACCGCCATCA 
AGGAAC^GAGGAGCAGAGGTTTGGCCAGAGGAAGGTGGGAAGGACGTGGTGTGGCrrc 
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TCTCCTTCCAAGAGGAGCCTCTTATTTTATTTTTTATTATTA 
GCCGTGGCATGATCATGGCTCACTGAAGCCTTGCCCTCCCCAAGCTCAAGTGAT 
TACAGGCACACGCCACCACCCCTGGCTAGTITTTGTATTTTTAGTAGAGATGGGGTCTTGCC^ 
CQ3GGCTCAAGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACA 
5 TATGGAGAAGAACTTGATTTACCCACTCCATGCTCTGCAGGCAGGTGTC 

TGTGTGTGTGTGTGTGTCTCCAGGGCCAATTTCTCCCTTAGGCCCAGGGGCACAGTC 

CACGGAT AT GGGT CAGG AGGGT AT ACATGTATAAAGTT CAGG AAGAC CAG 
GAGCCTTATGATATCTTCCCACCAGTCTCATCAGTGGGC^ 

TGTCCTAGCTCAAGTTCACCAAG CTACTTGGTGATAGAAGCGGAACATAAG CCTAGGCCACCTAGCTTCTGGGTCTGCGATCGTGT 
1 0 GAGGATAAATGCCCAGTGTGTTCGTGAACATGAGCATCCCTGTGTGAATAAATTG 
GCCAGGCGCCX3TGGCTTACTCCTGCAATCTCAGCGCT 
GCCTGCAACAAAGTGAGACCCTGACTCTAAAAACATTTT^ 
CCAGCACTTKKK3AGGCCAAGGTGGGTGGATCACTTGAGGTCA 

TACTAAAAATAGAAAAATTAGCCAGGCGTGGTGGCACATGCCTGT AATCC CACCT ACTGGGCAGGCTGAGGC CGGAAAATTGCTTG 
1 5 AATCTGGGAGGTAGAGGTTGCAGTGAGCCAAGATCATGCCATTGCACTCCAGCCT 
AAAAAAAAAAAATTAGTCAAGCATGGTGGTACGCACCTGTAGTTGTTAGTTAC^ 
CAGGAGTTCGAGGTTACAGTGAACTATGATTGCATCACTGCACT 

TAAT AGTT CAAGG AACAG ACTTTGAAGCCTGACACC CTG CATGGTGTT ATTCCAG CTTTG CT ACTT ACTTGCTGTGTG ACT CTGGG 
TGAATAACTTAACCTCTCTGGGCTTCTGTTrCCCTTCCTGTAAAA 
2 0 AGTTAAT ATAAGCTCTTGGG AAGAATTAG CT CTTGTT ATGGTTGTGTGAAG AG CTTT ACACG AGTC C CTCTG CAGGATGCATGG TT 
GTT GGTCTGTG TGTGTGT AT AT AT AT AT ATGTGTGTGTGT ATGTGTGT AT CT ATGTGT AT ATG TG T AT AT GTGTGTGTTG CTCTCT 
TGATCTCTGAGTGTTGTGTGTCTGTCAATGTGTGAGATCATAT^ 

GGTCTGT AC CTGT CCTTGGCTG T ACACCTGAAC C CTACGT CTTCTTGTGTGTGTAGGG ATGTGTGT CATGTGTGTTG AT AACCAT A 
TGACAGTGTGTGTTTCCACATGCCAGTGTGTGTCTGTGAAGCTGTT^ 

2 5 AGCGGCTX5TGTGGAAGTCTGTGTTGCCTGTGTCGGGTGTGGAGTCTCT 

CATt»GAGGAGGAGATGAGGGGCTTGGATGAGACAGAAGAATGAG 
TAGTGGCTGCTGTGTCATCTCCCAGTCTCTGTCCCTCCCTCCT<K3CTGG 
GTGGGCTTC^GGCTGGACTTTTGTCTGGAAACTTCAACCTCCACCTATGC 
CCCTCCCTACTATTGGGGCAGGGAATCAGGACTCCCrGAAAATTAAGAACCA 

3 0 CCGGAGACAGGCCCTGCCCCTCTCTCAACCTGTGAAATTCAATGTTTTAAAAGTC 

CTGTGCACTGGGAGGGAAGTGCTGGGTTCTCTGAGGTTTGTGAA^ 
GGCTGTAGCATCTGAAATACCCAAGTCAGTGCAGGCATAGTGGCrCATGCCT 
GATCACCTGAGGTTGGGAGTTCGAAACCAGCCTGGCCAACATGGTGAAACCCCCATCTCT 
CACAGTGGCTTACGCCTGTAATCCCAGCATTTAAGAAGGCCAAGGCGGGAGGATCACCT 

3 5 CCAATATGGAGAAACTCTGTCTCTACn'AAAAATACAAAAAAACT 

GAGGCTGAGGCAGGAGAAGCACTTCAACCCX5GGAGGCGGAGGTTC 
CAAGAGCAAAACTCTGTCTCAAAAAACAAAACAGAAa^AACAAAAA 

CTCGGG AGGCTGAGG CAGGAGAATCACTTGAACCCAGGAGATGT AGG CTG Cy^TGAGCCGAGATTGTGGCACTGCACTCCAGC CTG 
GGCAACAGAGTGAGACTCTGTCAAAAAAAAAAAAAAAAAACGTCAGA 

4 0 TCACCTGTACCCCTGTGAGCCAAGATTGC^AACAGTACATGCT 

GGCCAGGCGCTGTtXJCTCACGCCTGTAATCTCGACACTTCGGGAGGCCGAGGCAC^ 
CCrGACCAATATGGTGAAACCCCXSTCTGTACTAAACATATAA 

GAGAGGCTAAGGTGAGGCXGAGGTTGCAGTGAGCCGAGATCGCCACTGC^CTCCAGC 
AAAAAAAAAAAAAAAAAAAAAAAAAGACCrrGACAAGTGAAAGCTACTAATATT 

4 5 TTTTTCAGGGAGTTTCATCCAACCAGGCAGGTCTCAGAAATCAGAA 

TGAGTTTCAATTTCTTGTGATGGGATCTIt5GGCA 

TTCTCTCTTAAC CTTCCAGAG CTGTTTT AAGGATT ACAAAT CTTCT ACTG AAAGG CGTAGCACAGGAG CTGTG CTTGGCAAGT G C C 

AAATACAAC^CATTAATTATTATTATTAGAATTAATAATAA 

GGTGGAAGGAGAGGCTACCAACACCATCAGAAGAGAGGCTTCTCTCT 

5 0 CAAACGCCGGACCTGTAAGGCCAGCAGGGTCCAAGACACACATCC 

AAAACATTTCCTAAGCTCGCACAGCGAATCAGGACAGAAACCAGGAC^ 
GCATCACAGCCCGCTCGGGACTCAGTTTCCCCACCTACGTGACCAC^ 
TTCTCGAATGGGGGCGCTGCACCATCGGTAGAACAGGGTAGGTGGGGGCGCCAGAGGT 
CCGCCCGGGTCAGCGGGGTCCCTGCGGGGCTAGTCTAAGCGCCTATTATTACCAG 
55 GGGCGGGGCGCGGGCGCGCGCGCGAGCGAGCGAGGGATTCCCTCTGACCTCAT^ 
GGCCGAGCTCCTTATATGGCTAATTCCGTCACAGGAACT^ 
CAACCCCOTAGACCCCCAGGGCCCCGAGGGTCATGCAAGTGACCAGATC 
TCGATTTG<3CGGGGCCGGAGATTTCGGGAAGTTTGTCCAGCAAGGGGC^^ 

AT ACAGG CTGGCGGGT CTG TG CTTCATTCAT AAGACTCAGAGCTACGG CCACGG CAGGG ACACG CGG AAC CAAGACTTGG AAACTT 

6 0 GATTGTTGTGGTTCTrCTTGGGGGTTATGAAATTT 

TATTTCTTCA T T TT C TTT TGGAGGACCGACTTA CT 

GTGGAGGAGACCGTAGCTGAAGCTGATTCTGTACAGCGGGACAGCGCTTTCTGCCCCTGGGGGAG 
GTCCTACGGAGCCTGCACTTTCAAGAGGTACAGCGGCATCCTGTGGGGGCCT 
ATTTGTTGGAACGGGACGTTGCTCCITCCCCGAGCTrCCCCG 
6 5 GGCTCACCCCGGACTTGCACCTTACTTCCCCAACCCG^^ 

CTGTGCCCAGGGAAATGTTTCAGGCTTTCCCCGGAGACTACGACTCCGGCT 
TATCTGTCTTCC^TGGACTCCTTCC^GAGTCCACCCACCGCCGCCGC 
AGGTTrTATTTTTTAAGTCAAGGGTGAAAAGAATAAACCCCAACCCCCACAA 
AAGGCTCACAGCXSCACTTTGCAAACTGCAAAGAGTCG^ 

0 AACTTTGA^ 

AAGCTTCCAGCAGTCTCTTTTCCTTTCCTCCTCOT 

GCCTCGAACCCTCAGCCGCGCTGCCTCCGCCTCCTGCX3CGGAGACGTAACGGGGGACCCGTGCGT 
75 CCCCGGGGCC^CGTTGCTAAGGGAGGGGGCAGCCCGGCGTTTCrGAT^ 
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TCCTATTTGTAGGGTGCATTTTCCTTCCCCCCTCT 

CTAGAGGCAGAGGGAGGGGATCCCTGTCX3TGACAAGAGCGCCTGTCTGCGACCCAATGGATCTG 

TCCCTGGG CTCTCAGGAGAAGGGGGTGTCTGCTTGTGTGCTGGCG'ri"rCT TGGAG AGATACGGTGGCTGTCACC CTCTTCTCCAGG 
CACACACAGACACATTCCCACTCCCTCTGCTCCTCATGCCC^TTCCrCCGGTC 
5 AGGGCAACCAGCCAGGTGGTCTCCAGGAGCTCCGCCCCCT 

ACCACGCCCCCCGAGAGAGTAGTTAAGCCTTCAGAGCAGTTCCAGGAGTCCATTTACGGGAGGGGGGA 
TTGGGGGCTCAGGTCCCGCACCATTCCCCCTCCGCGACAATCTGAGAGAGCTCCAGTGGTTACT 
CTAAACTGTCACTCGTCAGTCTCACTCTGAGAAGAGACAGTAACTTGAAACGT^ 
CCTTTTGACTGGGACCCCCGCCCCTGCATGGGACCTCGCGCAGAGGGGGGTGTGTATGTGTGTGA^ 
1 0 AAGGCCTCTCCTTCTCCCTCCCCTTGCCTCTGGGGTGGGGGTGGGGTGTTGTGGCT 
TTCTGTCACCCGGCTGTGTCCAGCCTCCTCTCCACCCCCCATACCTAAGAGTC^ 
ACCGTGCAACCTTTCCCCGAGGAAGAAGGAGGAGGTAGAAGCCAGTTGAGCAGAAATCCT 
GTGGAAGGGTGGGTGTTGTGGCTTTTTCCCTGTGACACACACATCCACACTCGCTCACCCT 

GTT ATGTGTGTTGGGTGTGTGTG TG T CGGTG TCJ.TTG'1 I"1GTGTGTCTACGCCTGTGTGTGTATGTGTCACCCCGTAGGAGTGCGC 
1 5 CGGTCTCGGGGAAATGCCCGGTTCCTTCGTGCCCACGGTCACCGCGATCACAACCAGCC^ 

CCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGCCTCCCAGCCCCCGGTCGTrc 
AGCTACTCCACACCAGGCATGAGTCGCTACAGCAGTGGCGGAGCGAGTGGCAGT 
GCCTGGGCCTGCCCGCCCAGCCCGAGCCGGGCCTAGGAGACCCCX1AGAGGAGACGGTGAGTA 
TAGGGGGAAGCAAGAGAGGCAGGAAGTTTCTTATGAATCX5AGGGGGGCT 

2 0 TTGCTACACGAGCGAGGACCGGAGGCTTGTTTTTTGGCT 

CTGTATCCCCAAATCTCATGGCCTCTATCTCCCTGACT CAGCTCACCCCAGAGX3AAGAGGAGJUVGCGAAGGGTG CGCCGX3GAACGA 
AATAAACTAGCAGCAGCTAAATGCAGGAACCGG03GAGGGAGCTGACCGACCGACTCCAGGCGCT 

AGAGGGGATGTTGAGGGG AG CT CTCTCCCCATTCTCTGCCCCCTCTCCACCTGTACCCTTATC CTGGGTTGAGAACTAGACGTTCC 
ACACATGGAACTAGGTACTGCTGTGGCCAGACTGGGTAGCCCAGGGCACAAACACAG 
25 CCCCCCCCCATTTCCTGACCCCACGGACTACTCTCCTAGCCTrCATTTCATCCC^GG^ 
CGCCCCCATCATTTCTCCCATCTGGTCTTCAGCAA 

TCTCCCGGG CTCTTTCTACCCTT AACACTCTGGAAAGCCTGTG AAATGAAATTATTCC AC CTCCTGCCCT AGCCACCCACAGCTCT 

CCTGGTGCTGGTGGCATCCCCCAAAACCCACTCCCTTCCTACGTCCTCCCTTGGT 

GAGCTGTTACTCCTTGAGGGAACAAGGGAATTGTCAACTTTCC^ 

3 0 AATAGAAGGGAACACATTAAAAACACATAACAGTGGCTCATGCCTC 

TGAGCCCAGGJU3TTTGAGACCAGCCTGGGAAACATAGGGAGGACTTGTCT 
TGCACACCACTGTGGTCCCAGCTACTATGGAGGCTTTAGTGGGAGGATCG 
ATTGCACTATTGCACTCCAGCCTGGGGGAOVGAGCGAGACTCTGGCrCA 
TCATTCAACAAACCTCCTGCACACCAGAAACTTCCCCATAAACCTGGCCC 

3 5 AGGTTCCTGGAAGGCAAGACCCTTGCCCCAACTCCC 

GCCTTCCAGCAGCAAGTCCAAGGGAGCCATGACCCGAGTTTCCCGGTCACTGACA 

GACCTGGCCTCC CTGGCCTC AGG AGACAGAT CAGTTGGAGGAAGAAAAAG CAGAGCTGG AGTCGGAGATCGCCGAGCTCCAAAAGG 

AGAAGGAACGTCIGGAGTTTGTGCTGGTGGCCCACAAACCGGGCTGCAAGATCCCCT 

GCGGAGGTGAGAGATTTGCCGGGCTCAGCACCGGCTAAGGAAGATGGCTTCAGCTGGCT 

4 0 GCCCTTCCAGACCAGCCAAGACGCACCCCCCAACCTGACGGCTTCTCTCTTTAC^ 

TCCCCGTTGTTAACCCTTCGTACACTTCTTCGTTTGTCCTC^CCTGCCCGGAGGTC^ 

GGC^TGACCAGCCTTCCGATCCCCTGAACTCGCCCTCCCTCCTCGCTCTGTGAACTCTT^ 

GGGGGAGAGAGACTTGGAAGAGGAGGAGGAGGAGGAGAAGGAGGAGAGAGAGGGGAAGAG^ 

CTCCTCCGTCTGACCCTCTG CGGC CACTGCGCCACTG CCATCGGACAGGAGGATTCCTTGTGTTTTGTCCTGCCTCT^ 

4 5 GCCCCGGCGAGGCCGGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGAAGGGGATGGA 

CGTCAACCCAAGCTCrGGGGATGGGTGGGGAGGGGGGCGGGTGACGCCCACCTTCGGGCAGTCCTGTGTC 

GGTGGGAGGTAGGCTCTGGGGTGGGCTGGAGTCCTCTCCAGAGA 

ACACCCACCCTTTTTTTGGGGTGCCTAGGTTGGTTTCCCCTGCACTCCCGACC^ 

G ATCCTCTTTACTCTGGG CAGAAGTGAG CCCCC CC CTTAAAGGGAATTCG ATGCCCCCCTAGAATAATCTCATCCCCCCACCCGAC 

5 0 TTCTTTTG AAATGTG AACGT C CTTCCTTGACTGT CT AGCCACTC CCT C CCAG AAAAACTGG CTCTGATTGGAATTT CTGGCCTC CT 

AAGGCTCCCCACCCCGAAATCAGCCCCCAGCCTTGTTTCTGATGACAGTGTTA 
CTGGCCTTCCTCGTTGGGCCGCTCTGATTTCAGGCAG 
GAGTTGGCCAGATTGTGGGGTGCAGCTGGGTGGGGC^GCACACC^ 
TCTCCCTTCCGTCCATCCCCCTTCTTCCTCCCCTCAACAGTGAG 

5 5 CCTCCATCCACGTGGCCTCTCTCTCTCTCCTCAC^ACCCTCAGCCCTGGC 

AGGACGCCAACreCTCCCACCCCTTGGCCCCTC^CATCCT 
TCAGAGCTGAGGCTTTGGTACCCCCAAACCCCC^TATTTTTGGACT 
GAGTCCGCCCATCCCTGACCATGGTTTTGGCTCTCCCACCCCGCCGTTCCCTGC^ 
AATTTATATTTTTTAATATCGGGGGGTGGACCACGCCGCCCrCCATCCGTGCTGCATGAAAA^ 

6 0 CGTCTCCCATCCTGATCCCAGACCCATTCCTTAGCTATTTATCCCTTTCCTGGTTTCCGAA 

GTAAATATATTATATATGGATGTGTGTGTGTGCGTGCGCGTGAGTGTGTGAGCX3CTTCTGCAGCCTCGGCCT 
CTCAAAGCGAGCCGTTGAATTGGAAACTGCTTCTAGAAACTCTG^CTCAGCCTGTCTCGGGCT 
CCCTCTGATTGTTCCCGATGGTCTCrCTCCCTCTGTCTTTTCTCCTCCGCCrGTGTC 
TGACTGTCCCTGCCAATGCTCCAGCTCTCGTCTGACTCTC^XJTTCG 

6 5 GGATCGTAGATTTTTACAATCTGTATCTTTGACAATTCTGGGTGCGAG 

ATTCAATGAATCCCCGACCCCCCTACCCCATGCTGTACTTCTGGTTCTCTTTTTGTATT^ 
AGATTGGCAATGGGCCGTCCCCTCTCCCCTTGGTTCTGCACTGTTGCCAATAAAA^ 
TGTTTATTTCCTCCTAACACCCATTGCCTGGCTTCCT^ 
G^GATCATACACCCATGCACACACACACACAGACACACACCTATC 

7 0 GGAAGGCAGACAGTa^GGAGGACTTCCrGTACTOSC^GGCAGTGCCT 

GCTTGCGCCTGTAATCCCAGCATTTTC^3GAGGCCAAGATGGGCG 

GGTGAAACCCTCTCTCTACAAAAAATACAAAAATTAGCCTC^ 

GGCAGGAGAATCACTTTAACTTAGGAGXJCAGAGGTTC 

AACTGTC^AAAAAAAAAAAAAAAGTAAGGGGTCAGAAGGTTAGCCTGCAGGTGTG 
7 5 GTATGCTCTATGGGTTAAAAACAGACGCTAAGGAGATAAACTATACACAGAAGAAGCT 
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TGCAATCCCAGCTCTTT CGGAGGCCGAGGTAGGAGG CTAGGAGTTCGAG AACAG CCTGGGGCAACATAGTGAGACACCCCCCACCC 

CAACCCATCTCATTATGTTGAGAAAAAAAAAAAAAAGAGGATCTTGAGAAACTTGCACA^ 

GGAATGAGGCGTCATCTTGGATTTTTTACAAACTACAACTACAAAAAACAATTAT^ 

CCAT ATGAC CACTATTGGGGCCAACC CAAGACTGACTGAAG CAAAT AT ATGAC CATTTTTTTTTTTTACACAT AG AGTGGAG C CAT 
5 CCCCATTGCAAAATCCTAGCCGTCTCTTATTTGCATAACAGCAAGTCCT 
GCTTTGGTAAATCAATAAGCAAG-GGAATTTGTC^^ 

AACTGTGCTGGGAGTTGGGGACACTATAGGGACTCAGCCrrCAGGTCTTGCCCT 
CGTTGCCAGTOVGGGACAACCTAAAGTGGATTG 

CrGGGAGTCGGGAAAAACTTCCTGGCGGAGGGGACAGCn^AGCTGAGTCCT 
1 0 AGGCCAAGGCCAAGAAAATAGCACAGCCATAGATCTGTTGGTGGGGAGAAGGCTTC 
CGATGGCTCATCCCTGTAATCCCAGCACTTTGGGAGGCCG^ 

AACGTGGCGAAACCCC G T CT CT ACT AAAAAAT AT AT AT AT AT ATATACAAAAAATTAG CTAGG CATGGTGG TGGG CACCTG TG ATC 
CCAGCTACTCX3GGAGGCTGAGGCAGGAGAATCGCTTCAACCTGG^ 

CAG C CTGGG CAACACAG CG AG ACTCTGTCTCAAAAAAAAAAAAAAAAGAAAGAAAAAAAAGAGAAAACT CAGAG ATTCGTGGAG AC 
1 5 TGGAACC^CGGGTGTGGAGAGAGGGGTTAGTAGAGACCAGATTCTGCAGGTACTATAATC 
CTTTTCTTCAGGGCACTGGGGAGCTATTGCAGGTTT^ 

GGCCCCAGACTGAGCGTGACTG<5AAGGGAGAGGAGTGAGACTGAGCAGGAGACCAG 
GGGGTGAGGAGGAGTCCAGAACAGGGCCAGGGACATGGGACAGAGAGGAGG 
TTGGGC!TTG<3TGAGCAGCCGGCrTGG 
2 0 GTGGTGGCACGCACCTATGGTCCCAGCTACrrCAGGAGGCTGAGGTGGGAGGAT^ 

CTATGATCGTACCGCTGCACTCCACTTTCGGTGACAGAGTAAGACCCTGTCTCAAAAAAA 
ACACTGCCTATGAGTTAACCCTGCTCTACAAC^AGCAGTTTTTAAAGTAAAATTAAAA 
AAGAAAGAGAGAGAAAAAAGAAAAAAGGCTGGGTTGCCTTGGACAACAGACTT(^TCT 
AATGGGGGCTCTTAAGAGGAGTTGCAAGGCTTGTGCATGCCAGCAGTA^ 

2 5 TCTTTATTG^XXJTCAGCCTGAACCCTCGA 

CCGACCAGCTCAGCTCTCCTTATGAGAGGGCCTC&CTTTTCTATT^ 

GTGTAATTACATGATGGCTTAATGTTTGTCTCCCCCACTGGGCTGACTGCACCAGGAGACC^ 

GCGTCCTC^CAAATGCTTGTTTCATGAGTGAATGACAGAT 

TTAAATAAATGAGTCAGCAAACX5ACTGAATGAGTAAGCAATTGAAGGG 

3 0 GAACTGGGTAGATGACTTTTTCTCCGTCTTTTGGAGTGGGTTTTTATACT 

GAGTGACAGAGAATGAGTTTGCGATGCGGAAGGCCGCAGCGACCTCTCCAGGTGCAGGCT 
AAAATAAGTCCGGTTCGATAAGTAAGATTCAAGGCGCTTAGTTACTACCGCCCGAAAGGT^ 
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CAAAAGCTTACCTTTTAGAGTTATGGTCGCGTCCTTTGGAAAGGTGTGAGATGGTTGACT^ 



ATAGCGCCACX3CATTCTGGGAAGCAGCGTTTGAGCACCTGGCAAG(TTC 

TCAGCTTTTTAAGAGG AG AG ATTCATGATT CGGCATTCAGAAG CG CCTTCTCGG AAAATTCCG AATTCTCTTTCTCCGGAGG AGTC 
GTGAGTGGGTTCCTGTTAGAGTCTCATTCACCCTGGCCCCAAAGGCGCTGAGC!AGGCGGCTATAGGGCT 
TCTTTTATGGAATTG CGTAGGCCGATGGAG CTCCGCGTTCATTCAAACGTTT ATTG AGCGTCT 
4 0 GTGCrGGTGACGACAGACATCCTGTCCCGATTGAGTTTAAACACTAGACATAACCAA 
AACTGAGACAGACTCAGAAACGGATAGTACCATATTCCGGTTAAAACAGCCCAT 

TTTATT A CAAAAAAGGTG ATTCTTTTTTCTTAAATTTTT AC^ T ACAG G TGCCGATTT CT ACATGCAT ATG TTGAT 

TGCATCGTGGTGAAATCTGGGCTITrAGGTGTACTCATCA 

ATTCTCCTGCATGAAACTCTCCCATGACTTCCATTTCA 

4 5 GCCCCTTTGGTCCCTCTGACCCTGTCTCACATACrCCTCTTCCCTTGGATAT^ 

ACATGGATCATGCTACCACGACAGGGACTTTGTGCTTGCCGAT^ 

G C CTCCCTTCACTCTGT CATCTG ATCAAATG TCAAT ACCAAAAACT CACTG AGTG AC CAGCACTCAGTGGGGCCTTCC CTG ATCC C 

AGTTTAAAATAGCGAGTGGCGGCCGGGCGCGGTAGCTC^CGCCTGTAACCCCA 

GATCAGGAGATG^AGACCATCCTGGCTAACACGGTGAAACCTTGTCTCTACTAAAAATAC 

5 0 GCGCCTGTAGTCCCAGCTACTCGGGAGCCTGAGGCAGGAGAATGGAGTG 

TCCACn"GCACTCCAGCCTGGGG<5ACAGAGCGAGGCTCCGTCTCAATAAA 
GGCTCAGGGCCTGTAATCCCAGAACTTTGGGAGGCCGAGGCGGGTGG^ 

ATGGTGAAACCCCGTCTCCJVCGAATGATACAAAAATTAGTCAGGCATAGTGGCTCACGCCTGTA^ 
AGGCAGC^GAACCGCTTGAACCGGGGAGGCAGAGGTTGCAGTGAGCTGAGATGGTGCCATTGTACTCCAGC 

5 5 AAAACTCCATCTCCAAATAAAAGTAATAATAATAATAAATAAAATAAAAATTTTTAAAAA 

CCCTGCn" AGACATTTCCCCACCACTCTG ATCACCAC CTGCCATTTTCTGTGTCTATTTG CTTACCGTCTGTCTGCTTCTGTAG AAT 
ATCAGCACCATGAAACTATGCACTTTATTTTTGATCATTGCTC 

(2AGTAAATGTTTGTGGAATGAATAAAAGAATATCATCACTGTCTTTTTTTCATTCTTCT 
AGGCTGGAATGCAGTGGQ3CAAAAACGGCTCACTGCAGCCTCGACCTCCCAAGCT 

6 0 ATGACAGGCATATACCATCACGCCCAGCTAATTTTAATTTTTTTTGTAGAAACAG 

AACTCCTGGCCTCAAGTGATGCTTCTGCCTTGGGCTCCTAAAGTGCTGGAATTTCACGCGTGAGCCACA^ 

CCTTCTTTCAGCCTCAGTTTTCTCATTTGTATAACCAGACTAGTACAACT 

ACTGGCTGAGGCAACTGGGAGGAGCTCAGTAAAGGCTGTTTCTGCTGG^ 

GGC CGAGGTGGGTGG AT CACAGG AG ATTAG AAGTTCCAG ACCATCTGGCAAGCATGGTAAAACCCCAT CTCTACTAACAATTCAAA 

6 5 AAGTAGCCAGG CATGGTGG CTCACACCTGTGATCCCAGCTACTCGGG 

GGTTG C AGTGAG CC AAG ATTGTG CCACTG CACT C CATCCTGGG CG ACAG AGCAAG ACTCTG TCAAAAAAAAAAAAAAAGTTTTTTT 
TTTTTGG CTGG AATT ACAGG CG CCTGCC CC CACACCTGGCT AATTTTTGTTTITTGTTT^ 
TTCACCAGACTGGTCTTGAACTCCTGACCTCAGGTAATCCAACT 
TACACCTGGCCZAATAAAGGCCGTTTCAGTCrTC!AATCTGTTTTGAGCT 

7 0 AGACCCTCTTCCACCACTTTTTGTGATAGATCAATAAACATTTTGTC^ 

GGACAGGCGCTGTAATCCCAGCACTTTGGGAGGCCGAGATGAGCGGATAGCTTGAGCCCAGGA 
TAGTGAGACTCTGTCTCAAAAAAAAAAAAAAGAAAGAAAGGAAAAGAAAAAAAGAA 
CAGATTCAAGATATATCCCCTAAACCCTCTTTGTTTTAATTAGATAGTT 
nj^GGACTCCTTCGAACACC^TTTATCTOT 
7 5 TTTTTTGGTTTTTTGTTTTGAGACGCAGTCTrcCT 
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TGCCTCC CAGGTTCACG C CATTCTC CTG C CTCAG CCT C CCT AGT AG CTGGAACT ACAGG CG C C CG C CACCACTC CC GGCT AATTTT 
TTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGG 

C CCAAAGTGCTGGGATTACAGG CGT AAGCCAC CG AG CCCG GCTTTTTTTTG AG ATGG AGTTTCC CTCTTGTTGCTCAGG CTGG AGT 
GCAGTGGTGTGATCTCGGCTCACTGCAACTTCCX5CCTCCCG<K3 
5 ACAGGCATGCACCACCACGCCOSGCCAATTTTGTATTTTTA 

CCGACCTCAGGTGATCCACCTGCCTTGGCCTCCTAAAGTGCTGAGATTACAGGCGTGA^ 
TAAACGTCAATATTTTTGTGTTTATTTTTACTATTC 

CTT CTT AAAAT ACATTTGTT ATT ATATTAT/T AAAT ATT AAAC CAATTTT CAAGAAAAAT ATT AG AT AAATAGCAGAACAGG TTGT A 

GAAAGATGAAGGTCATGTGGGGAATGACTGAGGTTTGGGAAACAATATTATAACT 
1 0 AATGCCAAGGGCTTTACATGTAGGTATAGTTATC^ 

CTGTCGCCCTGTCTGGAGTGCAGTGACGCAATCTCAGCTCACTAC^CCT 

CTCCCGTGTAGCTGGTCTTACAGGTGCCCACCACCACATCCGGTATTTT^ 

TCTTGAACTCCTGACCTCAGGTGATCCACCTTCCTTGGCCTCCCAAAGGCT 

TTTTTTCTTACTTAATTTTTTTGTAGAGATGAGGTCTATG 
1 5 CTTGGCCTTCCAAAGTGCTGGGATTGCAGACGTGAGCCATCGTGCCCGGCCTAATAA 

AGAGATGAAGTCATATGCTCAAGGTCATACAGCGAAGGGGTAGTAGAGCC^tt^ 

AGTGTTCTCCCAGTCTCCCCACX5GGTCCCTTTGCTTCATCCTTTCCGAGTTT 

GTTTTTTGATAGAGTCTCGCCCTGTTGCCCAGGCTGGAGTGCAGCGGCATGATCCTG^ 

AAG TGATTCT CCTGCCTCAGG CT CC CG AGT AG CT GGGATT ACAGGT ATG CGCCGC CACACC CAGCT AATTTTTGT ATTTTT AGTAG 
2 0 AGATGGAGTTTCATCATGTTGGTCAGGCTC^TCTGGAACTCCTGACCTCAGTGATCT 

TTAGAGGCGTGAGCCACTACGCCCAGTATGTAGTTTTTTTGTGTT^ 

CTCAGGCTGAAGTGCAGTGGTGTGATCATAGCTCTCTGTAACCTCX3AACT 

AGTACCTGGGACTACAGGCATGCACCACCACGCCCGGATAAATTTTTTTT^ 

GTGGTCTCGAACTTTTGGCCTCAAGTGATCTTCCCACATTGGCCTCCCX^ 
2 5 CCTCAGATTAGTGTTGAAACTGGAAGTCCAGGGAGGCCCTGTATTTGTTCCCG^ 
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GTGTCTGGCCTGGTGACACCGTGGGCAGTGAAAACATCTCTGAGTTO 
GGGTGGAAGGGGC^TCCAGAGGCCGTTGGACACGTGGGTTTGGGCTCAGTTGA 

TTAGTGATGTCTGCCTTGAGCACAGGCATGGG AAGTGG CTGC CAAGGCrT^^CTAACCCTGG AATGGGTGGTCTGTCAG CATCTTC 
CCCTCAATCAl T r CTl ' TCriTTrTl ' Tl ' l ' X' TTTGAGACAGAATCTCX3 

TCACTACAACCTCCATCTCCCAGATTCAAGCGACTCTCATGCCTCAGCCTCCCGAGTAGCTGTGAT^ 
CCCGGCTAAATTTTGTATTTTTAGTAGAGATGAGGTTTCACCATGTTGGCCAGGCT 
CCCTCCTCX^CCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCGGCCTCCCCT 
TGGCAGAG 

HUMAN SEQUENCE - mRNA 

CATTCATAAGACTCAGAGCTACGGCCACC^CAGGGACACGCGGAACCAAG^ 

TTATGAAATTTCATTAATCTTTTTTTTTTCCGGGGAGAA 

GACCGACITACTTTTTTTGGTCriTCTTTATTACT^ 

TGATTCTGTACAGCGGGACAGCGCTTTCTGCCCCTG 

AGAGGTACAGC^CATCCTGTGGGGGCCTGGGCACCGCAGGAAGACTGC^ 

CITCCCCGAGCTTCCCCGGACAGCGTACnTTGAGGACTCGCTCAGCT 

ACTTCCCCAACCCGGCCATAGCCTTGGCm 

GCTTTCCCCGGAGACTACGACrrCCGGCTCCCGGTGCAGCTCCTCACCCTCTGCCGAGTCTCAATATCT 



TCACAACCAGCCAGGACCTCCAGTGGCTTGTGCAACCCACCCT(^ 

CAGCCCCCGGTCGTCGACCCCTACGACATGCCGGGAACCAGCTACTCCACACCAGGCATC 

TGGCAGTGGTGGGCCTTCCACC^GCGGAACTACCAGTGGGCCTGGGCCTGCCCGCCCAGCCCGAGCCC 

AGGAGACGCTCACCCCAGAGG AAGAGGAGAAGCGAAGGGTGCGCCGGGAACGAAATAAACTAG CAG CAGCT AAATG CAGG AACCGG 

OTGAGGGAGCTGACCGACCGACTCCAGGCC<3AGACAGATCAGTTGGAGGAAGA 

CCAAAAGGAGAAGGAACGTCTGGAGTTTGTGCTGGTGGCCCACAAACCGGGCTGCAAGATCCCCT 

GCCCGCTGGCGGAGGTGAGAGATTTGCCC^CTCAGCACCC^CTAAGGAAGATGGCTTCAGCT 

CCGCCCCTGCCCTTCCAGACCAGCCAAGACGCACCCCCCAACCTGACGGCTTCTCTCTTTACACACA^ 

CGACCCCTTCCCCGTTGTTAACCCTTCGTACACTTCTTCGTTTGT^ 

GCACCAGCGGCAGTGACCAGCCTTCCC^TCCCCTGAACTCGCCCTCCCTCCT^ 

AAACACATGGGGGAGAGAGACTTGGAAGAGGAGGAGGAGGAGGAGAAGGAGGAGAGAGAG^ 

GTTTCTGTGCCCCGGCGAGGCCGGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGAAGGGGATGGACAC 
CTCTCTGACGTCAACCCAAGCTCTGGGGATGGGTGGGGAGGGGGGCC^TGACGCC 
GGGACGGGGGTGGGAGGTAGGCTGTGGGGTGGGCTGGAGTCCTCTCCAGAGAGGCTG^ 
TGTCTCCCACACCCACCCTTTTTTTGGGGTGCCCAGGTTGGTTTCCCCTGCA 

TGGTGTGAG ATCCTCTTrACTCTGGGCAGAAGTGAG CCCCCCCTT AAAGGGAATTCGATGCCCCCCTAG AATAAT CTCATCCCCCC 
ACCCGACTTCTTTTGAAATGTGAACGTCCTTCCTTGACTGTCTAGCCACTCCCT 

GCCTCCTAAGGCTCCCCACCCCGAAATCAGCCCCCAGCCTTGTTTCTGATGACAGTGTTATCCCAAGACCCT^ 
GACCCTCCTGGCCITCCTCGTTGGGCCGCTCrGATTTCAGGCAGCAGGGGCTGCT 

GTGAAATGAGTTGGCCAGATTGTGGGGTGCAGCTGGGTGGGGCAGCACACCTCTGGGGGGATAATGTCCCCACT 



TGACAGTCCTCCATCCACGTGGCCTCTCTCTCTCTCCTCAGGACCCTCAGCCCTGGCCT^ 
CC^CCTAGGACGCCAACTTCTCCCACCCCTTGGCCCCTCACATCCTCT^ 

7 0 GAAGATTTCAGAGCTGAGGCnTTGGTACCCCCAAACCCCCAATATTTTT 

CATGCCCGAGTCCGCCCATCCCTGACCATGGTTTTGGCTCTCCCACCCaSCCGTTCCC 
TGAGGCAAATTTATATTTTTTAATATCGGGGGGTGGACCACGCCGCCCTCCATCCGTGCT 
TGTCGCGCGTCTCCCATCCTGATCCCAGACCCATTCCTTAGCTATTTATCCCTTTCCTGGTTTCCGA 
TGTATAAGTAAATATATTATATATCGATGTC 

7 5 GTTGGCCCTCAAAGCGAGCCGTTGAATTGGAAACTGCTTCTAGAAACTCTGGCTCAGCCT 
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TCTCGGCCCCTCTGATTGTTCCCGATGGTCTCTCTCCCTCTGTCTTTTCTCCTCCGCCTGTGTCC^ 
TCCTTTCTGACTGTCCCTGCCAATGCTCCAGCTGTCGTCTCACT 
GACnXSAGGGATCGTAGATTITTACAATCTGTATCTTTGACAATTCT 
AACCACAATTCAATGAATCCCCGACCCCCCTACCCCATGCTGTACTTGTGGTTCTCTl^ 
5 CTGGGACAGATTGGCAATGGGCCG TCC CCTCTCCCCTTGGTTCTG CACTGTTGC CAATAAAAAGCTCTT AAAAACG C 

HUMAN SEQUENCE - CODING 

ATGTTTCAGGCTTTCCCCGGAGACTACGACTCCGGCTCCCGGT^ 
GGACTCCTTCGGCAGTCCACCCACCGCCGCGGCCTCCCAGGAGTGCGCCGGTCT 
1 0 TCIACCGCGATCACAACCAGCCAGGACCTCCAGTGGCTTGTGCAACCCACCCTCATCT 

CGGAGCGAGTGGCAGTGGTGGGCCITCCACCAGCGGAACrACCAGTGGGCCTG 

GACCC CGAG AGGAGACG CTCACCC CAG AGGAAGAGG AGAAG CG AAGGG TG CG CCGGG AACGAAAT AAACT AGCAG CAG CT AAATGC 
AGGAACCGGCGGAGGGAGCTGACCX3ACCGACTCCAGGCGGAGACAGATCAGTTGGAG 
1 5 CGCCGAGCTCCAAAAGGAGAAGGAAC^TCTGGAGTTTGTGCTGGTGGCCCACAAA 

CCGGGCCGGGCCCGCTGGCGGAGGTGAGAGATTTGCCGGGCTCAGCACCGGCTAAGGAAGATGG 
CCGCCACCACCGCCCCTGCCCTTCCAGACCAGCCAAGACGCACCCCCO^CCTGACGGCTTCTCT 
AGTCCTCGGCGACCCCTTCCCCGTTGTTAACCCTTCGTACACrrCTTCGTrTGTCCT 
GCGCCCAACGCACCAGCGGCAGTGACCAGCCTTCCGATCCCCTGAACTCGCCCTCCCTCCTCGCTCGGTGA 

20 



232 



WO 03/008583 



PCT/US01/51291 



Table 6 

MOUSE NOMENCLATURE 
ICSGNM Cmkbr7 
Celera mCG15362 

5 

HUMAN NOMENCLATURE 
HGNC CCR7 
Celera hCG1643840 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

TTGGGAAAGTCCTTAATTTCrCTAGTTCTGTTATTTC 

AGATGAAATGATGTGATGGAGTAAGTACTACTITCCCAGCCT 

CCTGAAGGCCAGGCTTTGGGTCCTTCATCCAAGCTTGGATGAG 
1 5 GGATGCTGGCClTTGGAAGCGGGGCGGATGACrCTGCGTGGAAATATTCITC^ 

GTTCTATGTGGCATCTATTTGAGGTCAAATCTATTGAAATATACTTCAGCTTAACT 

ATACTAAGTGCTATGTAAGAAATAGATAAAGGACCCM^ 

CAGTGATAGGGACAGCCGGATCGCTTTCTGTAGGCTTGAGGGTTTCTGAAA 

TTTTT AAG CGT ATG AGTGTITGGTCTCTCTCr ATGCAT CTGTGT C 
2 0 AGACACCCTGGAACTACAGATAGTTGTCAGTTGCCATGTGGGTGCTGGGAAACCCACCCT 
TGAAGGGACTACTGAGCTGTCTCTCCAGCCCCTTACGTTAAATCTTGA^ 
CGGAGAGAAAAATGTGTGAAAGTAGTCAGGGCCTTGACCTGGAAGAGTTAGAG 
ACAGCTCCTGAGTGTTCAATGGGCCAAGTTGAGAGGGGCTTTGGATGCCATC 

TCCTTCAGCTACCACAGGGGAAGGGTGGGAAGGGGGTGAGGAGATTTCCTAAATGTGCATCATTGGCCT 

2 5 ACGACCAGTGGGGTCCCCAAAGGGTGAGACCAGACAGAACAGACAGCCACTATTC 

GACGAAGAGAGAGCTGAGAAGAGACCCTGAGCCTCTTCACCACAGCAGACAGG^ 

GTCTACGTCAGTCGCTAGCTATG CACATGGGTACCCTGGG CATGGAAAGTACAGACTGGCTTTCTAACACATCATTTTGCCGTGAG 

ACCAATCCCAGCATTCCCTCrATTGTCCTCAACAGAATCAAACGTCCAAGCTAAGGGGC^ 

(TTTAGAGGTCAAGAAAGCATTGACACAAACCTTGGGT 

3 0 GCCATCTCTACTCTCTCCTAAATGGCACGAGGTTTAGATGAGTAACTCTGAAAGAAC^ 

AGTGTCCAACAATGAGCCCTCCC^GCAAAACAGGAAGCCTCTTGCTGCCAAGG 

ACTACCGTGCC!ATGGCAACAGGTGTTGCTGAGACCTTAGCCTTCCCGCC!AACCTGTG CCCTGCCCTTCTCTCTC CTCCTTACTCCT 
AG AG AGTGT CCTCGTTGTG CAGTGTCCTTG ACTCAC CTGGACACTG CTGTG AG TG AG CGT C CT AATT AC CTT AACTTGG ATG AGAA 

3 5 GACTAATTCACCCTATAAGTCAGTGGCCCAGGGCCAGGGCTTGGGCCTTAGA 

TGTTGATCTCCCCTTGTTTCCTGCCTATCAGTGGAATC 

GCTGCCATGATTCCCCAACCAGGATGGACTCTACGCTGGAACTGTGAGCTTGTATTAACG 

GTCTTTTCATCACAACAGCAGAAAAACCAAAACCAACT 

ATTTGGACACCAAAGATCTCACTGATGATCATCCAAACACAGTTCT^ 

4 0 ATTTAGCAAGCTTCTTAGCTCTTTTGGATCGCAAAAGTCTAGGTAAGGTCTGA 

TGTATGCCAGACCTTCCTGTAAGGTTGCTGAGTCCTGAAGCTAAACTGACTGTCTTAGTTAAGGTTAGT 
AACAAAACAAACAAACAAAAAAAAAATGAACAAGCCAGGCGTGGTGGTGCATGCCT^ 
GCAGATTTCTGAGTTCAAGAC<ZAGCCrGGTCTACAAAGTGAGTTCCAGGACA^ 
AACAAACAAACAAACAAATAACAGCAACAACAACAAAAACATGAAC^ 

4 5 TCCAT AT CATGGTCCATT ATCAAAGG AAGTCAGGGC AGG CACTCAAGG CAAGG CAGG AAC CTAGAGGCAGG AG CTGATGCTGAGG C 

CATGGAAGAGTGTTGCTTACTGGCTTGCTGAGTCTGCTTTCTTTT 
AAGAAGAGAGAAGGACCTCCTCAGACGC^GATGGAAGCATGTGGTCCTTTG 
GGCH^TCAGCCAAGGCACAGGGGGTTTGAGGGTTTTATAG^ 
TGTTTCCTCAAGTCTCAGCACTGTAAATACCAACGGAGGTTC^ 

5 0 CTGCCTCCAGATGTTAGCAGTGTATCCCTGGTGCCATCCGGAGTTTCTCAGCTCTCCTGG 

CAAGACCAG CAG CAG CAGCACCG C CCAGTGAACTAGCCTCAGAG ACTCACCCTACAGGCCAATCTGATGGACGCATTTTTCT 
TGTGGTTCCCTCTTCTCAAATGACTCTAGAGTGTGTTAAATTGACATAAAA 

AAAGG AG CTrGAAGATG AGGGTGGGGACAGAGG CAAGGGTATGTCTT AG ACTTT ACTGATTG AATTGCAAG CCG AG AAGGATTAGA 
AGACATCAGCAGTATCAGTGGACAAGTGGCATTTGAATGCCATATGGAAGAAGGACA^ 

ATACTAAGTTTTCTTTCATC CTGTG AACAATCAAGTCATCA 

CCTGGTT C CTG CTCIACACTCTG AAAC CTTTG CT CCTGG GCAAAACTT CC C CTAAAG AACAAT GACACAC ATTCT CAG AG AACAGAG 

AACAATAAGCCGTGTGTAACAACATCTTCCAGAGAGAGAGAAGCGTGGATTTAGTCC^ 

CTAGCTTGACATAAGGAGAAAAGCCCCTTAGCTTGGAGCTAAAGGTGAGA^ 

6 0 GAAAGTTACCAAACACAGCAGCCTGAGGGTTTCATGTGAGAAGGTCATGGAGTCCCAATCACTC^ 

TGCAGCCCCCAGCTGGCCTTGAACTTGTGGCAGTGGAGAATGTGCT^ 
GTCACTGGTTGGAAGCCCTGGACCATGGTGGATGGACACTCX5ATTCT 
AGTTGCTGGATGATGGTGGATGGACAGACAGACACGGACTGG03CCT 

GAG AG CTTCATGTC^AAG ACACAAAG CAAAG CCAT CT CATGTGC CAAC CACAACCT AG CCAATGGG C AGTTATCAAAGCCAGAAAT 

6 5 CTGG CCT AGAGG CCATAGCACTCT AAAATGAGG CCAAG TT AAACT ATGTGGCG CTCT CTGCCCT COUVACTGC CAACTTCAG CAG C 

CCTTTTGAAGCTGAGTCAAACTCCTGACCATAGCX5TTTCTTTTCACCACTC 

GTGTTGTGATGCAATATTGTCCTCCATTATTTCCTAAATACCATGAAATATTTAATATATAAAGCaVTAAA 
GCTCACACCTTTAATCCCy^CACTTGGGAGGCAGAGGCAGGCAA^ 

7 0 AGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAAACAACAACA^ 

AATGAAGACTTTAGTTC^TCATAAATAAAACATCACCTTCCGAGCCCT^ 

CTCTTCATATTGTTTGAGAGTGATTGTGTCTTGTCATTTGTCT 

TAAAGGTGGCAGATCCAAGTGTGGCAATCTTCCTACCTGAGC^ 

CTTCTTCTTCTTCTT CT rCTTC 1' 1'CTl C TTCTTCI 1 CTTCTTCTTCTTCTTCTTCriVrTClTlTTCT 
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TCTTGATACGGTATTCTGCCTCCATGTATACCTGCAGGC^ 
TGGTTGCTGGGAATTGAACTCAGGACCTCT^ 

CCTCCrCTTCCTCCTCTTAATAATTTACTTTTATTTTATGTACATT 

ATCAGGAGTCATAGACAGTTGTC^GCTGCCATGTGGGTGCTGGGAACT 
5 CTGCAGAGCCATCTGTCCAGCCCCTGCCTCAGCTTCTTACTGCT 

GATCCATCTTTGTCTATCTTCTTCACTGTG<3TGGACTGGATACT 

ACCCTACGATGGCTGGATTTGAGAATAAAGCTGTTGG^ 

TGCATGTCCAAACACTAAAGATCCTTGTCCCCAATTGGTTTTTC 

ACTGAAACAAGACCTTTAGATTTGCTCAGGGTGGGAACTGGGAG 
1 0 TCAGAATTAGAGACGCAAAAGGAAAAGCATCCGAAATGTAGGAGAGAA^ 

GGGCAGCAAAGACTGGGGT AGGGGCAC CCAGAAGGAG CCAGGGCAACAAAG ATAAAATATAGAATT AGAGGGCATT AAGC CAGGAG 

T ACCAAAGGGAAATGTGTTCT AGC CACAGGG AAGATT AGAACTGCCCAG CCTTT CT AGTCAAAGCATATTT AAAATTAACTGG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCTTCATTCTTGAATCCAG 

ACGAATCGGGAGCACAAAATGATTTAGCTAACTACTGCTACGTAGAGAATCT 
1 5 TAGGGTCCTGGTTCCATGGAATTGGTATGTGAGTGACATTACC 

TAGAACCATGAGGAAACATATTTCTGITGTGTAAGACATCCAGTCGATGGTATTT^ 

AAGTTAAGCAAGGCCAAGGATCATGTGTGCTTGTCTTTGGATCAATACTGTCCATC^ 

TATCTGTTTGTTGAATGAATGAATGAATGAATGAATGAATGAATGAATGAATTAGATAATTACT 

GGGGTATGCAAGATGGTTTAATGGGTAAAGGTGCTTGCTGACAAGCCTGACAACCTGAG^ 

CCTTCCCCCACCCCC^TTCTCTCCTTCCTTCCCTCATGCCTCMCATGAAA 

CCTGCCTCCTGCCATGCCAT CATC CAC CATG ATGGATGTG AACTCATCCTCTGAAACTG T AAGCAAGCCT ACAATT AAATGCTTTC 

TTATATAAGTTACCTITGGTCITGCCATCrrGTTCACAGGAATAAAA 

TTGTAAATGAATATTTATAGCAACAATACTCATGATAACTAAATAGTAG 

2 5 AATGTGGTATATCCACACAATGGAATATTATTCAGCCACAGAAAGG^ 

TAAAAGAAGATAGATATGAAGGTAACATTGTGTCTATTTA 
GCGGCTCAGAGCAGTTATGTGCACAGGTGGAAAGGACATGGGAATCAT 
CAAAGTTTACCGTGGCGAATAAACGCCGTGACTATATAAACACCACAGAAATGTCT 
AGATGGCGCAACAGTTAAGGACACTTGATGCTCTTGCT 

3 0 TGTAACTCCAGTTCCAGAGGATCTGACGCCGTCTTCTGACCTCCACGG^ 

GCAAATGCTCATACACATAAAATAAAAATCAATGAATCGTTTTCAAAGGTGA^ 
GTCTTTTAAAAGAAAGGAAGGAAACACAGATGAATTCCTTTCTTTCAACCTTCCATTAC 
TCTCCAGAAGCTAGAGGGAAAGTCGTCCGGGAAATGTAGTCCCTGTATGCAGAGCA 
GTTATGTTGTCATGGCAACTGTTTTAACTATTTATTTAT 

3 5 GGGCIATCGGGTCCCATTACAGATGGTTGTGAGCCAACGCGTGGTTGCTGGGGATTGAACT 

CTCTTACCCACTGAGCCATCTCTCCAGCCCCCTGTTTTTAACT 
GGGTCCTGTCGCTCACTGAATTGTCTACAATCAGTGGCT 
GATATTTGCTCTAGAACTCAATGGCCACCCACAGTCCCCACATGATCTCTATC 
TGGCTTAGTAGCTMCAGAGAATTGCCCCACAAACACGAGG 

4 0 CATGAGCATTCGTCATCCTAGCAGTCCCCGAAGGAGAT 

GTATAGAATGGGGCACTCAATAAAGACGTGATACCTGCAACAGAATGGAAGT^ 
CGTGGAATGGCATAAAATCCCGACATTCACGCATACGATCGCAC^ 

AACAATT AAAAATGC CAAT AAAT AT AT CAATCAGTTT CC CTGTGGGG GAAAATGGGTGTGG T CACAAAG AAT AAAAAAAG T AAAAT 
AAAAATAGGCCAGGCATGGTGGCATATGTCTTTAATACCAGCACTCGGGAGGC 

4 5 G CT AGGTCCACAGAACAAGTT CTAGG ACAATCATGG CACTGACAAAAAT AAAAT AAAAT AAAAT AAAAT AAAAT AAAAT A 

TAAATAAATAAATAAATAAATAAAATCTCTTCTTGGGGAACCAGTTTGGATAATGA 

AGAAAG CCGAGGGGCT CCAGGCAC C CTCATAT CTAGCATCTAGG TC CTGTG ATGTTT AG CTTAAGT ACT AAAAAT AAACACTCAAA 
CTTTCACTTCGTGGTTGTGTCACTGAAGCCTTGCTGGC 

CTGCCTGGCGGCTGCTGGGACAATAGCCACTGATCAAGTTCATACTGAAAGG^ 

5 0 GTACACCTCTGACTCCCGAGCCTC^GCCrrATCTCACCCCAC^ 

CTCCCTGAGTCTCTACAAGGCACTACTACCAGGGGCTTTCT 
TGGTTATCACATACCTTGTGGCCCAGGAAGACTCTGAGGGGTGGGAC^ 
TTACCAC1TCCTCCCTGCCAGAGAGGGCTAGCTGGAGAGAGA 
CAGAGCACCATGGACCCAGGTAAGTAAGCTTTCTCnTAAGAGAAGTCAAAACTTCT 

5 5 ATG CATGGGGGAT AGG^GTCATGGGGGGGGGGGGGAGTTTCC CAGAGCAATCAESGGCAGTTGGTGTGAACTGGTGGCTGCrTTC AG 

AGAATGCCATCCCGTTGGATGAGGC^TTGGAGAGAGGATGTACCTGCTCTTT 
GACAGACACAGGCATAAAGTTGGGTCCTCTCAAGCAGCAGGCAATGAAC^ 
GTTCTCAGCTGGTAGGGATGGGTGATGTTCTCTGGCACCTCTGGGGACTAA 
CCCACTCTGGGTCCCTCCTTTCTGTCCTGTTAAACCAC^ATTTGAA 
60 GCTCTGCATAGTGCAATACAAATACTAGCTGCC^CCTCTGCCTCTTGTTGTCTCTGC 

CTT C CTC CCT CCCTT C CT CC CT CCT ACACATC CCAGG TTTTGTG CCT CTTGCAGG TT AAAGGGTG AATGACT AACAT AAAATTGAG 
TGGCCCAGGGGGCACTCCACGTAGAGAAACTGTAGCCAGCACAGCACAGCCCCTA 

TAACAAG CTATTCTACCAGGCAAGCCT CAGTG TCT ACTAT AG CAAAGTG C CAAGGGG AGTCCAG CT CC!AG ACACCC CAG T^ 
TCTTAAATCTCGGCTG CAGAGTCAG ACTCGGCTGGGGCTGGAGGGG CCAAGTTCCT AGAGTTCCGCT 

6 5 AGGGGTTGACTTTAAAAAGTGAAGGTTCCCTC^TGCTGTCCTCACACCCCCT 

GTCTCTCCTGCCCAGCCTGCATTGCACCCTAGATCTAGATCCAGCTGTGGAAGGTATATAGTGGGGTCCA 
ACTCATGGGAAGCIAGGATTTTGTTCTTACACCTCTGCCTCTCA^ 
GAGAGGGATGGAGGAATGGATGGATGGATGGATCGATGGATGGATGG^^ 
CAGCCTCTCTAAGTATATTTTAGAAAGCAAATATCTATTCCAGCTTCTCCTCCCACCTGCCT 

7 0 ATCTACCTCCACCAGCCATAGTCTCAC^GAACCCACACCATCCACCAAAGATCTTCTCCACCT^ 

AGGGAAAGGTTCTAAATTTACAGCATTTGCCAATTTCCATGGTGT 
GGTCTAAATGTTTTGACACAAAAAGGAGACAGACACCTGCCTC^ 

TGGGG ATG CT AAATG CATGG CTTTGAT ACAGGGGGTGGGG CAAACTCTGG TC CTGG AAG AAGAG^ CTTTTTT ACTCAA 
CTTAAGAAGCACCCTAATAGCTAACTCTGAAATGCTGGTGTGAGGGGAAACTGCTT 
7 5 cCGTTGGAAGGGCGGGTGGGCCACAAACTATCTGTTrTTCACT 
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GGTTTCTGTTATACAAAATGAGAAACTACAAA^ 

TCCTAACTTCAAGACATGGGACCACAGATCTCAGCACAACATCTCAAAATCAGAGA 
AGACTTGTGTCTCACAGCTCAGCTGTAGATGGCAGTAGAAGAGAATTC^ 

5 TCXTGAATACAGAGTCTGGTGGGGAACGTAACATGTATGTCAACACGC^ 

ATAAGGACAAGATTCAAAAGCAAAAAGCAAGGCTGTGAACACGTGTCCAGAAAGCATGGCA^ 

GTTGT CAAACCT AATAAAGAAGGGCCAGGTTGTAG CCAGAT CTT CAGACTCTACAGCCTTAAGC CAACAAAGT AAATGAGG CATCC 

GAGTGTCAGGAACGTGACtGGCACACAAGCCTGAGGAACCTGAGCTTGTAG^ 

CACGTAATACCTAAGCCTGCTTAGGGAGGCAGGAATAATAAGACTTGCT 
1 0 CATGAGAGACACTGATGTCAGCCTCTGACCTCTGTATGTGTGCACACCCATA 

TCAGCTCTGGTGAAGGCAATAGAGCAACAAGGAAGGGGGAATGGTGTCA^ 

AAGCTAGACTGGGGTGGCCCTTACAGCGCTGTGTGACAGCACCAGGGAATC 

GGTCCCCCACCCTTI^CCAAATAGATGGGCAGATAGATGAAATGTGTGTTTT 

GACAGGGGAGGGACCAACATCAAAAGATGTGGAAGATCGAGATGTGGAAGCAAC^ 
1 5 AAG AGTGGCCTTGATCTGTGGAGGACCCTCCACACCCAAGG CTCATCAGTAATGTCACATG ACTCAG CCATTAGAACCCAGGCTAG 

ATGTTCCCCTTTCCCATCTCTGATCTATTGTTTAATAGCTATGTGAATCCAGAG 

GAATACAAAAGGAACTTACAAGATGTTATGAGGGGGGAGAGATGTAG 

GGGGATGG CAGCGCCTGTCT CACGTGGACAG CG CACTT CACTGAAAG CTCAG TGGGTG C CAATTGCAT CACTGAGTGACAG CGTT G 
2 0 GTGCCCTAATAACAA^ATCCCACCAGCTCTCACATCCCT 

TTCCTGGGCAGCCTGGAGGGACTGGGGGAATGTACAGATGGGAGCTTTAGGAC 

AGCTGGGGGCGGGGGAGGGAATTGAGTGTCAGGCTGAGGCAGGGGAGGGCAGGGGAGTTG^ 

GAGTTTTCCTTTCACCTrAATCATTCTGCAAACCCAAGAACTCAG^ 

2 5 ATTCCCAACTGCCCGTAGCTCAAGAGCAGTTGCTTTCTGACT 

CCACACGTTCAAATTCCCCCAAATTCTCCTGTTAACTGCCI^ 

CAACCCCCACC CCCAACCTCATATCCAGTATCCCTGGAATGAGACATCAATTTTGTTCACACAC CCCAGAGCCCCTCCGGGTTTGT 

GGCTACAAAAGAACCGGGACAGAACCCAGAGTAAAACAGAAACAGTT^ 

ATTGGGCTTTCTCTACAGTAACCGAAAGCCGTTGATGGGTTGAAAGAAACCT 

GT AATTCAGTTGGGGGGGGG AATGTGTGAG AG AG ATGAGCTGTG AAGC CACG TGGTG ATGTTCC CCTTCCTGTCTC CAAACG TT CT 
AGTACAAGCTGATGATAAGGGGACAGTGGCTGTCTGTCCCCAGAAAAAGATGGAGCTGGT 
AGrcTCrCTCTATCTCTACAGCTAGGATTTCGACTTATGGTTAGGG 
GCTTCAGTAAGACGTGCTTAC^TAGCATGGAGGAGAATGTAGAGGGGAAAAACXSTTT 

3 5 TGTACTGAGTGACCAAGCX3TAGCCTCCTCAGCCTGGAGTACATGTCAGGTACCCTGG 

GTGCTATGGCCAAGGACAAACAAC^TCTGCXJTTTGAACTGTTTCCCTGGC 
GTCCTACTCTGGACTGCAAGACCCATAGGGTCTGACTCTTCCTGTAGATCTGAAG 

AATAGCAAAGAAGGCGGATCAAGGGTCCCCAGGGGTGAGACGGGAGTATGTGGGGTAGAGGTCTGGGGTGG 
TGCAAAAATTACTTTTCCAGTTGTCCTCTTGCTTAGCTGTCAGGAG 

4 0 GAATGAGGAGTGGTCGCTGTCATAThftJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNITONNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 
NNNNNNNNNNNNNNNNNNNNNNNIOINNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

4 5 NNNNNNNNNNNNNNNCTGGGCAGCAGGCCAGGTGT^ 

TGGAGCCTGGGATAAAAGTTCCCAATGGTGCCCCTGCC^ 

TAACATCATCTCCTTTGTCTCCTCTAGGGAAACCCAGGAAAAACGTGCTGGTGG 
CCTGCCAGGGCAAGAGCTCTCTTCCCTCCTCCACACCCCCTCCCCAGCCAGGAGTGCT 

GAAAGGTCTAGACACAAACCCACCTGCCGTTCACGGCCCCTTGCCCCCCCCCCGCCTCCCCCCCACGAAGGGGGCA 

5 0 GGACTTCCATTGTCCCCCTTTAAAATAAAGCCCAGGGAGACGTAACCTACCCCCACCCTCGTTCT^ 

TCAGGAGACAGAAACAGTGTCTGAGATCTCTTAGACTTGTGCAACACAGGGCTTAGAGTGG 

CATCATCTGTGAACTGAGAAAAGGACCCTCGGTCGGGCGCTCTGGTTACTCCCCAGT 

GTCCAAGGGTCTCCTTGTAATCTCATTCriGTTTTCCTTCATA^ 

GAGAATGAGTGGAG CCCGGGACCCGGGTGGAG AGGACGGTG AAGGGAACACATAGTGGGTCTGAATT CTGCAGAGCCAAGAGCCCA 

5 5 CTCCCTTCTCTATCCCAGCAATGGAGGCAGAACGTGTGT 

CAATTGGGGTAGGACTCTCTCAAGGTGGGGAATGGCCACGGCCAGCCTGGA 
CCGGAGGCCAGAGGGGGAGCTTTCCAGGAGCTCAGGCCAGGCACAGGCCCCAGG 
CCCCCTGGAGCTTGGGTTGCAGCCCCTTAGCCTGGCCCTTTCCCCA^ 
AGAGAAGGGTAAAGACATCTGTGGCTGGTTTTTGTCTTTCACTTGGGAGGAA 
60 TGGGGCTGTGGGCTGCAGAGGCGGCAAGCTTGGGGX3AAGGGGTC 

GGGAGCACACTTCATAGAAGCTGGGAGTTCTGACCTCCAAAGGGTCTAGAAGCCCTTTAAGCCTC^ 

ACAGACCCATTCGTGAAGAGCCAAGCGCAGCTAACCTCTCCCAGGGCCCCCAGGCT 

TGCTATGGACTCTTCTTCTCAAGTTAGCCATGCCC^ 

CAATGGTGGTG CGATCTG TGGATCACCACTGAATG ACTG AAAAT CT C CT ACCCAG AGGAGG AAC CCT AAACTTTC CAAAGTGGGC C 

6 5 TTCAAACCACCCCACCCCCCTCCCATCTGTGGTGGTCCATCC^TCACACAC^ 

TC^CAGCAGCrGCTGAACCGTCCCCCAC^CX3CACCCrrGCAGCCCrrG 

GATTGGG ATCTG CCGGAAG AGAGG CAAAGAGGTGTGGG AAG AG G^TGGGAAGACAG AGGGAGAGTGGGGG AAG AG AGCCG CTTCCT 
TCCAGGGG CACAAGCCGCCATAGG AAGGCAGTCACATTTACCCAAGCACTACCC CACAGG CCCTACGGTACATTrGTCTACGCGCT 
GAGTGTAGTGGGTACCAGGGAGCTGTGAAAATCCCCATGTCTCTTATCAGCAGTCCCTCGACTGAA 

7 0 TTTGATCTTCTCTTCCTGCTTCGGACCTTCCTCA 

AGGCATTTTGTAGCCCCCACCCCAAAAAGAAAACTCTCCIGGGA 
TCACAGGGTCCCAAACCCCACAACGGTCCAGCACAGGAGTTG^GATTTGC 
CAGCTCAGGGCTCAGCAGACCCCAAAGCCCTAGACTATCCCCAAT 
AAAACATCAGGCCITTGCTACTGATGAGTAGAAATGTAAACTTTCTGGTCCTGCCT 
7 5 ACTAGATCCTCCCTGAGTAATTCATACATGTGCTAACATCGTTACTTAGATTT 
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AAGATTTTTTT ATTCATTG AACGT AAATG C^TACACTCTAGCT ATCITT AGACACATGAGAAGAAGGT ATCAGAT C CCATT ACAG A 

TGGTTGTGAGCCACCGTGGTGGTTGCTGGGAATTGAACCCAGGACr 

ATTTTATTTTATGGGTATAAACATTTTGCCrc 

CCATGGAACTAGAGTTACAGAGAGCCACGACGCACCGTGTGGGTGCTGGGAAC 
5 TCTGCTCTTAACCCCTGAGCCAGTTCTTCAGCCCCACATGCrGA 

TTTGAGGGGGAAACTATAGTTTCCTGGGCTCCTCCGTCTAAGATGCTGACCC^ 

AGAAGCCCCGGGCAATGTCCTGACTGTGAGATCCGGACTGTGAGCTCATCAGG 

GATGAGGTCACCGATGACTACATCGGCGAGAATACCACGGTGGACTACACCCTGTACGAG 

GAACITTAAGGCCTGGTTCCTGCCTCTCATGTATTCTGTCATCT 
1 0 ACATCTATTTCAAGAGGCTCAAGACCATGACGGATACCTACCTGCT 

CCCTTCTGGGCCTACAGCGAAGCCAAGTCCTGGATCTTTGGCGTCTACCTGTGTAAGGGCATCT^ 

CTTCAGCGGGATGCTGCTGCTCCTATGCATCAGCATTGACCGCTACGTAGCCATCGTCCA 

CCCGCCTGCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCT 

GGCCTCCAGAAGAAJCAGCGGCGAGGACACGCTGAGATGCTCACTGGTCAGTGCCCA 
1 5 CCAGATGGTTTTTGGGTTCCTAGTGCCTATGCTGGCTATGAGTTTCTGCT 

ACTTTGAGCGGAACAAGGCCATCAAGGTGATCATTGCCGTGGTGGTAGTCTTCATAGTCTTCC^ 

CCTGGCTCAGACGGTGGCCAACTTCAACATCACCAATAGCAGCTGCGAAACCAGCAAGC^ 

ACAGCCTGGCCTCCGTCCGCTGCTGCGTCAACCCTTTCTTGTATGCCTTCAT 

TTCAAGGACTTGGGGCTGTCTCAGCCAGGAACGGCTCCGGCA.CT 
2 0 AGGCGGAGACCACCACAACCTTCTCCCCGTAGGGGGCTCCC 

ACACATGGACAGACTCTCCATCCACCGAATTGCTGCTGAGGGAAGAGCAA 

CTGCTTAACCCCATCCCACTTATAACTACCTCAACCAAAGCTGTAAAAGATATGGCTGA 

ATCCCCAAAACGACAGCCAAAAGTGAAAGTGAGAGGCTGCCACACTTTCCGG 

GAGTAGTCTTCGGAGGCCTCTGAATGAACCTGCTTCTAG 

2 5 TTAGCAAGCAAGCAAGAGATGACAGTCTCTCT AAATG CTCCCAC^ 

CTGAGGTCCAGGCCACACCATCAGCCTCAAGGCAGTCCCAGCTGGG^ 

GGAATAAAAAGGTAGGGGAAAGGTGACAGAAAGGAGAGAAGGTGACCCTGCrTGGCTGAC^ 

TCTGTCAGCCACTGATACCITTCCTCATGTTCTGCrTTTGAT^ 

3 0 TGCGCCCCACACCCCTCCGCCCCAGCCTTCCTTTCTCAGCACTCAGAACCATGAGGTCACAGAGCCT 

CCTTACAGGCAGCAGTATACATAGGAGAGCACTCTGAGTATGGAAAGACGAGG^ 

AGG AACC C CGGGTTGGGGGG AAATGGACACCTGCATCCrrGTTGTGTTTG CTGT AACAGG AT AGCCCGGAAACAGC C CT AACTCTTG 
TCAGGCCCCTTCAGACTTGCGTACATTCCTCTAAGAACTCCAGTTTGGTGGGAA 

3 5 GGGAACTAGGGACTAGTCAG ATGCCACAGG CTGG ATGGG ACTCTGC AGGGTGTTTG AG AGCT AGG AC CACTCGGTCCCCTCCTTTC 

TACCC^CTAGTAACAGAAGTTACTGTTACCAGCCTGACCOT 
AACAAAACACTGCGGTCTGCGGAACTTATACAGAACAGG 
TCTCCCAAGTCCATTTATAAGGCCTTAAATCCATTCATCAGGGTGGAGCCTT 
CTCCAATACCATCATCCCCTAGGGTTCAAATTCAAACACAGGCCTA 

4 0 GCAGAGGGAGAGGGCCAGACTCTGTCGCCCCTCACTCCATCC^ 

AAGCCCTAATTTCCCCAAGCCCGAGAGCTTGGTCCAAACTATGATACCT 
GAGCCTGGCGGTAGGATGAGGGTGGGGGGTGGGAGAAAAAAGGTGAGACATTCACACAAAAA 

AAG AAAAAAG AAAAATGG AGCCTGTTCCATAACAGTGCTCATTCT CTAAGCTTGTAACCACAAAACAC CCATT ACCCTCCTCCACA 
GGATGCATGGGGCTAGAAAGACTTTAACT^CAATTTGTCAGGCGGCCTCCCGGT 

4 5 CTCAAGTGCAGGCAGTGGCTGGTGGCXaGGTGGCGGGTGGCGGGCAGC^ 

G<rGACTGTGTGACCACAAGCACGCTGCAAGAAGGCX3ATGC 
GCTTTGTAGATCCTAATGAAGTTACAGATGGGAAAACAGCCCTCGAT^ 
GAAGCTGGGCCAACCACACAAGCATGCCACCCTCGCCTCAAATACATTGCGCTACT 
ACAACAGAACCCCCAGCAATGCTAATTCCCCAAACTGCTGTGTCT 

5 0 GACCIGG CCTG ATGCACTCTTGTGCCTACCAAGGGTG ATCCCCAGGATACAAG AAAGC^ CTTC 

TTTCTCATTCTGCCCrTCTCGTCTGCTCCATACAACAG CCCCATG CCTCTAAATCTGTGTCCCTTGCCACGCCATCATTCTGCAAT 
TGCCCCAAGCTCCTTCCATTCCACTAACAGTCCCTTGTCCACGCCCCCC^^ 
CACACATCAGCCCTAACTCAGATCCCCCCTGGOTGTGACCCTAGGCCAAATGTAAGAC^ 
GCTAGAAGTGCCGAACCTTTAATAGGTTCCTGGAGACCTCAGGAAGCAGGGGCTGACC 

5 5 CTGGCTTGTCCAAAGAG ACACAG CAATGAGGGGAGGTTGCTT AAG CTAG CTGTGGAGCCTGAGGGTCTGGCAGAGTTGCCAAAAGG 

GGGGATGG AGCCTGGCCTACTCTGAAT CCCAACCCTGCCGCT CCC CAGCTGTGTGACCTTCAGCTGTTGCTCAACTGCTCTGAATC 
TCAGATTTCTCAATTGCAAAACAAGGGAGATTTGGGTAAAC^ 

CTCCCACTCCACAGCCAGATGGCCTGGTTCATGTCTCCGCTCCCTTACGGTGCGGTGCCTCGGTT^ 
TATTAGTCGTGCCTGCCTCCCAGGATTGAGGTAAGGAAGCAATTCATCTACAAAAGCAAAAT^ 

6 0 TATCATCTTTTGGGAGATAGTTGAGAAATTATAGATTTAAAAC^ 

GCT CTT CCAG AGG ACC CAGGTT CAG TTCC CAG AATCCACATGG CAG CTATT AAC GGTTTG CAAATCTAGTG TCAGAT AATGAGG T A 
CAGTGC CATGT ATGTGTGGGGT ACATG CCT ACATGCAGG CAAGACA CT CAT ACAT AT AAAATG AAAATAAAT CTTTTG TTTTGTTC 
TGTTTTCATTTTTTC^AAACAGTGTTTCTCTGTAGAGTCCTAGCAGTCCT 

AGAGATCCATCTGCCTCTGTCTCCCAAACCCTGGGGTTAAAGGTGTAGGACACCACCCACCGGCATAAATAAAT^ 

6 5 AATAAAAAT GT I TCrTT AAAGAGACAGAGAGAGAAA 

AGTCCAGATTCATAACCCACACTCACAGGCCAGGGTTGGTGTTG 

AAGTCTAACAGAAAGTAGAAGCTTCAGGCTTCACAAAGAGACCCCCATCTCAATAG 

GGCAGACACCCCAGTGTCTCTCTGCCCTCCACCTGCACACACACGAAGGAG^ 

7 0 CCAGGAGAGACATTTGAGGCACAAGGAGAAGGAGATCACCTGAGGTCTCTAGGGGCCTCT 

TAGGTCCACCCTCAATCCGGTCTGCCAGCATCCATCCTTAATACGAACCTCT 

TCTTCTTTGAGAACGAGGTACTCACTTGCGGGTTTTTTGAGTCAA 

GGAATAAAGGAOUlAGTCGGAGAGGAGATTCrGAATTCGGAGGAAGGAG 

GAGAGTATATCCAGTGGGCAGATGAATCCGTGTGTGTGAGTGAGTGAGTGTGTGTGTGTGTGTC 
7 5 TGTGTGTCCTCCCTCAAGTCTCCTACTCTCACAGGAAGAAATGATTCCATGGGTGT^ 
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GTGTGGTGGCGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGATTTCTGAG 
TGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCJUGTTC 
GTCCCTGAGGACTCTCACTCTCACCCCAACAGGAACCCAGCTTTTCCACACA 
CTGTCAAOIGAAGCACCAGCTAGAACTTCTGTGATCCTGA 
5 CCGATCACATTTCCTrcCAGATGTTAGGTTTACCCCTTCCCTTGTGCA 

G CATGC CTTTAATCCCAGCACT CAGAAGGC^G AGGCCAGCTG ATCTCTGTGATTTTGAGGCCAGGCTGG C 
AAGATGCCCAGGAGAAAAAGGAAGGCAGAAGCAAATGGCTATCGCrrCTATTTTGTCT 
ACCAGGCTGTTGTGGCACTGTGTCACACCACAGTGACCCCACCGAGTTCTCATCT 
AGTGGATACCCAAGTCCTCTATCACGAGGGCAGGAGAGGGGACTTCTCAATCCATGTCACAGCTTC 
1 0 GTTTGCTCTAAACAAATATCATTGTGAAGCTGAACAGATGTTGCAGATAAAATGCCCT 
TGGGAACTCAGGCGTGCTAGTCTTGGATGGTTTCTGCATCTGTGGTCCA^ 

GTGAATTC1TCTGGCCCCGGGGGTGTTTT CTGCAGCCTGG CTAGCTCCAGACCTGTAATTAAGTCT AGTGGTATGCTGGCAAGGAT 
GTGGAGAAAGAGGAAC^CTCCTCCATTGCTGATGGGATTGCAAGCITGTACA 

AAATTG G ACATCGT ATT ACCGG AAG AT CCAG CAAT ACCTCTGCTGGG CAT AT AC C CAGAAGATGTT CCAACAG AT AAT AAGGACAC 
1 5 ATGCTCCACTATGTTCATAGCAGCCTTATTTATAATAGCCAGAAGCT 

AGAAAATGTGGTACATTTACACAATGGAGTACTACTCAGCT^ 

AGGATATCATCCTGAGTGAGGTAACCCAATCACAAAAGAACTCACTTGATATC 

TTAGAATACCCGAGATACAATTTGCAAAACACAAGAAAATCT^GAAGAAGGAA 

ATAGGGAACAAAATACCCATGGAAGGAGTTACAGAGACAAAGTTTGGAGCT 
2 0 CCCGGGGATCCATCCGATAATCAGCGACCAAACCCAGACACTATTGC^ 

AGG CT AGGTCAGGGC CTGGCAAACACAGAAG TGGATG CT CACAGTCAT CTATTGAATGG AACACAGGG C C CCCAATGGAGGAG CT A 

GAGAAAGTATCCAAGGAGCTGAAGGGGTCTGCAACCCTATAGGTGGAACAACA^ 

TCTCTAGCTGCATATGTAGCAGAAGATGGCCTAGTCGGCCATCATTG 

GGGAATGCCAC^GGCCAGGAAGCGGGAGTGGGTGGGTTGGGGAGGAGGGCGGGAGGAGGGTATAGGGT^ 

2 5 AAATGT AAAT AAAGAAAAT AT CT AATAAAAAAG T CT AGTGGTGGAG AATCAC CCAAGTGGGAAC CATCTTTGCT CCTGCACT 

CTCTCACAGGACTGGTAGACATGCCATCTTGTTCX3AGCTACCACCT 
TGGGTTGGCCGTTGACAATCTCTGGACTAGAAAGCCTGGTCAGGGAATGAACX3CGATG 

T CTCTCTCCATCACCCCAGCAGAGCCAACCCTAAGCAG ACTACGAACTAGAAG AG CTTC CGAGCTTTAGGTTAAACACAGCAGAGC 
AGGGGAAAGACAGGAATCACGAGGGATCGGGAGGGAGGGGAGGTTTTGAGGAAG 

3 0 GTGTCCATTCTAGGGATCCACCTGGCACTGAATCCACAAGTGAGGAAGCCAAGGGTG^ 

GACACAGCCAGCAGCTGTGCTAAGATTATATTCCCTACTTCATTTATACTGTGTATTGAAACCTT 
TCTGTCTGTCTGTCTGTCTGTCTCTCTCTCTCTCTGTTTTTTTTTTTTTT^ 
GGCTGACCTGGCACTCACTTTGTAGACCAGGCTGGCCTCGAACTCAAAAATCTGTCT 
TAAAGGCGTGCACCACCACACCTAGCCTTATTGAAGTGTTTTAGCAATCCT 

3 5 CCAGAGAGGGGAAAGTGCCAAAGCCATAAGGCAATAAAACTACTAAAGGATGT 

TGCATCTGCCCTCITGCTGCACCATTTAACCACATAGTTGCCTACTCT^ 
GTGCTX^TGTACATAGCATCTCTCAAAGCACGCTGGATGCAGAGTTCTGT 
CAGGGAGCAGGGAAACAGCCCXX3CTGTGATAGGCTGTCCACT 
CCTCCTGGCTCTGCCATGGAGCCACTGGGCAACATGAACAAGTCACCT^ 

4 0 GATGGAGTGAGTTTTACACCCATCCCCTGCGGTTCTGAGGTGCCTTGGX3 

GGGCATGTTGACGCCTGGAAGGGGTGCAAAGTGCCTGGGTGAGGG^ 
GGCCACTGGTGGCAGCAGGTGAGGGGAAGTATCTGTC^CrCTTCCAACAAAGG 
AACTGCAGTT^AAGGCGCCAAAAATAAAACACAACTTTCCTTTGATAGGCACT 
G ATTT CACACCT CAGTTCAT ACCC CACACACACACAGACAC^ 

4 5 TAATAGGCAGTCAATGATCCCAGAAGAAACATCTCCX3TGGGCTGTGA 

CCTTTCCATTTTGAAATGTTCAATATTTTAAAAGAACTT^ 
ATGTCTGTGTACCCTACCATTC^CAAAGGCCAGATTAGATTCCCTTGGAACT 

CCTGGAATCAAACTTGGGTCCTCTGAAAGAGCAATCAGTGCTTTTAAAATATATATACATATGTGTGTG^ 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTAAAATTTATTTTATG 

5 0 GCTTTGGATCCTATAAO^GAGGGTTGTGAGCCACAGTATAATTGCT 

CTTGACCACTGAGCCATCTCTCCACCCCTAAAGTTTTCAGTATGTTTTAAGCA^ 
TCAAGCTACAGAACCAAACTTGCTAGCATTCATTCATAAATATC 
AAAAGTGCATACATTGCCCACACAGCGAGAGGAACAGACACATAGACCCCCC^^ 
GGCTCCCCTGTGTACACACACTGCAGTACAACTGCTTCCCCCATAAGGGACAT^ 

5 5 ACCACAACCCAGTGACATGAATGGATTGACTTAGCCAGATCAAGGCACCTCC^ 

TGTATCTTCAACTGTCACAGCTGGCCTCTCTTrCGGGCTATGGGCT 
CCTTGGGGAGTAGAGGAATCCTTGGGAGGAGAGGTTGGGAGGAGGAGAAGA^ 

AGGCG CCT AGTGAACTGGCCTGAGTCGTGGTTCTG AAAG AGCCTGGGGGAGGCX^ CGAGC 
TCCTGCTCTCACATGAACAGCCAATCCCTTCCCCCACTCCTCTTTCrAGCT 

6 0 CTCAGTCTCCATCTCCACGGAATGAGAAAGTATTGTCTGC^TGGCAGGGTTGTGTGGT^ 

GGGCTCTCCCAACATTCAACAGTTTGCCTTCTTTCCCGGCCTTGCAATCCGACTCT 

CGGCAG ACCTTCAGAATGTGTITATTTCTTGT CTCCCCACCCCCACCCCACTTC CCAGCCAGATGCAAG CATGGG AGGTGG CCGTA 

GTTCAAGGCCCACCCTCCCTTCGGGGACTTTCAACGTTCCTCAGTTCC^ 

CCAAGGCCTCCAAAGTTTTCTGCCTGTGCTCTCTACCCCTAACCCCAGTGCTGCCTGTAAGAAA 

6 5 GCCCCTCCATCCTGTTAGGTGATACCAGATGCTCA 

CTGTAACCCCACCACGCTGAGGAGGCTAAGGGAGAAAGAGAGAATTCTCGGCCAGCCTGGGCTACAG^ 

AAAATTCCAACAAAATTGCTGGGGAAATATTGCTCAACCATCGGAATGCTTACATAGCATGCATGA^ 

NNIWNNNN NNNNNNNNNNN^ 

70 NNNNNNNNNNNNNNNNNNNNWNNNNNNN^ 
MOUSE SEQUENCE - mRNA 

GGACATGGACTGCTATCTGCGTCGCCTCAAACAGGAGCTGATGTCCATGA 
GCATGATGGGCGC&GACTGGGCTAGCTGGAGAG 

7 5 A CCATGGACCCAGGGAAACCCAGGAAAAACGTGCTGGTGGTGGCTCT 
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CACCGATGACTACATCGGCGAGAATACCACGGTGGACTACACCCTGTAC^ 
AGGCCTGGTTCCTGCCTCTCATGTATTCTGTCATCTGCTTCGTGGG 

TT CAAGAGG CTCAAGAC CATGACGG AT AC CT AC CTG CTCAACCTGG CCG TGG CAG ACAT CCTTTTC CT C CTAATTCTT CCCTTCTG 
GGCCTAGAGCGAAGCCAAGTCCTGGATCTrTTGGCGTCTACCT 
5 GGATGCTGCTGCTCCTATGCATCAGCATTGACCGCTACGTAGCCATCGT 
CTTCTCATCAGCAAGCTGTCCTGTGTGGGCATC^^ 

GAAGAACAGCGGCGAGGACA.CGCTGAGATGCTCACTGGTCAGTGCCCAAGTGG 

TTTTTGGGTTCCTAGTGCCTATGCTGGCTATGAGTTT^ 

CGIGAACAAGGCCATCAAGGTGATCATTGCCGTGGT^ 
1 0 GACGGTGGCCAACITCAACATCACCAATAGCAGCTGCGAAACCA^ 

CCTCCGTCCGCTGCTGCGTCAACCCITTCTTGTATGCCTTCAT 

TTGGGCTGCCTCAGCCAGGAACGGCTCaSGCACTGGTCTT^ 

CACCACAACCTTCTCCCCGTAGGGGGCTCCCCTGCCCG^ACT^ 

AGACTCTCC^TCCACCGAATTGCTGCTC^GGGAAGAGCAATTCTGGCC^ 
1 5 CCATCCCACTTATAACTACCTCAACCAAAGCTGTAAAAGATATGGCTC 

CGACAGCCAAAAGTGAAAGTGAGAGGCTCCACACTTTCCC^ 

GGAGGCCTCTGAATGAACCTGCTTCTAGCTTAGAGAGATGTCCCC^^ 

CAAG AG ATG ACAGT CTCTCTAAATGCTCCCACAG AG CACCCCTGCCCCTCCCTTCTG CCTCrCCACCGCCTTTCCTGAGGTCCAGG 



2 0 AGGGGAAAGGTGACAGG AAGGAGAG AAGG TG AC C CTG CTGG CTGACAGAGG C CAG CAAGCT ACTTC"!' 1~I'GTTCT CTGT CAGC CAG C 

CACTGATACCTTTCCTCATGTTCTGCTTTTGATTCATATA 
CAACTTGG 

MOUSE SEQUENCE - CODING 
2 5 ATGGACCCAGGGAAACCCAGGAAAAACGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTGTG 

CGATGACTACATCGGCGAGAATACCACGGTGGACTACACCCIGTACGAGTCGGTGTGCTT 

CCTGGTTCCTGCCTCTCATGTATTCTGTCATCTGCTTCGTGGGCCTGCTCGGCAAC 

AAGAGGCTCAAGACCATGACGGATACCTACCTGCTCAACCTGGCCG^ 

CTACAGCGAAGCCAAGTCCTGGATCTTTGGCGTCTACCTGTGTAAGGGCATCT^ 
30 TGCTGCTGCTCCTATGCATCAGCATTGACCGCTACGTAGCCATC^ 



GAACAGCXXJCGAGGACACGCTGAGATGCTCACTGGTCAGTGCCCAAGTGGAGGCCTT 

TTGGGTTCCTAGTGCCTATGCTTOCrATGAGTTTCTGCTACCrCA 

AACAAGGCCATCAAGGTGATCATTGCCGTGGTGGTAGTCTTCATAGTCTTCC^ 

3 5 GGTGGCCIAACTTCAACATCACCAATAGCAGCTGCGAAACCAGCAAGCAGCT 

CCGTCCGCTGCTGCGTCAACC CTr TCTTGTATGCCTTCATCGGCGTCAAGTTCCGCAGCGACCTCrrCAAGCT 

GGCTGCCTCAGCCAGGAACGGCTCCGGCACTGGTCTTCCTGCCGGCATGTACGGAACGCGTCG^ 

CACAACCTTCTCCCCGTAG 

4 0 HUMAN SEQUENCE - GENOMIC 

AATATATGAAACAAATATTGAC!AGAATTGTCAAAGAAATAAGAGAAA 
ATAATAGAAACTTAAATATCCTCCTTTCAATAATAGATACAACAACT 

G CTATGAACCAACT AGACCT AACGTCTATCTATAAAACACCAC CCAACAACAGCAGAAT ACATATTCITCTCAGATAT ACATAGAA 
CATTCTCCAGGATAGGCCATCTGTTAGGACATAAAACAAGTCTCAAAAAATGTAAAA 

4 5 GACCACAACCAGTAACAGAAGGAAATTTGAAGAATCCATAAGTATGTGGAAATGAATC^ 

TTTGAAATGAATGAAAATGAAAACACAGCATACCAAAACTTATGAGATGCAGCTAAAATAGT 
TAATGCCTGTATTTTTTAAAAGAAGAAAGATACCAAATTAAAAAAAAACTTTTCACTTT 
AATCAAGCAGAAGGAAGGAAATAATAAAGATTAGAATGGAAAAAAATGAAATATGGAATTGGAAAAACT 
CCCAAAAGTTGTTATATCAAAAAGATTGATAAGTTTGATAAACATT^^ 

5 0 CACAAG AAAAGT ACAG AC CAAT AT CTCTCATAAGACACATAT AAGAT AGATGC ACAATTT CTT AACAAAATATTAACATGCCAAAT 

CCAGAAACATTACAAAAAAAATTGTATACCATGACCAAGTGATTTATTCC^ 
TCATATAATACACCATATTTAATAGAATAAAGGGCAAA 

ATGTCCCTCCAAAGGACATTATCTCATTCCTTTTTATGGCTGCATAATATTCCATGATGTATATGTAGCA 
TCTATCATTGATGGGCATTTAGGTTGATACCATGTCTTTACTATTGTGAATGGTGCT 

5 5 ATAATAGAACAATTTATATTCCTTTGGGTATATACCCAGGAA 

GGAATTCTATACTATCITCCACAATGGTTGAACTAATTTACACTC^ 
AGCATCTGTTATTTTTTGAATTTTTGATAATAGCCATTCrGACTGGTATAA 

CTAATGATCAGTGTGTTGAGCTTTTCAAAATATGATTCTGGGTCACATGTATGTCTTCTTTTGTGAA 
GCCCACTTTTTAATTTTTTTTTCTTGTAAATTTGTTTAAGTTT 

6 0 CAAAACTTTTTTrCCCACTGCGTAGGTTGTCTGTTCACTCTGTTGATAGTTTCT 

ATCCCTTAATTTACTTATTTGTCAATTTTTGTTTTTGTTA 

GTCCTGAATGGTATTGCCTAGGTTTTCTTCTAGGGTTTTTATA^ 

TTTTTGTATATGCTGTAAGGAAGGGGTCCAGTTCCAATTTTCT 

GAATCCTTTCGCTATTGCTTGTTTTTGTCACGTTTGTTGAAGATCACGTAGTTGTAGGAG 

6 5 TCTGTTCCATAAGTCTGTCTGTTTTTGTATCAGTACCATGCTGTTTTGGTTACT 

CATGAGG CCTCCAGCTTTGTTCnTTTTGCTTAGGGTTGGCTTGG CTATTTGGGCTCTTTTTTGGTT CCATGTGAATTnTAAATAG 

TTITTTCTAGTTCTGTGAAGAATGTCAATGGTAGTTTAATGGGAATAG 

TCCACAATACTGATTCTTCCTATCCATGAACACGG 

GTAGTTCTCCTTGTAGATGTCCTTCACTTCCCTTGTTAGCTGTATTCCTAG^ 

7 0 GTTCATGCCTGCTTTGGCTCTCTGCTTGCCTGTTTTTGGTGTATAG^ 

AATTTGCTGAACTTGCTTATCAGCTGAAGAAACTTTTCGGCTGA 
CAGGAATAGTTTCACTTCCTCTCITCCTATTTGAGTACACTTTA 
CCAATACTGTGTTGAATAGGAGTGATGAGAGAGAGCAACCTTGTCTTGTCCT^ 
TTCAGTATAATATTGTCTGTGAGTCTGTCATATATGGCTCTTAT^ 
7 5 TTTTAACATGAAGGGGTATTGAATTTTATCAAAGGCCTTTTCTGCATCT 
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TTATGTGATGAGTCACATTTATTGATTTGCA^ 

GGAT AAACTTTTTG ATGTG CTG CTGGATTCGG TTTG CCAGT ATTTTGTTGAGGATTTTTG CT CAATGTT CATCAAGG AT AATGGCC 
TGAAGTTTTCTTTTTTTGTTATATTTCTGCGAGGTTT^ 
TC LT1 IT CA ATTTriT TGAGTAGTTTCCOT 
5 CCTGGACTTTTT1TGGTTGGTAGGCTGTTC 

TTGGTTTGGTCTTGGGAGGGTGTATGTGTCCAGAAATGTATCCATTTCn^ 

TCTCTGGTGGTTGTTTGTATTTCTGTGAGGTCAGTGGTAATA 

TTCTTCTTTATTAGTCTAACTAAC^GTCTATCTATTTTATTAACrr 

1 0 TTGGTTCT CT AGTT CTTTTAGTTGTGATGTTAGGTTGT^ CTTT CT AGCTTTTTGATGTGGG CATTT AGTG CT AT AA 

ATTTACTCCTTAATACTGCCTTAGCTGTGTCCCAGAGATC 
ATTrCTCCCTTAATTTCATGATTTACCCAAAAGTCCTTCAG 
TTTCTTAJGTCTTGAGTTCTAATTrcATTGTGTTG 

G T ATTTTACTTC CGATTATG TGATCAATTTT AG AGT AAG TX3CCGTGTGGTGATT AG AAGAATGT ATATT CT 
1 5 AGAGTTCTATAGAGATCTATCAGGTCCATATATTCCAGAGCTGAGTTCAGGTCCT 
TCTGTCTAATGGGGTGTTAAAACCTCCCACTATTATTGTGTGGGAGTCT 

TTTGC ATGTGAGATGGATCT CTTG CAG ATAG CATACCAATGGGTCTTGGTiTlTT'AT C CAG CTTG CCACTCTGTGT CTTTT AATTG 
GGGCATTTAGCCCATTTACA.CTTAAGATTAGTATTGTTATGTG 
GACITGTTTATGCGGTTGCTTCATAGTGTCACTGGTCT 
2 0 CATATTTAGTGCTTCCTTCAGGAGTTCTTGTAAGGCAGGTCIWT 

TTATTTCTCCTTCGCITATGAAGCTTAGTTTC^CTGGATATGAAATTCT 

TTTCTCTCTGGCTGCCCTTAACATTTTTTCTTT 

TGGAGTATCTCACTGGGGTTCTCTGCATTTCCTGAATTTGAATGTTTGCT^ 

2 5 TCCTGAAATATCTTTTCCAAATTGTTTCCACTCTCCCCATCTCTTTCAGATACCC 

ATCCCAT A TlTCl T GGAGGflTTTATTCftTTCCr^ 

TTCAAGCTCTGAGATTCTTTCAGAGGAAGAATCTTTCAGAGGAAGACTAA^ 

TTGCATTGTGAAGTTCTTCTAGTGTGTTTTTCAGCTCGATCAGGTC^ 

TCCTGCATTGTTTTATCATGATTTTTAGCTTCCTTGCATTGGGTT 

3 0 CATTCTGAAGTCTAC1TCTTTCATTGCAGTGATCTTAGCCTTACCCCAGTTCCAAACC 

GAAGGAAGAAGGCACTTTGGCrrriTTGAG'r'r'rTTAGTGrri i i ATGCrGAriCrrr^i CATCTTTGTGGGCTTATCTACCTTTGAT 
TTTTGAGGTTGCTGACCTTTGGATGGGGTTTTG^ 

GTGG CTTGCTG AGGGTCCCCTCCAGACTCT AGTCACTTTGGATTTTCC AGT AC CTGGAGGTATCACCAGTGAAGGCTGTGAAACAG 
CAAAAATGGCAACCTGCCCCTTCCCCTGGGAGCTCCATCCCAGGGAGGTATGAACCTATTGCTG^ 

3 5 TGGCCGGAGACCCCATTTGGGAGGTCTCACCCAGTCIAGGAGGAACAGGATTG 

CCGAAGAGAAACTATGCTATGCTGGGCACCACTTGCATCCCCAGTTGGC^ 
TCGCCAAAACAGCAAAGATGGCAGCCTGCCCCTCCCCCGAGGAGCTCTGTCCCAGGAACTCTTCA 
ATCGACAGGAGTGGCTGGAGGCCACAGTTGGGAGGTCCTGCCGAGTGAGAAGGAATGGATTAGGGACCCT 
GGCCATATTTTGGTAAAGCAGCTGTGCC^CGTTGGGAGATCCTTTCA^ 

4 0 GAATGGCTGGGATGCCCAAACAGCAAAGGTGGCAGCCCTCTCCACCATCCA 

GG CT GTAGAAAATGGGCAGAGG TGG CTGAAGGCCIT'GTTTGGG AAG AC CCACC CAGTG AGG AGG AATGGAT AGGAG T C CTGCTT AA 
AAAAACAGTCTGGCCACGTTTTGGCAGAGCAGCTGTGCTGTGCTCGGGGAT 

TCTACAGGCTGGAACAGCTGAGTCGTCCAAACAGCAAAGGTGGTGCCCCTC^CCCCCACCCCAGCCACCCC 
TCTOUGGTAGGTGCTACACTGTTGCCCGTGGCTGGCTGGA^ 

TCACCTTTGCTCGGGTTCCGGAGCCTGAGTATGTAAAGCTCCT 
ATGCAGCTCTGTGTCTAAGACCCAAAGGCCTGGTGGAGTGGGTTCACAGG^ 
AGAAGCATCATTCCCCAGGTTTGCACATTCACTCACCACT^ 
TCCTATGTAGGCCATTGCCCTGCCCTGCTTTCCTCTATCCT 
50 GGATGTTTCAG AT AAAGGTTTTGAAGTAC CCACCCCTTTGTTCCTCTCTGAG AGAGCCATG CCCCATACCTGCTT OTAGTTGGCCA 

TCTTTGAAAATAAAATTAAGAAAACAATTTCATTTACAAGCACA 
TGAAAGACTTGCAGCCTAAAAAGTATAAAGAAATTAAGGAAGACCT 
TTTGTTAGAATGGCAATACTCCCTAAATTGATTTACAGATTCAATGAAAT^ 
TGATAACCTAAAATTCATATGGAAAAGAGAGAGATGCAGATTAGTCAAACAATGTTTAAAAAG 

5 5 AACTCCCAATTTCAAAATTTGCTATAAAGCTACAGTAATCGA 

TAGAATTGATAGTCTAGAAATAAACCArTAC^TTTATGATGAATTGATTTCCAACA 
AATAGTCTTTTCAAO^AATGATGCTGGGAAAACTAGATATCCATATAAAAAATGAAGCTGGAT 
AATTAGCTCAAAATGGATCAGATCTAAATGTTGCAGCTAAAACTATAAAAG 
CTTGAATTA1GCAATGGTTTCTTAGATATGGCACCAAAGACACAATTGACAAAAGA 

6 0 AACTTTGCGCTTCTTTTGCCAGAGATTCAGGTTTTAAAAAACAAAAAG 

GATGATCCATAGTAAGGGAGAAAATATTTAC^ATATGTATCAGATAAGGGATTGTAACCAGAATATATAAAG 

CAAT AATAAAAAAT AAAT AACCCAATTTTTTAAATGGG CAAAGG ATTTG AAT AG AAATTT CT CAAAAAAT AAT ACAAATGGT CAAT 

AAG AGCATGAAAAG ATG CTC AG CAT AGGTAGTCATTAGGAAAACGCAT ATCAAAACTACAAAG AGGC CGGGCATGGTGG CTCATGC 

CTGCAATCCC^CACITTGGGAGGCCGAGGCTGGCAGATCACCrrAGGTT^ 

6 5 CCCGTTTCTACTAAAAATACAAAAATTAGCCAC^CATGGTGGCACGTC 

ATCACTTGAACCCAGGAGGTGGAGACTGCAGTAAGCCAAGATCGTGCCACTGCAC 
TCCAAAAAAAAAAAAGAAAAAAGAAAAAAAACAt^AATGAGATACCACTTTA 
AATAGTAAGTTTrGTTGACAATGTGGAACCCTCACGCATTGTTGATG 
AGTTCGTCAAAATGTTAAATATATAATTACCATATGACTCAACAATTrCACTTC 

7 0 T C CACACAAACACTTGCACAT AAAT ATTT ATAGCAG CATCATTCAT AAT AG CT AAAAG GGAAAACAG C C CAAACAT C CAACTGATG 

ACAGACAAACAAAATGTGGCATATCCATACAATGGAATGTTATTCAGCCATAAAAA 
AGGTAACCTTGAAAACACTAACTGAAAGAAGCCAGATGTGAA 
GAAAATCTATAGAGCCAGAAAGTAGATTTGTGGTTGCCAGGGGCTTTCGATC 
TACAGGGTTCTTGAGAGGGAGTGAGAAAAAATC 
7 5 ATTGAATTCTATACTTTAAAAAGGTAAATTTTATGGGATGTGG^ 
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GAGGGGAGAAGAGGGAAAGGGGAGGCAGGGGAGGGGAGGGGAAGGGGGGAGAAAAAAGATACA^ CATATCTCTTCC CTTTT 

ACCTGTCATTATGTGTCAGTGCCTCCCATTGGCCT^ 

ACACTGAGCAGAGCAAGGGAAGGATGGGGACACTTCrCTCACACATCTCTGACTCCCC 
TATTCATGCTATAAATATTTGTTGAATCAGTAAATGGCCCCTCATCTAGATTCTC 
5 ATGACCTAATGATTCAGGATCCTATGACCAGCGACTGTCACCCTCTGTCACC1T 

GGGGCCCTG CATGAGTCAGGGAGGGCTGGGGGG AGGAGCAAGGGTGGGAGGGGCAGGGAAGA^ 

AGGGGCTTGAACCACTTCCTCCCCAGACAGGG^TAGTGCGAGGCCGGGCACAG CCTTCCTGTCTGGTTTTACCGCCCAG AGAG CGT 
CATGGAC CTGGGTX5 AGTGAG CCTCTTCATGTG AG AAGG AACAG T ACCAGGGTCXT CCTGTGGAATGGGGG 
TGGAGGATGCGGGTGGAGCGCATAGGGGTGCTTCCTGGAGAAATCAGTT 

GGCCG AGATCATGTCTTT CTAAAAO^CAGGG ACAT AAGAGAGTGGCGGGGCTGGG CTTGGTGGTCTGGCAGAACCTGTTCTCATC 

ATGAGGGTGATAGGTTAGGCCCTCTGGGACCTCTGGGGACAGGTT^ 

CCACTCTCTGATTCCCACCTTTCTCCCCTGCCAAATATTCATATACCTC 

ACTGGGTTG TG AGAATGGTG CGG TG CATTG CAAATGCAAGCT CTGGTTG CTTT CT CT CTG CTGCACATC C CACCTG CTTTGATCTG 
1 5 GCTrTGCTGTTGCCTCTTGOVAGTTAAGGAACGGGCAGGCAACATGGAATGG 

GCCAGGCCTTCAGCACAGCCCCTGCCCCTGAGCAAACCTCACGCACTCTGCCCCGGC^ 

CTCAGGTCTAACCTAGAAGGGGTCAAGGGGAAG CACTGAGGGTGTTCTGAGTGGTT CATCCAGCTCCAG ACGCCCCCTTTATAAAT 
CTCAGCTGTGGAGTCAGACCAGGCTGAGGCTAAGAAGACCAGGT^ 

2 0 TTGATTTCATCTCTTACTTTCCCGCCTGC 

CACCCACTGACTGTGAACTCCTG AAAGG CAGGTTOlGGTGCITTGAGGATGTGAGG AGAAAATGT CTG CGG AATGAATGAATACAC 
CGTATCAAGAACTCCGTAGAGTTTAGTTTTTGAAAATCTCTTCCAAA 
CCCTGCACCCCCTCCACCACCCCCCATCACAATCCCACAGTAC^^ 
CCTCCZAGGAAGGAAACCAAGCAGTGCCCAGTGGTGTGCTACT 

2 5 ACTCCAGGGTGGAGGGAAAGTCCTGAATTTGTAGTGCTTG^ 

AGCCATCTACATGACTGAACAGGGGGTTGGGAAGATGTGCAGAGTGTCTTAC^ 

CTTTG ATGCAGGGAGAGAGGGTAAT CTCTGGTCCAGGAAGAAGGG ATATTCTTGCCACTTCACTCAACTTG AGAG 
TCAAACCCGAAATACTGGTAGGAGGGGAAGCCX5CTGACAAGC^ 

CCTTGGAGGAT CTGTTGTTCTGCTCTT CATCTGC C CAGCAAGG AGCAAG AGGTGGGTGAGG AGGGAGGGTTT CTG TTACACAAA^ 

3 0 GAAAACTCC CAAG CAGGGCTGAG CTGG CCTTTTT CACGG TTTG AATG AGCAG CTCTGAGG CACCCTCCCCAC CTTCGAG CAATGAG 

ACCGAAGATCTTTCATAATTCTCAAAAGCAAAGCAAGGAAAAAATG 
GGTGGC1AAAGGX3GGATGCTGAAGCTAATTCAGGGCCAGTTT 
CrGGGGTCGCTGTTGGAAAAAATCrcrCCTCrTrcc^ 
TCTGGTGGGGAGAGTGACACATATGTCACAAAGCCACTGCAACAGAATAAG^ 

3 5 GGGAATAGTGAACCTC^TCAGAGAGGTATTTAAGGTGGGTTTTCAAC^TGT 

TGGGCCCAGGATGTGGCAGGAATGGCAAACTCCATTGCCATCAGAGGCCGACAA^ 
AGTGGAGGGGAGTAGAGTGACCCAGAGGGTGCATGCCCCACCTAAAGGCATT 
GCAGAGTGTCCTGGGGACTAAGGCTGCAGCCGTGTAGATGGTCAGCTGAGACAACAGTAACAGA 
GGGAGGGTG^GTAGCTGGCCATGCCAAGGAGGAAGATGAGGAGGAAGGACGCAGGCACAG^ 

4 0 CGGCATCCCTAAGGAATTTGAACTTTATCCTGAAAACAATGGG 

ATGAGATTTGTGTTTTGGAAGGAGTTCTGTGGCAGCAAAGAGGCAGCAAAGCCCGTGTCAG 

GGGGCCGGGATGTTGGGGGATGGAAGCAGGAAGCACTG^TGGCCAGTGGGGCATGG 

GCGTCCTAGATGCAGCAGGCTCATCAGAAACACAGCCTGATG 

4 5 AGGGTAGCTCATGAGATGTTATGAGGGAGAGACACATACAGACCCAGTGGCTC 

GTTGGAGGGTTTCAAAGCCTGCTCCCCTGCTGGCCATTGTGTGACTCTGA 
AATGGGGATAATAGTAGCGCCTACCTGATGGGGCTTTTGA 

GAGCTCAGTGAATATGAATTATCTCATTGGATAC^GCAGTTGATAACATAGAAAATGCAAC^ 
GCCTACTGTCTGAAAGTGGATTCCCCAGATCTCTAGACCTCTCACCACCTCCCTG 

5 0 AGAGAGAGAGAGAAAAGTTGTGTTGGCAGGTGAGGAAGTTCAAAGAGGA 

CAACTGTGATTTCCTGGGCAGTCTGGAGGACCTGGAAAAATGTGGGGATGGAA 
TAATTTTCTTTCCTGGATGGGCAGTCTCTCATGTTGG 

ACITCAAG AACTCAGAAAACACCCAACATCTAGGTCAACCACTCCTrAGC^ CCAACTT CATTGGTCTCACTGAAGCT 

GCTGCCCAAGACCAGTTTCTTTATGACTTTGTAACAGACAGGGATGATCT 

5 5 TAC^GAGTTCCCCTAAGTTTCCCTCTGATGTTTCTTGAAAGCCACCTGTTTTGGAC 

CAAAACACCTGGAATGAGAAATCACAGGCACACACGCCAGG 
AGCGTCCGGGAGAAGGGGTTATCAACAGAGTTCAACAGAAACAGAGTTCTTGCT^ 

CGAGACTCGACCTCTAGATGAGTCAGTGGAGGGCGGGTra^ 

6 0 ACTCAGGAGCTAAGGGGTAATTCAGTGAAAAAGGGGAATGAGCGGTGG 

ACAAATGCCCGAGCGCAGGCTGGGAACGAGGGGACAGCGGCTGCCT 

GTGGACAG CGGAGGACTGGACTGCCAGGCCAAGCATCAGGGGCrrCATCCT CAGGGCCGGTT AGAGCCCCTG AGGATTTAGGAGGA 

AGGTGAGGTGGGGACTGCCATCAGAGCTAAGCITTAGGCAGATC 

AAATACCTATTTATTTATGTAGCCCGATGCAGTCTCTCCTTCCCGGJVGGAACT 
6 5 CCGGGGAACACATTAACC^CTGCAACCTGGGTGCCATGCAGGGAGG 

CTGGGCTCTCACTGCTTCCCCAGCCTCCAGGAGAAG^ 

TGCCAAGGGCCTGCCTTCACTTGTGGCATCGCAGTTACTGACT 

GATACGGGAAGAGGGCTACTGCCGCCCTGACTTCTAGGTAA 

CAGCCTCAGGAAGAAATAAGTTCCTGCTGGGCCCC^GGCTCAGCCACAAA^ 
70 GGGTCCCCATCAGCTGCTCAGACCCCAGGAAGCCTCCTGACT 

ACCTGCCCCCCAAGCCTAAGTTAGAAAACAAGTTAAAGGAGAGAAAGTGTG 



CTCCCTTGOAAAAAGGACAGGCATTCGTTTACrTGTCATTCATC 
AGCAC!ACAATTGTGGGGCACCTCCTCAATTCrrGGGCCTGCACAAGATAAAAGG^ 
7 5 TGTGCGCCCAGTGGACCAACAGTCAGTGCTGAATGAATGTCA.TTTCAGT^ 
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TCTCCATCGCTGTGGACCAGTAAGGAACCTCATCCCT 

AGACACTCAGGCTTGGCTTCATGCACGTGTGGCTGCGCACTCACCTGGATGCCAGGAAAATGCT 

CCAAAGCAGGGCTGGAAACGTGTGCACCTACCCCACGACCTCATAGCTTTGCCGCCT 

CACTCAGACCCACCAACCAAGCTCAGTCTTGAGGGGTGGTCTGGGCT 

CCAAGACAGGGX3GTGGGAGGGGCK3CCCTGCATTCTGATGCTGL , 'rrC , r T T CCAGGACCTGGGAGTCGAGGAGGACAAGAAGGAGGT 
GAGGACAGTGATGCCGGGCAGCAGGCCAGGTGTTCAAAGGCACAATCTGGTTCTGATGTTCT 

GCTTGGGAGG AAAGTT C CCAACAGCGT CTCCCCCTCCACTG CTTT CTTT AAT AACAAAGACTTG TC CCTG CCAAG CAAT AACTTTC 



10 

GGACTTCCTTTACTCCATTTAGGTTAAAACATGGGGGAAAATAACTTACCACCTTCCCTATC 
AGACAGAAATATGGACTCTGAGTTCTTAGAGAGAGAAGAGCTGCAAAAAG 

GAAAATCACCCTGGATCTGGCGATCTTGTTACTATCCTATCGCCCCAAAGTTGTCTCTTTCCAACCTTCACC 
GCAGCC CCCG 'I'i'T r ClT T GGTGAAAT AG AGAC C CAT CCGG C CT CCCTGG CAGGGG AGGGAAG AG AGGATGGGG CCCT AGGGATGGG 
1 5 GAGAGGGGTGGATGAGCACACACTTTATAGGAGGCCTGAACTCGGCAGGAACAAAGACCCT 
TCCAGAAACAGACAGAAAGTGTGCCTCACCAAGTGCCAAAGTTAGAAGAAACTGTAACT 
AGAGGTGGGAGCTCAGAGTAGGGGTAGGGGAGTCCAACTGAGGCCAGAGCTAGAGAG 
GGTCTGGAGGCCAGAGGGGCAGCTTTCCAGAAGCTCAGGCCAGAG^ 



20 

GGGGAGGGGGTGCTCAGATAGGGCAGGATGTGGTGGGGCTGTGGGCTGCAGAGGCAGTGAG^ 
AGGAACAAGGTTCAAGGGGAGAAGGGAAGGCAGGACTGGATGATGCAGAGTGCAC 
AAGGCTGAGACCCCCTCATGTCCAGGGACCCCGGGTCTAAGGCCTTGGGCCGTAACCATGGCTGGG 
CAGGGCTCCCAGGCTGCAGCCATGCCCAGGAAAGGGAAGAGGGGTTGCCACTGACTTTTCGTCT 

2 5 AAGG CTCCTGGACATCATTTTGTCCTCAC CATGT^CCCAAGCAGGACAGTGATGAATCCAGAACTTGGGTGTGAAAAAGAGATAGA 

GACAAGATGCGGTGAATCCCTCCAGCATAAATGGATAGAAACCCACC^TCGCTGAGCTGCTGAA 

ATGAGCCTCGTGAGGCAGGTCCTCCAGCCCAGCTCCCTGCCCTGCACAAACCCGTCCATCGCTGTGCGCAGAGGCAGGG 

TGGACCCCCTCCTITCATTGATAAAATCACAGCAGGCTGCTGAACCTTCACCCGACCCTGGA 

TGAGTCCTGCTGATTGGGATCTGCCGGAAGAGATGCGAGGAAGAGAGAGG 

3 0 AGAGTGGGAGATGAGAGCTGCGCTTCCTTCCAGGGACACAGCTGCTGAGG 

CACGGAGTTTATGGCACTTGGGCTCACGTTGCCTGAGCTCCTGCTGGGTGCTAAGAGGGGACCCA 

ACTCCAGGCCCTCCTCAACCTTGACCCCCAGGTGGATTTTCCCCCT 
AAGTTCTAAAGGAGGGTCAAGAAGAAAGACACATGTTCCCTCTCrCACG^ 

3 5 GAAATGGTG CTGGAG C CTGGG CTAGGT AC CT CACAG CAACCATGGG AAG CAAAT AG G ATTCAT CT C CATTTCACAG AAGAG AAAAC 

TGAGGCTCTGAGAGAGAGAGAGAGAGACAGAAAGAGTTTAACCAGCTCAAGGCTCATCTGGCCCCA 
CCCCTGTCAGCACATGTAGAGTCTCTCAGACTTGG 

ACCTTAGCCTACTGATTCAGGATAGACATGTTCACAGGATTAATCAGGTGATGCGTATTGG 

T AAAACCAAAG AATCAC CTGGAGACTTT AGG AAAACACAG ATT CCT AG AC C CATCACTTG AG ATT CTGGTT CAGAATTCT CAGGTG 

4 0 GGG C CTGAG AATTTG CATT C CT AACAAG TT CCCAGGTGATG CTATGATACTGATG ATGCGGACCTCACGATGAAACCACT ATTCCT 

CCCGCCTGGGCAACATGGCAGAATCCCATCTCTACTAAAAATACAAAAATTCG 
TACTCAG^AGGCTGAAGTGGAAGGATCACCTGAGCCTGGAGAGGCCGAGGCT 
CTGGGCAACAGAGTGAGACCATGTCTCAAGAAAAAAAAAAAAGAAAGAA 
CCACCCAGCTAAACTGGCCTGTTTTCCCTCATTTCCTTCCCCGAAG^ 

4 5 GAGACAACACCACAGTCGACTACACTTTGTTCGAGTCT 

ATCATGTACTCCATCATTTGTTTCGTGGGCCTACTGGGCAATGGGCTGGTCGTGTTG 

CATGACCGATACCTACCTGCTCAACCTGGCGGTGGCAG 

AGTCCTGGGTCTTCGGTGTCCACTTTTGCAAGCT 

TGCATCAGCATTGACCGCTACGTGGCCATCGTCCAGGCTGTCTCAGCTCACCGCCACCGTG 

5 0 GTCCTGTGTGGGCATCTGGATACTAGCCACAGTGCTCTCCATCCCA 

AAGCGATGCGATGCTCTCTCATCACAGAGCATGTGGAGGCCTTTATCACCATCCAGGTGGC 

CCCCTGCTGGCCATGAGCTTCTGTTACCTTGTCATCATCCGCACCCTGCTCCAGGC^ 

GGTGATCATCGCTGTGGTCCTGGTCTTCATAGTCTTCCAG 

ACATCACCAGT AG CACCTGTG AGCTCAGT AAG CAACT CAACATCG C CT ACG ACG TCACCT ACAG CCTGG C CTG CGTCCG CTGCTGC 

5 5 GTCAACCCTTTCTTGTACGCCTTCATCGGCGTCAAGTTCCGCAACGATCT 

GGAGCAGCTCCGGCAGTGGTCTTCCTGTCGGCACATCCGGCGCT 

CATAGGCGACTCTTCTGCCTGGACTAGAGGGACCTCTCCCAGGGTCCCTGGGGTGGGG^ 

ATCCCCCCGCCAAAAGCTGCTCAGGGAAAAGCAGCTCT CCCCTCAGAGTGCA CCCCTGCTC CAGAAGATAGCTTCACCC CAATC 
CCAGCrTACCTCAACCAATGCCAAAAAAAGACAGGGCTGATAAGCTAACACCAGACAGACAAC^ 

6 0 CCTAAACCAAAAACTGAAAGTGAAAGTCCAGAAACTGTTCC 

TGGGAGTGGCCTGAAGAGTCCTCTGAATGAACCTTCTGGCCTCCCACAGACT CAAAT 
CTTATCTCCAAGACGAGAGATAGTGGGGAGACTTCTTGGCTTGG 
CCCCTCCCTCCATCGTTTTCTTCACTGTCCTCCAAGCC!AGC^ 
CACTTGCCGCGTCGCCCTCCCAGGCTCTCAACAGGGGAGAGTGTC 

6 5 AGCCACACTCTGGGCTCCAGAGTGGGGATGACATGCACTCAGCT 

AGGGGCGGGGAGGGTGACAGTGGCCGCCCAAGGCCCACGAGCTTGTTCTT^ 
GTTCTGCTTTCGATTCGTTAAGAGAGCAACATTTTACCCACACACAGATAAAGTTTT 
AAAGAAAAAAAAAGTCTTTGGTAAATGGCAAATGAGCGTGTCrTTGTTTTGCAA 
CCTCCTCCCTGCCCTCTCCCAGGCCTCGTTTCTTAGAGACGCT 

7 0 GACAC!AGGACCAAGAGGCTGAAAGGTAGAAGGAAGATCAGGACAGGAG 

GG C CCCAAG C CTG CACAGT CAGCACTGAACCGG ACATG CTG ATC CT ACTGCAGCTTTGGGAAGG CCAACTITTTCCGGGT AACCTCG 

ATACCTGGGGACTCTGCATTTAGC!AGCrCTTGCACAGT^ 

TTTGTCAGCCCCTTAGACTCTGAAAGCTGGGGTTTGGAGGGAGGGCCTTC 

GTCAGGGATGAGACCCTTTGCTGGCATGAGAGCTGGAAC CTCTGCT AGCCACTGTAG CAGCAGTGGCCTCCGT 

7 5 GGTGTATGTCATAGTCCCTTCCTGCTGGTACAACAAAATGCCACCGATGGAGTAATT^ 
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TTTTAGAGGATTGGAAGTCCAAGATCAAGA 

CCATGCCCTCTGGAGGGGACAAACACTGTGTCCTCATGTGGTAGA^ 
TCTTTTATTGCTATAAGGGCATTAATCCCGTTCACAAGGGTGGAGCCTTCATGACCT 
CTCATACCATTGCCTTGGGGTTTATGTTCCAACCATATGAATGTC^^ 
5 GTGGAGGAGCAGGGAGAGAGGGACAGACAAAGGCCCCCACACAGGCCAG 
CCGTGACTTCTGCAAGCCRGAGAGCAGGTCAGTCCAACCTAC^ 
TGGGCCGTGGGGAGCTAGAGGGAGGGCTGAGGCACTCACGCA^ 

AAAATGGAGCTGCCTGGTAATGACACCCATTCGCTCATCTCGCAAGCACGTGACATTA 

AAGGTTTTTAACAAGATTTGTCAGCTCGTCTCCAAGCCX5GGAA 
1 0 GTGGAGCGGCGTTTGCAGCAATGGCATTAAATATTAAAA 

CAGTGTGCGTGTGTGTCCCCGAGGGGAACCTGCCGGAGCGCATGTGCGTGTGGAGCCCGCT 

AGATGGGAAAACAACCCTCGGTTCCTGAAGGC^AAGAATGTAAC^ 

CGCCCCCTCAGCTGCAGATACATTITTCTAGAATAACTGGACATCTCCAA^ 

ATTCCCTCGAACTGCTGTGTCTCAGAAGTTCTGACCTCGGTC 
1 5 GCCCATCTTGAGCTTGGGGTGACCCCTGGGAAGAAGGAAAAGGCCCCCTGG 

CrGCTTCATACATCTCAACCCTCTTCCCAGGTTCACAGCCAATA^ 

CAGGCCCTCATCTCCXrrCATCCTTGCTXKKrTGCAACCCCAAG^ 

AGAAGCACCACACCTTTCTCATCACAGACTCACAGAGCTCCAAGGGACCTCA 

AGATGGTGAAACTGAGGCCCAGCGGGGAGAAGCTGACTTGCCTGAAGA 
2 0 GAAGCCTGAGCGTCTGGCAGGGTTGCAGGAGGGTGTGGGGGAGAGCAGAGACTCT 

CTGCTTCeCAGCTATGTGACCITCAGCTGTTGCCCAA 

TCCACTGC CTGGG CTCGTT ACTT AG CCTCCCTGT AC CTCAGTTT CTTAATCTGT AT AATGGGGA 
GGATTGTTGTGGAAGGCAACGAATCCATATATGTAAAATGCTTAGAAAAACTTACATATG 

2 5 TT AAGAAACGGT AACTTAAAAAAAAAAG TTG CT AAC CACAAGG AAAC C AAATTCAATTC CTT AG CC CATTTC CATCCC CACAGAAT 

TTATAATTCAAGCTGGAAGGCAATTTAGAG^ 
GGGAAGCAACTAGCCTGAGGTCTCTTGGGAGGTCTTTC 
TCCCACCAGACCCTGCTGGGCTCA.CGGGAACATCCCTGCACATCCACC 
CGGCAGGCACCGTTCATCTTGTCCTTTGATTAGGTCCTTAAGGAGAACCT 

3 0 TGAGTCTGG AGAATG AATGAAAT AT AAAAGAAATATGG CAG CGGTCC CAAAGG AAATCAGG CAACTGATCTTTTT AAG AAAGGAAG 

ATTGTGGGGGGAATTAAGGCAAAGGAGATGGGGGAAAGGCAGGG^ 

GG CGGGACTTG AAGCAAATCTTCAGTGGGT AG TTAG ATGTC C CATC CTG CATGCACACACATGCATGTGCACACACACACACACAC 
ACACACACACACACACACACACACATAGAGCCTCCCCAGGTCTCCCAATCTC^ 

3 5 ACAGGTGGGCCCATGGCCTTCITTGCAGAGAAGATGGTT 

GAGAATAAAAAAGGTTGGGGGGGGGCCTCrCCCTCAACTCAAAAGAGCAACCCT 
CTCCAACCCACX»GGAGCAGCTGTCTGAGAAGTCCCyVACT 
CAGAGACACAGACAGGAGTCTGTGTGCAGAAAAAGCTCCCCA^ 
CTGATCTCGTGCGAAATCAGAAACGCCAGGATGTGTATTCCCCACGGGCTCCTC 

4 0 CCTCTGCCGCTGTC!AGATTCTCTCCTGCCCATATTCACTTTGAG^ 

GGG ACTT CG CTAGCTGGGCTGT CCT ACGTGG T ACAGTAAC CACCACAT AGTGTGG C CT CCAGGTTCTCAT CTCTGGGT AAATAAAC 
ATCTCCGCACAG^CTTGGCCAGCTCTGGACTGGCTCAGTTC 

TTGCTCCAAGTAAATAATATCTGACATTGGGAAGCACAG 

4 5 AAGAGGCCTGGAAAGGAC!AAGCTCATTGATCTCATTTGGTTTATGAAACT 

TAAAGACTG AATGG CTCAATCTGCTTCCCAGGCCCAGAGGGTCTTTTCTGTGGCCTGG CCCAGGCCTATGGTG AAGCCGGTGCTAG 

GGGGTCA.CCTATGTGGGAGCCAGCTCTGCCTCCTC 

TTGCTTGCACTAGTATCTCirACCTGCCACCTGCCCTCCTCCA 

GACAACCTCTGGACTTGGAAATCAGGAGGAAGGGAATGGGAAGAATGGTGGC^ 

5 0 TCTGCLTTTGTT CT CCAGGGCAAGACACAGGG CT AG ACACG CCGG AAT C CTT AAATCCTTTGT CAAAAGCAG AACTGGGATTTTCG 

TCCAAGGACTACACCTGTCCTGTCCTCAGCCTCCTTCTCA^ 
ACCATGGACCGAAAGACCTTGTGAGCTTTTGTTGCTC^AGTGTCAG^ 
GAAGGGATGGCAGCGATCACAGAGGCTGGGGGGCAGAGGAAAGAGACGTTGA^ 
AGGCTAACATTTACTAAGTGCCCGTGATGCACCAAGGACrrAGGCTAGCT 

5 5 GGAAACAGGTTTAGAGAGCTCATGTAACTTACCTGTGGTCrACACA^ 

TATTAAGCATCAGTATCTTATCTCATTTAATCCCCACAACAACTCTACT 
AGAGGTTAAGTAACTTTATCAATGTCACACAGCTACTAAGTC 

CCCTACTTCACCGTTCAACTAAACAGCTGTTTAATACCAAATATTTCTGGAGCCCCT 
GCCAACATCTTTCAAACCCTGCCAGACACAGAGCTCTGACCTGGGAGAAGGGAG 

6 0 CAGGGGCTGTGACCCCCACAGCTGGGCAATAACCAGGTCTTTGCTGCTC 

GCTTCTTACCTAGGCTTCCTCCATGGGCTAGCCCTGCAGATGACTGAGAGGGCTGAGAT^ 

AAT ATGGGCAGGT CAC CTCCCTTTTCTGGGCCTCCAG CT CCCCAGCTGT ACAATGAGG TTG CTGGAC CAAAT CATTTCTAC CCATC 
GCCTCCAGCTCTGACTTGCTATCC^TGCCTTGTCTGGGCAC^ 

GATG CTTGCACGAGGTGCIAGGGCAAGTCTGTGAGGGG CTG AGGAA CCACCCCCAGCTGTGGTTTCTGGTGGGGCCACCAT 

6 5 GGCAGCAC^TGGAGGGAAGTGTTTGTCACCCTTCCAACAAAGGCAGTCAAACAA 

TGGTGAGGAAAGGGTTCAGATTTCX3TGCCCAAGAACACT 

ATGCTCAATATTCACACAAGAAACACCACCGCACACATCACAC^ 

TACACATACAAATAGAAAACTTGCCCCCATACAAGACCAGCTTCATGGGCCGCC^CCTG 

7 0 GCCCTGCACTAAGGCCCATGCTGGGTATAATGCTTTGCTGCCTGCACTT^ 

ACCTTTTAACACTTTTACACTGTTGATGGGAGTCCAAArr 
GAACCAGAAATACCATTTAACCCAGCAATCTCATTACTGGGTATATACCCA 
CACACGTATATTTATTGCAGCACTATTTAC^TAGTAAAG^ 
7 5 AAATG TGG CACAT AT A(^CCATGGAATACTATGCAG CCACAAAAAGGAATGAGATCGTG CTG 
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GAAACCATCATTCTCAGCAAAATAACACAGGAAC^ 

aacacatggacacagggaggggaacatcacacactggggcctgtcag 
aatacctaatgcactlagggacttaaaacttagatgataggtl^ata 
aaacctgca.cattccgcacatgtatcctggaccttaaagtaaaattaaaaaaaaaa;^ 
5 cgctacacacatacacacagatgcacaaacacatgcatgccacaaatgcaacacccacac^ 
gcgcciaaagcacacacattgatacccagaaggggt^ 
ctgacacaattcaattgacttaacattgggggcagcagaacaatc 

gggaggtgggcgcatggacggaaaggggacagggatggaggtggagagggagagagagagg^ 
1 0 gggcaggaactgagtccccaggtgagctggcccc^cagtggtctcataagagctcagaagg 

aatgcccctgtgctccccgcaagcacagggcggtcccgccctcag^ 

cagccttgggcacgccgtgtaactccgggcctcagcttcctcctttgtgg^ 

gcacacacagacacaac^cctccrggattcctgcagc^tcaacagctcgtctc^ 

ggaactccagaccccctcattacagcagacatttggaac<5cgtttatttcccatc^ 
1 5 aggtacaagca.cagaggtggatgtaattcatggcctcatggccaccatca 

ccccaccccctacccaaaccaagcaggactccaaggcctggaaatttttcttcctcctcta 

ttttgagacggagtttcactcttgttgcccaggct^ 

ccattatcctggctcagcctcctgagtagctgggactacaggcacccaccaccatgcc^ 
acagggtttcaccgcgttagccaggatggtctcgatctcctgac 
2 0 caggtgtgagccaccgcgcctggcctcctcctctattctttacccccaacctgta 

gcaacctctgcctcccaggtt caagtgactctcctgcct cagcctcccg agt agctgggattacaggcatgcgccatcacgcccag 

ctartctttgct ttt t tt tt tt t ctt tt ttt t g acagagtct 

actgcaagctccgccacccgggttcacxcc1attctcctgc 

2 5 CAGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACTGTGTTAGCCAGGCTGGTC^ 

TCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATCCCTATTTAAAATTGTCCCAGCCTCTG 
ATTCTGTTAGGTGATACCAGATTCrTAAAGAGTGATTCGGGAAAAACGAATGGGCCCTAAG^ 
ATTACAGTCACCCACAGGAGAGAGGAAAGGTCGTCCAAGTGGCCACCAACAAG 
TGTTTGAGACGGAGTCTCACCCCGTCACCCAGGCTGGAGTGC^ 

3 0 AAGTGATTCTCTTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGCACG^ 

ATACAGGGTTTCACCATGTTGCCCAGG CTGCTGT CAAACTCCTGACCTCATG ATCCGCCCACCTCAGCCTCCCAAGGTGCTGAGAT 
TACAGGCGTGAGCCACTGCGCCCGGCCTATTTCTTCAGATTTCTTATAGGACATA^ 
CAACTTACTGAGAGGGAAACTGAGTCAAGGAAGAGTGAAGTGACTTGGGTGAGCAGCAGAGT^ 
GCAGTCTCCATGAGACCAGATC 

. 35 

HUMAN SEQUENCE - mRNA 

GTG AGACAGGGGT AGTGCGAGGCCGGG CACAG CCTTCCTGTGTGGTTTT ACCGCCCAGAGAGCGTCATGG ACCTGGGG AAACCAAT 

GAAAAGCGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTATGCCTGTGTCAAGAT 

ACACCACAGTGGACTACACTTTGTTCGAGTCTTTGTG 

4 0 TACTCCATCATTTGTTTCGTGGGCCTACTGGGCAATGGGCT 

CGATACCTACCTGCTCAACCTGGCGGTGGCAGA^TCCTCTTCCTCCTGACCCTTCCCTTC 
GGGTCTTCGGTGTCCACTTTTGCAAGCTCATCTTTGCCATCTACAAGATGAGCT^ 

AGCATTGACCGCTACGTGGCCATCGTCCAGGCTGTCTCAGCTCACCG C CACCGTGCCCX3CGTCCTTCTCATCAGCAAGCTGTCCTG 
TGTGGGCATCTGGATACTAGCCACAGTGCTCTCCATCCCAGAGCTCCTGTACAGTGACCTCCAGAG^ 

4 5 TGCGATGCTCTCTCATCACAGAGCATGTGGAGGCCTTTAT^ 

CTGGCCATGAGCTTCTCTTACCTTGTCATCATCCGCACCCTGCTCC^ 

CCAGTAGCACCTGTGAGCTCAGTAAGCAACTCAACATCGCCTACGACGTCACCT 

C CTTT C 'TTGT ACGCCTTCATCGGCGTCAAGTTCCGCAACGATCT CTTCIAAGCrCTTCAAGGACCTGG<5CTGCCTCAGCCAGGAGCA 

5 0 GCTCCGGCAGTGGTCTTCCTGTCGGCACATCCGGCGCTCCTCCATGAGTGTGGAGGCC^ 

CGACTCTTCTGCCTGGACTAGAGGGACCTCTCCCAGGGTCCCTGGGGTGGGGATAGGGLAG 

CCGCCAAAAGCTCCTCAGGGAAAAGCAGCTCTCCCCTCA 

CCTCAACCAATGCCGAAAAAGACAGGGCTGATAAGCTAACACCAGAGA^ 

AAAAACTGAAAGTGAAAGTCCAGAAACTGTTCCCACCTGCTGGAGTGAAGGGGCCAAGGAGGGT 

5 5 GCCTGAAGAGTCCTCTGAATGAACCTTCTGGCCTCCCACAGACTCAAATGCTCAGACCAGCTCT 

CAAGACCAGAGATAGTGGGGAGACTTCITGGCTTGGTG^ 

TCCATCGTTTTCTTCACTGTCCTCCAAGCCAGCGGGAAT^ 

GCGTCGCCCTCCCAGGCTCTCAACAGK3GGAGAGTGTGGTGT^ 

TCTGGG CTCCAGAGTGGGGATGACATGCACTCAGCT CTTGGCTCCACTGGGATGGGAGGAGAGGACAAGGG AAATGTCAGGGGCGG 

6 0 GGAGGGTGACAGTGGCCGCCCAAGGCCACGAGCTTGTTCTTTGTTCTTTC 

TCXjATTCGTTAAGAGAGCAACATTTTACCCACACACAGATAAAGTTTTCCCTTG^ 

HUMAN SEQUENCE - CODING 

ATGGACCTGGGGAAACCAATGAAAAGCGTGCTGGTGGTGGCTCTCC 

6 5 GGACG ATT ACAT CGGAGACAACAC CACAG TGG ACT AC ACTTTGTTCGAGT CTTTGTG CT C CAAG AAGG ACG TG CGG AACTTT AAAG 

CCTGGTTCCT CCCTATCATGTACTCCATCATTTGTTTCGTGGGCCTACTGGG CAATGGGCTGGTCGTGTTGACCTATATCT ATTTC 
AAGAGGCTCAAGACCATGACCGATACCTACCTGCTCAACCTG^CGGTGGCAGACATCCT 
CTACAGCGCGCCCAAGTCCTGGCTCTTCGGTGTCCACTTTT 
TGCTCCTACTTCTTTGCATCAGCATTGACCGCTACGTGGCCATCGTCCAG^ 
70 CTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATACTAGCCACAGTGCTCTCCA 
GAGCAGCAGTGAGCAAGCGATGCGATGCTCTCTCATCACAGAGCATGT^^ 
TCGGCTrTCTGGTCCCCCTGCTC^CCATGAGCrTCT 

AACAAG^CCATCAAGGTGATCATCGCTGTGGTCGTGGTCTTCATAGTCTTCCAGCT 
GGTGGCCAACTTC^CATCACCMTAGCACCTGTGAGCrCAGTAAG 

7 5 GCGTCCGCTGCTGCGTCAACCCrrTCTTGTACGCOT 
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GGCTGCCTCAGCCAGGAGCAGCTCCGGCAGTGGTCTTCCTGTCG 
CACCACCTTCTCCCCATAG 
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Table 7 

MOUSE NOMENCLATURE 
ICSGNM Ccndl 
Celera mCG21251 

5 

HUMAN NOMENCLATURE 
HGNC CCND1 
Celera hCG27985 



10 MOUSE SEQUENCE - GENOMIC 

CACGGGATCCAAAACCCTCTCCTCACTTCTC 

TGGC1TTCTGATCCTACGGCTTCTAATTTAGCACTAACACTTA 

GCCITTCAGGCTTCTCCTTGCTTTTCAC^ 

ATGTGGCCITGGGTTCCATCCCCAGAACTGAGTGAATGAGGCAT^ 
1 5 GTAGAATGAAGAGTTCCAGTTTC^AGGTACACCAAACCTTGTACCAGAAAGAAG 
AGCAAAAGATGGAGGAAGCRTACAGTGACTTTGAAGATGTGTTAACTTTCCCT 
TAAAGAACAGAGGTTATGTGAACTCACAGTTTC^GGATTTTA^ 

GATGAGAGCGAAATGCTGAGCAAACAGAACTGAACTGCCCTGTACTACATGGCAAATCAGA^ 
TGAGGCCACCTTC^GTAACCTAAGGACTGCCAGGCTGTGCTGTTTGTTTGTTCT 
2 0 TTGGTTTTGTTGGTGGTAGGTTTTGTTCATTTGCTTGTTT 

TCTACCACCTCCCAATAAAGCCACCCTGAGAATGAAGCCITGGAC^CCAGA 
GACCAGAAGGCTCTGCCAAATTCACAAGCCACTGGCCCAT 

2 5 TCTATGCATATCTGTGTGTGTGTTGGGGGTGCTAACCK3TGTCTGTGCATGCATGCACAGT 

CGGCC^CTGTAGGCCATGTTACTGTTCGCATGGGACAGGCCTTAAAGAAGAGCTGCAAAAGAGCCTTGTACCCA 
AGACCCACCCCAGTCTGCTTTCTCAGTTAATCT CTTCCTCT CTGGGAATTTGCTGTTGAGTAGATTATA 
GGCTTGGCCTCAGCTGGTCCACTAAAAGCAGGAGGTGAGGGCCCAGGCCTGAGGA^ 
TTGTTGGAGACACATTCTCCAGGACCACAAATAGCTGACATGCT 

3 0 AGAATCAGGTGTTCAAGGCCAGCCTCTCAAAAAACAAAACTGAAGTAAGTACATCAACT 

GGGCCTGGAGAGATGACTCAGCAGTTAAGAGCACTGGCTGT^ 
CTCACAACTGTCCATGACTTCAGTTCCATAATTCCAGGGGATTTGTCATGT^ 
ACATAAAAAAAAATAAATCATAAAACAAACAACAAAGAGTTAAA^ 
ITrTL'fTTTCCCCAATTTAAAAG'rTTCTTCTGAGGTTTACATCTCACACCC^ 

3 5 TGATTCATCATGAAGTCATTTCAAAATTTCTCTTTCTGAATATAAACC^ 

TTACAGCCTTCAAATTCACTCTGCTTAACAGTAAACT 

AAATGTC CT ATG CACA1T AC CTGAATTACTTCTGGC CAT GGCCTGG CAG AG CC C AGCAG AG ATGAGGG CAT 

CTTTGCCCACAGCCTCTCCAATGCCAGAGAGCACAGCTCTGTACGTTGCCTCCT 

ATCCTGAAAAGAAGTG^CTTGCCTTAAAGTGACTTTTCTTTTTC 

4 0 GGTATGTGCTACCTCATCTAGTTCTGGTAGTACGTGGATTTTCTCTCTCT 

G TGT GTGAG AGAGAGAG AG AGAG AG AGAGAG AG AGAAAGAGAG AG AG TG AG AGAGAG AG TGTTTG TG T AC CrGTTCTTTCTTG TGT 
ATGTGTGTACCrGCT C TTTCT T GTCTGCACATATGTGCACCTGTTCr^ 

TATGGATTGTTAAGCTGCTAACCTCAGCACACTGGTTTTCTGCCCTGT 

4 5 AAGGTGGGGTCTCCTCCTCCATCCCACCTGAGAGGGAATGATTTCCTTAGATAGGCT 

CACTGGGGGCACACAGGAGCCATGGAGCCCCTAAGGCCCACCAGCAGCTTCT 
AAC^GAGGGGTTGAAAGATGGCCCAGAAGAGGTCrCCrCAAATCTCTGACCCTTACAG^ 

TAGGC CT ATC CTGG CTTCATTT CTT CTGCTGTGAT AAAATGT CATTGG GGG AAAAAAAAAAAAGCAG CTT AGGG AGAAAGGG CTTA 
TTCTGGCTCXWAACTCCAGX3TTTCCAGTCCATTACTACATGAAAGGCAG 

5 0 CTCCCAGAGCAGTGAGATATGAATCCAGGGGTGGGGACTGGAATGATGGCTC^ 

TGCCGATTAAAACTGTCTGTAACTCCAGTTCCAGGGGATCTGAC^ 
AACACCAATGTACTTAAAATAAAAATAAATAAATTATAATAAATAAATAG^ 

ACCCATAGGGCTTCTTGAGACTCTCTAGCAACGTGAATCGAGGTTGTGTCAAGT^ 

5 5 TCCCTGATGAACAGGATGTG^CAGAGACAGGCATCCCATTGGTTAATGT 

TTAACITATATTTATTTGCTTGTTTGTTTATTAGTGTTTGTATCTATGTC 

ATGTGTAGAATCCCTCTAGAGCTGGAGTAGGGGGCTGCCGGGTATGGGTGCTGGAAATCAAACCCGGTCCTC 

AGCACTCTGACCTGTTAAATCATCTACCTAACCTCATCCTTGCrCATGTT^ 

TCGCCITTAATCCCAGCACTTGGGAGGCATTGGCAaSAGGATTTCT 

6 0 AGGG CT AT ACAGAJGAAACC CTG TCTCAAAG AAC CAAAAAT AAATG AAAT AATT ATAG TGGGTTTTTGG ACATT CT CTG T ATG CT AA 

GCACCTTCCAGATGGGCTTCCTGACAAGAGATCCGTGGCCTGACCCACACTC^TTTGAAATAATGATGA 

GCAAAGG CTGTAGTT CCTCTCTGG CTTCTACCACAGTCAGTAACCAGTCCAGTTCCTTAGAG A C 
CAGCGGAGCCAAGACAGTATAGTG CTCCTTATAAACATGGAAGG^CAGTTCCTAC CCAGCGAGTCACTGCGGG CTTGGTC 
CTGAAGCGCAGGCTCAACCACCGTTCAC CGCGGGGAGCGTCCTCAGGCTCTCGCG AGCCACTGCTGCGCGTCG CGTCCAAGGTTT A 

6 5 c<K3TAAGCGCGGCGCTCAGGACGACCACGTGGCGGCGAAACACCGGCCGGCG 

CTCCCCAAGCCGT<3ACCCCGGAGTTCCCG^CGCGCX3TTTCT 
CACCAAGCACCACAGGGAGCAGGGGCCCCATGGAG^^CTCAGCACGTGAGGCT 
GCCCTCCCTGTTTGGGCTCCATCACAGATCTGGAACCCGCTTAGTCCCCATT 
ATGCCTGGTTTTGCAGGGAAGCAGGTO^GGGCTCTGCGCCCTGACCGCCCT 

7 0 AAGACCCCAGTTCCAGCAGCTGCGCCCTGGGATCGCATGAGGGTTCGCTCACC^ 

GTGGCTCACCTTGGCCCGAGTGCCCGCTCCCCACCATTTAG 
TTCAGAGCACCT<rrGATTCGTTCAGAGAGCTGGAGACCGA^ 
GGAGCCAGGCTTGCCGCTCCAGAGCTCATTGCACGGCGCGCGTCT 

GGTAAGGCACGCTTGGGAAGATAGGATGCGACGGTCCCAC^TGTGCACAGGC^ATGAGCT 
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GCCTGAAATCCGCTCAGGGTAACCTAGGCGGAGCAGCCGTGTAGCACGTGGGCTGCCACGCGCGCCCCAAAACG 
GGAGAGGGAAGCCGTAATGCCTCCGGAACCTTGGGGTCCATATTCGGAAGTGTTTTCT 
TGGCTAGCTGGAGCAGAAACAGCCAAGTCCITTCAAGTTC 
CTTCTCCAGGAGGGGCTACAGCCAGAGGCTCAGTATTGCCGCCCAGCCCCGGAC 
5 GTTCCTCAATGAACGCGCTCCCTCCCACTTCTCAATGAAGTTCCCACAGCCA 

CTCTTTGGAACCTTGGCTCCAGTCAGGTGGGGGTAGGGGTAGGGTTCGAGGAGGCAGG^ 
GAGATTTTCTTTCCAAATAACGTGGTTCCCTCTCGGCTGGGCAAGTGG 
AACACTTTAAGCCCAGAGCGGACAGTTCCTGCCACGAGATTTTTTT^ 
GTTTGTGTTTAAGTTACTATTTCGCTGGAAAATAAAAATCC^ 
1 0 TCTGAATTGCAGCCCACCTCCTGGGCTCGGGATTCGGACAAG CAGGGTTTCC3VGGATGGAGGATGGG CGCTCCCCGCCTGGGTCCCA 

AATTCCTGCGCCTCCCCTACGGGTTCCrGGACGGCrCTACACGCCCGCCA^ 

CCTGGTCTTTCCCGCGGTGGCGGl^TGGGGGGTGGCTGCGGGTOOT 
GCTGCCCTTGGTGGCGCACCCTGGCTAGGCTAGCCTGCrrCAGTTCGCTVG^ 
1 5 TCATGCTTACTGCCGCCCGGTAGGGATTTTATG^ 

GCGGGGCCTGTGGCCAGAGCAAGCGTGGGCACTCTGCCCGGGCCACCCACCACCriTCGG 

GGTCCGGACTTTAGGGACCCCAATGTCGGGGGAGGGGGAGCGGAAGGCATGAGCTCGGA^ 

GCCTGGACGCCCTGCGCTCCTTTACCAGTTTCTTCTG 

TGAAGTCGGG TATTGTG AAAA.CGTTTC CAAAACTAGGG AACT AAATAATGG ACG AAG^ CT AAAAAT AT 
2 0 TTTATTTGAATCTATAAATTATAAACTTAGGAGTCCATTCAGTTAAC 

CCTCAACGAAGCCAATCAAGAAGCTTCCGGTGGTCTGGT^CCTGGAAGGGCGACTAATAAC^ 

AAAAAATAAATTAGGAAGGAGCCTATCGTGTCTCAACCTTTTCTTCAACXKST^ 

AGGGGGATACTGGGGGACCCTGGCCAGGATAAACCGGTCACTGTATGAAGACAAATCTCAGATCCC^ 

2 5 GGAATAGATGAAATAATGGCCACCATCTTGAGCTGTTC 

CTGGGGATCCTTTAAAGTTCAGATACCCCTCTGGCCCTTTGCAACCACCCCAG 

GCTCGTCTGGCATCTTCGGGTGTT ACACAGTT CCTGAATTTTACACGTG^ CTGGGATTT 

CTAAGCAATGGGTCCGCCTGTGGGTGCCTCGTGGCGT^ 

CCCCGAGGCTCCCCTTCCATAC^TTCTTCCTTGGCTTGGGTC 

3 0 CTACCCCACCTCCACAGGTCTCGGAGGACCCTCTTAGGGAAAGAAGCCCCCCT 

AAGCAGTCCGAGCGATTTGCATATCTACGAAGGCTGAGGGGG 
NNNNNNNNNNNNNNNNNNNNIWNNNNNN^ 
NNNNNNNNNNNHNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

3 5 NNNNNiraNNNNNTOWNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNTO 
40 NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTTATTG^ 

AGTAGGGGCGCTGGGACAGGCACGAGGGGCCTCGTGGCGGCCGGGAATCACTAAATAA 
GATTCTTTGTTCAAACAGCGGACCTGGGTACCCCX;CA 

4 5 CCTAGTCAATGAGACTGACTGTTGTGCGTGCGGTTCTGGATTTGGAGGAACCTC 

AGCACATTTTCAGATGTTCAGCAGGAGACTCAGGGGTCCCAAAATATTTTAAAATA^ 
GAGGCAAACAGCGCCCGCACCCTGCAAAAGGGGGGCCCCCCGGAGTTTGAGTTGCTGGA 
CTGACCCTCGGCTTGGATCAAGGGTCGTCCGATGGTTATTTTC^GGTCGTTTTTAGG 
TA1TTTTTGAAAAGATGACGTCTGGGAAATGCAGCGCGGCGGCCTGGGACGCCACCA 

CACCATTGCTTCTACCGGCCCCGCCAGCTGCCCAAAGACTTTCCCTTTC^ 
AGGAGCX3CATCCCAGGGAGTTGGCCACCGGAATCCCTCTCTGTGCAATCT 
CCTCCCACGTTGACGGTCAGGGTCCGGATCCCCGCAG 
ACTACCTC^CCGCTTCCTGTCCCTGGAGCCCCTGAAGAA 

5 5 AAGATGAAGGAGACCATTCCCTTGACTGCCGAGAAGTTGTGCATCTACACTGACAACTCTATCCGGCCCGAG 

CACGTAACCCACCACCGACAATCCCTCTTTATGCTGGGTGGTGGGAGCTGCAAATGGCAGGAGA 

TCCTGCCCCACGAAAGG CACCCGCACCGCACCGCACGACACCG AAATGG CAGTGAGCAGAATGCCAACCCATTCTACCTCTGTGCC 

AGG CCACGGCG CTGGCX^GTGGG AGA'IT CTT1T1GT CCG CAGTCTGTCCCCrT AGT ATTTCT ACAGG CACATTTTGTTC CTG 

6 0 TATGGGGTCCCGGGTTGACCCAACTCTTTAGACCTTCTCGAATCAGCCTGCCT^ 

AATGGAACTGCTTCTGGTGAACAAGCTCAAGTGGAACCTGG 

TGCCAGAGGCGGATGAGAACAAGCAGACCATCCGCAAGCATGCACAGACCTTTGTG^ 

CCATTCTTCCCAGGAAGAGCGGGCTCCTTGCCACTCCCCCTGGGGTTAAGGCCTCT^ 

CAGATAGTTTGTTGGAGAGATAGATGATACTGGGTGGCACATTCTCTTG^ 

6 5 CCCTGTGCCAACCTTTCCACTGCCAGGAAGTGGTACGGCCACAGCATGCTTTCTGAAGAGGT^^ 

TG CCTTGC CTCTG TTCTT AAGG AG CTCGAG CT ATAGAG CACACACACACACACCAG CTT CTCCCAGG CTGAG C CAC CTGCT AGGGG 
TTCCTACATGGATGGGACAAATGGTCCAGCCTCAGCCCCACGAGCTACATACAGTAC^ 

ACCAGGGGT ACTGCAGGCCAGT ACTTTTTGGCTGAGT C CTGC CTGGGAGGGTTGG CTGG AAACAGAG AAACT AG AACCTC C CT ATG 
ACTCCTCCAAGGGACCCTCTCTGGCCCCAGACCCCAGGTAGATTAGG CT^ 

7 0 ACACATAGACAACCCTtSCTACTTCTGAGTGGTTTCAGTA 

GAGGCTTAGCAATATTGAGGGGGTGTGGTGTTCTCCCACCTGTTGAGGAC 

TATAGCrrGTTTGGCCTGTCATCTCAGAAGGGCCGGTAGAGTATCT 

GACATCTTCTTATGTCTGACAGCCCCATAGCCAGTGTTTCTAAGCTGCTGGTCT 

7 5 CTAAGGGGTCiG^K^ 
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GGCAGCCTTTTATGGAGCTGAAAGTGCCrrTCAGAGAGGCAGCAGAGTTTCCCAGAGATA^ 
CCCAGGAT AGTTCTGGCAGT AGTGTGCCCCTCCCX5GG ACCCCTAGATG CCAG CAGAGTAATCJIGT 
CTGGACTCAGCTCTTCCAGCTCTCCTTACTGCCTTTG 
CGGTAGGCCAGTGTTCTGTGGTCTGGAGCATTTGTTTCTCTGTAGGAG 
5 GAGGCCAGTGTGACATCACCGTGAATGGAAAGATGCTAACATCAAGGCCAGGAGATCTGTCGTCTC 
TAGGAC^TGGGAAGGCATAACTTCCCTTCTGCAAACCGCCCC^ 

TAGGGTTGGGG TTGTT AACACAGGTGG CCAGAG CG CTGTGTGGATGGATACAGACAGTGGG AAACTG AACTTAATGAAGT ATGT AG 

GTGTCAGACATGCCACTAAGCCAGGGTACACCTATGTCATCAGCAACCAGTGTTAGTA 

TCTTGCTACTGCCCGTGGGAGGGAGAAAAACAGGCTCTGGAAGGGGCTGGGGACT 
1 0 GTAAAAGGGCTGTTTGTGATTTGAACrTCACTCCATAGCCAGGTTTT 

ACCTGGAGGTATTGCTTGTCTGCTGGATCCCTTTAACAGCCAAAGCTGTATGTTCT 

TACAGCAACTCTCTGCCTTCCTGATCCCATTCCTTCCAGCCTCCTTGAT 

GCTGACATCTrGTTCATTCTTGTGTCCTCCTCTGGCCTGGAAGCCCT 

CTTTGTTGGACGTGACAGGGAAGCAGTTTTCAGACTCATCTTTGG 
1 5 CTGCAGGATGCTGAGTCCAGTGGACCTGAGAGGTGTGGAGTTGGTGACCCCAGATGAAGTCCTAAAA 

AGTTGGGACATGTACATATGAGAGTCCCTAAGACTCCAGGGCCCCTCATGAGGGA 

CAGCTCCCTCTGAGTCTATTCCTACTGGGTTTTCCTGGACTGGGGTCACATCT 

TAACTCCCTTCTGTGGAGTCTAGCCTGGCCTCTGAGGAACCTGGGCCTC 

TCTAGTCCTTGGGGTGCTGCCAGGCTGACTCTGGAAGAGACAGAGATCCCTGCCT 
2 0 GATGTGAAAGCACCACACCTTCATAGATTCCATCCTCTTAGGGCGACATAGCCAACCCCTC 

GATTGAACCCGGTGTGTATGCCAGGCAGCATTTTAAAGTTTGAGCCAAGGCC^ 

CACCGTAATCCTCCGATGTGCTGGGGTCTAGAGATGCAGCACTGTGTGCACCTAGTGC 

TTGATGTGGGCATCCATTGGAAGGGCTTGAAGACCAGCCCCGAGGCTGTGGGTGGGAGCTGA 

TCGCTGTAGATGTGAAGTTCATTTCCAACCCACCCTCCATGGTAGCT^ 

2 5 GGCAGCCCCAACAACTTCCTCTCCTGCTACCGCACAACGCACTIT^ 

TCAAAGGAGGCTACAGCTGGCAAGGGCCTGGGAACCTTGGTGGGCAGTGCCAAGGCACTGT^ 
CCAAATCTGGGTCCCCACGAGCCTCTTCCCAGAGTCATGGGTCCCAAAGGCATAGAGGG 
ATGTGTATGGGTAGTGGCATGTGAAGCCCGGTCTGGCCCAGCCAGCATAGGACCATATCTCCCTC^ 
ATCCATAAAGGACTCTGGTATAGACCTACACATTCTAGAACTTTCTTTGTC 

3 0 GT CT AACTGT AC CTTTCTGCCAAGGAGT ACAGCTG ATCATG CTTGGGG CT CTGG CCGTGAATG TTTT CTCAAGGGCATCACAAGTT 

GG CATAGACAGGCAAAAGTCCAGTTCTGGCTTGAACTTCAGTTGGGGGT^ 
CCAGGTGTGACAGCTGTTCTGCCCTGCCTGGCATCAGCC^CAGGGACA 
TG CAGCCCCCTT CCCCTAAGTGTCTGGGGCTCCTGAG AACTGAGAAT AGGGTATGTGGGCAG^ 
GGGAAGGXSTGTCTGTGGGTCCATACTAGGCTACTTGGGCCTCACTTTCT 
35 AGCCATGGTGTTCATCAAACAAAGGGGGAACAATACACAAGTTTTGAATGTGGGGG 

GCTTCAGGAATATAGACAGTGCCCTAGCTTCAGCCTGCCTAGGCAGGGCTTCAGTTCATC 
CAGACCCTACCTGTGCTGGAGCTGTCCTCTCTCAGAGGCCACCCAAGGCAAAAAGTTATC 
TTCAGGGTACAGCCTGGGAGGAGGAACCTAGAGAGTAGCAGTACCTTTCTAGTCTCTGG 
TCGGGAGTTTCCTCAAGCCTTGCTCTTAGCAAAGCCTGCCTCG 

4 0 CCCTCCATCTCCGCCCGCCTCTCCAGGACTGCCTCCGTGCCTGCCAGGAACAGATTGAAGCCC 

GCCCAGC^GAACGTCGACCCCAAGGCCACTGAGGAGG 

GCGAGATGTGGACATCTGAGGGCCACCGGGCAGGCGGGAGCCACCAAGTAGTGGCACC 
CTCCTGACGAOSTCCCCCTTGGGGACATGTTGTTACCAGAAGAGGAAGTTT^GTTCT 

TC CTTTCT ATCT G ATTT AAG CAAAAG AG AAAAAAAT ATCTGAAAG CTGT CTT AAAG AG AGAGAG AG AGAG AGAT AGAAT CTG CATC 

4 5 ACCCTGAG AGTAGGG AGCCAGGGGGTGCTACAAAAATAGAATT CTGTACCCCAGTAAT CAACTAGTTTTCTATTAATGTGCTTGTC 

TGTTCTAAGAGTAGGATTAACACAGGGGAAGTCTTGAGAAGGAGTTTTGATTCT^ 
CATTCCTTTAAAAAGGAAGGAAAAAAAATACAGCAAACCATTC 
GCTGAAAAGCCACAGCTTAGGCCCTCAGCCTCACTCCCTGGCTTGCTCAGTGCCTA 
CCGAGGGGTGGGCAGACCTCTTAACCTTATAGATGGTCAGTGCGACCT 

5 0 AGGGCTCAGCTTAAT<3TGCCCTCTCCCCC(^CAACCro 

TAATATATTCTATTTTTGTAACCTTCCTATTTTGTAGCTCTGTTT AGAGAGATGCTGGTTTTTGCCTGAAGGCCCTGCAGCCTGCC 

CACATCAGGTTAAACCCACAGL"l'rrTGTGTGTGGTTTGTTTTGTTGTGTri'TC 

GCACTTTTGGTCAGCTAGCTGGAGGCAGTGTTGCTGGTGTGTG^ 

55 TGTTCTCGCACCACCGGGATCTGTCCTGCAAAGTTGAAG 

ATTACCTGGACAAGGAGATTGGGGACAACTCTTAAGTCTCACACAGGAGGCTTT^ 
CTACAGAAGAGTATTTATGGGAAAGGCTGCCCATGACCAGTGTGACrCAAAGCAATG 
TGCCCTGCTGGAGAAGGTTTAGGGCC^TGTCTGAGAGATTGGTCTTTTATTGGGCAACGGG 
AAAACCACAAAGACAGAGATTTGGTCTGC1TGACTTTCCCAACCCAATTGGCC 

6 0 GGGGACTCCAAAAGAGTTTGATTCTGGCACATTCTTGCCGCTGCCCCCAAG^ 

TGTGCCTTTCTATTAGGACTTTTTGGCAAAAGGTGGAGAGCGGGAGGCT^ 
ACATGGGACAGGCCACGGCTCCTCTCATGGCGCTGCTACC^ATGACT^ 
TTGTATCTTTCACGTTGTTTTCGCTGCTATTGGAGGGTCAG 
TTTAATTTCCTCATAGAGTGTATTTACAGATGCCCTTTTTTGTACTT^ 

6 5 ACCGCTGTATTCCAAAAAAAAAAAAAAAAAAAAAAAAACAGGTTCCTGTTCACAATACCTC^ 

CTGGCAGG CAGGTGGG CGGTCTGC CT C CAGGGAT CCTGGGACC CTGATGGCGATCGTCCTGTCATG CTGGG CC CTTCATTTG AT CT 
GGGACATAGCATCACAGCAGTCAC^GCACCTGGATTGTTCTGT^ 
TGCTGCTGGCCCGGAGGGATTTGCTCTGAGTCTCCGGTGCATC^ 
ATTACAGCATTGTGCTAATGTAAAGACTCTGCCTITGC^AA 

7 0 AAACCAAAACGGGCACCACCAG CTCAGCTGAGG<IATCCCAGCCAGGCAGGACCCTTGAGGGCCG CTGTGTCCATGGTGATGGGGTG 

AGGTTTTGGCCAAAAGGCCAAAGACTGGTGGTGGGTCCAC^ 
GGCCAGGTTGGCTrTTTGTATTTCIGGTTGACACACC^ 
AAAGACAAAAAATCTAGAAAATAAAATTGGTAAAATCTCAGCTCACTG^ 
ACAGCCATGTGGGGTGGCATCTCTGGCCCTGGCACCAACCrGG 
7 5 AAGTGGGAGCCTCTGGCGTCAGGGAGGCCAGCGACAGCCACT 
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GCTT ATTTGCAG ATGGAAG ATCAACAGAAC C CG CGTTCACTG CC CCTTGT CTGGCGCAAATGGG TGTTTGCTCTGGGTGAAT CAC C 

GTTTCTTAGGTGCTGCCAAGCGCCCAGAGAGCCATGCTTAGTGTTAGGGCCAA 

CCTGAGTGGCCTGGCAAGACTCCTGTGATGTCTTCGGGGTGGCAGTTACrrCATGACT 

GTGGCTTTTAGACATCATCTTCTAGAAGGTAGCTTT^ 
5 AAAGTGGAGGGTG T AG AGGACAC CTGGT CTG AGTGGTG C CAC CCAATGATGAAAGCAGAAAG AGATGGACGG AGGGAAG CCCACAG 

TG ACCTTG C CAGG CG AC CT ACTCAATGAGAGT ATATGGGAACAC CAG AG CCAGAGGTGGCTT ATGAAGACACACTACTCATCTTGT 

ACCAAATTCTCTGATGGCAGGATCCAGAAGGTATTTTGTCAGCCCAGCTCCA 

CATCCAGGAGAATGGGCATC^3GGCCGTCAGCATTCCAGGAAATGGC 

CTTAGAAGCACTGGGGAGCGAGCTTATCGTGTCCAC^TAGGCTGACATGA^ 
1 0 GCAGCAGAGTGGTACT AGGAATG CAGACACAAAT AGCAAGGGG CT AAGCATCCACTCAAAGGCTG CTG CTG CTGCTG CAGCTGCTG 

CTTTCCACCACTGCCrATCAAAGGTGCTCATGGAAGGGOTCCCAACT 

ACATGGTGCCGTCTCTTAAC^TGCAATTGGCTTGTCTCGTCATCC^ 

CTTTTCTTGGTTrTAGCAGTGAAACCCCAGATACCAC^ 

TAGATCCTCTGCTGGGTGTAGAGATACrCCTGTTCTAAGGGTGATGG^ 
1 5 GGGAGCCAGCACACCCAACTGTCTCCTCTCAGAAGCTCAATC^ 

TGTGTCCTGGCCAGGCTCTCCTTTTGCAGCCGGCCTC 

CTGCCTGGCCTCTCTGAGCCTCAGCTCCCTGAACTGGAAAGATC 

GCTCCAGCCTTCCTGTTTGTCCTACAGGACTCACAGTGAAT^ 

ctgtgtagcctctcccctcctgactccacatcctagaaccaagtct 
2 0 gcactttgtcccagaaagcaggccaacttcaccaacccta 

ctg aggactccaac-g<x;tgggacacaggcagaagagtaggact 

ccaggagacagtcaatgcx^gttgggctgtgtgtttggtggcaaggct 

gccaagtgtgcttcccaagcaaacctgtacttgagctgataaagc^ 

ggcgggcatgttgtatacagggccccacgtgggcacagcacagtgct 
2 5 . tctcttactcagggatgtciatcccaggaatgtatttgatcaa^ 

NNNNNNNNlTOmJNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
35 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNIWNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNl^ 

TTGGACTGTAGACTGTAAGTCATCAATAAATTCCTTTACTATATAGA 

AAT ACTU^CATGCAT ACACAT AAAATT AAAATGTGC CAGG CAGTGGT AGCACATGC CTTT AAT C C CAAC ACTT GGG AGG AAGAGGCA 
4 0 GGTGGATTTCTGAGTTTGAGGCCAGCCTGGTCTACAGAGTGA^ 

AACCAAAAAAATAAAAATTAAAAAAATAAGTAGGTAAATTAAATTAAATTAAAATGTTTAAAAAGAAG 

CT AC AGTTTTTT AAG AG TTTGTCAAACATTTTGATGTT ACAATGG C CAATGATG ACAG T AG CC C CTCACAT AAAT ACAT AG T ACAG 
AGGGC^AAAAGTGTCCACAGGGCCAGTTCAGAAATTGCrGGAAAATTCTG 

TATGAAAATGGTCG^CTGGAGAGATGGTTAAGAGCACTGACTACTCTTCCGAAGGTCCTGAGT^ 

4 5 TGGCTCACAACTATCCATATTGAGACCTGACGCCCCCTCCTGGTGTGTCT 

ATAAATCATCAAAAAAAAAGAAAATGGTCAATTTTTAAAATCAGACATTC 

CCTGTTGAGGCCTGAGAGCACAACACATTAAAGACTTGGCTTC 

TACAAGATGGATGGACGTTGTACrrTCTTAGAATGCTGGTCT 

AATGACCTCGTGCAGTACAGAGCACAGAACCGAGACTGCATTGAAGTCTTGGGATGCAA 

5 0 AGATCTGATTATGCATTTGATGTACTTCATTTTTAGACAGGAC 

GGCCTCAAATTCCAGAGGATGCATCTGCCTCTGCCTTCT^ 

ATTTTTTCAAATTAATTCTTTTCTCATGTTTAGAAATCTTTTCCT 

TTGKX5TTTTTCTGACTCCCATACAGAATTGGGATCCACAGTTTTTA 

CACAAGGACAATCTTGGAGAGACCGCTTATGTGAACAGCAGGAGCTGACTCTGTTCAGGGAGCT 
CGTTGTAGCTTCTCATGGCCTCCTCCTCAGGGTCCACTACCACCCACCTTAAACAATGCCA^ 

AGTGTAAAAGGCC CAG C CATG CTTG AGC AGG CCTTCC CTG CACAAGTT CAAC CAC AAG CAAACTT CTG AGTGAACTGGGTGG CT AG 
GACCCTTCCAGAG CAG ACTTGGCAG CAGG CCTCTGAGCTTGAG AG CAGG 

G CAACAGT AAGGAACCTTCTGGGAG AATGGG C AAGCACTTCAAGTCTTCAATC CTGCAGACACTTC^TCTTTAGGCACTGATG CAA 

6 0 CCTCTCCAATAGAGGACT AAAGGACCCACCATC CATG AAGCATGGGGACAGGCACTCCAG CC CTGCAACCCAGGAATGG CCAGGGT 

CTTATGGAAAGTGCCTCTGGGAAGCTGACCACAGAAAGGC7AGGAGTC 
CTTCTAGGAAATGTGAGCAAAGCCCTATTCACCCCAGATGGGGCTCCAGTGACAG^ 

GGTGAACCAATGAGTTTCTAAGTGGTTAGTTGGTTATAGGAGTGTGGGAGACACCCCAGCTACAGATAATGATCTTC 
AAAAAAAAGGGTCCCCACTTCTAGTATACTCCTGAATCCCCTAAG 

(TTACATCATCATTAAGATGGTGGTAGAGTGAACIXjAAC^ 
GTCAGGGCTCrGGGGTCCCATGCCTCATTTTCTCTGTC^ 
CTCATAAAGCTGGGAGCCACTAAGGTCAGTGGGGCAATCTTrCTAGTCAGC^ 
AACATGGGAGTCTCCCTCCCZAGGAAGTCn'CCCTGCCACCrCCTCAGOT 

7 0 GCAGCCATTATCAGAGAGCCCACACGGC^TCTGAAGCATCTGACAGCCCTTCAACACTGCTCCT 

T AGACCTG CT AG ATTCAAG ACATG CCAT AGG CT CAAATG AG CAAG AAACTAGG CTGG AGCAG CAGG CACAACTCT CACCCCACT AT 
GCTTTGTCCCGCCTGGAATTGGCCCTTTACTGTGACCTCCTGC 

GTAGACTTAGTGGT ATGGGGAAGTCAGGGCTTAGCAAACAG AACTGTGGCCC CrTCTTCCAGTTTCCTCATCTCACTACCAGG CAG 
AGGAAGAAAAGGTTrGGGGTAAATAGAAAATAAGGAGGGAGTGAGCCAGAGTTTCTGGGT 
7 5 AAACATCACCATC^VGCAGGGCTGGGGGATGGAGAGGTGGTTAAGGGT^^ 
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GACC^CCATATGACAGCTCACAAGCACATGGA^ 
NNNNKNNNNNNNNNNNNNNNNNNN^ 
NNNKNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNtTONNNN^ 

CTACAGATTGAGTTCCAGGACAGCCAGGGCTACACAGA^ 
TCCTCACTAACACTAGCTTTATACAGAACTTCA 

CATAGTGTCTCTGCAGGCCCAAAAGGCCTTCTGGATCACAGTTTAGACAGACTG 
AGTTACTTTTCTGTGTTGACAAAAAAAAAAAAAAAAAAAA^ 
1 0 TTATAGGTGGCAATCATTATGGTGGGAGGCATGACAGAAGGAGCCTGAGGCAGATG 
GTTGCTCAGCTCACTTTCTCCTTTCTTTTCAGTCCAGGACCCCA 

AGTTAAGCCTCCCTGAGAAGACCCTCCCAGGCACACCCAGAGGTATCTACGAGGTGATTCTAAATTGTCT 

CAGGCAGCAAGCTG AGCAGAAAG ACTTGGGGAAAG CCAGC CCAGAAACCTCTGCCTCAGACCTCCAGCATGTGACAAGTGATTGCT 

CATAAGGGATGACATCTAAGCTAGAGTCACAGGAAGCCAGAACAGGGTGGGATC 
1 5 ATAGCTCACAGGACATGGTCTGAGAAGGCAGGGGATTGA 

CTCTGCCACCTTCAGCGACAGCTTGGCCACAGCTCCCACAGTT^ 

AAGCAGAGCATTCTTCTGACGACCTATGCCCTCCCCTTrCACAG 

GCAGAATGGTCCTGTAGTCTGTCTAAACTGTGA 

TGAGAAAGGGTCAAGTGGGCAGAGGTCAAGGCTTGTACAG^ 
2 0 TGGGAGATCTGACACCTGATGTACACAAGGTGAGGGGATAGGAGGTAGAGATGAAGGGA 

ACTAGCAACTGATTGATACCTGTAGTGAATGTGACAGGCACAGTAGGCCCATATCTGTGAACCT 

GGCACTGAAGGACTGCTGCAGCTGTCAGTAGAAATGATCCCAGACAGACTACT^ 

ATATAATAGGAAAGTAAAATATGGTGATAACAGCAGCTCAGTTCTCACACCGAAGAGTCCCTGATCCT 

TGCTGAATACCTGAGAGGTAGCCGCCAGGCCCAGGAGGCCAGGTC 

2 5 CCTTCGGACATACAGCTAGGCTCTATTCTGACTAGATTATAACTGAAG^ 

TGGTTACCTGTGCTCAGGTACCATGTGTCCATCTCCTCACATCCT 
CTCTCTCCCAGATGTTCCGCCTATTTCCTGCCTAAGCCATAGGCC^ 

G AT ATTCTCTCTACGAAATG CCTG AGAGCACTGTG CCACTCCTGCCAGCTCGTCTTATATCTGCCTGTAAATACCATGCCAGACAC 
ACAAGCrGTTGCTAAAGGAAGTACAGTGTCACCAGGOVAGCCCTACA 

3 0 TCCAATAGTTCCCCTAATGCCACTTTTTGTTTGTTTGGTTTTGAATTCAC^ 

CCTACTATCTCTTGAGCACTGGGATTAAAGGCGTGCACCACC^ 
CACACCTGTTAGGAGCAGCATGAAGGCCCTGCTTC^GCTTCCTCCCTTGTCT 

G CAAAC CAAGTC CAGTGG AG CCTGAGT CCTGTGCAG AT AG C AGG AAT CAGGT CCTC CAGTGGTGTTGG CAATGACAG CG CATGTTG 
G AGGCAGAGGCAC CATGTGGTACTTCAGGGCATGGACCTGGGGTGGGGGCCAGCTCTG CCACCAGGCCCTT AGACTAG ATGGGTCT 

3 5 ATTTTATCACTTCTAAGCCATAAGGTATCCrrCCACCTCTGGGGCrGCCAGAGGGATGGAGTGAA^ 

AGCAGAGGCTCAGCAGGCAGCAAAGATCCTGTCACGGCTTCCCA 

AGGGTAACCCTACCCAGGGCCACTCACTCTGGCCAGTGACAAGCGGTCTTTTGAGTOT 

CAGGTAAGCCACCTAGGCAGAGACTCAGGGAAGAGGCCAAGCTGGTCC^ 

ATCACTGTGACACrTAATGTGCAGACATCTGCTCTTCTGTGGGTCCTCAAAACGCAAGTCCAGAGCCT 

4 0 TCTGGCTGCACGTTGAGCAGTGAGCACAGCTGAAAGTACCCAAGAGCCCATGCCTTACACCTCCTC 

TATCCCAGCACAAGCCCAAAGGCTATCTCX3CCTTAACGTCCCCAACCCACTTTCAGTCATTCT 

CTCTGGGTCTATATCTGGGCACACATTGCCCAACACAACCATCACATGAACCCTAGGGCC 

CTCAGGCACACCCCATAGAC 



45 MOUSE SEQUENCE - mRNA 

GAGGGCTGTCGGCGCAGTAGCAGAGAGCTACAGACTCCGCGCGCTCCGGAGACCGGCAGTACAGCGCGAG 

GCCGCCACCGGAG CCCAACCGAGACCACAGCCCTCCCCAGACGG CCGCGCCATGG AACACCAGCTCCTGTG CTGCG AAGTGGAGAC 

CATCCGCCGCGCGTACCCTGACACCAATCTCCTCAACGACCGGGTGCIGCGAGCCATC 

CCGTATCTTACTTCAAGTGCGTGCAGAAGGAGATTGT^ 

5 0 GAGCAGAAGTGCGAAGAGGAGGTCTTCCCGCTGGCCATGAACTACCTGGACCGCTTCCTGTCCCTGGAG^ 

CCTGCAGCTGCTGGGGGCCACCTGCATGTTCX3TGG 

ACACTG ACAACTCT ATCCGGCCCG AGGAGCTG CTG CAAATGG AACTG CTTCTGGTG AACAAGCT CAAGTGGAAC CTGG GCGCCATG 
ACTCCCC^CGATTTCATCGAACACTTCCTCTCCAAAA 

CTTTGTGGCCCTCTGTGCCACAGATGTGAAGTTCATTT CCAACC CACCCTCCATGGTAGCTGCTGGGAGCGTGGTGGCTGCG ATGC 

GACTGCCTCCGTGCCTGCCAGGAACAGATTGAAGCCCTTCTGG AGTCAAGCCTG CGCCAGGCCCAG CAGAACGT CGAC CCCAAGGC 
CACTGAGG AGGAGGGGGAAGTGGAGGAAGAGG CTGGTCTGGCCTGCACG CCCACCG ACGTGCGAGATGTGG ACATCTG AGGGCCAC 
CGGGCAGGCGGGAGCCACCAAGTAGTGGCACCCGCAAAGAGGAAGGAGCCAGCCCGGGTG 
ATGTTGTTACCAGAAGAGGAAGTTTTCTTCTCTTTGT^ 

6 0 AGAAAAAAATATCTGAAAGCTGTCTTAAAGAGAGAGAGAGAGAGAGATAGAAT 

GCTACAAAAATAGAATTCTGTACCCCAGTAATCAACTAGTTTTCT 

GGAAGTCTTGAGAAGGAGTTTTGATTCTTTTATATGTTTTAAAAAAAAAAGCT^ 

AATACAGCAAACCATTGTTAAAGTAGAAGAGTTTTTAG 

AGCCTCACTCCCTGGCTTGCTCAGTGCCTACAGCCCTGTTACCTGATACCTGTGCTTTATC 

6 5 TT ATAGATGGTCAGTGCGACCTCTAGTGGTCT C ATGGCGTGTGGCACAACCCCCCTCCCCAGGGCT CAGCTTAATGTG CCCTCTCC 

CCCCAACAACCTGCAGGTTCACAGCGCCAGCCACACAGCGGTA^ 

CTATTTTGTAGCTCTGTTTAGAGAGATGCTGGTTTTTGCCTGAAGGCCCTGCAGCCTGCC 
TGTGTGGTTTGTTTTGTTGTGTTTTCTTTCTC^ 

7 0 TGTAGGGCTGGTAGTATGAGGTGCTTGGGAAGTTTT 

GCAAAGTTGAAGGGATCCTTTGGTGCCAGCTGGTGTTTGGAAGTAGGAA 
ACTCTTAAGTCTCACACAGGAGGCTTTTAAACACTAAAATGTCrAATTT 
TGCCCATGACCAGTGTGACTCAAAGCAATGTGATCTCCCTTGATTCAAACGCACAC 
TGTCTGAGAGATTGGTCTTTTATTGGGCAACGGGGGGGGGGGGGGGGGGGTCCTTAAA 
? 5 TCTGCTTGACTTTTCCCAACCCAACCCAATTGGCCCCATT 
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TGATTCTGGCACATTCTTGCCGCTCCCCCCAAGTTAA 
CTTTTTGGCAWU«3GTGGAGAGC^GGAGGCTTAAGAGGGGATGTG 
CTCCTCTCATGGC<3CTGCrACa»TGACTCCCAGGATCCCAGGCGTTCAGA 
TTTCGCTGCTATTGGAGGGTCAGTTTTGTrrrGTTTTC 
5 TGTATrTACAGATGCC C r i T lT TGTACT CT ^ 

AAAAAAAAAAAAAAAAAAAGAGGTT CCTGTTCACAAT ACCT CATGT AT CAT CTAGC CATG CACG AGC CTGGCAGGCAGGTGGG CGG 
TCTGCCTCCAGGGATCCTGGGACCCTGATGGCGATCGTC 

TTTGCTCTt3AGTCTCCGGTGCATCATTTAATCTGTTAGGTTCTAGTC 
1 0 GT AAAGACTCTG C CTTTGCGAACGCAG CTGCAGTGCTGT AGG C C C CCAAGTTCCCT AG CAAG CTGCCAAAC CAAAACGGGCAC CAC 
CAGCTCAGCTCAGGCATCCCAGCCA.GGCAGGACCCTTGAGGGCCGCT 
ATAGACTGGTGGTGGGTCCACGGAATCTGCCCTGTGACATGAAAGGCTT^ 
TTTCTGGTTGACACACCATCGCGCTTCCCAGCACAGACATGTGACCAGC^ 
AT AAAATTGGTAAAATCT CAAAAAAAAAAAAAAAAAAAA 

15 

MOUSE SEQUENCE - CODING 

ATGGAACACCAGCTCCTGTGCTGCGAAGTGGAGACCATCCGCOSCGCGTAC^ 
AGCCATGCTCAAGACGGAGGAGACCTGTCCGCCCTCCGTATCTTACTTCAAGTGCGTC 
AAATCGTGGCCACCTGGATGCTGGAGGTCTGTGAGGAGCAGAAGTGC^ 
2 0 CGCTTCCTCTCCCTGGAGCCCCTGAAGAAGAGCCGCCTGCAGCTG 

GACCATTCCCTTGACTGCCGAGAAGTTGTGC^TCTACACTGACIAACrCT 

TGGTGAACAAGCrCAAGTGGAACCrGGCCGCCATaACTCCCCACGATTTCATC 

GAGAACAAGCAGACCATCCGCAAGCATGCACAGACCTTTG 

CATGGTAGCTGCTGGGAGCGTGGTGGCTGCGATGCAAGGCCTGAACCTGGGCAGCCCCAACAACTTC 

2 5 CGCACXTTCTTrCCAGAGTCATCAAGTGTGACCCGGACTGCCT 

CTGCGCCAGGCCCAGGAGAACGTCGACCCCAAGGCCACTGAGGAGGAGGGGGAAGTG^ 
CACCGACGTGCGAGATGTGGACATCTGA 

HUMAN SEQUENCE - GENOMIC 

3 0 CATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATC^ 

GAAACCCGTCTCTACTAAAAATAOVAAAAAAAAAAAGAAAAAAAAGAAAGAAAAGAAAAATAGCT 
TAATCCCAACTACITGGGAGGCrrcAGGCAGGAGAATCGTTTGAACCC^ 
CACTCCAGCCTGGGAGACAGAGCAAGCCTCTGTCTCAACAAAAAAAAAAAAAA 
CATCCCAAGTCTCAGCTCCTACTCTTTTGCrATATGTCCTTCAAACCAGCTATGT^ 

3 5 CTTTTGTATCTTGCl'lTl'TTTGATTTAATGTTTTT 

ATTGTCTCTAGGGTATCTCTTGCAAATAGGATAATTGTCAATGATGTAACGGGTCTCTATGCAACTTGCAAAAGTG^ 
GTGTCCTCCT C ATC CTAGTGTTTGGTT CCAAGCAACAG AAACCAATGCTGGCAGATTTCAG CAG A^ CAG AA 

GATAGGCAGGAACAAGGGAGGCAAGACAGCACGAATCACAATGGAAGCATGCT 
CCTACCCATCCCAACACCAGACCCTCCAGTAAAACTGCAGCGGCCC^^ 

4 0 AGCACAGATACTGCTGCTCATAAACTTGAATGCCTCACCCAGTTCT 

CACCAGCTCCCAATTCAAAATCTTGATCCAAAAAGATGAAATGTATAAATAATACTATTTATAATGAC^ 
TAAGAAGAAATCTAATAAAAGATGCACAACACCCAACACACAC^ 

TCTAAATAAACAGAGGTCTATATCATGCCCATGAATTGAATGACTCTGTTAGTAAAAGTGTCAATTCCCTC 

ATC CAAT ACAAT CC CAATCAAAATT C CAG CATTGT GTGTGT GCACATGTG T AAGTTG ATTCT G AAATGTTT ATGG AG CC CCAAGGG 

4 5 C CAGGAGT AGCCAGGGCAAC CATAAAAG AG AAG TG GGTTGG AGGATTT AG C C CG C CAG AT AT CAAG AATTCG TAG CGT ATAGCTT C 

CATGATGCTTCTGGGGTAAGGACTGAAAAAGAGACCAGTGC'AACGGGATAGAAAGT 

AGATTTT AG CTG CT CTGTT CACAAAAAAAAGT AACTATGTG AG ATG AC AGGT ATG TT ACACT AATT CACT ATGGTAAC CATTTTAC 

TATGTATATGTATCTTATAACATCATGCTGTCAAGTTTATTTCAAAAATAAATT^ 

AGACAAAAAGTGGGAAAATTCTATTTTTTTAAAGTACAGAAACTTTATTC 

5 0 GAATTAAGGC(!AGGTGTGGTGGCTCATGCCTATAATCCCAGC 

TGAGACCAGCCTGGGCAACATAGTGAGACCCCATGTTTACAAAAATTAAAAAATTAGCT 
CAGCTACTTGGGAGGCGAAGGCAGGAGGATTGCTTGAGACCAGGAGGTCAAGGCTACAGTAAGGTATGA 
AGCATGGGCAACAGAGCAAGACCCTGTATCTAGGGGGAAAAAAATGATAAGCCITGTGA 
TAAACTTGTACCTCTGTGTTCATCGGCAGCATTGTCCACAGTAACCAAAG<^ 

5 5 GATAAGCAAAATGTGGTCTATCAATACGATTGATAGTATTCAACCA 

TTGAAAACGTGATGCTAAGTGGAAAAAGCCAGATGGAAGAGGCTGCA 
CAGATCCACAGAGAAAGATTAGTGGTTGCCAGGGGCTGGGGAAAAGGGGGAAGAAAAG 
TTCTTTGGCGGATGATGAAAATGTTCTCAGATGGTGATGGTGGCTGTCCAACATAACAA 
AGTGAACTTTACGGTATGTAAGTTATTTTTCAATAAAGTGGTTTCAAAAAAATA 

6 0 GGAGTTTACCGTTTTTCTCCTTGCATTCTGGAGTCAGAGTGAGAAGGCATCTCAGCTCTACCAO 

CGGTTTCCTGATCTGTAAAATGGGGATAAAGGGCCTCTCCTTGGAGCTGCCA^ 
AGGGCAGGTGGAGAGCACAGTAAGCGTCTCGGGGGGCCACGCCATCACCGCCAGC^ 

TCCCCTGCCC^CCCCTTACTCCTGGAAGTGTGACTCGCTCCCTCCT^ 

6 5 TTATCTCTCATCTCTTCTTTCCAAACAGCCTCTGTCTAGCACCTTG 

TGGCTTCATCAGATGACAACTAAAAGGCAGAGGCCTGGGGGGGCAAGGGAGCT 

CTTGGGATGCTACAAAAGGGATTCCAGAGATTTTTTTTTCAATAACTCAAAAGGGAT 
GGCAGCTAAGGGAAGGCCTCAGGCTTCCTTTGCCTTTC 

7 0 GCTCAGAGCCTG<^3GGCCCAAGGAATGCTTTCCAGGCTGGCGGTCCCAGGAGCCCCCT 

GCAGAGGCHX^CCTGGCAGGCCGTGTCGAGGCCCTGAGCTCACATGGCCT 
GCAGTGACACTCACTGACCCACACCTTTGGCCAGCGTCCCTTGTGACTGCCCCCT 
ATGGGTGCCAGGAACCTG<5CCAGTCCATCCAGAGACAGGGCAGAGCCCTGG^CT 
TATGGCTCTGCCTCTCATCCTGGCATTGACACTTGTGCCCTTAATTAAAGTGCn'CTn" 
7 5 CAGAGTAAGAGAAAGAAGAGTTAGGGGAGAAGAAAGAAGGAAGGAT^ 



250 



WO 03/008583 



PCT/US01/51291 



CGTGATTTAAATCTGATGTCCACAAACAGCT 

TCCTCrTTCTGCATATAAACTGAAATGTCTAATTAAT^ 

ACATGAATGAATAAATCGAGCAGGCCTTGACAGTCACCAG^ 

CGTGGCCTTGCGAAGAAAACG ATCACAGC CCX3TTTT ACAGATGGGGACAGAG ATTTGTCCAAGG CCACACAACT AGTGGCAGATGT 
5 AGGTCTGTATGTGACATTCTTGTTCAGTTTCATAAATGTGGCTGCCACCTGCCT 
TTGCATGAATTCTTGAAAAAGTAGATTTTTATCTTTACCAGCT^ 
TACAGACCATCACCGTTGCCAATGTTGTGAACCTGAACAAGGCACCTGGTT^ 
TTCCCTGAGATTAAGTGTGTGAAACTATAAGTTGGCTGGAGGAAACACCTCA^ 

AGG CTTTTTCTC CAAACC CT ACAGGAGATGG ACCT CAAAG C AAATGG CCATGCTTCC CTT AG ACTGGGTGAACTGGTG CTG CTCAG 
1 0 GGCTAGAAGTGTGTCTAGAAGCATGGGGACCAATGCAGCCCCCTCCACCCTAGG 
AGAGACAGCTCTCTAAGTCAGGATTCAGCGAGATGGTCCAGGA 
ACACAACCTAGCCTGTGTCTCy^GGGGGCCCCAATCATGCCCTGAACCTCATC 
CTCAGGGTCCAGGGATGCACTAGATTCTTTATCTTAAAGTGACCCTCAAGA 

GACGTTTTC CTATTTGTTAAGCAC CTTCTTG CT CCGAGCTTTCCAG CCACAGTGCACTGGACGCCATCG GAAGG CTGCAG CTGCT A 
1 5 TTTGGTGCTTT AC CACCG CTGAG AAGGTGGAATGGTCACGGCCCGGTTCTG T CACTTTCTTCAGCCGG ACAGTCGCCTT ATT ACGG 
ATTCCAGTAGGGCCGAGCACACTCCITCCTGCCCGTCITTACMATGAACATC 
GTAAAAAAGCAGCCCCTATCTACCCtSCAGGGAAGGAGTATGTTCGGCACCACAGC^ 

GGTCCGCATG CGCT C CGG AACCGT AATG CGCG CTTTTT CTAAG C CTT ACGGT AAACG CGG ACGCAGGG CAACCACGTGG CGG TGG A 
ACCGAGGCCCGGCGGGAATGCGCGGAATGCGCXX303CGGCCTCGCGCGGTTCCCGAGCCACGGCC 
2 0 crCGCCTCCTCCCCTCACCTCTCCCAGCCGCACCCCG^^ 

AACACACAGAAGGGGCTTGGGGACCTGAGCGCGGCCCATCAGTCCCT 
CGAGTCCTGCCCCCAGCCCTTAAGGCACGGGCTCCAGGGA 
GCACCTGGAAAAAAACTCCCTTTCCCTCCCCrrCCCCrGCTTCTTGAGTC 
GACCCCGCTOTCCCTCCACCCACCCCCAAGTGCCAAAGCXSAGGGAT^ 

2 5 CTGGAGAGGCGCGGGGATGCGGGGTCCTCCCCGCAGTCTTCCGGAAAGGGCGGGGGAGGGCGCG^ 

CCGTGGGAGCCCACGAGGGCCTCAGCGCGGATCCTCCGCCGGAA 

GGGAGAGGGCCGAACGTGACCTCCGAGGTCIACCCCCAGCCAGCTTTCTCT'CCTGTG 
GCCTACTCCCCACCACXrG<3TCTGAGAGGGGCTtKXK3CCGGAAGGCCA 

3 0 GGTACCAGCCGATGCGTTTTGAGGGGCGGGAGTCGA^ 

TCCCAGATGCGTCOTTGGAGCCAGGGCCGCCCCCCTACCrGTTGGGTTTGC^ 
CGGAGCTAGGGGAGGGGTCTCGTTTCCCCGCAGCCCGCCGGACAGACGACTGGGGC^ 

TGTGGCTGAAACT AATTGATCTGG AG CGGAAACG CACG TCTG CXSGTTGGC^CG ATGGGGGGGG CGGTG CXXKTTGT CCATGTGCCGA 
GCGTGTGGCTGTCTCGGGTGGGCACTGGGGCCGGAGTTCGCCCCGGCCGAC 

3 5 TAAGCGAAGCAGAGCTGCCATAGCACGTGGGCCGCCACGCGCACCC 

TTCGGAGCCCAGGGGCC13GCTTTCGGAAGCGTTTTCCCGGGCG^ 

GTCCTTTCAAGTCGCCGCCAGGTATGCXX3CTGCAGGTGACCCCACCTGGGTGCGCCCGCCC 

GTGCGGGAGGTCGCTGGCGAAAGGTCGGGACTGGTCCCTGCACCACCCGCCCCCAACCCAAGCCCCGAGCC 

GCGCTGAGTCCCGGGGTCTGCGTCGCGGCGCGCCGGTTCCTGAATGAACGCGCTCCCT^ 

4 0 CCAGGGACGGTGGCGAACACGCGCCTGCAGCGGAATTCGCTTTCTCCrGACCGAC 

GGCCAGGGGG CGAGGAAAG CGTGAAGGTG ATTT'CAGTTAATTTTGG ATTTTCITTT^ 
ACTTGCCCTTTGTCTCCAAATGGTCACOVAGAAATAAGAACAGAGCACTTTAAATC 
TGGGTTTTAAGAAGACAGTGTAAAGTAAAACTGCAACCGAAAAGTTTTTTAAAGTT 

ATGCTTTCTCTGCGCTTCTTGAAGCAATGACCCrC^ CCCCCTCGGGGG ACCCGGGGG CCGCCAAGCAG 

4 5 GGTTCCCCCAGGTGGGGGCTGGGCAGCTGGCGCTCCCCXCCGGGCCCCAAATT 

CCGCGGGTTCCTGGACGGCTCTTTACGCTCGCTAACCGGGCrTGC^TTTTGCGCT 
CTAGTC^AGATCTGCCCGGTCCGCCTAGTAACAGCGCCGCXSCCCCCATTGGCTCATGCTAATT 
GGGATGGGGGGGTGAAGCTCCCTCCTGGACCCAGAGCCGGTTGTGCCGGAGTG^ 
GACTTCXKKTCCGCTTCGCGCCTCGGGCTGGGCCAGGGTCCACGCGGGGCTCGGGGCCCCT 

5 0 TGATAGCTCCGGGCCCCATAAATCATCCAGGCC^CCGCCGGGTCGGGATTTTATGAATGAAAAAGCAGCTGGGCCGCCCTTGTGCG 

CGGGCTG ATG CTCTGAGGCTTGGCTATGCGGGGGCCAACGCGATTGTGGGTGCT CGGGGAGTGGGGGGGGGCACGACCGTAGGTGC 
TCCCTGCTGGGGCAACCCATCGCTCCCCATGCGGAATCCGGGGGTAATTACCCCCCCAGGACCCGGAATA 
COX3CGGGGGAGGGGGCGCGGGAGGAATTCACCCTGAAAGCT 
GGGAGAGGGCTTTTTCTATCAGTTrTCTTTGAGCTTTTACTGTTAAGAGGGTACGG 

5 5 GG AAG TAAATGAACAG T T1T CTT AATTTGGGG CAGGT ACTGT AAAAAT AAAAACAAAAGTTAAG AC AG T AAAATGTCCTTTTATTT 

TTTAATGCACCAAAGAGACAGAACCTGTAATTTTAAAAACrGTGTATTTTAA 
ACCCTCTCATGTAACCACGAACACOTATCGATITTGCTAAAAATCAGATCAGTACACT 
TTATGCCGGCTCCTGCCAGCCCCCTCACGCTCACGAATTCAGTCCCAGGGCAA^ 
CAAAACCAATTAGGAACCTTCGGTGGTCTTGTCCCAC^CAGAGGGGACT 

6 0 AAAATGAGTCAGAATGGAGATCACTGTTTCTCAGCTTTCCATTCAGAC^TGTGTTTCrCCC 

AGGGGGTGCAGTTGGGGACCCCCGCAAGGACCGACTGGTCAAGGTAGGAAGGCAGCCCG 

TGAAGGAAATGCTGGCCACCATCTTGGGCTCCTGCTGGAATTT^ 

AGCTGAAATCCCTTTAACTTTTAGGGTTACCCCCTTGGGCATTTGCAACGACGCC 

6 5 GCTGAGATTCrTTGGCCGTCrGTCCXSCCCGTGGGTGCCCTCGTXMCGT^ 

AGCCACCTCCACCTCACCCCCT AAATCCCGGGGG ACCCACTCG AGGCG^ CGGGGAGA 

AAGGCTGCAGCGGGGCGATTTGCATTTCTATGAAAACCGGACTACAGGGGCAACTCCGCC 

TGGCrrTTGGGCTCTXSCCCCTTOCTGCTCCCGGCGTTTG^ 

7 0 CCGATTCTCTGCCGGGCTTTGATCTTTGOT 

TGGAGCCTCCAGAGGGCTGTCGGCGCAGTAGCAGCGAGCAGCAGAGTCCGCACGCT 
GCC^GGCAGCAGAAGCGAGAGCCGAGCGCGGACCCAGCCAGGACCCACAGCCCTCCCCAGC^^ 
AACACCAGCTCCTGTX3CTTGCGAAGTGGAAACCATCCGCCGCGCGTACCCCXIATGC 
ATGCTGAAGGCGGAGGAGACCTGCGCGCCCTCGGTGTCCTACTTCAAATCTC 
75 CGTCGCCACCTGGATGCTG(*AGG^^ 
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TTGCTACTCACCCCCCTCCCTTCTCTCCCGCTAGAACTTTGAAGTTTGCCGT^ 
ATTGCGGGTATTTTCTGAAGGAGGAAGGGGTGGGGGTGGGGGTG 
GGGGTGCCTTCGGGAGAAGCCAGTGCCAGGGGCACCCCAATGGGCCCGA^ 
CCGCCTGCGCCCCAGCCCGGCTGCGCCTCAGCGGCCGGGAGCCGCCAACTCCGGGGGGAGGG^ 
5 ATATCCATGGACACGTATGCAAGGGCCGCTCGTGCCAGTATTATGCGCCATCTTTGCTCT 
AATAATTGGGGGCAGGGTGGGGGCGGGGAGCGGCCGCCGGGC^ 
GGAGGGGCGCAGGGACGCGGCATGGGTAGTTTTGGGGGGACCCCGCTA 

GCGCCCCGAACGGGCAGCCTGGGCCGGAGAGCACGGCGAGCTGCAAGGTCGCGTGGCCCCCAAGA 
TGCAGGGATATCGGCTTGGAGGACCTTCTCCGAGCGAGCCGGGGGCCTGGG 
1 0 GCCCGCAAGTATTTTAAAATAATTTTTGAAAG 

GGGGCCCCGGAGTTTGAATTCCTGGGGCTCCCCCCGGAGCCTGTAACX1AACTCCCAACCC 

GGTCATTTTCAGGGTTTTTTTATGCACTTAGTTATTTTTTTAA 

GGCGCXX5CGGCCTGGGACGCCACCTTTGTGTCTCGCAGGCGCGG 

GGTGTCGACCCCCAGTCAGAGGGACCACGGAGCTCCAGGGCGGGCCAGGGTCCCGGGGGCCGGCAGCCCGCGC 
1 5 GCCCAGCTGTGCCCGCTCCCGCCCCCACCGTGCCAGCCTCGCG 

GCGCGGGGGAGGGGGCGCATCACGGGAAGCTCCTGCCGCCCCC^ 

AGCGCGATGGGGGGTGCGGGGGCGTGCGGGGGGGCGGCGCGACCTGGCGGCGGCGGTCACGGGCC 

CGCCTGCAGCTGCTGGGGGCCACTTGCATGTTCGTGGCCTCTAAGATGAAGG 
20 CTACACCGACAACTCC^TCCGGCCCGAGGAGCTGCTGGTAACCAC^ 

ACCACGGGGCCGGGGAAGGTGCAGGCGGTGGCGGCCGGCCCGCCTCTGACATATCTGCTCCTCCGAGGGAG 
GGGCX3TCCCTGTCCGGGGAGCGGGCGGGATCCTAGCCGCCCTCGTCCCGCCGCCCTGTGTGCGCTTGCCTGCG 
TCGCGCCCCGCGGTGTGGCCGAAAAGTGGGCGGCGCGCGCCCTCCAGCGGCTGCACGAGGAGCGCCGCGCTCGGC 
AGTTCCAGGTGGTGGGAGGTCTTTTTGTTTCCACT^ 

2 5 CGTGTTGCCCCAGCTCCCTTGAGTCCCCAGCATTCGCCAGCCCTCCCCTCCAACATCCAGGACC^ 

CTCTGAGCCGGAGGTGCGGCGTGGCCCGGCCCCCGTGCT 

CGCTCACCCTGTGTTCGCAGCAAATGGAGCTGCTCCTGGTGAACAAGCTCAAGTGGAAC 
ATTGAACACTTCCTCTCCAAAATGCCAGAGGCGGAGGAGAACAAACAGATCATCCGCA 

3 0 ACCTGGGTGCTGTCTGGGAAGATGTCCCCAGACCCCCTCCTGCGCTGGAGAGCGCTCTTCCAGCTCTGG 

TTGTTTGTCGCGCTGGATGGAGGGAGATTTGCTCCCTCACGGCGA 
CCTGTGTCCCGTTACTCTGGCCTCGTCCTTCAGGCCAGGCAGCCT 

GCTGGGCCACCTGCCT^GGCGCCTCCAGGGGCGGGGAGAGCTCTCGGCCTGC 

3 5 GGTGTCCAGCAGAGGAGCTCGGCTGCCTGAGGCCCTGCCAGGGGTGCCGGCAGCCAGCCGGGCT 

TTCAGAGCACTCTCAG CTTGGGC CGCCACCGTGGGCAGCAGAAGCACCCAGTCCTCACTTCCCCTGGCATGGCCCCAGAGG CCT 
CCCTGACATCGCCTTGGCCCCAGAACCCAGTGGGGACAGA 
TCTGACCCCCCTGTGACCGCCTCCTTCCCTGGCCCAGGAGGCCTG^ 
TGCTGTGGCCCACCTTTGGTCAAGCCTCAGTTGTCACATCTGTTTGGGGGCTCACTCT 

4 0 GCATCAAAGAGG CAGTAGCATCTTCTCCCCTCCCCAG AGGGCAGAGCCCCCCAAGCCT ACTTCAGAGCTCCCTTCTGACAC CGGT A 

GCCCGCAGCCGGTATTCCAGAATGGGTTCTGGTTT AGGCGTG AGGCCTCCCCCACCTC CTCCACCTGCTTG^ CCCTC 
CCCCACGTTTCCAAGCG AGTCCCCAAGGTGGG CAGATGAAGATGCCAAGGATGTCGAC CAGTCTGGATGGGTCTGGGGTGGGGGGG 
CATG CGGCAGACAGGGAGGCATTCTCTGGCTGGTGCTCCTCA^ 

TGAAAGTGG CTTCAGAGAAATG CAAAGTTTCCTGGAGAGAACGTGGGG CGTGGTTCTTGCACAG CCTCCCT ACAGGGTGGCTCCAG 

4 5 CAGTGGAGCTCCCCTCCCAGGACCCCTGGGTGCTAGTGGGAGGCA^ 

CCTTTGTCTGCGAAGGCGCCCCCAGCGGTGGGTGAAGGAGGAGGGACAC 

GGAGTCCACTCCAGG<3TGGGTCCCGAGGGAGGGGCAGGAGACCAGGGGACCCACCCCT 

CCACCCCATGGAACGTAACTGAGCAGCCAGTGCCTC 

TCAGAAACAGGTTGAAGGAGGTGGAGGCGTGGGAAAGAGTCTAGGAAGGTGTTTTTGCCCTCCACGT^ 

5 0 GGTGATGCTGGGTGTTCTCCCTGCACTAGGCATTCCT 

GGGGTTTCAGGGCAGGTGCGCCCTTGGCTCCGTGGGAGGCCAGGTC 

CTTCCCTTT ACAACGAGGCA.GCGG AT AGGGT CAAATCCTGG AG CTTTGG T GTCT AATTCTGGGTGG CT CCTAATCT AAGCACAG AC 
AGCACCACACACTGGGGTGGGGGCACGAGCTTCTGAAACAACGTGGCCCCAGTGACTCCACGCTGTCT 

5 5 AGGTTTTCATTCATGGCTCCCGGCTCTCAGACTGGGTGGAACTGCTCCCATTTAAA 

ATi'TC ' r rri'TCTAAGTTGTT ACGGCGCCCAGCAGCCGG C?TTTGT CTCC CCTTCAGGGTGGCTG CCTTTCTTCCCGGCCCCTCGCCG 
GCGGCCCTCTCTTTAACAAGGC CGAAGTTGTTTArTCT CTCGGG ATGAAGTCTCGGATGGGCCGCCACACCCCTGGCGGCCCGTGG 



GGCGATGGGGCCGACAAAGATGCCACACTC^TCCCTGCCGACCTCCGGCTCCCA 
6 0 TCGTGGGTGTGACAAAAGGCTGCCCCCAGGCTCCGCTGGGGTGGGGGCCAGGCCAAGAGGCACATCCCACACTGG 
ACGGTAGGCGCATGACTGCCCTGAGGAGGGGAGGCCGGCATTCCCCG 
GAAAGAGCCAGTCTGCTGTTTGTCTAGGAGGTCAGTCACAGA 
CAGTATCTGGCAGTGACCAGAGGGAGTTTTGTGCAGACCACAAAGGCT 

GCCCAGATGTGCGGGACAGTCCCOUSGAAGCCCCAGGTGAGGGCACTGGTGCCCTCTTGGGAAA^ 
6 5 CTCttXKXrCCAGTCCTCCAGGGGTGTCCCATGG 

TTGTTGGGCTAGATCAGACGGCCCAGAAAAGTGTTTT^ 

TTGGGGTGT ACTTGGT CTGTGCTCTG AAGGT CACTGTGACAGTCATGGTCCCATGGT AAGGGGCATGGGTTGCTGG AAGAGCTCTT 



GGGCTT ACCCCTCAAGGGACACGG AATGGCTTCATCAGTACC CTGCAGCCCCGTGG CCTGGCCCGGGTGG AGGCCTAGGCTTCAG C 
70 CATGCGATGTCCCTTCMAATATGACTTGTCTC 

GAAGCGACATCTCTACGTCCTCATATTTGCGTCATCTAATTTTGTTTTTGTGAATACGTG 
AAAAGGATGAGAAGGCAGTGATGTCCCCATCACCTGTCCTGTGTCTTCCCGTGGCTTTCT 
AGTGGGGCTGACGTTCCAGGAGGGTCCGTGGGCCAGGCTCTTGCTCTCCGAGTGCCCAGGGAT 

GATGTGGAGCCTCAGATACCAAGTG CTTCCCT^CAGGCCGGGCCGCTTGCTCAG AGCCAGCACACAGGGATGC CCGGATCACGSGGG 

75 
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GTGGCAGCGGGGAGCGTGGTGGCCGC^Un'GCAAGGC 

CTTCCTCTCCAGAGTGATCAAGTGTGACCCAGTAAGTGAGGGTGATGTCCCAGG 

TAGTGCCACGGAAATGCCGAGGCTGGTGCCAAGGCCCCCAAGGGTGACAAGGTTGGG^ 

CACAGACTGACAGGGCACCGGCTTCTTCC^CTGCT 

TG^TCTGTGGGGTTAGGCCC^GGCGAGCTTTGTTTGGGC 

AGGTCCGAGACACCGGACAACGGGCGGATAGAGACAGCCTTGTTGTTTACGGCCT 

ATGACTGTGT LT 1 TCTT C T T A AAAATGCCATTGTTTTATTCCCGAGT C ' r T T ' l ' C 1 1 AAAGAAAGAATTAAAATGACAATCAAAAGGG 

TTTG TGG CATTT ACCAAATTAG ACCAG AGAGGTGG CCGGGTCAG CCG C CGG CCCCG CGGTGTGTGAGGGAGTGAC CGCCTGAC CCC 
1 0 AGCTTGGGGCTGGGTGGGCCTGCAAGACCCGTTTTGGCTCTG 

CTCCGCACGGGTCT CAGCAGATGCTATCTAGGGTCCACCTGCCTGTCC CCTGCCTAGTGGTG CCTCTGTCCCX3GGGACACTGGGAG 

TAGCGGCTGCCCAGCCCATGTGTGTCTCGGAAGAGGAAGAAGCTTTTTTG 

GTGTTCTCGGCCATGGCCTCCCTTGCACCCTGCCCCGTGTTATCCTTTGGGGGTGGTGAGGT 

GCGAGGAGCCCGCAG<rTCTCTCAGGAAAATt3GCT 
1 5 OVAAATGAAACTCGCGTCTGCACTTTTCATTTCGAACTCCACGCCCTC 

GATGTTGCTGTCTTCGGGCAGTTGTGGGAAGTTGGGCGCT 

GTCTCACAGCTGACAGACACACGGGGTGAAGTTACCCGAGGCGGAGTCCACTCTC 
CCAGACTCAGAAAAACCGTCCACAGCAGAGGCCCCTGCATTTTCTAG 
TCTGTGCAAATCGGGTGCGCAGTGCCACACACCAGTGACTTITCGCGGAGGAGCGTGCT 
2 0 GAACAGCTTTGTCCACCGGGGTAGCCTTGCAGGCAGCTGTG 

GACCCTGGAGTGGGTCATGGGCGAGGGACGGGCCGCAGGTGAAGAATCCCTGG^ 
AAGCTGGCTGGTGTTTTAGGTTGAACGTCAGGAGTCTTGTA^ 
GACTGTTTGTGCAGCCAGCCCGGCCAGCTTCATTTGCCATGATGAGAATTTATCTC 
AGGTACAGGCGCTAAAATGTCATGACCTCAGTGGTCCACCTAA 

2 5 GTCCAGCCCCTTCTGGTTGGGAGTTAAGTGGCACCTGTGCGGCACGTGGTGGGG 

TGTTTCCTGGGCTGCCTAGAGACTTGGCTTGAAGCCCTAGCGTGGCTTCCTC 
CCGGTTCTCCATCC^GAAGCCCCCGGGCAGTAAGCAGCCACTTCAGGCTGCGTG^ 
CCTGGCTGGGCGTGGCGTTCCAGATTTCACGGCTGCTCTTTCCCACTGACAGTGTGGTC 
CCAGAGGGTGGAGGTGGAGGACTTCCAGGAGCGTCCGTCGCACTCCACCCGAGGGCGAGCACCTCAGTGGCC^ 

3 0 CATGCTGTGCCAGGCTGATGGCTGGCCCCGGGGCACAGGC 

CTGGCCACCCAGCATTCATCCTCAGTCATGCACGGC CCAAGGCTTCGACAGCCATTG ATCATGGAAGG CCAGGTTCACCTCAAGGG 
CTGCCACATGGAGAGGTTAAGTCTGAAAAGGCTGAAAAGGCAG«3TTAAAAGGGC 
CCAGGGAGCTGGCACGGCCAGTGGGAACAGGCGGTGAAGGCGCTGTTGGACATGGGGACGGGC^ 
CAAGCATCTX3GTGTCTTGTGGCTCCAGAGACCAGGTGGGAGGTGGAGGCGTTTGGTCCTGAGTGTCCTGA 

3 5 CCACATCTCGCTCAGGTTCAGAGGAGGCAGCATGGGCCGAGGGACAGTTTTTGG 

ATGGCACCTGGGAAGGGGCCCTCGCTGCAGGCCCCTTCTAAGGACCCCCTCTTCCCACCT 
GCCTCCGGGCCTGCCAGGAGCAGATCGAAGCCCTGCTGGAGTCAAGCCTGCGCCAGGCCC 
GAGGAGGAGGAAGAGGAGGAGGAGGAGGTGGACCTGGCTTGCACACCCACCGACGTGCGGGACGTC 
AGGCtXMCGCCACCGCCACCCGCAGCGAGGGCGGAGCCGGCCCCAGGTGCTC 

4 0 ACCAG AAGGGAAAG CTTCATTCTCCTTGTTGTTGGTTGTTTTTTCCTTTGCTCTTT CCCCCTT CCATCTCTGACTTAAGCAAAAG A 

AAAAGATTACCCAAAAACTGTCTTTAAAAGAGAGAGAGAGAAAAAAAAAATAGTATTTGC^ 

GTTGTGCTACAGATGATAGAGGATTTTATACCC CAAT CTCTGTTGTAAGAATA 

GGCATTAACACAAAGGAGGCGTCTCGGGAGAGGATTAGGTTCCATCCTITACGTGTTTAA 

CATAGAAAAATTCAGCAAACCATTTTTAAAGTAGAAGAGGGTTTTAGGTAG 

4 5 AGCTGTAGTGGGGTTCTAGGCATCrrCTGTACTTTGCTTGCTCATATGCATGTAGTCACTT^ 

CCGTAGGTAGATGT<nAACCTCTTCACCTTATTCATGGCTGAAGTGACCT 
CCTTTGCGCCTGTGACCACCACCCCAACAAACCATCCAGTGACAAACCATCCAGTGG^ 
GGGTCGGGAAAGGCCACCTGTCCCACTCCTACGATACGCTACTA^ 
TACTCTTCCTATTTTTGTAGTGACCTGTTTATGAGATG 

5 0 cCCACAGCTACTTGGTTTGTGTTCrrCTTCATATTCTAAAACCATTCCATTTCC^ 

GCGCTGTTTTTGTTGTGTGTGCAGGGAGGGCAGTTTTCTAATGGA^ 

TTGAGGCAAGTGTGGGCCACTGTGGTGGCAGTGGAGGTGGGGTGTTTGGGAGGCTGCGTGCCAGTCAAG 
TCTCAC^TTGCCAGGATGATAAGrrCCTTTCCTTTTCTTTAAAGAA^ 

AAGTAGGGACCTCAGAGGTTTACCTAGAGAACAGGTGGTTTTTAAGGGTTATCTTAGATGTTTCA^ 

5 5 TAAAATATATAATTTATAGTTAAGGCTAAAAAGTATATTTATTGCAGAGGATGTTCATAAGG 

TCCCCTTGATTTAAACACACAGATACACACACACACACACACACACAC^CAA^ 
TTTATTTTTAAAGATAGATCCTTTTATAGGTGAGAAAAAAACAATCrGGAAGAAAAAAACCACAC 
TTGGCGTTTCCCAGAGTCATCTGATTGGACAGGCATGGGTGCAAGGAAAATTAGGGTACTCAACCTA^ 
CTTATCCCCrGCCCCTTCCTTTAAAAAACTTAGTGACAAAATAGACAATTTGC^ 

6 0 TTAGGAAGTGTTGAAGGGAGGTGGCAAGAGTGTGGAGGCTGACGTGTGAGGGAGGACAGGCGGGAGGAGGTGTGAGGAG 

CCGAGGGGAAGGGGCGGTGCCCACACCGGGGACAGGCCGCAGCTCCATTTTCTTATTGC 

TTTGGAAATATTCACATCGCTTCTGTGTATCTCTTTCACATTGTTTGCT 

ATACTGCCATGTACTAGTTTTAGTTTTCTCITTAGAACATTGTA 

AATTTTGACTTAATGTGATTACTGCTCTATTCCAAAAAGGTTGCTGTTTCACAATACCTCATGCTTCAC 
65 AGCGGGCAGGTTCTGCCTGCTTTGGCGGGCAGACACGCGGGCGCGATCCCACACAGGCTGGCGGGGGCCGGCCCCG 

CGGGGGCGTAGCATCATAGTAGTTTTTACAGCTGTGTTATTCTTTGCGTGTAGCTATGGAAGTTGCAT 

TATAACAAGTGTGTCITACGTGCCACCACGGCGTTGTACCrrGTAGGACrCTCATTCGGGA^ 

TCAAGTTTTGGGTATGTTTAATCTGTTATGTACTAGTGTTCTGTTTGTTATrGTTT^ 

7 0 AGACTCCAAATCTCAATGAAGCCAGCTCACAGTGCTGTGTGCCCCGGTCACCTAGCA^ 

GCTGCGGGCCCACGTGGTTGGGGCCCTGCCCTGGCAGG<3TCATCCTGTGCTCGGAG<5^ 
ACCCCTCCAGAACACGGCTCACGCTTACCTCAACCATCCTGGCTGCGGC^ 

TGTCTGTCGTGATGGGGCAAGGGCACAAGTCCTGGATGTTGTGTGTATCGAGAGGCCAAAGGCTGGTG^ 
GCGGAGTCTGTCCTGTGACGCGCAAGTCTGAGGGTCTGGGCGGCGGGGGGCTGGGTCTGTG CATTT 
7 5 CCCAGCACCAACATGTAACCGGCATGTTTCCAGGAGAAGACAA^ 
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AGCGTGGTGCCTGCCTCTTTGCTTCCTGGGCTGGCCGTG 

GGCTTGGCGGATGTGGGAGGCCGCAGCCTGTCCTGTGCGCTGTGGGAAGTTCMCA^ 
TCACGCCACCCGCCGTGACAACCAAGAATGTCTCCTGACACrcCCACAT 
ACACCCCTCCCAACTGGGAGGAAGCCCTCAGCACAGGTCT 
5 ATTTGCAGGCTG CCTGG CTGGGTGAGCCCAC CTCCAACCACGCG AGACAACAG CTCCGG C CTGGGTG ACGTG AG CGGTG CC CATTG 

ATGGGGAACATCTTCCCCCTCTTCCTTGCCCCACCAGTTTGTCTTCCCGGGTTATT^ 
TCGTTAACCCTCTTCTGGCGCAAACTGCTGTTTGCTCTGGATGAATCATC 
ACCGTGTCAGGGCCATGATCCGGGGTGGCCITTCACTGGGATCGTGGGGACCTGGAGG 
CCTCTGGATTTCAGGTGCACGTGACTGGACTTAACTTCAA^ 
1 0 TCGGACCTTGGTCACTGCTCCTGAACCCATCTGTGAGGCrGGTTTG^ CCTCATCC CAAGCTAAGTGGAAGCTCAGGTCCCAAGCCA 

CCGATGGGTGCTACTTGTCAGCTGCAGGTTGAA 

ACTCCTCCCAGGGCTCAACAGTGGGGCCGGGGTGGTCG<5TGTGTTGGCTCCACAGGCGCCT 
AGGGAAACGCITGGCCCCTGTAGGTCTCCACCAGCCTCTCCCCTGAGGGTGGGGGCT 
TTGAGTGGGTGGGGGAGCCTGCAAGGTGCCCCTGACAGGTCACATCAGAAAGAGCT 
1 5 GGTGGCCACTCGGGTGGCTCACAAGGCCCAGCTCCTCCTTGCTCCTGGGCAAATTACT 

CTGGGTTTACATTCATCCCCCGAGAGCACATGGGTGTA 
GTTGTTTCACGGAGCAGGCAAGGGAGCCATCGGATCCTGT^ 
CATGCAGGGAGAGGAGCACATGGGTCTAGGGATCTACTTTAGTGTTT 
2 0 TAGGTCTGGC^GAGCCAAAAGGCAGCGACATGTCTACTGGGAGAGATGGAGCTGAGCGCGGGGCT 
GCCGGGGCCCTGGGTGGGTCAGGTGGGTTCACAGCCAAGTGTGTAGAGAGGGCTTGGG 
CACAACCCATTCTCTCTGTCTCGGCATCTGTGGCATCCCGTGATGGGTGGGTCTGTACACA 
GGGGTGTCTGTACACCCCTCACCCCTGGCTGTGCC^ 

GTTCTGTACATCCCCCAGTGATGGGTGGGTCTATACATCGTGGTTATGCCACGAGCTCCAAGG 

2 5 GCTAACAAAGACATACCCGAGACTGGGTAATTTATAAAGAAAAAGAGGTTTAATAGACTCACAGTTCT 

ACCATCACGGCGGAAGGCGAAAGGCCCGTCTTCCATGGCAGCAGGCAAGAGAGAATGAGACT 
AAACCATCAGATCTGCTGAGACTTGTTCACCACCATGGGAACAGTATAGGGGAAACT 
TCCCTCCCACAACACGTGGGAATTATGGGAGCTACAATTCAA 
CGTCTGGCCACTCTAGGCCCTTGTTCCCAACTCTCAAACTCCAC 

3 0 GAGACCTCGTCCCTTGTTGGAGTATCTGCGCTACCCAGAGGGTGATGGTAAAAACTC 

TCCGCAGTCAGCTCTCCCTGCAGCCCCTTCGCAGAAGCCTGTCCTCACACGCCCGCCACCCCT 
CTCCCTGCAGCCAGACAGTCCTC^TCCATTCTTCATC 

GTCCCTGGGACTCAGCCCCATCTCTCCCCAGCCTTGACCCCAATAAGCrrCGATGT 
GCATCCAGCTTCCTCTTCTGCGCTTGGCCCACCGCCTCGGTCCACAGGAACCTTCATCTCT 

3 5 TGAGGGCTTCCCCACCCCACAGCAGGCAGGTGGTTCTGGGAGTCTGGGATTGTCATGCCTGGGCACCTTCCT 

CCTCCAGGACGGAAATCTCCCAGATTTCTCTCTGCCTCCGTCTCCCCACCCTGGCATACAGTGGGTCCTAGGA 
GAAGGGAGGTCGTGTCGGCTCCTACAGGGCAG CCCCAAG CTGCTGGGCTGAGGTGCAGAGCCATGGGTCCAGCCTGACCCGCATG C 
CGGGAGCTGATTCCATGGCTGGAAGGGTCGGGGGTTCCCCAGGAGCCCAGACAGCA 
TGAGGGGACAGAGGAAGGAGGCTGGGCAGAGTGGGCACGTGGACTTCGGACCGCGCTTGT^ 

4 0 GAGGGTGGTGCACTGCCCATGGGAGTCCAGAGAAGCCAGGTCCAGTC 

ATGCAGCGTGACCCTCCTTGAAGCACCCCAGG AAGCAGTCACAGCCTCGCT CTGCCCAG CATTGGGGGCCTGGCCTCTGAATCTGG 

TGGGGAAGGCTGTGTTGCCTGCTGGCACCCGCTCTATCACCAGTCCCACCATAAGGCTAGTCCCCAAAGTCAG 

GCCAGCGAGGGCCAGCAAGGGCAGGCCTGTGCAGGGGCTGCAGCGTGGGGTGTGCGTCCrc 

CAGGGCTCTGTCATCCCAGG AATGTCTTTG ATCAAGAGTTTATG CAAGACGAGGAAAGCAAGCCCAGGAC CTCTAATCCGTTCCCA 

4 5 CAGTCCCCTGCCCGCCAGGGCCCCACAAGCCC^ 

AGCTCTGGGGCCGCTGGCCTTGGGGCCCACATCCGGGAGCATTGTCACCGCCACTCACGAGGGCA 

GAGAATGCAGAGCCCTCTGCTGAAGGGGCTTCGGTGGGCCTTGAGCCT^ 

ACCTAGCGGCAGCTGAGACAAGCCTCTCAACCGGCAGCCAC 

ACCCTGGGCCCTCTGAGCCTGTCGAGGCAGTCCTCCTTGCCTTTGTTTAATCACGGGAGGAA 

5 0 GCACTGTGTGCCTTGTGCTCAACACCGAACACAGTTTCTC 

CTCTGGTGGCAGCCACAGACACAGCCCTGCCCTCCCAGAGCCT 
TGGCAGGTTTTCTGGAAAGGGCCACAGAGAGAGGCTTCAGGCTTTGTGGGCC^ 

G CCGCTCTGTG AATG AGTGAG CATGGCTGTGTGC C AAT AAAG CTTT ATTT ACAAAAAT AGG CAG CAGACTGGATT CAGGCTGAGGG 
TGGTAGTTTGCCGACCCCTGTTGTAGATGAGTAAACCG AGGCC 

5 5 TGACCAAGCCAGCCTGGTGGGTCAGCCTCATGGAGAAGAAAAGGCT 

CATTTGCAGCCAGGCCCCACGCACACAGCTCAGGGCCCTGCTTCGCCAGCT 

AAAAGAAAACTCACCGGCATGTATAGGCACACATTAAGCACACACGTTTTGGAGTAAAAG^ 
AGCTTTCCAGCTGTGCGGCCTCATTTCTCTGGGCCrrCAA 
60 ATGGGGAGTCGTCTCCATCTCCTACTCACGACAACGACCAGCACAGA^ 

TCTGGGGTGCTGAGGATTCTATGGCTCAATGTGTGTGAAGCACTGCTGAGCACCGAGCTTGGGGTGCAGTGATCACTCAGGGAACG 
CGATCATCCCCTCGGCGCATCCX3CATAATCACCATCAGCATTCCATGCACTGCCCAGGTCTGG 
CGGCCGCCTGGGTGGTGGCACCAAAGCCGATCTTACTTCCCTCCTTTGGCACATTAAGTAGACA^ 
CGGATGCCCCCTCCTGGTGTTGACAGCACCCAGACTTTGCTCCAGG^ 

6 5 CCCATCCCCAGCTCCAGGCCAGGTCTTCATAGGCTGAAACCCAACTGCACATCCACACCCACCAGAAGAGCA 

GGGTGAGAAATCAAATAGAGCCATTGAGTCTTGAGGAGACATTTTCTGGAGCTTCTGAA 
ATTCCCAGCAGGGAAACACAAGCCCAGGGGTGGCACTGCCCCCCTGCAGCCCTAAGGGAAGACTGAGCTGCTC 
GGGGAGGCTGAGCGTAAGGCCAGCCCTGAGGGAAAGCAGGGAATGGGTCTCCAGACAAACTGCTGG 
ATCCCCCTTTTGTTCCAGAAGCCAATCATTGTCCTTCATTAAGTCAGTTTGATATGGGCACTGTC 

7 0 TTAACTATTACAGCTTGTGTGTGTTTTCTTTTTTTTTTTTGAGACGGA 

CTCGGCTCACTGCAAGCTCTGCCTCCCGGATTCACGCCATTCTCCTGCCTCAGC 

ACCACGCCCAGCTAATTATTTGTATTTTAGTAGAGACCGGGTTTCACCGTGTTAGCCAGGATGGTC 

TCCACCCGCCTCGGCGTCCCAAAGTGCTAGGATTACA^ 

AGATCTAACATGTTATGATTGATATAAACAACAGGAGCGACTGACAAAGTTACCATGTGATCTT^ 
7 5 GCTT AGCTGGGAGTTCCACAGGGTCAGGG AATGACT CCTCTTGGGG CCACCAGTGCCCAGAAGTATGCCTAGAACTGCTCTGTCTG 
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AAGGGACAGGTCCCTCTTCCATGGCACACACrcrGTGTGTAACAGAGCACACGTC 

CACATCCATATCCCAAGTTCTGAGGCCGGATAGTGCCTGC^^ 
ACCCTGACTCAGTCCTCTGTGCAGGGCAGCTGCGCCTCCCAGJUVGGCAGAO 
TTTCGGGGGCTCCCGCCACTTGCCCTCAGTGGCGGC^ 
5 GCCAGAAGTGTCTGGATGGTGTGGCCGGCA.CCCTTACCCTCCA^ 
TTCTGTGGCTGCTGTAATGAATTTTCACACACTCAGCA 

ctgcagtgcccgctctctcccagcttctggtggt^ 

human sequence - mrna 
1 0 gcagtagcagcgagcagcagagtccgcacgctccggcgagggg 

agcgcggacccagccaggacccacagccctccccagctgcccag 

tggaaaccatccgccgcxk:gtaccccgatgccaacctcctc^cgaccgggtgct 

gcgccctcggtgtcctacttcaaatgtgtgcagaaggaggtcctgccgtccatgcggaagatcg 

ctgcgaggaacagaagtgcx^ggaggaggtcttcccgctggccatgaactacct^ 
1 5 agagccgcctgcagctgctgggggccacttgcatgttcgtggcctctaa 

tgcatctacaccgacggctccatccggcccxsaggagct 

CGCAATGACCCCGCACGATTTCATTGAACACTTCCTCTCCAAA^ 

GCAGTGCAAGGCCTGAACCTX3AGGAGCCCCAACAACTTCCTGTCCTACT 

2 0 TGACCCAGACTGCCTCCGGGCCTGCCAGGAG CAG ATCGAAGCCCTGCTGGAGTCAAG CCTGCGCCAGGCCCAGCAG AACATGGACC 

CCAAGG^CGCCGAGGAGGAGGAAGAGGAGGAGGAGGAGGTGGACCTGGCTTG 
GGGGCCCAGGCAGGCGGGCGCCACCGCCACCCGCAGaSAGGGCGGAGCOSG 
GAGCATTTTGATACCAGAAGGGAAAGCTTCATTCTCCTTGTTGTTGGTTC 
TTAAGCAAAAGAAAAAGATTACCCAAAAACTGTCTTTAAAAGAGAGAGAGAGAAAAAAAAA 
25 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGAACACCAGCTCCTGTGCnGCXSAAGTGGAAACCATCCXS 

GGCCATGCTGAAGGCGGAGGAGACCTGCGCGCCCTCGGTGTCCTACTTOUVATGTGT 

3 0 AGATCGTCGCCACCTGGATGCTGGAGGTCTGCGAGGAACAGAAGTGCGAGGAGGAGGTCTTCCCGCTGG 

CGCTTCCTGTCGCTGGAGCCCGTGAAAAAGAGCCGCCTGCAGCTGCTGGGGGCCACTTGCATGTTC 

GACCATCCCCCTGACGGCCXJAGAAGCTGTGCATCTACAC 

TGGTGAACAAGCTCAAGTGGAACCTGGCCGCAATGACCCCGCAC^ 

G AGAACAAACAG ATCATCCGCAAACACGCGCAG ACCTTCGTTG CCTCTTGTGCCACAGATGTGAAGTTCATTTCCAATCCG CCCTC 
3 5 CATG^TGGCAGCGGGGAGCGTGGTCTCCCGCAGTGCAAGGCCTGAACCTGAGGAGCCC 
CACGCTTCCTCTCCAGAGTGATCAAGTGTGACCCAGACTGCCTCCGGGCCTGCW 
CTGCGCCAGGCCCA£CAGAACATGGACCCCAAGGCCGCCGAGGAGGA 
CACCGACGTGCGGGACGTGGACATCTGA 
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Table 8 

MOUSE NOMENCLATURE 
ICSGNM Ccnd3 
Celera mCG15576 

5 

HUMAN NOMENCLATURE 
HGNC CCND3 
Celera hCGl6683 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNN^ 

AGGGCTGGGATAAAGGCGTATACTACCACATACATCCCAAAATACATATAAAAATTAATAGAGCT 
GTGATAGTCTCTGATTTGGGGATGACACCTTTCTT 

GTGTAGATGAAGTCTAGCCCAGTGTGTAAATGAAGGCrAGCCTGGTGTGTAGGTGAAGGCTAGC 
1 5 GAACTTTTACTTGAAGAGTGAAAGTGATTTAAACAATG^ 

AGGACTGGTT CAGT AGCTT AG CAGTGTCACT CAGG CG AGGGATT CATT CAG AT AATCCACT AGG CC CCTTTCTTCATTGCAG AGAT 
GGCAGAGGTGGCCCCAGCATCJVTAGTAGAGTGGTT 

CTGAGGCAAGAACATTGCCAGCCTAGGCTACATGGTGAGAATCTGGTCAGGCAGG^ 
AAAACAAAAO^AAAGCGCAGGGCATATAAAACAGACCTTGTTTGCT 
2 0 GTTCCCTCCCTAGGAGCTCACGAGCTGAGCATAGTGCTACACAC^ 

AAGATCAGGAGTTCCAGGCCCAATCAAGTTTAAGTCTAGCCTGAGGTCC^ 
TCAACTGTACATCACATGAGAGGCTAGCITGGGCTACGT^ 
ATGGTTCAGGAAGTAAAAATGCATGGTGGAGAAGCTTAA 
TGACTGCTCTTCCGAAGGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCT 

2 5 TCCACACAATAAATAAATCITTACAAATAAACAAACAAACAAAAACCCCT 

ACACACCAGGAGGTTGTCCACAGCJTGCAAACACACACATATC 
TGGTAAAGTGCCTTTAATCCCAGCACTCAGGAGGCAAAG^ 

GTTCCAGGACAGCCAGGGCTACACAGCAGTGAGGCCCTGTCTACAAATAAATAGACA 
AACAGATCAGTTACACAGTAAGGCGTATAACAGATCCAGT^ 

3 0 ATGATAAAATATAATGGGGCTTGTAAGATGACACAG<^GGTAGAGGTGCTTGCTGCC^ 

AGAGAACTAACTCTCTCAAGTTGTCTGCTGACTTCTACXSTACATATGCATG^ 
AAATGAAATAAACCAGTTATAATGGGGAGCTAGCTATAGCTCAGTAG^TAGAGTGCCT 
GTCCACAGCATTGTATAAACTGGTCAAAAGATGATCATGGTTGTAATCCTCGAATTCAGGAGAT 
CCGTCTTCATCTACAAAGTGAGTAAGAGGCTAGCCTGGGCTGCATGAGACCTTATCT 

3 5 GTTTTCCAGTAGGCACAAGGTTGTATATTCCTGCCGTCT 

GTATTCACTGTGAATTCTAGGCCAGGTAGACCTAGCAAGAGTTGC^ 

AACAGATTCCAGTCTGATGTACACACACATATCAAACCGAGCGGCTATTTGTTCA 

TGTTTTTTGTTTTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCT 

CAGAAATCCACCTGCCT CTGCCTCCCAAGTG CTGGGATTAAAGGCGTGCACCACCACGCCCGGCTCTATTTGTTCAATTCTTGA 

4 0 ATGAGGCTAGCTCTATCAGGAGAGTTCTGACCTAGAGGGGCATGGGACITGTCATTTC^GA^ 

AGGTACCTGGATTGCTTAGTGGCTTGTTCTATCGGAAGCTATTGTACCTGCCCACAGCAAGGTG 
ATCCTGAGTCTGCCTACAGAAGCCITCAACCAATGGC^ 

ATCATGATACAGACAGTTGTAAGCTGCCATGTGGTTGTGGGAATCGAACCTGGGTCCTCTGGAAGAGTAGTCAGTGC 
CTGAGCCATCTCTCGAGCCCTTGCCCAGCCITTTTTCTTAAAATTGTATTTTTAAAA 

4 5 TGAACATGTGGATCCTAGGGGTTGAAATCAATTTGCCGGGGTT^ 

GGGAAACCTTCCTGGAAGCAGGATGCTACTCAGTATTTTGATCACTCATAA^ 
ATGTTGTTTTGTCTTGTTTGTTTAATATTATATTCTACAGAACATTATAT 
ATTTATTTACTTATTTTTGAGAAAGGGTCTCACTAAGTAGCCCTGCCTGGCCTGGAACT 
CTCACAGAGAGCC!ACCCACCrrcrGCTTCTGGAATGCAGGGACTAAAGCCATGAGCTACCATGCCTAG 

5 0 GTTTTTGTTTTGTTTTTCAAGAGAGGGTTTCTCTGGGTAGCTTTG^ 

CTCACAGAGGTCCATCCATCTCrGCCTCCTGAGTGCTGGGATTAAAGGCATC 

AGCGGATAGCTGAAGATGCTCTTGAGCTTAAAACTCAGCTTCTTTATTTTCTTTATTTACTTA 
TCTGGTGTAGCCCTGGCTGACCTAGGACTCTATTTGTAGCCCAGGCTGGCCTTGATCTCACAGAGATCT 
55 G AGTGCTGGG CTGAAAGGTGTGT ACTGTTGCACCAGG CTAGCGG CTTTTTTTTTTAAAAAAAAG ATTTATTTTATTTATGTATATG 

AGCACATTGTTGCTGACACACCAGAAGAGGGCATCAGATCC^ 
CTCATGACCTCTAGAAGAGCAGACAGTGCTCTTAACCATTGAGCC^TCTCTCT 
TTTCTTTTCTTTTCl M i'Tl'ClT'I'TCTTTTTCTTTTTCT 

TTTGG AGACAAGGTCTCATTTTT AG CTTAAGTTGG CCAGGAATT C ACTCCAT AG CCT AGGCT G ACCTTG AACTTGGGG AG ATT CTC 

6 0 AAATTGGGATTCCAGGTCTGTGCTACCAGGCCCAGCTTATACAATTGTGTTTATTT 

ATGCGCTGGGGTATTTTGCCTGCATGTATTTATGTGT ACACG CATGCCTGGTG CCTG CGGAGG CCAGAATG AGCCTCAG AGCCCCT 

AGAAATAAAGTTACAAATAGCTGCACAGGTGCTGGGAAGTGAACCTGGGGCCT 

ACACrCTTTCrAGCCTTTAATTTCTCAATTGAGGTAAATGTTATTAAAA 

CATACAAG AAGC CCAGGGTTCAGCCTCAGC CACTATTATTATGTTGAT AGTTGTAAATACCCATACACG AG^ 

6 5 TACAAAGCCAAG CCAT AATCAGTTGGACAGTGGTG^CGCATGCCTTT AATCCC AGCACT CAG AGGCAGGCAGATTTTTG 

AGTTCGAGGCCAGCCTGGTCTACAG AGTGAGTTCCAGGAAAGCCACGGACT 
ACATCCATAATCACMCTCAACTACCTGGGCACCCACCCCCCTC 

TGGGG ATTGAACCTAGGGACT AAATGAATTCCGAATAAGTGCTTTCCCAGTGAGCT CT CAGAGAAGAGTGTGTG AGAG 

AGTGGAGAGGGTAATTGTTAGTATATAAGTAACCTTTTTGCTTTTC 

7 0 CAGTAGTCCAGGCTGTCCTTGAACCTTGCAGCAATCCTCCTGCCTCAGGAGGA 

TTGGTTTAGAAACTTICCACATTTGC^TGTTCCCTTTC 

C CGGT AGGGT ACGG ACTT TGTT ATT AC C CAACTGG ATG AAAAG AAAAACAG ACACG AG AG AAC CAGT AGGTTTCAAACT CACTC CA 
GAGTGGCCTCATTCCGGTGTTGTGTCCACCTTCCTATTTAGGGGTG^ 

CTCTcrcTcrcTCTCTCTcrrcrrcrcTiri^ 
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ACACACACACTGGGTCTGAGAGTAAGGGTGAGGT^ 

TCCAATGATAAAGAGGTGAGGGAGTGGTAGGAGGAAGGACGCTCAGAGTCCTTC^TCCT 
TTACATTCCAGCTGTCCTCTAATTTACCATTTCATTGAGTAACITT 
CCCCTGGTrATGGGGAGTGTGAGCTGGGCTGGCTCl 1 1 rciTrCCCTGTCTCTTTGT 

TAGTTGTT TG i r r i ^TTGTTGTTGTTTTG r i U ' ri ^AGGC^ 
GCTGCCCACCTTGCTTCCACXrrTCTAAGTGTTGAGATTCC 
TGCTGGGAATTGAACTTAGAGCTTCTTGCATCTTAGCCTAAC^ 
TTCTGTTCTTCGAAGGGCAGTGATGOTGAAGCCGGATA 
1 0 AACCA.CCCTTCATGTAGCTGGCCAGCTCCTTGGCCCTGAAGGCCC^ 
AGGACTCAACTGTCTCTCTCTCTGGGTGCCCTTTCTCAGTAGGTC^ 
AGGCAATGGAATGAAGTGGTCAAGAGTTGGTACTGACTGCTGCTTTCAGATCT 
TCTAACTCAGTGTGCTTAATTTTTTGTTTCCTTCATTCTGTCT^ 
CCCTGTTTGTTATTCTGAAGACACTCTCATCCT1TTCTATTCCCATCGGTC 
1 5 CAGCTCAGAAGNNNNNNNNNNNNNNNNNNNNNN^ 

GCTGAACCAAGTGGATTCTGGGAATATGGTGGCTGTTGTCTTGGCAACAGGTGTTT 
TGGAGGTCCTGATACCAACAGT 

AGGTCCTGATACT AACACAAAAG<SO^GG CTTTCT CT CTCTCTTCCTGG AAGCTTTCTTCCC 

2 0 TTTGTTT CC C AAACTCT AAT AAAATGGT CTT CAAG CTT ATT AAAACAAAAAGAAG CC ATG GGGG TT AG AAAGGTAGCTCAATGG TA 

GAGCGTACCTACCTAATATGACCAGCATCGATTTCTCCTC 
CAGCCTGCITTTACCAACTCCGAGCTATCTCACCrCrrCCGTGG 
ACACAGGGCAAAAATAATATACTTTCTGTAAGGCTGTTGTGAAGATAAAATGAT^ 
CATGGTATGTGTACATGAAGACCTGTTACTGTTGTTC^TACTCAC^ 

2 5 TTTATCTCTGCCCTTCAGGACAGGAAGTTTGGTGGTGCATTCCTGTAATCTC^ 

AAGCTGGGGGCTATTCTGGTCTACCTAGAGTCAGAGCTACATAGAGTCAGG 

GT CATAAAACT AACAG AATGTT AGTGTGGGT AAGGCATG CAATTGGG AGGT AAAAG CAGG AAGAT CAGAT CCAGCCT CAGTT AT AT 

ACTAAATTCAAAGCTAGCCTGGGTGCCGTAAGACCCriTGTCTCAAAAAGTGTCCCCrC 

TATAAACATGGTGGTACGAATTTGTAATCCCAGCACTTGGGAAACTGAGGCRCGAAGCTG 

3 0 AGACAAACTCTTTCAACGAGGAGAGAAACCACTGGGGCAGTGACCTTCTCT 

GAAAAGCCATCCCCATATGTGGATTCTGACCCCTTGCT 

AGTGCATAGTCCAGTGTGTAGACTAGTGTGTAGATGGATCTCACTGCTTATATATTGTCCT 

TGAGCAGAGAGAGCAGTGTCTTAGTGGTGGGCTTTCTTCTTTCTTTCCCT 

TTTCCCTTGATTTTTGTTTTTGAGACATACT 

3 5 GAACTCAAAGAAATCTGCTTGCCTCTGTCTTCAAAGTGCTGGGATTAAAGGGGTGCT 

CTTCCTTCCTITCCTTCCTTCCTTCCrrTCCTTGTTATTTGAGTC^ 

T ATCATTTGG TTTG CTTGG TTTTGTG ATAAAAT CTT AT AT AGC CC AG GCTG CT CCTGAACT CATT AGGAAGCAGACAGTGACCTT A 

ATCTTCTGATCGTTTCIXKiGTTCTGCAGTGCTCGG 

TATATCTXSCAGCCCACCCCCTGCCACCGTCTGTGTGTGTGTGTGTGT^^ 

4 0 GAGAGAGAGAGAGAGAGAGACTCTAGGCAGGCCTTGAA 

TGGGTCTGAGATTACTGGCTTGCTCCACCIAC^GCCAGCACCAAATGGCT 
CCGTTGGTGCAACGCAGTGTCCCATTCAGCAGACCTGAGAAACTGCATTTCTTCTTTTGCT 
GAAGGTGTGTGCGCAC^TGCTACATACAGCCTTGGTGTAGAAGTCAGAGGACAGCTT 
GTGGGTCCCAGGGATCACACTCAG^TTGCCAGACrTGATGCCAGGATAGTTrAC 

4 5 TGCTCCCAGATGCTGCTGCAGATCTGCGCAATTCGTTAAA 

TCTTTGCAGCTACTATTGTGCCTCAGTATCATGGTCTC 

GGAGTGACCCAACTATGTACAAAGTTATGACCACATTCCCTAGAGCAATGAAATATGACGTATAC^ 
CCTAGTTCTTTCCTGGGTGGCTAGAATCGCGACGGGGCCCATTGATGG^GAAGAGGGCACCCTTCC^ 
TTGGTCAGAAGGCJVCCCGAGGGCCCGGAAGCGTGCGGGCCGGAGCCCAGCGGACCAATGGGGCTGCGGGG 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNCC^ 

ACCCGCGGCTGCTTGGGGACCAGCGTGTCCTGCAGAGTTTACTCCGCCTGGAGGAG 

5 5 TGCGTGCAAAAGGAGATCAAGCCGCACATGCGGAAGATGCTGGCATACTGGATGCTC^ 

TCACCCCACCCCCACC CCCGGCTCTGGGACAAATTGAACTTTCCCTGTCGG AT CTTTGCGATCCCGACCACAATCTGCAAGATGCT 

TTGGCGGCCCAGACTACTGGGGCCTGGCCTTTTCACACAGGCGCCCTA^ 

CCTGATCGGCTTCTTCTCTGATTTCCAGTCACTT^ 

CTTTCCCCAGTGCCTTTCCTCGGTGTCTCCTGCTT^ 

TTCCGGTCCACAG<K3AAATGAAAAGTGCGGGAGACCGCGGCA 

CGGTAATGAGGGTAAGGTCCCAATTXrrCTCTAG<3TGACCCTTCCGTCCCCCTTCCGCTCGCCGTGGGGG 
TTCGTGAGCATGGGCTTCTCTTTTACACCCCCTACrCCGGGGTGGGG 
AG<K3C!ACATCCTCCTCGGTTATTGATACTTGTCGCTCTAAGTTTATCCCT^ 
65 TCCACCCCCCACCCCCACTTCTCCACGCTGCCGTGTTTCACTTG<3TCCT^ 

TGAGGAGCAGrccrGCGAGGAGGATGTCTTCCCTCTGGCTATGAACTACCTGGATCGCTACCT 



ATCTATACGGACCAGGCTGTGGCTCCATGGCAGTTGCG<K3TGCGTGCGGAAATGCT 
GATCGCCCTATGTACTAGTAAAAfiAAAAGCCAAGCTCACT 

7 0 CAAAGGCCTTGTAGTCTGGTCGTGGTGCT^CGGTTGCAAG 

CTCCTAACTCGCCCCCCCCCCCCAATAGCAACAAGTCCAATTGC^ 
CCGTGAGTTATTTATTGGC^GCAGCAGGGCATGTCTGCACACCC^ 
TTGCATG AG CCTTTATCCCCGTGTGGACTTTGTGATGG AAAAG 
AGCGCTGAGGTGTTTTCATCCCTTTCCCTCCTATCCCCGAAGACACCGCTCTTTTCT 

7 5 AAAACTCGAGCAGCTGCTCTC^GCCC^TGGAAGAGTTTCTCCCTTTGCTGCTCAG 
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TGAGCTGCAGCTGCTTCCGATGTGATAATGATGCCGGTCCCCT^ 
GCTGCTGTCACTCCTGTGGTTTGGGAAGTGGCTCCTTGGGGCCTCTTGCT 
JVAAAAAAAAAAAAAAAAAAAAAAAATCGGTTGGGTTCTGCCCTGC^ 
GAAGGGAGCTTGAACGTCATCTCCCACAGGGTCTGGGGGAGAGGGTGGGTGAGCTG 
5 CCAAGCCTTCTCTGACCCTGAGAGGAGTCTCTGGGCCCCATCTCCT 

TGGGAAGCTAGGGTACTGGGGCCCCTGACTGCTGTGAGTCGGGGAACTGGATTCCT 
GGCCTGAGTTGGGAATCCGGGAGGCCCCACAGCTCTTTGTGATGAATCTAGTTTCCAG 
TTGAGGCTGGGAGGTAGGAGACTTACTCTTTGTGAAGAGATTA^ 
AGGCAAAGCCCTCCTTCTCCTGCTCGAGGCCCrGAGCTGAGAGTGGCCTC 
1 0 GTCTAAATCCACACAGCAGAGCAGAGGCCCCAGAAAGG 
TGCCAGCTCCTCTGATCTTGGGGTAGGXSCTGGGGTT^ 
ACACTCCTGGTTGCCATTGGGCTGCCCCTrTAGAGGCT 

AAGAAGAGGGAC^GCTGTCACTGGGGGGGTCCTGGAAGATACTGGGTGGGGCTGACATTTCT 
1 5 GAGTTCCTTCCTGAGATAAAACAATGAAGAAATCCTCCTGTGTGCAACTCGGAA^ 

TGAGTAGACTGAGGTTGGGGCTCCTAGCCTCCTCCrrAAGACCTCTCAA 

TGGATTCATCCTGTTTGGCTTGGATGCCCTTGTCTGGAGACAGGACCCA.CCCGTG 

GGGCAACCTCCTCTTTGAGGAACCTTAGGAJGTGGCTTCATGCT 

ATTTCCGAAGCCTGTGGAATTTAGAGTTGGTCTGTTTCAACTCACGCC 
2 0 AGGTCGTTCTAGCACCTAGTAACCATCAATAGATGCTTAAGTGTACGGCCCTAGTTCTTC 

CAGCAGCCCCTGTlAGCCAGAGGCAGAACrrGTGACCCTTCTGTCAAATTCTCGGCCCCCT 

CCCTAACCCCCAGGAGTGGGAGGTGCTGGTCCTGGGGAAGCTCAACTGGGACCTGGCT 

TGATTCTGCACCGCCTGTCTCTGCCCAGTGACCGGCAGGCTTT^ 

GGTAAGAGCCAACCTGGGG<3AAATTGTGGGGTTTTTTGGGAAGGATACG 

2 5 GTGTACAGTGATCTGGGAATACTGAATAAAAGTTACTTGCCTAAGGTCTTCATAGCA^ 

CCAGGTACACCTTGGGTAGATGGAAAGTTTGTTGTTTGCT 

GGTGCATTCCTCAGTCAGATTGCCAACACTGTGAGCACACACCAACW 

TCAGTGTCACCCCATTGACCACTGGGGGTTCCCAGGATGCAGCCT 

CCTTTGCGATGTATCCTCCATCCATGATCGCCACAGGCAGC^TTGGGGCAGCCGTGCTAGGC 

3 0 GATGAGCTCACAGAGTTGCTGGCCGGGATCACAGGCACTGAAGTGGTAAGTCTCAGCCCCTCGCCCCC 

AAAAAAAAAAAAGCTGTTCCTCTGAATCTCCAGCAGAGGGCACCATCCA 
TTCTCTGGCCTTTTATCTCTGGCTCCAGTCTGGAGAGGAGTGGGTGGGGTGAAGGACACT 
ATCTCTTGACAAGTGCTGTGTCTCGGACAGGGGTGGGGGACTTGGTGTCAG 
TTGCCAGTTTCTGAGAACTAAAGACCTGATTCCTGGGCTG 

3 5 CTTTCCACCTGCTTGCTGTCAGTGCTGTGAGGGAGAGAGAGAGGCCCAAGGATGAATG 

TGCCTGCGAGCCTGCCAGGAACAGATCGAAGCTGCCCTCAGGGAGAGCCTCAGGGAAGCTGCT 
CAAAGCCCCCCGGGGCTCTAGCAGCCAGGGGCCCAGTCAGACCAGCACTC 

CAGGCCCTCTCAGGCC^CCAAGCAGAGGAGGGGCCCCTGCCACCCCCTCCCTGCCTCTAGAACAATCCATGCT 
GAGGGGGGCTCTCCTTCCCCCTCACAAAGCCCAAGGGGCCAGGTCCTGCCTATCCCCACAGTGTGCACTAAGG 

4 0 TGAGGGGCGTCTTC^TGGCCAGTCAGCTCCTCTTCCTTCCCACT 

CAAACAGGTAAAATCCACACACCAGCATTTCTTTTGAGTCCCTCTTCTGTCCGGGGCTCCAACCTTCTCA 
AGTACCTTCCAAAGGTGTTGGCCCTTGCAGGCTCCTTCGG^ 

GTAATCCGTACTCCAGCTGCTCTTAGAGGGAACAGCCTAGGCTTTGGCCAGAGCAAGAACCCATACACTGTTGCTTT 
TTAGCTTCTGTGATTGTGGGGTCTTAAGGGTGCCGTGGTCATTTTAATTTATTGCT^ 

4 5 GCCTGTACCCCACAAGTGGTGGTAACCCTGGCGGTTGCTCTTTCCCTCCC 

CGATGCCGTGGAGGTGACAGAGCTATCTGGAGAGAGGGCAAGCCCTAGGGTCACAGGTCrTTCCT 

GTGGCCCATTTCTATCATGCTGCCTTAATAAAGATTTCGGAATAAAGGTGGCTCTTGTGATCCCTGGGGTTGTG 

CACCAGGCAGCTGTCTTAGGOU3AGCACTTTTC 

5 0 GACGAATTACAGCGCTAAGGAGCAGCCAGCATTACCCAGAGGGCTCATCGGGAGAGAGGGCTC 

TTCATACGACATCAGGCATGGTGCTTCCTTAGCACACTC^ 

TTTGGCTCAG CGACAGCACTGG CTACTCCTGAG AGCCTGGACTCAATTCCCAGCAGTTAAC CATTTTAAACTCCAGTCTCAGGGAT 
CTGATGCTGAGGCAOVAAGCTCAGAATGTATTAAAAAAACAAAACCGGATGAATGGGG 

AGCAGGGAGGCAGGCCCTTCCTCAGTGGTGCTGCAAAGGAGCAGCCCTGCATGGGAGAGGAGAGGAGAGGAAGAGGGAGGGAGGAA 

5 5 GGGAGGGAGAGCTTGAGTGAGCTCAGGGAGTAATTTGGTTGGGTTGGGCTAGGTTTGTTCTCTTTGCT 

GATGATCTACTGTGTAAGCAAATATGAACTCAACAGACCAAACCTTCTCAGAT 

AAAAGTGAACGTGATACAGCAAGXjTGACTTCAAGTCAGTCAGCTGTGATGCAATGGTCTAAGGGTTTTTG 
GGTAGTGTTTCACAG CAGTGGCTGGCCTCAAACrT ACAGATCCATGGTTGGCTG 
ATTAAAATAATACAAAGATTGTAAGCCTAGCCCTCAAAAAGTTGAACCATCAGGCAGGT 

6 0 TCTACAGAGTGAGTTCCAGGACAGGCAGGGATACACAGGGAAACCCTGTC 

CAGTATTCTTTGCTCATTAGTTATGCAGACACAGGAACCTGACCTGGCGTCTCAGCCAGCCTA 
GGAAGAAATCCCATTTTTTCCTGATTCTGTTGTGTCACTGTCCCCTAGCAAGGGCCTGCAA 
CTGCCACTTTTCCAACAGACACCTTTCCAACATGGCAGCAGAATCTGGCATCCCTAGAGTCGGCCCT 
AGTCAGGGGCGCGTGTGCATACATTTCCTGTGAAGAACAAAGGAAGCAAGAGTCTAGAGACAC 

6 5 GAGGTTTCTGTGGGAAGT AAGAACC CGCTCTTCTG AG CTGGGG ACAGGG AGG AG CCAGCAGTGACTCC CACCTCTCCCGTGGGTG A 

TACACAGTATCTGGAGAGAGCAGACCTTATACCAACTGCCAGGTG 

GCTTCCAGAAGTGAGCAGGGACCAAAGCCTAAAATGGCAGAGAAAGCCAACCAGATGCAGTCCTCCCT 
GCAGCCATAGTAGCCCTTCAGTGTCTTGACCTAGGTCTTTAACTATACCAG 

7 0 AACACAGGTGACTGTTCTCATTCCATGGTGACACCACCATCCTCCACGT 

AGGGGACAGCTGCGGGTGGGGCTGCAGCCGAAGCAGCTCCAGGAGGGCCTCCTTCTGCT 
GCGCTAGTGTCAGCAGGCACTGGTGCCAGAGGAOUMCAGCTGGCGCTTTTCCGTCCGG 

TCG AGCACGCGGTAGGGCAATGCGTACTTCTTATCCAGCAGCAGGCGCAGGAAG ATG CTGTTGGCACCGCTGTATTCCATTT CTGC 
GATCTTCAGCATGGCCGCACTGGAAGGGAAGGTGGAGTCACAGGAATTAGGCCACA 
7 5 TTACTTC^UGTGTTCCCAAGGGGGCCACCACAGGCTGCTCACCTGTTGATTACTAT 
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CTG C CC CACC CACACCT CTG C CTCTGG CT CGG AG AG CAAG CAAT CCCTGTGTTT AACT AC AG AT CATGT C CAAATG TCTCACGATC 
CGCACATTTTAGCACACTTTCTTCCTAGATGCCCATTTACAC^^ 
CTCTTGAAGACCTACTATTCTGTTGGAGTAATCATACACCCACTGACA 
ATTCGCACCCTCACGACCAAGTTCCGGCCTCTCTATCACACGGCrrCCTCT 
5 AGTGCAGCACGGGGATGGAGCACTTG<riX^TGAT<Kr^ 

GGGATCAGGATTCCTAGGCAGAGTCAGATGCAGTGGCTTT^ 

CATOVAGGAOrCCCAATGATCTCCCAAGTAAGAGCCCCACCCXCT 

AGGCTTGAACAGGGCCITCTTGAGCGCC^TCT 

GCAGGACAAGGTTGTAGAAGCGCTGAGCCATGCGCTCCTTCAAGTTAGAGGC^^ 
1 0 CAGAGAGCCTGACCCTGGCCCTTTAAGAATCATCGCGGATGGTGCA 

TATAGTCAGAGCTGTAGCATGGGAGTGAAAGCTCTTCCCACTGAAGAGTA^ 

CTCAGTGGGTAGGAGCACCCGACTGCTCnTCTGAAGGTCAGG^ 

ACAAGATCTCACGCCCTCTTCTGGAGTGTCTGAAGACAGCT^ 

AAAGAAAAACGAAAGAGCAGGGGGCTGGAGAGATGGCTCAGCGGTTAAGAGCACGGCCTGCTC 
1 5 CCCAGCAACCACACAGTGGCTCATAGAC^TCTATAATGAGATCTGATGT 

CTGTATAGATAATAAATAAATCCTAAAATAATAATAATAATAATAATAAAATAAAAACCA^ 

ACTAAAAGCCAAAAACCCATAAGACTGAAAGTGACTGAGTTCAAO 

TACACAGAGAAACCCTGTCTCAAAAAAAAACAACAATAACAACAAC^ 

CATACAAGAAAATGGAATCTAAAGTCAAACTTGAATATC 
2 0 AAGGGAGCAGCAGGAGCXSAGTCCAAGGGAGGCCCATGGGCAACT^ 

CCAGTTGGACAGTGCAGGGATGACCTTAAAAGCCTTGG^ 

AAATCGCCAACTGTGCTGGGGTCCTGGCCCTGTTG CTCACTCATCTC CAG AACCATCTAGAAAGCCTCGTTCTCCGGGTGACCGGA 
GTTTAGTAGACTTCCAG<K5TCAATAAGAjGACAGTGCAAGAACCTGAGCCT 

2 5 CTTATTTGGTGGGGGCC^TGGATGAGAGCCCACCCCGAGTCAGGATATCCCAGCA.C^ 

CCTCGGTAAACTTCCAGGACCCGGGGATCCAGCTGAGGCATAGG<5AAGCCCGATACCT 
CTTCTCCGTCAACTTCTCCATGATGATATCAGCCAGGG 
ACTTTCCATGAAACTGTAACX3TCCCCTCAACCTGTTTTTT 
CCTTGTCTTAGAACTGGCTCTGTAGACAAGGCnX3ACCTCAGACT 
30 GTATGCCTGGCTCreTTTATTAAAAAAGAAAGGCAGTAGTGGTGCTC 
TTGTCAGTTGAAAGGTCAGCATGGTCITCATAGGGAGATCTAGGC 

AAAACAT CCCCAAACAAAACAAG AATGCTT CACAAACTTG CATGT CCTCCTTGCATGGGGGCCACGTCCCTGT CACTGTT CT AATT 
TTAGTGTACGGGAAGCTGGAAGCGAGCAATGTATAATTCCCCTG<3ACGCTC 

TTCCCCT CTATGTCCATTTAGCCGCCTCAAAGTGCTG CAG AGCACGGGCCCTG<3ATGTCTCTGCATCTGACGCAGAGCCAGAGTGG 

3 5 TGGGGACGTTAGGCCCTTAGGCCCTGAAGTCAGCTCCGGCTC 

AGCCTCAGTTTCCTCATCTCTAAACTGGGGAAGTAAAAGGACCCCCCTGG 

GGCCCGACTCTCAGCCAG AG CCAGCAGGTCCTG AGGGCTGTTT CCCACTGCTCACACAACGGGGAGCAACT 
TCTGAGTGCAGAGTGGAGCTCTGTX; CCAGGCAG CGCTGGTGCACACCTTTCATCCCAG CGCTCAGAAGGCAGCGG CAGGCAG AGTT 
TTGTGAGTTCAAGGCCAGCCTATCCTACTCACCCAGCACTTC 

TGGTGGCAGGTGCTTTTATTCCCCCGCACTG^ 
ACTTTCTCAGGGAAAGGGT AAATTTT AG TCTTT CGACTTCT 

TCCACCATGTTGTCACTTCCGTATAGCAACTGCAGAAAACATTCTGCTAAGTTTG 

4 5 GCCCGAGAGCCTTTGGGGGGCACATCACA.CCCCATGACCATTCTCAGAGCT 

T CTCTG AAC CCAAG CTG CT CATCACCGT ACT ATTAATG AT AGT GGTG TACT ACAC C C CACTCTG AAGAAT AAACTGGTTGAG CCCC 

TTTGCCCTAAGCCCAGTCCTCTCAATGAGGCCAGATCCATCTGAATGGAACCAAGAATCTGACCAC^ 

TGTTTAGAAGTAGCATTTATAACAGATGGGGGTACGTTTGTGG^ 

AG AG ATAGCTG CTATTGG C CCCAACACTGT AACTGAGATGG AAAAC C CTG AGTT AAAGACTAAGCTATGGGCTGG CG AG ATGGCTC 

5 0 AGCAGGTAAGAGCACTGACTGCTCTTCCGAAGATGCTGGGT^ 

GATTTGGTGCCCTCTTCTGGTGCGTCTGAAGCCAGCTACAGTGTACTTATGTATAGTAAATA 
GTTGGCCAGAATGAGTAGAAGTCCTAAAATTCAATTCCCAACAACCATATGATG 
CACATACATAAATAAATTAATCTTAAAAAAAAATAAGACTAAGCTATGAGAGTAACG 
CCCAC<rrATCACAAGGAACCGGTAATCCAGAGGTTCCAGATCAGATGGGCAAAG 

5 5 TCTTGGCAATGGTCAGACCAACAGCTGTGGTCTGGATGA^ 

GGAGGATTCTTX3TTCATGAACATTTCAATGGCACGCTCAT CTGGTGGTCCACC CCGG CCAT 

TTTGGCTGCCTTCTCCAGGGTAGGCCACTCCTCATCCTCCTCATCTGATCCATCCTGTGGCAATC 

AGCCGCAGGGGCTTGTGAGTCAGTTCCCCATTCCACGTACAGCACA^ 

TTGTCCCCAGAGACTTGGAATCTTGATAGAAAATACAGCACACGAAGCX3AGAGATGATAAGGCCA 

6 0 CCCTCAGTGAATGGATGACATGTCCCCCCTCAATGACCAAGAAC^ATCCA 

TGACCTCAAGGAATAGGTACTGTTGCTATACCCAGTTCAAAC^ 
GTCCTGACTTCCCACTATGTTGCTTGG&CAGTGGAACTG 

GG CACTTGGATGAGTACAJGAGGCTGAGG AGGCTCTGGATGGGCAGTCTC C 
TGTTAGAGTGTTG^CACTAAGGAGCCAGAAAGGAAACGGAAAGCCAACCT 

6 5 CTGATCAACCAACACATTTAAGTTACTGGCTTAGGTAGTCGAG^^ 

CACGCACCACAGTGCTGAAGGCTGAGCTTCTC^TGTTGGGTTAATCCCGTGTG 
TGTTAATCTTTGTTCCAGCAGAGAGCTGAGTACAGTCGGCATT^ 
AAACITCTTCTCCATCACCTTCTGATTC^TTTAATCAAAAGC^ 
C^CAGAGATAGGAACTCTGGGAAAGAAATCAGATGTGAGGGGACTCAACAG 

7 0 AGAG CAG T AACG AG CCC CGTGGTTG AATGT AGACT AGT AT AAAG CAATT ATGTTGT AT CAG CG AG TT CGG AACAAGGCAG CT AAGG 

CCAAGCTTTC^TAACTAATAAGAGGTCTCTGTGTCCTCACITG^ 

TGACCATGGAGTCrTGGCTGGGTACTTCCTGAGTGCTG 

ACATAGCTGGCAAGGCAGTTGGGAAAGAGTCTAATAAACC^ 

GTGGACTAAGGGTATATTATATAGAGCTCCAGAGAGTACACAGAGCATACCCAGTGAGAG 
75 TTACTTTTAATGTTTGAGTGCTGAC-I^eAcGTAwTC^ 
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TCCCITGCAAATGGAGTTACAGACAGCTATAGCCTGCCATATGAATGCCAGAAAC 
CTGAAGAGCTGAGGACCAACTCTTGAGAGTTTTCCTCTGTTCTCTA^ 

AACAAAGGAAGACAAGAGCTATGAAATACGACGTAGGGAAAAGAGAGAAAATCAGGAGACAGTCACTC 
GTTGTTGTGATCATTTGCTTTTTGTATATGAGTGAACC 
5 CAGGATTTGACTGGGACCTGGGGAT AGCTGATTAGG CTAGACTGGTTG CC CAGT AGG CC GAAGAGATCK^ 

TCAGAAACTACAAGCTGCTGTGAGTGGAAGTCAGGG CTCATGCTCTCTCAAAG CATGGACCTCACCAGCTGTG AG CATTATCTTAT 
CCTGTCCCTAGCTTGTGGTCAGACCTGAAACAGAGACAGACCAGACAG^ 
GTCACAGAATTCAAAGCACGATGTTTCCACAGTGATCCAGGGG 
AAAGTGGTCTGGCTCTGAAATACTCTAAAGACAACATCACC^ 
1 0 GAGCAGTGGAGCTAATGTGCTAACAACCTCCAAGTAGCAAACAGAGAGCAAGTTTC 

CAATCTTCTTCTTTTATTTTTTTTI 1 ' I CCTT CT CCG AGACAGGGCTTCTCT AGGT AG C CTTGG CTG TC CTGGAACTCACTTTGT AG 
ACCAGGCTGTCCTTGAACTCAGAAATCCACCTGCCTCTGCCTTGTAAGTGCTGGGATTAAAGGC 
GGACGCAATCTTACTAGAGTTAGATACTAGTATTCAACTTGGAGATGTCAGGTAGACCACTGGA 
GACX5AACTCTGAGGGAGAAAAAACTGGAAGTAGCTGTGAAACAA 

15 

MOUSE SEQUENCE - RlRKA 

AGCGCCAGCGCCAGACCCGCCCGCACCCCGCGCTCTCCCTCACGCGCGTCGCTTCTCCT 
TCACCCGTGAGCCATCGCAGGATGGAGCTGCTGTGTTGCGAGGGCACCCGGCACGCGCCCC^ 
TGGGGACCAGCGTGTCCTGCAGAGTTTACTCCGCCTGGAGGAGCGCT^ 
2 0 AGATCAAGCCGCACATGCGGAAGATGCTGGCATACTGG^ 



GCTGGCCTCCAAGCTGCGCGAAACCACGCCCCTGACTATTGAGAAGCTTTG 

TGCGGGAGTGGGAGGTGCTGGTCCTGGGGAAGCTCAAGTGGGACCTGGCTGCCGTGATTG 

CACCGCCTGTCTCTGCCCAGTGACCGGCAGGCTTTGGTCAAAAAGCATGCCCAGA 

2 5 CTTTGCGATGTATCCTCCATCCATGATCGCCACAGGCAGCATTGGGGCAGCCGT 

ATGAGCTCACAGAGTTGCTGGCCGGGATCACAGGCACTGAAGTGGACT 

AGGGAGAGCCTCAGGGAAGCTGCTCAGACAGCCCCCAGCCCAGTGCCCAAAGCCCCCCGGGGCTCT 

GACCAGCACTCCCACAGATGTCACAGCCATTCACCTGTAGCITGAGACAGGCCCT 

CCACCCCCTCCCTGCCTCTAGAACAATCCATGCTATATCTGAAGCCCGAGGGGGGCTCTCC 

CCACTCAACCAGCTTGGCTGTCCTGGGCCATGATGGTCAG 

CCCTCTTCTGTCCGGGGCTCCAACCTTCTCAGTTGCCAAAACGCCCCAGTACCTTCCAAAGGTG 
GGCATCCCGATAGAAGCTTATGAGGAGCTGCCCTAGATGGC 

GGCTTTGGCCAGAGCAAGAACCCATACACTGTTGCTTTGCTTGCTGCTTAGCTTCTC 

3 5 CATTTTAATTT ATTG CTTTG AATACAACTGT AAG AGGGT ACAGTG AGGC CTG T AC C CCACAAG TGGTGGT AAC CCTGGCGGTTGCT 

CTTTCCCTCCCTTCTTCCCCTCTGCTACTGCTTTGTGG 

ACCCCTCCACCTCATCTTCATCAGAGCAGGGTTAGGTTGGGATGGATCGATGCCGTCGAGGTC 

AAGCCCTAGGGTCACAGGTCTTTCCTCGGGCCACAAGGTTTGGGCTGGTGGCCCAT^ 

GAATAAA 

40 

MOUSE SEQUENCE - CODING 

ATGGAGCTGCTGTGTTGCGAGGGCACCCGGCACGCGCCCCGGGCCGG 

GAGTTTACTCCGCCTGGAGGAGCGCTACGTGCCGCGAGCCTCCTACTTCCAGTGCGTGCAAAAGGAGATC^ 
AGATGCTGGCATACTGGATGCTGX3AGGTGTGTGAGGAGCAGCGCTGCGAGGAGGATGTCTTC 

4 5 CGCTACCTGTCCTGCGTCCCCACCCGAAAGGCG CAATTGCAGCTTCTAGGTACCGTCTGCCTGTTGCTGG CCTCCAAGCTGCGCGA 

AACCACGCCCCTGACTATTGAGAAGCTTTGCATCTATACGGACCAGGCTGTGGCTCCATGG 
TCCTGGGGAAGCTCAAGTGGGACCTGGCTGCCGTGATTGCGCACGAC^ 
GACCGGCAGGCTTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCT 
CATGATCGCCACAGGCAGCATTGGGGCAGCCGTGCTAGGCCTAGGCGCCTC 

5 0 CCGGGATCACAGGCACTGAAGTGGACTGCCTGCGAGCCTGCCAGGAACAGATCGAAGCTGCCCT 

GCTCAGACAGCCCCCAGCCCAGTGCCCAAAGCCCCCCGGGGCrCTAGCAGCCAGGG 
CACAGCCATTCACCTGTAG 

HUMAN SEQUENCE - GENOMIC 
5 5 CACACCACTCCACTCC^GCCTGGGCGACAGAGCGAGACTCTGTCTCAGAAAATAAAAAAA^ 



CCCCCACCTAAACTTTCTTTGGATGCCCTCTTCTTCCTGCTTCTCTCTTCTTACTCCCACT 
TGTGTGTGTGTGTGTGTGTGTGTCGCTCAGGCTC^AGTGCAGTGACATC^ 

GCCATTCTCCTGCCTCAGCCTC CCGAGTAGCTGGGACTACAGG CG CCIX3CT ACCATCCCCX3GCT AATTTTTTGTATATTTAGTACA 
6 0 GACGGGGTTTCACCATATTGGCCAGGTTGGTCTCGATCTC 

CAGGCGTGAGCCACCGCGCCTGGCCTACTCCCACTTAATCTCTAAACTCCTGCAATCCCAI'U'TCrrTCA 
AATTCAGGTGATC^GAACC^CCAATCTCCACCAGTCCCAGGACAGCCCACCCTTCCTGTCACA 
TGGGTGTTGCAAATGCCGAGGTAAAAGTCnTCTGTCTTCATGACATCTTAGGAT^ 
CTTTAGACTGGAAGCTTAAAAGGGTGGCTTCTCGGCCCAGCGCAGTGGCTCACXJCCT 

6 5 GGGCGGATCACGAGGTCAGGAGATCAAGACCATCCTGGCTAACACGGTGAAACCCCAACT 

GG CG TGGTGGGGGG CG C CTGT AGT C CCAG CT ACTCGGG AGG CTGT GGCAGGAG AATGG CGTGAAC CTGGG AGGCGG AG CTTG CCGT 
GAGCTG AGATCGCG CCACTG CACTCCAGCCTGGGTGACAGAGAG AGACTCCATCT CAAAAAAAAAAAAAG AAAAAAAAAAGGG AGT 
CTTCTCATCTCTGACCCATGTCCAGCATGTCCCAGGGCnTCCGAAGGCCn'GCCT 
TTGTTAATGACCACAGCTATCTTTATTGACCACCT^ 

7 0 CACCATATGAAATAGGAGTTGTTACCATCCTCCTTTTGCAAATGAGAAGGCT 

CACAATCACACAGCTGGCAAATAGCIAGAGCTGGGCATCAACCTAAATCCAGAGCT 

CAGTGCT ATCTCCTG CCTAG AATGAGG AATGCTTGCCAG CCAAG AACCGAAG ACAGCTCTCTTTTTGCAGGT ATATTGCTGT AGCA 
GGTTACCCTTCCACACCTGAGGCCCTGTGTGCACCTCAGTCTTACCTC 

GCCT CATTTATT CAG AACC CAG ACAG CC AG AAAAAGG AAG AAAAAG CAAATGCTT AAAT AT AT AGGTTGT ATT C 
75 TTTTAATCTTGCTCTGTTGTCCAGGC 
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ACCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCCGCCACCACACCTGGCTA'lTTl'Tl 1 1 l"l'T , l"l"ri'GATATTTTTAGTAAAGA 
CAGGGTTTCATCGTGTTAGCCAGGATGGTCTCAACCrCCTGACCTTGTGATC 
AGGCTTGAGCCACCACACCCGGCCCTGTAGGTTGTATTCTTATTTTATGAG^ 
TTCTTTTCCCAAGCAGCTTATAAGCTAGAAATTGCAAACTG 
5 TGCACGGTTTTGTTACTGTTTTTCAACTCAAGTAATATAGATATTTT 

CTCTGTCG CCGC CTCC CAAAG TG CTGGGATTACAGGTGTGAGC CACTG CG C CTGGCGG AAATTT CTTTT LTTTTGAG ACACAGTTT 
CATTCTGTCCCCCGGGCTGGAGTGCAGTGGCACGATCTCGGCTCACTGCAACCTCT 

CAG CCTCHTGAGTAGCTGGGATTACAGGCGTGCCT CTAATTTTTGTATTTT CAGT AG AGATCX5GGTTCCACTAT 

GTTGGCCAGGCTGGTCTTGAACGCOTGGCCTCGCrGGTCTTGAACGACTC^ 
1 0 GCTGCGATTACAGGCATGAGCCACCACACCCAGCCTTTTTCTTTTC^ 

CTTGAACCCCTGGCCTCAGTGATCCTCCCAACTTAGCCTCTCAAAGTGCAGGGATT 

AAATATTATCTTGGGGAAACAGGGAATCTATGGCCACACTCATTACTATGGTCT 

AGCTTrGGATGGGACATGTGCTATCTACTGTCTCTGGCCCC^ 

TATGCTGGTo rrrr ' rci i n mi 1 1 i c i i " rrri ttt tt t t tgagatggagtctcgttctgtagtccaggctggagtgcagtg 

1 5 GCACGATCTCAGCTCACTGCAACCTCCGCTTCCCGGGTCCCGGTTCAAGCAATTCT 
ACT^GGCACGCGC CACCATGC CCAGCT AATTTTT^ 

ACTCCTGACCTAGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTG^ 
CTTATATCCAGTTTGTTTTATTCATTAACTCAAAAAAGATTTA 
GGGAAGCTGAGGTGGGAAGACCACTTGAGCCTGGGAGATAGAGGCTGCAGTGA^ 
2 0 ACAGAGTGAG AC CCTGTCTCAAAAAAAAAAACAAAAA^ ACTG AACAT CT ACT ATGAACTGGAT AC CAT CCTGT 

TTTGGTGGTGTGGTGTGGGAGTAGACAAGCrcTAAACAGAATATGCAAGC^ 
AAAAATAAAGCAGGGAAAGAGCATAAGAAGTGTCAGGGCTGGGGAGCTG^ 
TAAGAGGAATGCAACTTTTGGACTTCCTGAAGGAAGTGAGGGAG^ 
GGAACAGCACGTGCAAAGAGAACATGTCTGCCCXKjTGTGTTCCATGAACCGTGTGGCC^ 

2 5 ATGGGGTGGC^GTATAGATGCTGCAGGGTCITTGGCAGTCACTCCAAATGAAA 

GACTGGAGGGCCAAGGGTGGGT(ZAGGGAGACCCXK3CAGGAGGCTACC^ 
GGGTAGAAGTGGAAATCGAGAAAAGAAGGGGATTGGAGATGTATTTTTGACGTAGAA 
TGGGGGACCAGTAAGGTCAAGGAAGAACTGAGTTTGGACAGAGGAAGTAACAG 
TTGAAATTATTAATAGCTCCTCTGTGATCTGCTGATCIAGAGTGTGG 

3 0 CCAAGTGTTTGGCCTGAGCCATTGGAAGXSATGGAGTTGCCATTTTCTGAG 

ATTAGGATATCAGTTTTGGAGAAGTTGAGTTTAACTGGCCT 

AAATGGAAACT AAAAACAGG TCAGTT CAC AC CAACTCTG ATTTGGT AGTG CTG CCT AAAG TG CTGTGTTGGG AT TTT AAATGCCCA 
TCCAGGCTCTTTCTTTrCTTTTCTTTTT^ 

CAGCTCACTGCAACCTCTACCTGGTGC^TTCAAGCGATTCTCCTTCCTCAGCCT 

3 5 CATGCCCAGCTAATTTTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGTTAGGCTGG 

ATCTGCCCXSCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCC^ 
GGGTTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGACI^ 
AGGTGTGAGCCACCGCGCCCAGCC^AAGAGTGCTCAATTTTTACTGATAGGA^ 
AATATTGTACAAATATTACAGATGCTCATAATAGTGACCTGGAATACTGTAAGCACCGAGAT 

4 0 AGCTAAATTATATGTAGTTTGGTTTACTGCrrrri"r'l'U M rrr 

GAGTGCAGTAGTGCGATCTX^GCTCACTGCAATCrCCATCTCCCAGGCTCAA^ 
GACTAOVGGCGTGCGCCACTGCGTTCGGCTAATTTTT^ 
CTCAAACTCCTGGCCTCAAGTGATCTGCCTGCCCCTGCCTCCCAACGTC 
TTTCTGCITTTAAGTCCCGCAATCACAACTAACTTTr^ 

4 5 AATCAACCCATTTCCTCCTCACAACCIACTCTGTGGGGGGGTACTACrc 

AGGTGGTTTAACTTAGCCAAATTCACACAGTGAGGGAGTGGAGTGGCAGAGGGGTGTGGCTCCAGG 
ACGCTAGACCATGTGTTTACCGTTAGAAGCCAGGTCTTCAGGATTTGCTAGAGATGCAATTCT 
AACTTTTGCTTTTCTTAGAGGGGTTATGGATGCATGTTCATAACAGAAA 
CAGAAACTACTATCTCTGAATGCTTGCTTTGTGCCAG<5CACAGGTAAGA 

5 0 GGGCTATTATATTATCCCCACTGGATAAAAGAGGAAGCAGAGAGGTTCCAGTAATTCT 

TTG AACTG AAAG CAGT CTCTGACTCCAGAACC CTGTTTAGCACCCCTGTTCTCAGCCC CATGGCTCGCTTGGCTCTCCCCAGCAT C 

TCAATGGGTCTGAAGGTAACTAAGCCATGGCATGTGGACTGTGGGGTGATGGGCCT 

TCAGATGCGGACTTTTTTTTTTGGAGACAGAGTTTCGCTCTTGT^ 

GCCTCTGGCTCCAGGGTTCAAGTGATTCTCCTGCTCA 

GGCATCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCTGGCCCAGATGGGGGTATTTTGAT 
AAAATGTATCTTTCCTCCTTC-GGGTTTCTTATTTGGCCTGTGACATTCC^ 
TTTTCTAACTGGATGTACCTGGGTAGAAACTCCTCATACrATTGGCCCT 
CAAAAACAAAAACAAGAAAAACAAGAGGAGCCTCTATTTCITTGAG^TGGAGGTAACT 

6 0 CTCACCTGCCTCCCC^GTGGCCAGTCCTTC^CCCTGAAAGTCA^ 

GGCAGAAGTATCTCTCCTAAAGTCCTGCTCACCAGGAGCTCCCGTCCCCATACTACAGGTTCA 
GTCrTATGTGGCCCTCCCTCAATTAATAAATCAGCAACTAATTTGCCAGG 
AGCTGAGGCAGGCAGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACAT 
AAATTAGCCGGGCATGGTGGTATGCACCCGTAATCCCAGCTACTCAGGAAGCTGAGGCAGGAGAATCA 

6 5 AGGTTGCAGTAAG CTGCACTCCAGCCTGGTGACAAG AG CAAAACTTTGTGTCAAAAAAACAAAGAAAAC CAAAAAACAAAG AAAAA 

CACAAAAAACCCTTCTATTTGTTAAAAAAAAAAAAATCCACCGTGAACCAAAAATTAGTAAAAACAA 
TGCAAAATGTATGATAAC!AAAATGTTAAGGAAGGTCATGTGCCGTTATGGTTCACTGCAGCCT 
CTCCTGCTTCGGTCTCCCTAGTAGCTGGGACTACAG^CTTGTGCCACCGCACCCAGCT^ 
TAGGAGTCTTGCTTTGTTGTCCAGGCTGGTCTTCAACTCCTAGCrTCCAGTGATCCTCCrGCCT 

7 0 GATGGGACATTTTTATACATAGTGCCATGTTACTATAAATGAG 

ATTTTTATAACCAAAGTTAAAAAACCAAACAAAAAATATGAAAAGGGTTAATATCTTTC 

AAGAGAAAACAAACATCCCTATAAATGAATAAGCTAAGGACATGAATGGGTAATGTACATAAGAAATGTAAATGTCTAGTAAT 
CCAAAAT AG ATTT ATT ATT A AAT AAGC CACTTTCACTCTCT AGTTGG CAG AGTT GTTTTG AAAAAT AG AT ATGT AATG ATGG TG 
GAAAAGATTGGTTTAACTATTCAGCAGGAAAATTTGGCAAT^ 
7 5 GGAAATTCCACTTCTAGAAATTAATTCACTTCTAGAAATAATCATGAGTGTC 
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TTATGTCTAATAGAGAAGAACTAGAAATAATTTAAATT^^ 

GTATATTATGTCCCTATTACAAATTATGTCCTAGAATATTTAATAGCATGGAAAAGTGTTAAC^ 
TTACAAAACAGTTTGTGATGATTCCATTTAAAATGTGTGTTTATT 

GGGTTATTT CATGG CAAATTG CAAATGATTATTT C CT 'l 'Tl ' IT 1 GTGGCTT ATTTGT ATTTTTGAAGTTTTCTr ACAATCT AAAAGAA 
TATTTTATGATATGAAAACTACAATACAATTTATAATATAAGAAAGAATAATTCGGC 
AGCACTTTTG<^GGCCGAGACCGGCGGATCACGAGGTCAGGGGTTCAAGACT 
AAAAATACAAAAATTAGTCAGGCATGGTGGTGCGTGCCTGTAGTCCCAGCTACT 

GCAGTGAGC C CAGATCG CAC CACTGCACT CCAG CTTG AG CAACAG AGTGAGACTTCGT CTC AAAAAAAAAAAAAAAAAAAAAAAGA 

ATAATTAACAGAAAATGGTTAGACACTTCCTTACTGTCTCCTAAGTC^ 

CCCATTTGGAGCATTATTGGAGGTCITITTCGGCCTCTTC^ 

TTTGGGAGG TGGAAGAAATG CAACCGGT AGATCT CACAGAGT CTGTGCAAG AAACTGATT CAATGAGAAT CT AGTTTCTCCGTCCA 

CAGTTTCTCCAAACAGAAACTAAGGCCGACTTTAG 

ATGGCCTTTCaSTTCTGTTATATGCTGACTTAGACTAAAGCTCT 

TGATTCAGOTTAGGCGTGGGCCTGAGAGTCTGCATTTCTAACCAGCTCCCA 

AGTTTAACTAGCAATGGTGTAGAACACAGAATCTGCAGCAAGAAGG 

GACAGTTGCCTCGGTTTCCGAGTTTTCGTGAAGATGTAGTGAGT 

GCT CTG ACATT ATTT ATCGCATT C CTT AGAGCAAG CAGCCGG TGAAGT AGGGTTTGACGAATGAAT AAGTGAATGAATGACCITTG 

GAGAAAAATTGTTTCCTGGGTCACTAGAGTCCGAGAAGCAAAAT^ 

TTCTCTGCACCCGGTGGTCCAGAGGGCCTG^ 

aggggagagcgctcagccaaccctttccgtccggggcgccgc^ 

TGCTACATCGTCGCGAGGGGCGCCGCGCCTGTCAGGGAAGCGGCG 

gtgccagcgccagcgccagacccgcgccccgcgctctccggcccgtcgcctgcot 
ctgttcgctgcccgagtatggagctgctgtgttgcgaaggcacccggcara 

GACCAGCGTGTCCTGCAGAGCCTGCTCCGCCKX5AGGAGCGCTACGTACCCCGCGCCT 
CAAGCGGCACATGCGGAAGATGCTGGCTrACTGGATGCTGGAGGTACCGCCCGGACGCGTCCCCCTCCC 

GGACACACCGGGGCCGCCCTGTCGGGACATCCCGGT CC C CACAG CAAT CTG CAGG ATG CT CCGAGGCT AAGACTCC CGGGGCCTGG 

cctttcactagggcacccacgccagctgcct cccttg<rrttcccgag ccct 
ggcaccctaactgtgcctctcttcc^gccgagcaaacggc^^ 

TCCCTCCGCCTCCCGGCCTTCTCCGGAG^ 
A^C^TTCCGTCCCACTT^ 

(HrCCCCGTCAAGGC^AGAGGGGAGGAGGCCGCGCAGAG 

G^TCCTTGTCTCTCTAAGTAGCTTACCCTCTCTGCCGTTTTTGCTCCCTAT 

TGTGTTCCCCACGCTCCAGACCCAGCAGTGACTGTC 

CCCTGGC CATGAACT ACCTGGATCGCTACCTGTCrTGCGTCCCCACCCG AAAGG CGCAGTTGCAGCTCCTGGGTGCGGTCTGCATG 

CTGCTGGCCTCCAAGCTCCGCGAGACCACGCCCCTGACCATCGAAAAAC^ 

GTTGCGGGTGCGTGTGTAACTCCCCCCACCCCCGACACGTATTGTCTCCCCACTTTCCAC<5A 

GCTTTCTGCCCCCTGGAGAGGAAjGCAAGG^XSATCAGGA 

TGATGGTTAGTATCACTCCCGCCCAGGCTTC^ 

TCCTGGTCTTTCCCCCAGATAGCAACAAGTCAAATTGCATGTCTAC^ 
GAGTTATTTATTGGCAGCAGAAGGGCATGCCTGCACACCCCCACTG^ 

TTTG CCTTGGTTGAGTCTTT ATCCTCZATGTGGACTTTGTGATAGAAAAGCCT CCCTG ATTGCTT 

TAAGGAAGGGCTGATGTATTTCATCCCACCTTCTTTtSCTTCCTCTTATATCCC 

CCTCATTCTGAGGAACTGGAGCAGCTGCTTTXjGGCCTG^ 

GACAGACAC^AGGTGAGCTGCATCTGCTTCCGATGTGATAATC^TGCCCCCTCCCCTC^CCCGCCCT 

ATAACCCTGTGGGGAGCTGCTGTCACTCCTGTGGTTTG<3G AAGTGG crCCCTGGGCCTCrTGCTGGAAACTTGGGCrrGGGCT CAGG 

CCGGGCAGGAAAGTCACCGGTTCTGTCCaSCCra^GAAGGAGCCTC 

TTAGGACATGCAGATTGTCrCCTAGGGAGGCrGG<KK5AGAGGGTG<3 

AAGTCTTCTGCTGACCCTGAGAGCAGTCTCTGGGCCCCATCTC 

GGGAAGCCAGGGTACTGGGGCCCCCGACrcCTCTC 

GGAGGCCAGAGTTG<K3AATCCAGGAGGCGCCACAGCTCTTTGTG^ 

TTCCAG^XjCCAGAGAC-CAAGATGTTTTTGGAGC^GTTGC^ 

GGTTTGTGTC CCCACGTATCAGGGTTTTTCJAGTACAGGCAAAAAGTACT 

GACTTCCTAGAAGGTCAAGGTTG<3AAGGGACCTCTGAGAACTCCAGTTG<5TCCA 

AGGAAGG AGGGCAAGGTCCTGCCTCAGCATCGCACTG AG CAGATG AGTGTAGAACCTGGGTCTTCTAG CT AG ACCAGGTTGCCTTA 

AAAGTCAGGAATGTX3ATGGCCGCTGCCACCTACTGAAGCTAAG 

CTC^TTGCCTGGATATGGGGAAAGGTGGCCACTTTCCCTCAAA(^CAACCC 

GCCTTCCAGATGCTTCCTTAGAAGCTGAGTCCCTTCrGTTTACTGCT 

CGTTGTAATTTATCTGTCTCACTTCTGGCTCCAGGCCAGGCTATTTTCT 

TGACACCTAAGCTGGGGCCAAGAGAGAGACAGCrrGTGGiCTG<3GGCCTGGAA 

GGAGTCAAACCCACAAGTTCCTTCCTCAGATAAAACAATGAAGAAA 

ACCCATGACGAGTGGACItKjGGTGGAGCTCCCAGCCrTCTCCCGAGACCTAGTACTGG 

GCTTGATCCCTCTGGAATCATTTGGAGCCATCCTGTCTG^ 

ACGGGTTCCAGCTAAAGGGGGTGGGGTGCGGTGCTATACCTCAGGACCCATCCCrrGATATCCT 

GGGCAATCTCCCCTTTGAGGAACCTGAGAAAAGCCTCTGTGGCT 

GTGGACTTGGCTG<X3ATGCAGAGTTGCAG<KX3AAGCTAGT^ 

GCTTCATCCCAG AAGAG AG CCCAGTTCATGGTGATGCTGTGGTATCTG CTG CTTCCGTCTTTCAGCCCT<K5CTC<1AGAAATCCTCA 
G AGCCCTG AATG AAGT CAGGAG AAGGCCTGGATTATTGGCCCCTT AAACATAACCAGACAGGATTAAAAGGGATAAT CCATGTAAA 
GCATGTTGTCCTGGCACCTAGTAAATGATCAGTAAGTGCTCAAAGCACATGCCTCCTCCT 

TTGCTCCCTCCTCTTCTCTT^ 

TCAAGTGGGACCT<K5CTGCTGTGATTGCACATGATTrCCTGGCCTTCAT^ 

TTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCTACAGGTAAGAGCCCTATGC^ 

AGAAGACTTATAAC^TGAGAC^TAACTGAAGATTCTGGTTTATACTGTCAAAATTCCTTTGTGAAAATTATAC 
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CCCAGTCTCAATGCAGAAAAGCAGGTTATW^ 

AGAGAGAAGTGACTTGCCCAAGGTCTCACAGTGAACAGGTTGGGAACAGAACCCCAGTCT 

CCrrGCrGATATCTTGCCCCACCCACTGCTGGAGTTTTCCAGCATGTCCT 
5 TCGGAAGTCCATGCTGGCCGTTATCCTTCAGGTTTCCTTCTCTCATGGTT 
TACCCGCCATCCATCATCXJCCACGGGCAGCATTGGGGCTGCA 

AGAGCTG CTGGCAGGGATCACTGGCACTGAAGTGGTG AGTGCTGGGT AG CTGGGCAGCAG CCT CT CCATT CATAAAG T AGTG CT AT 
GCCCTGAGCCTCCAGTAGAGGGAAACCTCCAACCCCAACCIX^ 
ATCTCCTGCCTTGGTCTGGGGAAGAGCAAGTATCTTGGGTGGGGTGAAGG^ 
1 0 GTGCTGGAGTGGGGGTGGGGGGTTTGGTGTCAGAT 

ACTAAAGAGCGATTCCTGGGGCCGGGCTGTGGCCTAACCTCCCCAGACTTCCCCATGTGT^ 
CTGCCAGATGCTATGGCGGGAGTGTTTGCTCCTCACTCTTCTCCCATGTT 

AAGCTGCACTCAGGGAG AGCCTCAGGGAAGCCGCTCAGACCAGCTCCAGCC CA CCCCGGGGCTCCAGCAGCCAA 
GGGCCCAGCCAGACCAGCACTCCTACAGATGTCACAGCCATACACCTGTAGCCCT 

15 



GCCCAAGGGATCTGGTCCTACCCATCCCCGCAGTGTGCACTAAGGGGCCCGGCCAGCCATGTCTC 
TCCTCCTCCCTGCATCTGACCAGCAGCGCCTTTCCCAACTCTAGCTGGGGG 
ACCAGCATTCCTTTTGAACGCCCCCCCCCCACCCCTGC^GCT^ 

TGTTGTCCCTTCT AGGGTTATTGCATTTGGATTGGGGTCCCTCT CATGAT AGACACAT ATGAGGGGGAATAGT CT 

2 0 AG ATGGCTCCTCTCJ^TACTTTGG AGGCCCCTATGTAGTCCGTG CTG ACAGCTGCTCCTAGAGGG AGGGGC CTAGGCCTCAGCCAG 

AGAAGCTATAAATTCCTCTTTGCTTTGCTTTCTGCTCAGCTTCTCCT 
TTATTAATTGCTTTGAGCACAACTTTAAGAGGACGTAATGGGGTCCTG^ 
TTTCCTCCCTTCTGCrrACTGGCAAAAGGATCTTTGTGGCCAAG 
TGTCCCTCTGCCCCATCCTCCAGCAGGGAAAATGCAGCAGGGATGCCCTGGAGGTG^ 

2 5 CCCTGTTGACACAGGTCTTTCCTAAGGCrcCAAG^ 

AAATAAAACTGGCTTTGGCTTCTTGGATTGGTGTGTGGGTAAGTCTATTTCTC 
TCTAAAGCAGGGATC^TTGAGCCTTTTCTGTAAAGGGCTGGATATTTA^ 
CAGTTCTGCTTTATAATGTGACCTATGCGTTCCTAAAAATCACTA 
GAAAAATGGGGCTTGAAGCACAACACTCAAATGAGTGACACAT 

3 0 TGAGAAATTATATCACAATAAATATC^JCCTTTTATCTTGAAGACA 

ACACAGGTGGGGAGATGAGTGAGGTAGCTGGTACATGTTTAGGTGTGAGTTTTGTGTACTACTTAG 
GTTTCCCATATTCACCTACrAriTCTCGAGGATGAAATCACACACAAGCAAACATG 
ATATATTAAATGGGTTGGAACAAATGCGCAATTTCGAAAGCAAATAACAAGGGTTATAGCAGGA 
TAGCACAAAACCAGCCACAGAAAACATGTAAATGGGTGAGCATGGCTGTTCCAACAA 

3 5 TCATGTAATTCCTACCTATCACAAAATACTGTTTTAATTTTTTCCCCATTTA 

ACAAAAATGGGCACTTGGGCTGGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGG 
GTCAGGAGTTTGAGACCAGCCTGGCCAACCTGGTGAAACCCCTTCrCCACCCCGTCTCTACT 
TGGTGGCACGCACCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCTCTTGA^ 
CGAGATCATGCCACTGTACTCCAGCCTGGGCGACAGAGCGA 

4 0 CCTAGTTTACTAGTCACTAGTCTAAAGCTTTCATTGGCTGAAAAGCCAGAAAGG^ 

TTCTTGGGCCCAGCCTTATTCTCTGGCCTTTTTTTTT 

CGAATCCTXK5ATCACCGCAACCTCCACCTCCCAGGTTGAAGCGATTCTCCTGCCT 
ACGCCACCACACCCAGCTAATTTTTGTATTTTTAGTAGAGA 

CAGGT AATCCACCTACCTCAGCCTCCCAAAGTGCTGGGATTACAGG CGTGAGCCACCAAGCCCAGCCTCTCTGGCCTTTCTTGACT 

4 5 CTTATCTCTGCTTTCCCCAAGGAGTTTTCTCTTGTAGTCCTTTCATCCACTC 

GGCTGCTGTAGCCTGTGATGGGGGAAGGCAGAGGATGTGTCCCCnXKSGGCTAAC 
GAGAT CAAGGX3AGGTGGGTCCTATATAAACAGGACAAGAATGTCCTCCGTCCT 
TGACTACACCAAACACAGAGGTTCAATAAATATCTGTCTTTGCCTTTATCCACAG 
TCACACCAAGCCCCAGCTGGGX3ACTGGGAATGCTAG 

5 0 GAATGGAAGAGGGAACCAGCCAGACCCTCAGTCCTTCCTGTCCTGGACTC 

GATCTGGGTCTTCAGCCATTAAAG CTCAGTGTCATCTT CAGTCACCAACGGGGGTCTTGGTGT CCTTCCAAACCCCTTTGGCCAGG 
ACAGCTGACTGTTTTCCTCACTCCACGGTGATGGGAACATCTTCCACATCGCGGGGGA 
GGGCGATAGCTGTGGATGGGGCTGCAGCCGGAGCAGTTCTAAGAGGGCCTCTTTCTGGTCT 
GGACCAAAGTCAGGAGGCACTGGTGCCACAGCACAGGCAGTTCACGCTTCTCT 

5 5 TCCAGCACCCGGTAAGGCAGTGCATACTTCTTATCCAGCAGCAGTCGCAGG 

AATTTTCAGCATGGCCGCACTAGTGGGAAAX3GGAGGTGGGGTAGAGTAAGCAGGGAA 
TCTATGCTTTCCCCTTTTTTTCTTCCCCTCTGCTGGCCCTTACACTATGGTATTCCCC^ 
TATACTCCCTTGACCTCAGTCACCATCTCTACAGGGATCCACCCAAATCrCTA 
CTAGAGTATGGTCCAAACATCTCAACGTCTCCACCCTAACTACTTTCTTCCTGTGTTTCCT 

6 0 GAACTTACTTTATGCCAGGCACTCGGGCCCTGTTGTTGCACAACTTACATTGAGGGGGATAATGA^ 

ACAAGCACAAACCCAAGGCTTTCCTTAGCTCCTTCTCCACTCTGCCCATTCCCAAGTCCTG 
TGACCAGCCCCCTTCCACCCCCAGCAATACTACCTGGAGTGCAACACAGGGATGGAGCACT 
GGCTTCCCGGAGGGTACAAGTGCCAGACTCGCAtAGTGGAATCAGGATCCCTAGGGAG 

GATGAGAAAAAGGAGTT ACAGCAGGAAGCATTCCCACCCAG CCCAGGG ACAGCT AGCAATTTCCC^ CCTGCTGTGTTCC 

6 5 CCATAGAAGCTCCAGCCCTTCCCTTTCTTCCGTC 

GAGAGCCATGTAGAGATGGAAGTTGAGTCGTTTGTATTCAGCAACGTCATCTCGTACTCGAGGGAGCAGGACAAGGT^ 
GCTGGGCCAIGCGTTCCTTCAGGTTAGAGGCAAAAATCCTAGTGGGAGT^ 
ACCG3GCCCATTAAATATCACCATGGATAGTGGATACAAGAGGTTTTAAAGTAAAATAT 
TGGGAGCTGTGTGCATGCAACACCCATCAGCTTAGAGAGAGAATAGGAGGGGCAATGAGA 

7 0 TCTGTGGGAGAGCCAGAGCTCGTGCAAACTGATGGCTACTCAGCATCAGTTCGATTCTGCTAGCCC^ 

CAGACTTCTGAGGGTCCTGCTACAGGGTTGTATCCCAGTGGCT 

CCAGT AGCT ATAAAAG C CT CTC C CCT AGAAATTCAT CACGGAGGGTTCCTAACT GG CT AACTGGGT CTTGGCC CCAAAAAATG ACA 
CAGGAAAAGCTAGAAAAATGATGCTATAAAGTAGCACAGAGAGGGCCCTGGGGCAAAGCTCAAATG 
AAAGCCTGTCCGTCCCAAACTCAGCCTGCACCTATAAACCCAAGGCCCCCGAACCCCC^ 
7 5 GGCAGCTGCAGTCCAGGCCTCCGGCTCTGTGACGTAGAGGATTTGCT 
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GCAGTTTTCCACrrGCGGTACTTAGATWVTACCT 

TGGGCTAGGAACTTGTCTGGGTGCTTTTAATATATTAACTCATTGCATCCC 

CATTTT ACAG ATGACCAGTGAGAGTTC CAGT AAC CG ACC CAAGGATG C CAAG TCATTT ACAGGCAG AG CAGG AAT CTG AACC CACA 
CACTCTGCCTCTTAGGTCTGTTTTCTACTCCCATGACTTCTTTAGTATGC^ 

5 GG^CAGCTGGGGCATAGGGAAGCCCGA 

ATGTCAGCCAGGGTGCGCCTAAGAGGGGAAGTGCCAAGAGGAGGGGAGAGAGGGGGC^ 
CTCCAGACTCCCCACAGTCCCCTGTGAGCACCGCCATACTGCGATCCTGCTGGGATATGCAGCTTAGAGTTCT 
TTCATCCACCAOVGACAGAATCACTGCAGGGTGCAAAGGAACTGAAAGTCAT^ 
1 0 TTATTATGAACCTCAACTCTGAAGTCAGTCTGCCCAGGTTCAAATCCTG^ 

TTTACCTCTCTATGCCTCTGTTTCCTCATTTGTAAAATAGQGATAATAACGGTACC 
AAGTTAATGTCTGTTCATTACTTGGAATGGTCAGTACACAGTAAGCGTTATCTATGGAAATA 

T CAG CCTCCTTAAACAGTATTGTTGGGTG TATCAAACAT AATCTT AT CTTAATG ATTCAGAATT ACCATCG CCTTG AG CTGT CAT A 

T AAT ACT AG CATCTTCAGAGGCTGCTGAAGTGT ATTT ATTTATTTTTTTTGAGAG AGAGT CT AC CT CTGTTGC CCAGGTTGG CATG 
1 5 CAGTGGCACCATCATGGCTCACTGCAGCCTCTATCTCT 

CAGGAACATAGTACAAGGCCC^GCTAGTTTTTTATTCTTCTGTAGAGATGGGGTCTCC 

CTGGGCTCCTGCCITGGCGTCCCAAAGTGCrAAGA^ 

TCAGTTGGGCGCAGTGGCTCACGCTTGTAATCCCAGT^ 

CCAGCCIXK3CCAACATGGTGAAACCCTGTCTCTACTAATAAAACAAA 
2 0 TACTCGAGAGGCTGAGGTTGCAGTGAGCTGAGATCATGCCACTGCCCTCCAGCGTGC^TC 

AAACAAAAAACAAACTGCTCCTCTITTTCTGAGGTCCTACTCAATGCCAG^ 

CAACACTTATAGGACACGGAAACTTATGCAGACATGGGAACT 

TGGGGGTAAGAGTCAAACTCAGGTTTTTGGAAGTCTrGAGTTA 

CCTCTGATTGTTGAATTCATGTGGATCTAGGCTTGCTTTCriGAAGATGCC^ 

2 5 ACTTCAAGGAGCTTGTCTACTCTATTTACrACAAGCTGGAAAAGCA 

ATGAGGTGCTGAGTGAGAACCTGAGAGGCTCTCTGGGGGGACAAGCCC^ 
ATCAATCTGGGAGAGGGGCTCATTTTCATTTCTTCATTCTGCA 
TTTATAGTATGATTAGCAATGAGATGCATTAGCCCAAAACAGCCTAAAGTATGAGGG 
AATGAACTGGTTGAGGACCCTTATCCTAGACTCCACTCCCAAGGGCCAAAATTTATGGGCTCAGGAG 

3 0 AATGCCTGCATGTTTTGGAGCTAGCATTTATTATAGATGGGAATAAATTAGTGGTCCTGAC^ 

ACATTTGGCAATGTCTGATGTCATTTTTGGTTTTCACAACT 

CTACTAAACCTC CT ACACTG CACAGGACAGC CT CCACAC ACACACAAAATT AC C CAGCCCAAAATGT CAAT AATGCTG AGGTTG AA 
AAACCTTGGCTTAAATGAAGCTTAGGATATGAAGACT 

CAAATG CAAT AGG ATGGCT CTCCCCAGAGGTGT AAAAGGGGCCTGG AACTGTTTC C CTGG CAACTGGGC AGTTGT ACAACAAATGG 

3 5 ACATAAGGACACAATGTTTTTTGAGGCTC<K3TCATTCCTACTCACAGTGGTCCTTTGAGG 

CCTACAGCTAGCAACTCCAGGGCCCTTTCCAAAGGTAGGGAATCATAAGTCTGCCAGGGAACTTGT^ 
CAAGGACTCATTATCCCAAATCCCCAGGCCTACCTGGCAGG^ 
GTCCACAACCACCTCTGCATGATGGCCCGCTGCTGTCATTGTGGCAGCCTTCT 
ATCCAT CCTGAGG CATTCTTGGACCTAGAAGAGGGGAGACTAAATGGATGAGCACAGC^ 

4 0 TAACTTTAGCCATTATGTAGTACAGGCCTC^CATGTGGTTGTGAT^ 

GTCCCCAGGGGCTTGCAATCATGCTTGAGAAACAAAATGTATATACAGG 
ACATACCATACTCAGTTATGTAGAAGGAACTTTTTTTTTTTTTTTTGA 
GGCK5GATCTC!AGCrCACTGCAAGCrCCGCCT 
CATAAGCCACCAGCCCGGCTAATTTTTTGTATTTTTAGTAGAGTT^ 

4 5 CCTCGTGATCCGCCCGCCTTGGCCTCCCAAAATGCTGGGATTACAC-GGCGTGAGCCACTGTTCCCGGCCT 

AACACT AT AC CCATGTT AACTGTTT AATCCTT AT AAACCCAAG AAGTGG O^CANNNNNNNNNNNNNNNNNNNNACGGTGGCTCA 
CGCATGTAATCCCAGCACTTTGGGAGGCCGAC^CGGGCAGATCACGAGGTCAAGAGATC^ 
CCCCIATCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGTGG<5CGCCTCT 
AATCACTTGAACCCAGGAG<3TGGAGGTTGCAGTGAGCCAAGGTCACGCCACTGCA 

5 0 TCAAAAAATATATATAAAATTAAAAAAAATAAAATAAACC^ 

AAGAAGTGGATACTATTACTATACCCAGTTTCAAATGAGGACATTGAAGCACTGA 
CTGGGTCTAGACTGCATTCTTCACCACTATTCn'ACACATATTCGGGTTTGATT 
CACTTAAAGGATGAGTGGTGAGGACAGTTAGGAGCTGGATGGGCAGTTTGGGAGCAGAAGG^ 
AAGGCTTGGATATAAGAGTCTCCTAGGCAATAGCAAGAAATAATGCTA^ 
55 AAATTCAGGCAGTAGAGTTCAGATTTCAATCCACCAGTCCATTTAAATACATGAAGTTGGCTGGGCG 
TCCCACCATTTTGGGAGACCAAG^CGGGTGGATCATGAGATTGGGAGTTCGAGACCAGCCTGACT 
TACTAAAAATAC^AAATTAGCCGGGCTTGGTGGCGCATGCCTATAATCCCAGCTACTTGGGA^ 
AACCCGGGAGGTGGAGGTTGAGGTGAGGTGAGCCGAGATCGCGCCACT 

AAAAAAATAAATAAATAAATAAATAAAAATAAATAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTA 

6 0 GCGGGCGGATCACGAGGTCAGGAGACTGAGACCATCCTGGCTAACACGGTGAAACCC^ 

CGGGTGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTt^GGCAGGAGAATGGTGTGAACCC 

GTGAGCCAAGATCGTGCCACTGCACTC CAGCCTGGG CGACAGAG CGAGACTCCGTCTT AAAAAAAAAAAAAAAAATGAAGTT AGTC 
CATTTAAATAACTTCTGTTTTTGGAAGGCTACATGTCTGGGCATCTG^ 
ACGTATCAATAGGTCATTAACAACAAAGGCAGTGTGTGATAAACACTAAATA 
65 GGTCCCCCAAATTTGTACTTTGTAGGTGGTTCTCCTTAGTGCCAGACCCTAACT 
CTCACAGGCACCTCAAATCTCACCCGTTATCAACTAAACTAAT^^ 
AGTAAACGGCACCATCGTCCACCCAATG CTAAG CCTGGG AGG 

HUMAN SEQUENCE - mRNA 
70 GCGCCTGTCAGGGAAGCGGCGCGCGCGCGCGGGCGGCGGGa^CT^^ 
CGCCCCGCGCTCTCCGGCCCGTCGCCTGCCTTGGC»CT^ 

TGCTGTGTTGCGAAGGCACCCGGCACGCGCCCCGGGCCXKK3CCGGACCCGCGGCTGCTG 

CTC CGCCTGGAGGAG CG CTACGTACCCCGCGCCTCCTACTTC CAGTGCGTGCAGCGGGAGATCAAGCCGCACATGCGGAAGATGCT 
GGCTTACTGGATGCTGGAGGTATGTGAGGAGCAGCGCTGTGAGGAGGAAGTCTTCC^ 

7 5 iGTCTTGCCTCCCCACCCGAAAGGGGCAGTTGCAGC^ * Gv.vjCGAGACCACG 
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CCCCTGACCATCGAAAAACTGTGCATCTACACCGACCACGCTGTCTCTCCCCGCCAGTTGCGGGACTG 

GAAGCTOUlGTGGGACCrGGCTGCTGTGATTGCACATGATTTC 

AGGCCTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTG 

GCCACGGGCAGCATTGGGGCTGCAGTGCAAGGCCTGGGTGCCTG CTCCA CTGGCAGGGAT 
5 CACTGGCACTGAAGTGGACTGCCTGCGGGCCTGTCaGGAGCA^ 
CCAGCTCCAGCCC^CGCCOUUUSCCCCCCGG 

ATACACCTGTAGCCCTGGAGAGGCCCTCTGGAGTGGCCACTAAGCAGAGGAGGGGCCGCTGCCAC 
CCACACCACATCTAAGCCTGAAGGGGCGTCTGTTCCCCCTTCACAAAGCCCAAGGGATCTGGT 
CTAAGGGGCCCGGCCAGCCATGTCTGCATTTrcGTGGCTAGTCAAGCTCCTCCTCCCTGCATCT 
1 0 TCTAGCTGGGGGTGGGCCAGGCTGATGGGACAGAATTGGATAC^ 

GGCTCTCATGTTTTCAACTGCCAAAATGCTCTAGTGCCTTCTAAAGG^^ 

CCT CT AAAATTTAATGCATG ATAGACACATATGAGGGGGAATAG TCT AG ATGG CT C CTCTCAGT ACTTTGG AGG CCCCTATGTAGT 
C CGTGCTGACAG CTG CT CCT AGAGGGAGGGG C CT AGG C CT CAGC CAGAGAAGCTAT AAATTC CT CTTTGCTTTG CTTTCTGCTCAG 
CTTCT CCTGTGTGATTGACAGCTTTG CTGCTGAAGG CTCATTTT AATTT ATTAATTGCTTTCAG CACAAC 
1 5 GGGGGCCTGGCCATCCACAAGTGGTGGTAACCCTGGTGGTTGCT 

AAGGAGCTGCTATAGCCTGGGGTGGGGTCATGCCCTCCTCTCCCATTGTCCCTCTGCCCC^ 

GGATGCCCTGGAGGTGGCTGAGCCCCTGTCTAGAGAGGGAGGCAAGCCCTGTTGACACAGGT 

GCTGGTGGCCCAGGACCATCATCCTACTGTAATAAAGATGATTGTGAAATAAAAGTGGCTTTG^ 

20 HUMAN SEQUENCE - CODING 

ATGGAGCTGCTGTGTTGCGAAGGCACCCGGCACGCGCCCOKX3CCGGGCCGGACCCGCGGCT 
GAGCCTGCTCCGCCTGGAGGAG03CTACGTACCCa3^ 
AGATGCTGGCTTACTGGATGCTGGAGGTATGTGAGGAGCAGCGCTC 
CGCTACCTGTCTTGCGTCCCCACCCGAAAGGCGCAGTTC 

2 5 GACCACGCCCCTGACCATCGAAAAACTGTGCATCTACACCXfACCACGCTGTCTCT 

TCCTAGGGAAGCTCAAGTGGGACCTGGCITGCTGTGATTGCACATGATT^ 
GACCGACAGGCCTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCT 
CATGATCGCCACGGGCAGCATTGGGGCTGCAGTGCAAGGCCTGGGTGCCTGCTCCATGTC 
CAGGGATCACTGGCACTGAAGTGGACTGCCTGCGGGCCTGTCAGGAGCAGATCGAAGCTGC^ 

3 0 TCTCAGACCAGCTCCAGCCCAGCGCCCAAAGCCCrc 

CACAGCCATACACCTGTAG 
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Table 9 

MOUSE NOMENCLATURE 
ICSGNM Wnt3 
Celera mCGl9162 

5 

HUMAN NOMENCLATURE 
HGNC WNT3 
' Celera hCG27972 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
• NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
3 0 CAACCAGGCCCTTCTGGGGCCAGTTCCCTCCGGGCAGATAGATCAGCTGGCCCCAGG^ 

GAGTGGCAGCTTTGTGTGATGGAATGAGGCTTAAACTCCAGTTGGCAA 

ACACAAACAAGCACCACTGTGCGCACACAAGCAAGCACCTCTGTCTGCACAC^ 

CACTGTGTGCACACAAGCAAGCACCACTGTATGCACACTAGCATCACT 

CAGCACCACTGTGTGCACACGAGCACCACTGTATGCACACA^ 

3 5 ACAAGCACCACTGTGTGCACCCAAGCAAGCACCACTGTGTCCACACAAGCAAG 

ACACAAGCAAGCACCACTGTGTACAAACAAGCACCACTGTGTGCA 
ACAAACAAGCACCGCTCTGTGCACACAAGCAAGGACCACTGTGTACA 
GTGCACACAAGCAAGCACCACTGTGTGCACACAGGCACATGTGCATG^ 
CCTGAAATCTCTGTGACTGCTTGGAACATGCTAAGAGGTCCATGGACACCACCCAGTCCT 

4 0 GATATGACCCTCACCTATAGCAAGTGTCCGCAGAAGGCrGAGGTAGCCACTTCTAGGGACCCACAGCT 

CTGGTCTGTCCTGCAAGGTAAGGACATCAAGGATAATGACCATTAGTAATGGTGGAGAAACGAAGCATCTAGTCT 
ACGGCCAAAAGGACTCTTAATTAGGCCACGTTAATCTATA 
CAGAAAATCGGCCAGGGACCCCTTGGCTCCCCACCCACATCCTCCCAGCTCT 
CAGCTTTTCTGGGCCCTGGACTGTCTACCTTGTCT 

4 5 TCGCCTTCOTCCTCCCACTCCTGAAGAAAGCTTCTCACAGACCAAAGA^ 

GAGGCCGTCACGCTTCCTCCTTACTGGGAATATCTGGGTCGCAGAGGCCTCCGTATGACCC^ 
GGCTGGATTGGTGTCCCCAAATATGTCCAATACTTTGAGACGGGGTGGGGGTGGGGCCTCAT 

GG AGGGAACTCAGTG CCACCTTTGG CTG T AAGGAACAGC CT C AATTGGGATT AGGAAAAT ACACCAT CTTG AAT CAT AAACAAACA 
AACAAACAAACAAATAAATAAATAATCTTGAATCACAAACAAACAAACATT^ 

5 0 CAGTGGCTCAAGAGACTCAGTTTCTCCCACCCTGGTCTTAAGTCTCTACTTTGGTCTCAGTCAG 

AGATATAACAATACCCCAAAGCGGGGAAACTGGACAGCAATTTAGCGGACT 

TCCGTCAGCTGACTCCCCTTCGTGTCTCCTTTGAGTCGGTGTAGCCCAAGCCCTTGCAAA 

TCTGAGCCAGGTGGGGGAGAACAAACTTGGTTCCTCATGTGCCTAA^ 

TGTGTATATATATCATCCTAGTCAGGATTTCTACTGCTGCACAAAACATCATGACCAAGAAAa^ 

5 5 TTATCTTATACTTCCACATTGCTGTTCATCCCCAAAGGAAGTCAGGACTGGAACT 

GATGCAGAGGCCATGGAGGGTGCTGCTTACTCGATTGCATCTCCrAGCTTGCT 

GCCCAGGGATGGCACCCCCCACAGTGGGCCCTCCCAGCCTTGATCACTAATTAAGAAAATGCCTTA 

ATTTCCTGAAGGGAGGCTCCTTTCTCTGTGAGAACTCCA^ 

CATAAGTAACTCClAGTTCCAGGGATCCrGACCTCrGCAGGCATCGAGCAGGTGGATGG 

6 0 ACGTGTAAAAGAAGATAAATCCAAAATAGTAATAATAATAAAAAGTAACTCAAAAGAACTTTCCTCCT 

CCAGGAACCATTTCAGAGCCAGCAGGCATGACCACTGCAGGC 

GGGTTGTACAGTACGAGGCAGAGCTGGGCTCTAAGGCAACCACTCAGACAGTGTCTCCCTGGACAAA 
CAGATCCTGAG AGGCCAGCAATCCT AAGCAGGGTTTGCCCAAGGACCG CCC CC CACC^ 
GGATCTTTGCTG^TGAAGTAGCGAGGGCCCCAGCGACTGTCCACTGAACCATCTACTTGT 

6 5 ATTTCTGGTCCAACCTCAGTTTCCCCACGX3TCGCTGCTGTTGGTCATA 

GGGTG CAAACCAAGCATGATGGGCAATTGTCTGCCCT ATATCCATGGATCAG ACCAGGCT TTT A 

AATAGTGTCCCCACCAGACTCTTCTCCCAAGAGTCrrGGCTG<5CTCTTAATTT^ 
CCTCTTCCAGGAGAAGGTAAGCCTCATAGACTCAGAACCCTGGGGCTCAGGCATGCTTAC 
CTTAGTGAACAACTAACTACACAGGACACCTATGCAAAGAACACCT^ 

7 0 ACACACACACACACACACACACACAGAGAGAGAGAGAGA^ 

AGGGGCTGTCAAGTCTCAGATGCCTCCAGGTGTOVGTGACT^ 

G^^TGTTACCTTGAGGAAAACTCAACCCAACACTGGGGTGGACAGGTC 

ACATAAACCCAGGGCCCTCTCATCrGGGATCTCAACACCCTCTCTAAGTCT 

TG AGTTCAGTTGGAt AAaGGGCAGCACACAGCACACAGG C7TAGGTGTG CCATGTGTTAACTCCATCC CAGTGTT AGGGGCTTCTG 
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GCCAGAGCCTAGGGTAGGGGTTGGCCACTTSATGG^ 

GGGGCGGGGTGGGTGGGTTGGCCTCTGATACTAGGACATGTTAGGGGG 

GTG TTTTTTT GTTTCTTT G TTTG TTTG TTTTT TA ACCATC 



ATCCCATAAGCCAGTCTGCTGACTGTCTCTAGG^^ 
ACACTTCATCCCCAGGTAAGCTCCGAAATTGCACAGTAGATTAA^ 
AAGATCCAGGCTGCATAGGACCTGAAGGCTCACrCACTTCTGCTTTCCAG 
1 0 CAGGGAAGGCTCTGAGCTCTGGGACAGACACCTGCATCTCTTTG^ 
AGGCTTCGACTTCTGAGTCITCACTTAGGATGATGAGGCCT 

CTC CATG AT CCT CCT ACTT CCAAGG CCACCTG CTGT CTGGTCTC CT AGCTGTCTC CAGG AAACTCACCCG AAATCGTTCG AGAGGC 
TCCTAGAGTTGTTAAGGGAGACCCTTAAAAGCTCCCTTCTCCTGGTCAGAGACC^ 
GAGAATCTCTGCTTCAGGTCCCTTGACCTTCGCACCTGATGCCTC 
1 5 TTAGGCTAGGTTGGTTCTGAGCCCCCACCCCCATCC 

CTGAGTGGGAGATTTCCCCCTCCCTCCTCACGACAGACTGGCAGCCGAGT^ 
CTTCGAGGTGCCTCAGCAGGGGTCCGGTGGAGCAGCGGAGAATACACACAGC 

GGCTTCCACGTGCC^CATCrrTTGCTGCAGGTCCTGGGTCCCAGATGC 
2 0 AGGGCGAGCCTCATATCCACCCTCATTAGCTGGTTTCAGAGTAAAAATAATC 

GCCACCCGCAGGCAGGAGGAAGAAAATTAAGCCTGTGTTCAAGGA 

TCCTCTGGGGAACTTTCAAGTGGGGGGGAAATCCCCACGCAGGCTrCTACT 

TGGTCCCCTCCrCCTTTCCTTCTTGCACCTTTGCCTGrrTTGTTTCCTACCCT 

2 5 TTGCCTGTAAGAGGCACCTGCATACACACAGCCCTTCAAGGGACGACCAGTT^^ 

CAGGCAGCCAACATCTCACCCAAAGCCACCAAGAGCCTTTGCCCCCGCTCT 
ATGACCCCAAGAAGCCCTCCAGATTAGCTCTTGCCTC 

CAGAAACACTAATTTTGGACTCCCCCCCCCCCCGCCCCACGCTCTGATGCAG^ 
CCTATGGGGAGCCCCCATGGGATGAGCAGCAATCAAGATCTCAGCTAAAGGCTACCT^ 

3 0 CTTG ACAG AAG CCTC^GCCCCGGGGCTTTGCATTCCCCGATAGTGGCCAACC CCCTCCTACTGCCCTCCCAGGTGAATCAGAAACG 

GGGCCTCAGAATCAGCCGTGCCCCTTGGGGAGGAATCTTCTGGGAATGCCCGGCC^ 

CAACCCCCACCTCTTCCTCTGTCTGGGCGTTCACCCCACCCCCGCCCCTGTTTTCCGTC 
CTATCTCCCCTTCCTCCCCTCTTTCATCTCTCCCTCTACCCCCCTCCAGCCGAGG 

3 5 AGCCCAGGCGGCTCTCTCGACCCTTGGTGCCCCCAGGGGCCAGTTCCTTGCTCTCCTTAC^ 

AGACCAGCAGGGGCAGTGGATCCGAGCTGTGGGCACCAGCCTGCCGGATCAAAGGCT 

TCAGTCCCATCCACCCCTAACAGCAAGAAAGCCCGCCIATTCCTGGCTAGG 

CAGTTAAGTCCCACCATGGCITCTCCTTTGGCTTC 

CCGGGTAGAGAGGCTGAGGTACCTAGGTAAGGGGACTGAATCCTAACAATCTAGGTATGATTGGGCT 

4 0 GTGGAGAACTGGTAAGAAGAGTGCCCCTTGGAGGCCTGAACAGTTCCCCTTGTATA 

GTGTCTGCACGGGACAACGCTTAAACACCAATCGCCAGG 

TCGACATAGGGGAGGTAGATGTTCCTGAAGAGCCCTCGCCCCCAACCACTAATAGACAGAGCTATACCGTC 

ATGCTTCCATCCTAAGATCCAAGAAACTGCAGGTACTGCTGAGGTCCGGCCAGCACGCCTG 

CCCGCAGGAAATGCAAACGGCXK5CTGGTGTCAGTTGCAGGAAATGCAAAGGCTGCAGGAGG 

4 5 AAAAGACTCAAATGACACTAGCTCTGTCCCCCGGCGCra 

TCGGACACCCTGGAAGCGACCTGCGGCCGGGTGGCAGGGGCCGCCAGCTCGCGAGCCCGAAAAGGCT 
ATTTTCCTAGCCCCTGATCCCAGTCGCGCGATCTGGAAACAACCCCITTGCT 
AGGGATCACAGAAAAGGGAGCCACCCATCAATGCCCAAGCCGACCCCTTCACCACGTGAACGC 
AACCTGGCTCGCGTGGACTGGGAAGGGGGTGTCCCAGGCCAGCAGAGAATCCAGCGTGCCG 
50 CGGGGCGCAGGCTTTGCCCAAGTTTGCCGGGAGATGCCCTTCCCAGAGCCGGAGTTGTGGGCCGGGCGGGAGGGGCCCGTGATTG 
CAGGCTGAACTACAGACTCATCTCITACCTTAGGCCGCGGGCGCTGATTGGCTGCTCG 
GCTGGGCTCCGGAGGGGGGCGGCTGCGGGTGGAGGTGCGCCTCTGACAAGCCCGAAAGTCAT^ 
CCAACTATTGGGGGCGTCGCTCCCCCTCTTCATGATCGCCGGCAAACTTCCTCCTCGGaS 

CGGGCTGCTACTCGGCCT CCTGCTCAGTGGCACCAGGGTCCTCG CTGGCTACCC AATTTGGTGGTAAGACTTGC CTCTTGTCTGCC 

5 5 TGCGGCCCCGATCTCTCGGCCGCACCCGGGGACGGTGGGAGG CG CTGGCCGGGTCTGGGCTGCAGCAGGCCTCGGGCTCTTTTCTC 

CTTCCTTCCCGCTAGCCCCCCAGCCGTCCACCCCTCTTGGATCTTCTGCTGGTGTCG 
ATAATCACCATTTTCCCTGCTACCCGCCGCAGAGGCGG 
CTCCCACCCCAATTTGGGGGTCCCGAAGAACCTTCGT 
TTCTCCATTTTAGGGTCTCCGTGGGTAAAATTATCCTGTCCTTTTTCGCCA 

6 0 CC CACAATACTCG CGTTTG C CTT CAT ATTTG AG AC CTT AAGGGCTTTGGG AATC CCAAGCAGGGGACTT AGGGGGT AT^ CAGT AG 

ATTCTCCCCACTCCGTTCTAGATTAATTTTGAGCCTCTGAAGTGACT 
TGTATTCGTTCCTTX3CTTGTCCATCCCGGAGAAAAGCGAGGTGGGGGCTC 
TGGGTTTTITTGAGCGAAAACCTAGTTCTCCATGTCAAATTCGTT^ 
GGGGGAATGCTGTCCCTGAGAAATACGGATACAAAAGGCATTTTAGTAAAT 
6 5 TGGGTGTGTGAAGGGATTCACTGGAAGATAAACCGCCTTTTGGAAGCCCCAG 
CTGCTCCTCCTCTGTAGGTCCTCCGGTGGAAGACACTAAAACCAGTTrTCT 

TGCATT CCGGAAAACTTT AAT AAAAGAAC CCT CCC CTCCTGGG CAG TGG CT AG AGCC Cnt^GGGGGGTGGG AGGTGAGGAGG AGG A 



CCCCCTCCCTGCCTGCTACCTGCAAGCCTGCAGTATGAGGCTCAGAGCTTATAA 
7 0 ATGCTTTGGGGTGTCTGAAGGCCAGGGGAGGGGGCAGTCATGAACCCAGG CACCCAAGCCATGGCCTGC CCAAGGTTCTGTTTAG A 

GGGTCCAGGAGTTGCTCCTTAGGTTGATAAATACGCCAAGGACAGACCATCT^ 
CTGTGTGGAGTCTATAGAGCCTGTGCATGCTC^GGGACAGCCAGGGATC 

TGCCTCCGG ACTTTC CTTCCCCCAGCCAAGGCCAAACTCTG CCTTCTCAG AGACCCTGCGGTGGCGGGTGCATAGGAAGAGAGGGT 
GCCTGGAGCCCTAGGGATGGCATAGAGGAAAAAGGAGAGAGTGGGCTTGCTAGTTCTGTGAACT 
75 AGTACCAGTAGGTTAGATGGTGGTAACAGTAGCCTCTGTGTGTAGTAATC^ 



267 



WO 03/008583 



PCT/US01/51291 



AAATATTAGGAAAACCTGGAGGGCAGAGACAGGTCCTAACATA^ 

GCTTTTGTCTGAGCCATAGAGAGTATGTATGGGTATTTTTGAGATGGGTGGTTT 

TAATACAGTGTGTGCTCTAGGTCAGTGACCCTAAGTGTCTGTGTACCTGTATGTCT 

CCAGG CAAGTG TTTTGTGTG AAAC CTTTTGGTGTGTGCCT AGGGG ATCTG CGGCCCACAG CAGGGT ATG CTGTAT CTGCAT ACAAG 
5 TATCAGCAAGTGGCTGGTTCTGTGATGTGACACAGAAGTCGAGAAGTATCCGTTGCT^ 
CTGTGATACTGGACATGGCAGTATTTGTGTTT 

CCTTCCTGACAGCGTAAGGGGGAGGCCCAACAAATGTCTTTGTGCGTGAGATC 
TATGAGAGTGAGCCTGGGAGGCCTGGCTGAGGGCTGTGT^ 
TGTGCCAGTGGTTATGCGTCTGTGGTCATGTTTGCTGACACCTGACTTOT 
1 0 CATCAGTGGCCCTGAGGGGAAAGACAGGCAGGAAGTGGAGGTCTCAGAAGGTGGCTGCA 
GGGTGGGGGCAGGGCCTTCCTTGACACCATCAAGCCTrcCTTC 
TCCCITTCGTCTTGGAGCTGGGGCATGAATCACCTTCATGGGGCTGGAAGTAGAGA^ 
CCTCTGCCCCAAGGCCGGCAGATTCCAGGCCTTTGAGTCCGTGACTGCCTGCCACCT 

CAGAGGATTTACAGGGGAAATGG CTTAGCTGGGATTTCrGTCCACATCACCTTCAGAAAGGACC CCTACTCCCA^ 
1 5 TGCTTTTCCTTGCAAGGGCCCCAACTCrrGAGACAAGGTGATGAGGAAGGGGCTCTGAGG 
TGCTCCAGAGCCTCCCAGCCTGGCAGTGGCTCCATTGGAGCAG 

CTCCATTGGAAGTGCCCTGAAAAAGGAGGCCTGGTTGGCTGGGTGGTGTTTGGC^ 
TGATTCCTCCCCACCCCCCCTTTTGGGGGTCACTTTTCTCACTTTTTAC 

2 0 GGCTCCAAGCCTGTTAGGGCGCCTGGGTTCTCATTGCACCACT^ 

TCAGTTTCTCTATGTGTCACATCCTGCTGCCT^ 
ACGTGTTCAGTCAAACCCCCGTGGGGGTCAGTGGGCTGGGCCC^ 
GGTGTTAACACTTGTAACAGGCAGGAATTACAGCTGGCTGGGGCT 
AGACAGTCAGTCAGTCAGACAGACAGACAGACAGACAGACAGACAGATGGGAGATACC^ 
2 5 GGTCGCTCTTCCAGAGAAGATCTTGCCGTGGGGGAATTCGTCTGGGGGGAAAAAAGA^ 
GCCCACCCCTATTGTGGGCGGGGCCTTGTGCTACAAAGGGA^ 

GCGGACTGGACTTTATAGGGAAGAAGGCACGTGGTTGGGGTTGG^ 
CCCCTGCCCCCTCCCCCCTTACAATCTCCAAGTGTCTTATTAACAGCTTTGGGA 

3 0 TTAACATAGATTCTGAGGTCTTGGAGTTTTGGATGT^ 

CCTTCACAAAGCrGGTACCTGCTGCTC 
CTCTGCTCCAACCTCCATTAAAGAGTCAAAA 

CACCCTCTCAGGGCAGAACTCACTGCTCACAGGAGGACACAGTTGCCTGGAAGGTTA^ 
CCGCGAAGAGAAGGCCATACCCTACATTTACCCAGCATCCCCCTGTTTTGTTTTC 

3 5 TTTGTTGAGATAGGATTTCTTCATGTAGCCTTGGCTGTCCTAGAACTT 

TCTTCCTGCTTCTGCCTCCTXXX3TGCCGGGA 

TCTATATATTCCCTCTGGGGAGAAATGTTTCAGCCTGGAAGTGGAGAGCCACGTCTGA^ 

TGACCTAGGGTGACTCTGTCCTTCTTCAGCTCCCTGCCTCTCTCTTGACTATACT 

GGGGATATGATGGGGACTCCACAGCAGATACTTGGGACCT^^ 

TGGGCTGCTGGCAAAGTGACCTCTCCCCGTCT 

ATCAAATTCAGGCCCCAGAATGCTGCCGGAATGCAGGAAGTACAGTTCAATGTTTGAACCT 

TCTCTTCAAATAAACCCT CCGAGATTGGCCCCCATGGCTCTCA'l " l"r CT TCTTGTTCCATGTGCTTGGCACTGGTGCCCACAGGCTT 
GTAACTTCCTTCAAAATCAGAAGTTCAGCrrTTTAAAATATTTTTTATTATA 

4 5 CAAGAGAGTCGGATCCCCTGGAATTAAAATTATAGACAGTTGTGAGCTGCCATGTGGGT 

AAGGACCAGTGCTCTTCACCACTGAGCCMCTCTCCAGCCCATC 

TAGGGGAGGTAGGGTAAGGTGGACAGAGGTGGTCTGGGGTCTCCCCT 

CTCTGTCCTCTCCAGCTGGAACACAGAGCTCCCGG^ 

CACGAGGTGAAGTTG CTTCTGGAG CTGGGGG ATGACTCAGAGGGTG ATTTATAGACGGAAAGCAG AAAGCTTGGCTCCT 

5 0 GCCTGCTGCTTAATTACATTCTTTGGGGGGTGGAG^ 

TGGTACCTGTGATGTCCTAGCACCAAATAGGGTTCTT 

AGGAAGAAGTGGGACAGAATCATCGGAAGGACAGAGTCCCGTGGGGACCACC^ 

GGGATGCTGAGAAGGATCGCTGAACCTCCACCCTGCTTCTC 

ACTTCCCCTTCGCGTGGTTGGCTGTGAAGACCTTCA 

5 5 AGACCAGGCCTGGAACTCAGAGACTAAACTCTGCGTCTTTGGGTTAGATGACCT 

TGCAGCGGTTCTGGTAGCATCAGCCTCGTGCTGGAGGGAAGGA^ 

TGCCCATGCTGACCAGTGTAGTGACTCCAACACAGCCCCAGATACTGAATTTCTGGGTCAGGGGTCC 

AGGGGTAAGGAGGCCAAGGTCTGCAGCACCTGGCTGCCT^ 

TGGGGACAGCAGACAGCTCTCCAGTTCCCCCTGAGGCAG^^ 

6 0 ATGTCAGGTATACTCACACATCTAAAGAAGGATGGGGACCCAGAAATGTT 

GGTTTCCAGGTCCGGGATCTGAACTGTTGGACATGGTCTTCCTATTGCAAAGTGATGCT 
TTTTCTGTGCCTAGTGGGAGACTGGAACTGATTCCGGGACAGAGTAG^ 
GCTGAATTGCCAACCTTCCCAGACCTCAGTTTTCT 
AGCTCAGGATGTCTGTTCTTACCAGAGTCAGCTCAGCAA 

6 5 AGGTGCAAGAGTATCTGTAGTGGATCCCCAATCTCTGAGTAGAATCTCTGGGAGCAGAGCCGTTG 

TGGCAGGTGTGCTCCAGTTTCAGCTCTGGAGGTCCACGAAGAGC 

GTGGAGGTGAGGGCAAGGTAGGCAGCTGGAAGCACTCTCTCAAAGCCCGAGGCCATCAOT 

AAGGAAACCCACAGCGGCATAGCCCTGTTCCCTAACTGAGCTAGAGAGACTTGAAGAGAATGTCCTGTGTCCCTGTGTTGGCAAGG 
GGGATGGGAGCAACAGGACCCTGTGACCAGGTTTGTTGGG^ 

7 0 CTATCGGAGCCAGCCGGGATCAGCAAGGAAAACCAATCTTC 

CTGCACACCGCCTCTCTCTAGTTCCTCCACACTTCATAACGTGGCTTTCCGAGGGTGTCATTCTC 

TTCTGATCT AGACCCCCTTTGTTTTATTTTGG CCTCT ACCTGC CT ATACAACATGCCTTTCAGAACCCCTCTCCCACTCCTTCCTC 
AAACCGAACACAAAATAGTACATGAACCTCTGCATGCCTCCTTCCTCTGACCT 
rrAta^G CTTATTTCTGCTCTGCCAAGCACAGGACCT 
75 GTGTCICTAAGGCAATGGGACCCTCCTGAGCAGACTGCCGCAGCCGTGCT 
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TCCTTTGTTGCTCCTCCTCAGCCCAGATTGAA^ 
ATATAAAAGTGTCAGGACCTAGGGACAGGTGCCTTTGCATC 
AACGGTGAGCAGCTGCTTTGCACAGACCACATC 
GACTTCCrAGTTAGAAGGAGGAGAAGCTGCTTGTCNNNNNNNN^ 
5 NNNNNNNNNbmNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNmWNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNl^^ 
10 NNNNNNNNNNNNNNNNNNNNNN^ 

GGGAAGGGGGTAGCTTATTCATGTGTGCATTTGAGCATGGAGCCAGAATAC^ 
CITTGGAAACATAGTCTCTCACTGAGACATTAGGTTCCCTGATTCG^ 

TTCTG CCTC CCT AG CTTTGGG ATT ACAGG CACAC CACCACAC C GAACCTTTT CCCATGGG TTCTG AGGAATGAGCTCAG CAATTAG 
ATAAGCCATCTTCCCTCCCTAGCCTCAGAATTCATGCCCATAGCCTAGCCCTC 
1 5 GACTTTCACAGCTAGACATTCAGCACTATT^ 

pVTCAGCCATTTAGGAACCCCAGAGCTAGACTTGATACTCGATGTTTGCACCTC 
TTTtKTCTGATCCAGTGTTTTCATCTCCATTCTGCCATTT 

CCTGGAGCATCTAGAGTTCACCACGCTTGGTCTCTGTGTAGCGGCTACTGATGCTCCAT 
GTTGGCTGCCAGCTGTGCTGAGTTCTTTCTGAAAAGAGGCAAGGCAGAGGAAAAT^ 
2 0 AGC!AGTATGCCTGTGATCCCTGCACTCCAGAGACTGAGGCAGGAGGATTGCA 
CTGTCGACACAAAAGAAAGAAAGAGGGAAAGGGGTGGGTAGGGGAGAW 
GTTTTGAGTTGGGGGAAGCGCATCCTGTATCTTCCCCAG 

TCTAAGCTTGAAGGTGTCATCCCACCTCTTGGGAATCTCAAAGATTGTAGCCAAGGATGACC 
GCCCCAGTGCTGGGCTTAGCCCGCACCACTGTGCCTGGGTTTATTTGGTGTTAGGGATTGAACCCAGGGC^ 

2 5 AAGCACTCTACCAACGGAGCTATGCCTCAGGTTGGGGACTTTGTGTCTGCCATGGC 

CCATGCCTGTCTGTGGTGTTCTCCTTGGGAGATGACTAA 

GCCCCCTTTCCATCTCCCCAGCAGCTATTGCAGAACTACCACCCTGTAAAAAT^ 

CAGGAGACCACACTCAGG ACCACATAGGGTGGCTGGCCTCTCTCAGG CCAGACGTT CCCATTGGCAAAGCTGTGGGGAAGACCCCA 
AGTAGACCTGGACTAAGCCAGATCACTGCTGTGGCAGGAACCT 

3 0 TTTGACCCCAACACTTGGGAGGCCAAGACAGGTAGATTTCTATACCTTCAAGATCAGCCTGGCCTATG 

TCAGGGTGATGAGAGACCCTGTCTCAATAAACAAAAATAGAAGTAGAGAGG 
CTGGCAAGGCTTCAGTTCACCTCTATGCTCCTGCCCTTAGTT'GGCC 

TCCAAATCTACTGTAGGTGGACCTCCATAAGGTCCAGAACrrTTCCAAGGATGACCATGACT 
GCACCCTGTCTCCAGAAAGTGGGCAGCCAGGGAGCAGATTTTAATCATCAGCGAGGAATCCATCTGTCCT 

3 5 TCAGCCTGGTGTGCACAGAGGGAACACATCACCTCTGTCAG 

TGAGCCCTTGAGAGAACCACTTCAGGGCCTAAGGCTCCTCAGGCTGTGGACT 
GGCCCTCCAGGGTTCACAGCCCCTCACTGATAGCCAGGAAGCTCCCTCAAGGTCTATCTAGAC^ 
ATTTGGTCTCTGGGATGTGAACATCACTGAAGGGTCTCAATCTGCCCTTGTGATTGGATATGTTGTTCCTGGAGTCC 
CTTTGGAGGTAACCTGAAAGTGCCATTAGCCCAAGTTATGAGGGG 

GCTCCAGGCATCCCAATATCTGGGGAATCCTTCCCAGATGACTGTAGGCCTCCTGAGCTCTCTG 
TATTAGAGAAGATGACAGTACTCATGLAGCCAAA 
AGGTGTCCAGCATGGTGAGCCCTGTAGAGGCTGAGGGCTATT^ 
TGATTTTGCTTACTATTTAAATTTTGTATTAGTAACACACA.CACA 

4 5 AAAGCACATGTGTCCTGTGGAAGTCCAGGACAAGGATCTCAGTTTG 

TTACGTCTCTGGT ACCTGACAAACACTCAG CCITCATTTAATAATCTCGTCCCACACACTCTAG CCAGGATGAAGGAG TTGGATGG 
GG AAATGGGTCTGTCTAGGTTCTCCCAGC^CAGAAATCTCGGAGAAGGCTTGACC^ 
AGCCTTGCAACATGGGCTCACCCTAGCAGGAGAGGAGCCCCITGTTTAGCA 
ACCTTCAGGGCCCTCTTTGCTTCAGGGCCrCATCTTGCC^ 

5 0 GTGACCAGAAGGGCAGCGACTGCATCAAGGAACACACCCA 

CCACCCTGCACCITCCAGCCCCAGCCTGGCTCTCCTCCrGGCrACTCTCA 
GCTGGGTGACCTTACTTTCTCTGAGCCTCAACTTTTCCCTCTC 

G ACAGACAAGGTCTGGCGTCTACACAGTGAGTAATGCTCCCATTTGC CATCATTAAT CTCGCTGTCTATTTTGTTGTTCTCGCCAT 
GCATCACCGCTCTGCGCTTTTGTGCTCGCCCCTGCCTTCre 
55 GGTACCATTGTCTCACCAGCACCTGCTCCACCTGAGCTCCTCC^ 
GCAGCACGGGACACCACGCAGCATCrGGGGACAGGTCTTAAGCT 
C!TTCCTGCTCCAGGTGTCTTCCTCCTCCCAATGTCCTCCTCCT 
ACTTCTCAGTCACATCTCATTGGTATCTATCTTATCCTTCCAAAGAGGGGACCAGCT 
TGGTGACCCAACTGAGCTTCATATAGCAACTCTGGGGTGGA.GAGCACA 

6 0 GTGACTCAGGACCCCTCCTa^GAAGCTCCGGTGCGGCTTTCGCTGCT^ 

TCAGCAGGGCCCATCCTTGAGCTTCTAGCCITTCGTGGACACATAGCTCTCT 
GGCACGTCTCAGACTTGTTGGCTGGTGGCTCTCAGACrTTCA^ 
TTGTCAGGAAAAGTTGTGAGGTCCTGCTACATAGGGGCTGGGCTGACAGGGACr 
AGCGAGGGTTGTCCAGGGACTCAGTACCGGGAAAGAATGCAGGTCATCCTGCCCT 

6 5 CGCCTTGTGATGGTCTGCCCCAATTATCACCTrAlTTGAAAACCAAAATGGGAGTGG 

GGACCCTGGTCTGACCCrcTGTCCTGGTGTCCTGCTCAGCTCCTGGGAAGAGGTAGGCAAG 
TTACTGGGGCTCCTGGGCTGGCCTGTGCAGGTTTAATATGGCTC 

ATGGGTGG CCTGTGTGTGACTG AT ATG CAT C CTACACAG ACAC CAGCTG TTCTG CTGAGAATTGGAG AATG AAT ATG CTTTTTATT 

7 0 ATATAGTACrTTATGCACAOUAGCAAACACATCATCTATGG^ 

GAACCAGAACATCAGCATCTGTCCTTAGGCTTCTCCCTCCCCACCCCATCACC 
CGCCATTTTATAAAGAGAGATGGTGGAGGGGGTGGGGGGAATGTCTCTCACCC^ 
AGAAAGTCCCTAAAGATTCTCCTGTTGACCCTTCCCCA^ 
AACTCAGGTCTTAGCTTACAAAGCTTACTGATGGAGCCACCTCCT 
7 5 G CTGG ATG TATGGG C CCACAC CTTT AATCC CAGCACTGCAGAGG CAG AGG CAGGTGGACCT CTGAGTTGGtf\GT"T C CAG GACAG C CA 
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GGGTT ACACAG AG AAACACTCTCT CAAACAAACAAAC^ CTG AGAT AAG 

AATGCCCACAATACATTTTTCZACTTTGCTTGTTAAAAAAAAAAA^ 

TTCTTTGTCTTATTTTCGAGATGGGGGGGGGG 

GCCTCGAGCTCACAGAGGTCTCrrGCCTGCCAAGAGCrGGGATTAAAGGTG 
5 TGTTTCTAAGTTCCTCATTGTTCCTGAGTCTTTCTATATTTCCTTCAT^ 
CTTTTTGTGTACACAGGAGTCTCTCrGTGCCCATGGAGGAAAG 
GGCTGTTGAATGTTCTTAGAGTGATGCCTAGCCATTCCCAAACATGCTCCT 
CCTCTCTTTGCCCATGATCTTCCAAAGCTT^ 

GCCCCCACACTGCACATCCTCCAGCTCTTCGGAGCATCATTTGGATGGA 
1 0 TATGTGATCCTGCTGGGAAAGGCTTCTAGGGACAA 
ACCACAGAGGTGACCCTTCCTTTCATTTC 
AGTTTGGTTTTTATGGTCTTTATTTTCCCAACTGGGAGGCT 

GAGGAAGTTTAGTCTAGTATCTCCATCGCTTGGGGGTCTCGAGATGATGAATGAAAAAGCATAAG 

AGAGAAGCAGTTGGCrCGTTGCTCTGTGGTCCAGCTGGGTCTCTTGTCCT 
1 5 GGCCCGTGTCCAACCCTGCCTACAGTGTTTCTACCCAGCC1TCCTCCAACACCCACG 

GTCACTGGTTTGGGGGCATCTGTTTCATAGGGAGGCCTGAAACCCTTGCAAGG 

TCAAGAGTGGCTGCTTACCCTCTGGCTGGTGGATGAGTGCAGTCGGAG^ 

CTATGACAGTCTGCAGGCTCTGGAGTCAGACCAGATGAGCCCTCATAGACCTG 

TCTTCTCTTCTTTCCACCCACAATTACTCTACTGACTCTGGGAACAGG^ 
2 0 TGTAGACCAGGCTAGCCCCAACAATTATGAGAACCTC^ 

CTCTGATCTTCCTACCTCTCTCTACAAGTTCTGGGATTAAGGGAGCAC^ 
TTGTGTGCATGAGGC^TGAATTGAGCTACAGCCC 

GCCAAGTAAATAAGACTTGCCTTGAACTTGTGGTGACTCCCCTGCTGCAGC^ 

2 5 ATGTGTGACATTGCTGGGTTTTATTTTGATCCTTTGTTCATCCCTACAAACTA 

GGGAATGGAGGGAGCGAACCTACAGGTAGGAACTTGGGAAGCCCAAGGGTAGCCAGGACTTTCA 
ATCCTGAGACCAAGATCTCTTTGACACAGCGTGGCTAGAGCGTCCTGTGCT 
TGGCTTACTGCCGTGGTCCTCZAGCCTCCrGCACAGGATGGTCTGCGCTG 
CCTGGGCTTAGAGCGCCAGGAGGTCAGATAGCAGGGCCTGAGGATGCTAGAGGCCC^ 

3 0 CCTTGAAGTCGGGGAAGCTCCTCAGACCTCATTAACCTCCTATTGTAAGGAGCCTC^ 

TCGCACTGGTCAGCCirAGTTCTAGATCTGTCTTCCCTGGCCTCCCAACAG 
GGTGAACGTCTGATCTGACACACGGATGCCTGGGGACGTTCA 

GAG CTGTAGTCCCCTCCACAAGGGAGAACAGAG CAGGGGTGGTTCAGAGGGGTGG AGGCACGTAGGTGAGATTGCCAATTGGGTGT 
CCAAGAGGCAAAGATGTCATTGAAATGACAGAGCCAGAGAATCTAGTAGAGGGAGACTGA 

3 5 ATTCAAAGCAATTCACAGACCACTGGTCTTCTGAGAGCTCGGGTAGTGTGAGAGGGGAAGG 

ATCCTTT CTGCAGCGGTACAAGCTAGAACGCGTGGTAACAGGGTGGTATGAATTTCCC CAG AGGTGTGGAAGGCTCCTGATTGGTC 
TGATTCCTCAGCCTCCTCGTGAGACACTCCATCAGGTTGGACCT 

CAGACAGTCTCCACATTGCTGCCTGAGGGATCTGAACTCAGTAGCTGACrrGTCACCCCTGTCT 
TGGTGTCCTACTTACATCCCTATGTCCAAGGCTCCTTCCTTTTGAGAGCA 

4 0 CCTTCTTGGTCCAGGTCCCACCCACTGGGAATACCCTGGATATACCTCTTTT 

TCGGTTGG AAGG AAAAAATGATTTT AAAGCAAT AACAGCAACAACAATG C C AAGTG TG GTGGTAT CT ACCTG TCATCC CAGCAT CT 

CGGAAACTAAGGCAGGGTAATTTCCATAAATTAGCATAGCCTGGTCTGCATAGTGAGTTCCAGGCCA^ 

AACCTATCn'AAGGTAATATAATTTAATATAATATACCACrCCCCTATGTGTTTTG 

TGGGTTTTCTCAG CTCTG CTTGGAAGTTGCCTGTGTCCTTGCTATTGTCCTTT AGT CCCTGTGTGCATAGCCCTGTTATTAG ATT A 

4 5 TACAGTCCTCCGGGTGAGGGGCCATGACAG CGTTGAGTTAGTGTGGGACACATTTTGGGTGCAG AATAAGTGAATT CTGCCAGAAT 

GAGTCATTAGCGGAGAGTGGATGGAGGCCGTTAGACACAATGGCAGC^GCATAGGAGGCGAGATGAAGCCA 
CTTCTGAGTCTGCCTTCTCGTTGGTGCAGTGCATTAGCCTGAACCCCTTGGAGCTTT^ 
TGGAAAACAGCTTTGCAGTCTCAAAGGTGGCCAGCrCCCCTGAGAAGCCCA 
GCTTGCTATCTTGGAAGAGGCCCAGTCACCTAGAATTCTCTGTGGTTA 

5 0 AGCTCTTTTGTCTCTTGAGGATGCCTGGGACATGGCCCTGCTTTGATACAGAGACA 

CATTCAAGAGAGAAAATACCAGTAAGAAGGAAGGCTCCAGTGTGAGTCCAGCACTCAGCCTGAACAACAAG^ 

CTGTATGATCCCAAACATATTACCCGGCCTCTAGAACCTGTGTCATCACCTGTCAAGTGGA^ 

GCAAATT AAAGG ATGT ATTTCATG T AACGTGCTTTAT ATG AT AC CGGG CA C AG AGTCAGGG CTCAAT AAATGCTGG CCTT CACAAA 

5 5 TAATAACCATTGTGAGTGCCCTGGAGATGGTTTCACCCC 

TCTTAACAACTGTCTCCCCAAATAAGTGCACCCTATAT^ATCTTATTCACCCTGAGTGTGCACCTG 

AGTGTGGGTTCTGAGGGAGCCTTGGCCACAGGTCTGTGTTGGCATCTGCCCTCACGCGC^ 
TAATTTCTTCAAGCTTTGTGGCACATGCCTGTATATCTGTCAATTCAAAA 

6 0 ATATGTCTTTAATCCCATCACTGAAGAGGCAGAGGCAGGTGGATTTCTGT 

AGATCAGCCCCGACTACATACATTCTCACCAAAGGGGGTAAGAAAATGAAAG 

TATCTCTATCTCTATATCCATATCTATATAATTGGAAAGGGC^ 

AAGTCTTGGGTTTAATCTCCAGTACTGAAAGAAAAAAAAAATCTA 

AGCCTCCTGGGGTGACAAGATTGGGCACAGTX^ CCTGC AG AATGTAT CCTTTC CCCG 

CTGGGTTATGCACAGGAACTCAGAAAGATGTGGGTCCCTGTA^^ 
CCAGCATGTCTGGGAGCTCCTGTACTCACCAACATGTAAACCCCACAGCAAGA 
GAAAACTCTCGGAGGAAGAGAGATGCAAGGAGGAATGTGGGCAAGGGGGTGGGGTC 

7 0 TGAAAGTTTTTTCATGGTGACCTTCCTTTTCGAAT 

GAGCCGCCTGTGATTTATACCTGGCCTATGTAGATGAGGGAC^ 
GGCTGCTGCGGCTGTCAAGATGGAGGCTTGAGAAACCTGG 
ATTCTGGGTACCAGGATGTACGGCGTAGGGCGGAGCAGATAGGGCCCAGCGCT 
TCTTGGCTTGCTCCTTGAAATGAAGCGAAGACCCACTGAGCACCCAGACTTCT 
75 CTGTGTGTAGCCMCATTGAGGCTGAGCTGCCCTTCTCTGAGTGCTAGAATACCT 
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GAGTGGGCACTCACAAACCCAGAGAAGAAAGGCTAGGCTTGGTGGAGAAACT 
TGTGGGTGCGGGGCAGGGGGTGCGCAGGAAGGTGTTGAGG CTCGCT 
GAACTTGCGCTCAGCCTCCAGAAGGAGGGCGGGGGAAGTCCAGGTGCCGGC 
GGGGACGAGGCGO^AGGAAGACCTCCTGGGAATGCAGTGGAA^ 
5 CCGGGTGTGGCGCTGCTCCCGCCCCCATCCCTACCCGCCTCACGCTCCAGCGCGCACCTC 

CCACAAATGAGTTGGGAGTTTTCTATTGTTCTTCAAACTTTTATTATGG 
TTAGGGCTGGCAGAGCATGCTACAAGGGCCTGATGTTCGTTCATCATCACAAAGC^ 
GGGGGCAGGAGATGTCGCTCAGTTGGTGGAGTGCTTAC^ 
1 0 TGGGGAGTCAGAAAGGTTATAGCTAGTCCTGGGTCAGTCAGC^ 

AAAAGTAGAGGAAAATAATATAAGGAGCAAGAGTTTACACACCTTAT 
TGTCAAACAATTACAAAGAAAATTCTATTCCGCATGAGCGCT 

TGCTGGTTAACAAGACGTCTAAGAATTCCCGATGATTCAGGTTGCCTAGGTGTCCTCCT 

C^CAGGCTCCACTCATTTCTTTTCCCTTGTCTTCATTTTAATCTC^ 
1 5 AGAGAGCTATGGTTTGCAGTCTTTGATCATGTGACCTGTTCT 

TGTCTCTCGTCITCCrGACAACTGAGGGTTACTTAGGCGAACCGTC^ 

ATTGCGCCATCTCCAGCCATCCAGCTCTCCTGCTTCTTTGGAGGTGTTTAATAGG^ 

CCAGCGGGTTAGGACTGGAGTCCATGTGTCTGGGTTTTGC 

GGCCTCAGCAGTC^CTGGGCAGTGCTCATCCTGGGCCCTCCAAAGGGCATC 
2 0 GCTCTGGCTCAGCAGATCTGAATGTCTGGATCTGGGAGTTGGTAGTTCTTATAGATTATA^ 

CTCAACATCTAGCACACAGCTGGC CCAGTGT AATTGTCCCTGAAATG AAGATCTC CTGGTGG CC CTCAATGAG CCCCTA 

GAAAGATGGTTTCTGGTGTGAGCTTCCAGCCAATA 

GTGCAGCAGTTGTGAGAAAAGTCCAGAGAGCCAGC^GGGTGTCTTCTC 

AAGACTGGGGAGGTCAGTGGGAGCCTCTGGGTCCAGTCCAAGGGGGTCCTGGGCCCA 

2 5 TCACCGGTGAGGGAGGGAAGTTGTCCTCTCAAAAACCTCAACT 

TTCAAGATCTATCTTCTTCTGACAAGCTGAAGAG^ 

TTGAGGGGTGACTAGGAGCTCACCGTGAGATGAATGGCCAGGGAAGCTGTTT^ 
TCTATGAGAACTGGGAATGGTTCTGACCTAGGAACCAGCTGT 

TCACTGCACGTGGGAGACTGCCCTGCCGGAGAGGGTCAGC^GGGAGGAAGTGGTACGCCGCT 

3 0 GG CTTAAGGAACGTTTATAGTTCTGATTGGCAGGTCAGTAGGCAAAGAAATAAATGGCAGA 

AATCAAGCTGAGCTTGAATTTCAAGCCGAAAGCACAGGCCAGCCAGCTCTCAGCCT 

TTCCTACCCCAAACGGGACTCCTCACACTTGGTCAACTACAGATT^ 

AGAGCAGGAGCTX3TGCTGGTAACAGAAGGCGTTCCAGTGTGCT 

GGAGATGGGGCTCTGGCTCGAGGTTCAAAGGCCTGGACCATCCTCCTTTCACCTTGAACCAGTGAAGGAC^ 
3 5 CACTTAAGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTTGAGAAAAAGAAGTTAATTGTCCCACCAACCTTTGGCTTTGACA 

ATGACTTGATGCTTTCATGAGT AGCCTCTCAGGCAGCTGGCTATGGGAC CATTTACAGGAGG CTTCG AGTGGCTTT CCAGGG CCTC 
TTACATTCTGTCTCTCCTCCAGCAAGACft^ 

TGATCAACCTTCACGCAGAAACCCCCGTTCTAACTAACCATCGCTT 



40 



AACTGTGGCCTCTGGGGCCACAAAGGCAAGGATCAGACCTGCTCAGTGGGGGTGGGGAAGGG 



CCTTCTTGGGGTCTGAGGTGAGAGGCCCCTGACCTGGACCrGCT 
TAGAGAAGGTAGACTAGGCCAGGCTGCAGAAGCCCCTGTCTCCTTCrrTTCTTGCCTT 



4 5 GCCAGGAGGGACAGTCCTTCCTCCGTTCTCCACTGTGATCAACTCCCCCTTCCTATGAAATGGGGTGGGGGTGG 

AGGACACCTCAAAAAACAGGTTTCAACATTTCTVCAGGTTTCAACATTTTTC^ 
CTTGCAAGCAAGCCCTCTTCGGCACAGAGTAGATGCCGGTACATTCCAGTCACGCTG 
GCTTCTTAGATGGAGGCAGGATTGATACCACTTCCCTCTGATAGGGTCACCCCTAAGAGTGACCCCCGACT 
TATCATCTTCTTCGTCCCATAGATGCCCAGAATCCCGTCTCTTTGGGGTCCCCAGCAAGTG<!ACTGCTTCCT 

5 0 GGGCCCTGAAAGGAGCTTCTAGAAGTATCGGGTG AAG ATG AGTCT CT^ CCAGCA 

GAGCCAGT AAAGTTCTGTTGTTTG AGCTCCACCGGGAC CTCAGGAAGAGAGGAAGAAGGTCACCCTGCCCTGCTCAATGG CTGCTT 

CACCCTTCCTTGATGTTCTCCAAAAGACTCTGCTTCCCTGCT^ 
AAAGCCCAGAGGCTCAGAGTTCAGTCTGAGAAGCACACCCGGAGGCTGGTCC^ 

5 5 TGGGTGGGAGGCCACCCTGGGGGCAAACTCCACrrCTGTTTGTCAGAAAGGAGTC^ 

CTCTCACCT AGGGAGG AAAC^GACAAGG AACAGAGCAGAGGGAG A CAGGCTGTTTG CCTC 

TGTAAGTGCTGATGAAACGGGAGCCAGGAAC^CAGGCTTCAGCCCTAGCCGTGTGAGGTTTGGGCACACCA 
ACTCCTCCCTGCCTGGTATATGATATCAACAGTCCCAACCrGAGATTTCCAGCT 
TCAGGTCTAGAGTGCCTTTCCACTGTACCTCCTGAGCAGCTATCCTCAAGAGGAAATGCCTC 

6 0 . TGCACTTGGCCATGGTCTTCAGCTGTCACTGTCCCCAAGGGT^ 

GGTCACCCTCGTTCAGACCTTCTACATCCCTTCTCTTTG^ 

ATGGCCAGTGGGCCCATCOVCCGCCATAGCCTGTTAGCCTCTCTGG 

GATGTGTTGCCAGGTCCTCGAATCTCCTTAGTGCCAACTGTCACACC^ 

GCCCTTGTCCC ATTGTAGGTGGGAGATAG ATCAACAAACCAAAGT CCAGCTCTGGATG CTTCGAGCTCTG CCAGCTTGCC CAG AGT 
6 5 GGTATAAAACCTGGCAGGTGGGGCCCCTTGCCAGCCCCCCTCCTCCTGAATAGTCCAAGACGTCAGGTGAGCGATA 
TGGGTACACCAGCTGTTGTCCCCAGCCTCCTAGTTCGAGGCTG CT 

GGTGTGTAGCTGCrCCTGATCACAGCAGTGGGGACrTACTGGCCCCACCCTGGGAGACTCIAGTGTTCT^ 
GATGACTGAGGACATAAGTGACAAGTAGCCAAAGATCATCTCCTGCTCACGTGTGGCATCCA 
TGAAGAGGGCTCGTTGGCCTGCCGTTGACAGATGTCTCCTGGCCTGGGTGCT 
70 GCACCTGGAGCTCrcCCAGX3TTTGGAGT(^TCTCTC^ 

AGGACTTGAG CCCAG CTGCCGAGACC^AACCCTCCAGTTGCTT ATCCTCG CAGGCCCAGACT 

GCCCTGGGCTCCACCCITTTCCCTTCAGCCCCrGTAATGTGAGCTAAAGTTATCrCTG 

ATATAAGGGATGGAGAGAGGTCCGAGGACTTCCCATGACTGTGAAGGGCAGCATCT^ 

GTCACTCGGAAACAGCCCCTTCTTGGTCATACGGTGCTGGGAAAGGTGTAGTTATGTGCAGAGGGCCCTGTGTGTTCCAGATCA 
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GGCTCACACAGGGCCCGTAGAGACTCTAGAAGCTGAGAGAACCCTCGCCCACCAGACTCAGCT 
CTCTCCCTTGTAATAATCAGCAGCTCTAATTTCCCT^ 
TTCTGGCTTCTAGAAGTTTCCATATTGCCCAGOAGGTAGAGTTCT^ 
AGAAGTCAGCAGACAGAATACCTGGTTCTGTGCCCT^ 
5 CATTTTATTTCATGAGAGCCTAGGCTTTTGGATCAGAATCTTGTGGTTATCT 
GAAAGAGAAGGGCAAAGAATCCGGTGCTCCCAAGAAGAGTAAGACT 

CCTGGGATTCACAGGAAGGGACAGAGAAAGAGCTTGTTGGCATCGAG^ 

CTGAGCTTCCCCGTCACCCTAGCTCAGACCAGTGCCTTTCTGAAAGTTCTCTC 
1 0 AGCGGTTGGGGACTGACTGCTTTTAAAGAGGACCCAGGTTCAAATCCCAGT^ 

CCTGACACTCTCACX3TAGACATAGATGCAGGCAAAACACCAATGCACATAA 

AAAATTGAGGGATTCAGGTAGACTCGGTCCTGGTCATTTCAGCTATTACT 

TCAGGCAACTACCACTATTATTTCTAGAACATTCTCTTCATCCTAAACAGTCATTACTGTG 

TGGATCCroTGGTATATAACAAATTCTCTTTTTAAATGTTGT 
1 5 ACAAAAATAGTTCACCATTAAGGCAG CCTGAACTTGGTGACACCC CCACCAGCATTCACCATCTCCCTAAGCTCTAAG AGCCTGTT 

GGGAAGCTCAAAACTTAAAAGCAAACAAAACAAGCTAGGCTGGGATACAG^ 

GACAGAGCCAGCCTGATCTCAGCAGGGACTCAGTCATC 

TTGTAAGTCCCTGCAGCAGCTAAGCTGCCCTGAGTTCT 

CAGGCACAAAGGCCTGAGTTTGTTCCCCAGTACTGATTTAAATACCATAAAATCGTGAGAAATG^ 
2 0 CCTTTCCACTTAGGGAAGTTGGTACTAAACTGGATC 

GAAATTCAAGATCATCCTTGGCTACGTAGGGAGTTCAAAGCCAGCCT 
AAAAAAAGTCTGTATTTCATGAAAAGTCTTATATATGTATATATACTTCTACATTT 
TCTTTCGGAACTGCTGCAGATCTTTTCTCCATTTTCCTTCTAAT^ 
AATGGTTATGTOTAGGTTGGATGAGCACATCTGCAGTCCTGGTACTCAGGTG 

2 5 ATGAACCACAGACTATGGCCATGTCTCAGAGCAACAATATCT^ 

TAAGGTCCCAGGCTCCCTCTGCAGCCAGGAAAACTGTCTGATCTTGTCACCATC 
GACAAAGAACTGGGTGTCAGTGGGAAGCTGACAGGGCCATGGGGATGTTTACAGTCT 
ATCATCAGCCTAAGCCTTCCTGACATCTATGGCTCATCCACCTGGCT 
TGAAGACGAGCTGTCCCreGGGCTGAAGACAACAGAACCTTGGGTAGAGC^ 

AGCCTGGTCACCATCTCAGATCCTCACTGGAGGAAGTCCCTCAGCGGAACTGAGGAGTCTTGTCCTGTG 
G CTGCCCGAGTGGGGGCATCAGGCAGG CTCTTCGGGGTGTCCTCT ATTTTCTTAACT 
TGAAAAATCAATAGCTATTCCTCTTTGTGTGTGTGTGTGTCATGCTGCAA 
AGAGGGTACGGGGACACTAGCAAAGGTTACTTCTAGCTTGGGAGCTCACAGAAGTC^ 

3 5 GGGTATGGGCTGCAGAGAGGCACTCAAGGAGAGGGCTCAGCATGCGTGAAGGGTGTGAAGCTGT^ 

CT AAT CAAAGCCTCTCACAGTGGGG AGATGG AG AAGAT CAGGAG CCT CT CCTGTTG ACTCTC C C AGGGG CACAAG CTCTTCTGT AA 

CGGGGACAAGAAGTGGGTTACACTGATATGTCTGCAGCAATGGAATC 

CATGTCAGGGGGGACAGCTGTCCTGCAGAAGTCAATCCCAAGGCCATA^ 

AGGCCCTAGCTAATTACCACAGGATATCCAGATAGAGCTTGTTGAGACCTCTGACCGGAGCATCCT 

4 0 TGGTCAGTTTGAACATGGATCCAAGTCACACATGGClTTGCrrACGCTCAGCCTGT 

GTCAGGACTGCTGGTGAAG AGACTCTGCCGTGTCTTGG AG CCTT CCTCCrCGTGCTTGATGGGAAGTGG CTTT AGAAATTCTGATG 
TGCTGTATCAAGTCAGGGAACTGAGGCTCCGGGAAATGAGGTTGGCGACCCGGCAGGGACCAAG 
GTGGCTGCCATTGCTTGAGGGATCCTGATGTGCATGGCACTGAGCTGTACATGTTTACT 
AAGAAAGTGTGGTGTGGAGAAGTTAAGGAAATCGCTTGGATTG^ 

4 5 GACAGGCTTGGAAGGAGGAGGGAACCTAACTCTGGGTGACAAG 

GTAGGGCTTAGGGATGAGTGGCAGGCC^GGGCTAAGTGCTGGCTTAGTGTC 
GGGCCTGGTTGGGGGCAGGGGTCTGTAGGGAAGGAAGAGAAAAGACAAAAGACATGCCA 
CGACCCACAGGTCCAGGXIAGCTTACTTAGCTCCCTTGCCTCTGTTTCCCC^ 
CTTTACCTCATAGACTGGTTGCGAGGATTAAATCAGTTAAAGACG 

5 0 CCATGTTAGGTGTTTTTGCGGGGCTCCCCrATGCCTTTATGTGCTGTGTGGCTTO^ 

AACGTCAGCCCACCAGCGGGTAGGCAGGCGAGGCCTGAAGCCACCGAG 
AGGGAAGGCTTGCCTGGGAGCCAGGTAGTCTAGTTTCTAACA 

TGTGTGCTGGGTTGTTATCAAGCAACTCAAGGAAGTTAGTGAAGAGTTGGTAGTGAGCCGTGATGGTGTGACACT 
GGTCACCTCATTCTTTTTCTCTCACTCITGGCCAG 

5 5 CCTCTTGCTTCTGGCTCCCAAGTGTTCAGGGATTACTACCCACCTTCCTAGCTTTC 

GGGAGAGCTCAGTGGGAGATCAGTTGTCTGGCAAGCTTGAGGCTTTGAGAGGTGGGGTT 
TTCAAGGCCACCAATTCAAACCCCCAGTTCTGCCTACT^ 

CTGAGGCCAGAGGCTGCTGGAAGTGGTGCTGCTGTGTGCCATGTTTTGCTGGAGCCTCT 

TG AGTGTGGGGG AAGGGTGTTGGG AGGCAG CTCACAGCAT ACTCTGATTAGACTCTGATG AGCCATATACCTGTTC CCAAGAAGCC 

6 0 TAGGGCAAGCTAGTTTGTTCTGGAAACCCAGATTCCTCATCAGGGGACAGAGATGATCW 

AGTTCTAAGCAACTACAGAGTC CCTC AATGGCCGCGTTCGGTTCACCCATGCCAACGAAG CTTTTGTGTCTCTTCTG ACCTACCCC 
AGAGTTTGCCCCTGGCCCTCTTTCCAAGGCTCCTGCATCCAGGACTTGACCTTAG CTAGTG CCT AGGTCTCCTTCTTCCAGGACAC 
TCTGCCCTTTATTATGAGCCTGGAAGAGGGAAGGAAAAGGTAGAGTTTCTGGTAGT^ 
TCCCCCCCGAGTGTACTTACCGGCGCTGTAAAATCCCAW 

6 5 AGAAGGTTGGGGGGATGCCTGCrGAGAGTTTCCTTTTGCTCTTGAGTTAAA 

GGGGAGGTGGCACTCTAGAAACTTCTAGGGAACTATAAA^ 
ACAAACAGGCTCTTGGTGTCCTCTGTACTCTTGGGGAA 

AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAACACrGTGCTAAGACTTATGCT 
CACAATGGCCTCTGCTTCCTTTGTCCTTTATATTTGGATTCCTTCCCATGAATGGCT 

7 0 G AGTCCCTGGG AGATTGCTCTGCAGGAG CCCTGC(IATCrCTACCAGGGCACCGGGGCCCAGGCACTGG& 

TCCAATTCTTCATTTCCAACCCAGGGTTATTTTTGTGGCACAGTACAGACCT 

CCACATG CCC^TGTC!AG AGGGACACTGCTTGAACCGGTGCCGTG CCCTTTGTGGG ACACCGTGTCCC CAAGGAAAGTCAGAAAGCA 
GCAC^CAAGGTGCCAGGCTTCTC^GAGCCACGGATCAT^^ 
AGGCCACGTGTTCTGTGTCTACAGTGCC^TCrCCAT^ 
7 5 TCCCAGCCTCIGCTCTCCGGCrCC^TCCCAG^ 
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CGTAGCAGAAGGTGTCLAAGCTGGGCAT^ 
GGGTT^TGGGGACCCTGTCCCCTTCCTCC 

AGTGGGTGGGGT C CATCTCTGGACAGGACAGG CTGTGTCCACATCGGCT ACAT CTTTT CT AGTGCTACAGGGGGGTGGGT AG CCCT 
5 GAGGTGCCCCAGTGTCCTGAGATTGGAGGGCAGGCACCAAGGAGC^ 

TTTGATGTAGACATGTGTGACATTTCCCTTATGCTGTGTTAAGTGCT 

GGCCTAGGTGTTGTGCCATG<3AAAAAGTCTCTATGCTCTGTCAAAGCT^ 

CCTGAGAGTTCCTGTTGGTTCCCATAGCCTGAAAGCAAGACX3CGTCCATTAAC 

GAAGTCTGACCAGCTGTGACCCTAGTGGATGAAAGCCXMCTm 
1 0 TGCTC CT AAG AAGGGGACTTTGGCT CT AG AACCTCTCCTGTC CAGGAATT^ CTGGG CTGGC CTCGACTGG AGTTT 

TGCCAGCC<lATGGGGGCGAGCCrrcCTCTGAGAAGGTTCCT 

GAGGGTGGGCGCAGAGTTTGGGAGAGGGGAGCTATCCCCCCTCC^ 

AAAGTGTC1X3TACAACCTACAGGCAGAAAACGCAGGCTATTGTCGTGGGACA 

TGTTTACGCTTCTAATACCAAAGCTAATTATTCCTTGTTTAC^ 
1 5 TGGCTTTTCAAAACTGATCTCGCCCGCCTGTKK?TCTCAC 

GCCAGGCGTCCCGACGCCTCCCCCAAAGCAACACTGGCCTTCTGATCACrCCTC 

GCATGATACCCGCTCGCAGTGGGTCTGATGTTTAGATTCGCAGCAG 

GTTTGTGTCCCAGTGTGCAAGGAAATATGGGGCTGAGACAGTGGGATTTGATCAAG^ 

CCGCGTCCCGGTACCCTGTGGGCAGGCGGTGGGTCCTGGCCACT 
2 0 CX3CAAGGCCGCGGTCCTCCGCGGCTCCGGAGGGTTCGA 

GCCCTCTGGCCGCCGCCTCCCGGGGATCGCGCTGCCC^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNCCCAGGCTAATCCCCGCCCCGCCCCCGCCACCCCCACT 
GGGTCGCCGGAAGCCAGGGGCGCCCTCCCCTGGGGCGAGAGGGC^ 

2 5 GGCTGGACTGCTGAGCCAGCTCTCGCTCCX5CGCTCCCCGCCGGGCTGCGGGAA 

ACGAGGGCTTCGTGTCTGCAAAGGCTGCTGTGGCTACTGAGTATCTGGTTCCCT 

GAGATCCGGCTCTGACCACCTGTGCAGTATAGCACGACC CGCTCTGGGAACCT CCTCGC 
CCGGCCCTGTCTGTAAAATAGGGTTTAGAACATTGTA^ 

3 0 CTCGACCCTCGTGGCTTCGTAGGAGGAGGGCAAGA 

GAGGCACAGATGAGAAATCTGCCTACGGGGAACTGAGGTTCGAGTGGAACGAGTGGATTAGTAGG 
CTCCAGAATGGAGGACTCAGAACTGTTAGGAGGAGTGTTTCTG^ 
CCAACCTAACTC7TTCAGAGAGCATTCCGGTCCCACCATAGACCCCTCCATCTTAGTAGC 
TCCACAAAGAGAATACTAACTAGGGTCTGAACTCTTTAGAGC^ 

3 5 TTTGCGCTCTTCCCCAGCCCTCAGGGCAAGGCTGTGTATGTACCCTAAGGCTCC^ 

AGCCACCCGTGAATCGGCCTTCGTGCATGCCATCGCCTCGGCTGGTGTCGCCTTCGCAGTCACACGCTCC 

CCACCATCTGCGGCTGTGACTCACATCATAAGGGGCCACCTG^ 

GGGGTGCTGGTGTCCCGGGAATTTGCGGATGCGCGGGAGAACAGGCC^^ 

4 0 ACCCCATAGTAACCCCCTGACCACCCTCCTTCCT 

AGTGTGTGTATTCCATGTTGTGTATGTGGAAGTCAGAAGGCAACCTGTAGTGGTCAGT^ 
GAATCATGCTTGATGCTGTTCCTGGATCTTTTGAGAAATCT^ 
GCTACACTGGTTGGCAGGCTGTGCCTGTGTACGTCrrGTGATGTGGTTC 
CATAGCATAGCCCTTCCTATGCTCGTGGAACACTTAGCTCT 

4 5 TTTGCACCACTTCTATGGATGAGGTGAATGTGAAGCCTGCTGGGGCCGGGGGTGTG^ 

CTGGCACCAAGTAAGAGTAGAGGAGGGTGTGGTACAGAGGGGAAGCTTTCTCCOM 

TTTCCCTAGACCATCCTGGACCACATGCACCTAAAGTGTAAATGCCACGGGTT 

GGCCCAGCCCGACTTCCGTGCCATTGGCGACTTC 

AGTCCCGAGGCTGGGTGGAGACCCTGCGGGCTAAGTACGCGCTCTTCAAGCCACC 

5 0 TCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGCTCCTTTGGTACCAGGGACCGGACTT 

TGGCTGCGATCTGCTGTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAA^ 

GCTGCTATGTCAGCTGCCAAGAGTGTATTCGCATCTACGATGTGC^ 

GTGGGACTGCCCTCTCCTCCCATCCGCTAGGCCTGCT 

AGAAACTAGGCTTCCCACGCTTCCACGGGAATGGTCITGGGGAGCGACTTGACGTATTTA 

5 5 TGTGCACGTCAGCCACAAGCAATTGATTAAAATTCTTACACAAATTAGCAGTGATGAGCCAG 

CTGTCATTAGTGAGCATCTATTCGGCCAGAAGAGAGGTTGAGGTCTGGTCCCAGCCATGCCC 
TCAGTTT'CCCTACTGTCTAAAGGAGAGGTTCAAATGTCAGAAACAGGGGC^ 
AGGAGGCTGAGGCATCTGGATCTGAGTGTGAGGTCTGGCTGGTCTCCATAGAGAGTTCTAGGC 
AGACAGTCACCTAGTAAGTCTCTCCTAACCCTGGGGCAGAGTCATTTCACATCACCCATGAACT 

6 0 CACAAAT CCT CCAG AGCTTGGG AAGGTGGGTGGGGGAAG CCAG ATCTG AAG CTTGGGTGG AG CGGGAG CACTCTCACCAAGTTT AT 

GGAGGGAGACAGGAAAGAGGAGAGCTCCCAGTTCAGAGCGGCTGGGGTGAGTGACCGTGGGATAAAA 
CAAGCTTCCCAGAGGTGGCCTGCATGCGTGGGAGGACCCCAAGAAAGGTCAGCCTGGAAGCAGAGTGATCCTAGAGGC 
GGGCCAGTGTGACCCTTGTCTCTGTCCCCTCCCCTGAGAGGGCTCTGCTAGTGAGTCTGA 
AACCAACAACTCTGGCTTCAGCATCTGTTACCTTCCCCGAGCAGCTGTGAGGAACGT 

6 5 GCTTTCTGTCCATCTATGGATGGCCAGCGCAGTGTGTTCCACCCAGGGACCTrGGAGCTGGTGACACAC 

GTCCCTCTAGAACCATACAGGAGGCGTGGGTTTTCTTTAT 

CGCGCCCACTGCTGAACCTCGCTCCTCATCTGCGCAGTGATGGCAGTATGGGGATCTTGATGGAT^ 
GGGTGTGAATGCTGCTCTAAGTGTCAGGGTTGCCATCAGCGTTAGCACGGGGTTGCTGCAAAGGAA 

7 0 TGCTTTTCAAAGGGATGGGAGAAAAGTCAGCAAGTAGGAAGGTTGAGGTC 

TTCACTCTGACAAGAGTTAGCCCTGCACCCCCCAATATCAGATAGTACATACAT 
GCTTCTTATCACTCAGCTAGAAGGGCCATTTGTCCT 

CCAGGGGTCTGCTTCCAGAAGCTTCCTATGTAGGCTTGCATAGGCCTCAGCATTG 
GGTCCAAGATACAGAGCCCATGACTCCACTCCGTCCTTGCCCATGTGGGCAGCTGAGATCTTCCA 
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AGAAC^CTTACACCCCTGAATCAATCTCCAGTACT 

G CTG TTTTTCT AGT ATCATGCAGACATGTTTTT AAGAAGCT ATG CCCA CATGGGG TGGGG 

GGTGGAGGGGGGAGGAGGACTGGACTCTCAACCTCACACTTCCTGATAAGAAAGATCCCT 
GAGCCCAGGTTTCCTCCTGTGTGAAGTGATAAGGTCT 
5 TCTGGGCTCCTGGTTTGGAGCCATAGGCCAGGAGATG^ 

TGATGG AAGC CAAGGGACAAATGGGCAGGGAGACAGCAGGCAT AGAAGGCT 
GAGCCCAGCAGGCCACATGCCCTTCTCiGAGAAG 
TCCATTGCTTGAGGTTTGTCCACTO^ 
CTTTTTATCCCTATGTCGGAAAGGCTATGAATCTAGTCA 
1 0 GGGACCTTAGGCATATCCAGGTCGTCTTGCTATGCCTGGTTCAGCTCTGCCAGC^ 

GGGGTGC CTGGGGCGTGG CT CAGT ACCTCAGATC CCACGGGAT AGGG CTGTACAG AGTGAT CATGCTCAGG AAAG CACCCTGGGAG 

GGAGTTCACCATAACTAGGTAGAGTCCGTTTCTTTTGGACAATTCACGG 

GACTCX5TGGCTCAGACTGGCAGGCITTTCCGTCTC 

TAGGAGGAGGTTTGAGTTGTGTGGATGCAGGCTCTGTTCTTCCTGAGGATGGCATGTC 
1 5 GAGGGAGGCAGGGCTGGCAGCGTAGGCAGCCTGTGATCTTT 

CTTCCTGTTTATTTCCCAAAGGGCACTGGGAAGGGGTAGATTGTGCGGCTGGATCC^ 

TAGATCCAGGCCAGCCTTCGGCACTGGCCAGCAAGGAGCATGGACTGTTGCC^ 

CCTCGGACGGACGGGCGGCTTCTTTCTCAACTAACGTCTCTCCCCCTGCTCTGGATG^ 

ATGGTTTTACCAGGGTCTGCTGGGGACAGACTCGAGGCTTACCTTTGCACATGTTAAAG 
2 0 CCGCAACAGAACAGGCTGGGCTAGTGTGAGCTCTTGGCCTGGTGGGAAGGACAAGACCATGG 

CTACTCGTGACATTCCAAGATGCCTCHX3AGGTGGGAACTGTG 

ATTTAAATTGGACATACCTTGTCGTTCTGAGAAAAGCCATAGATAGGTGTAGCT 

AGTGGGAGCAAGATCTTGAGTTTTGAAGACCTCAGAGTTCT 

CCTGTTTTCGCrAGCCCTGTTCTGCCCTGGAGCGACAGTCAGATCTCCACGCCCCTTTCT 

2 5 ACGCGTTTTTTTTTTTTTTTTCATGATGACCTTG 

AGAAAGAATGGGCTGCTCTGCCCTTCTCAGCCTTGCTAA^ 

CTGAAGTAACCAGAAAGACAAGAGATCCTTAACAAGCCCTTCTTCCCACTTGT 
TGGATTAGATAGATATTCAAAAGAGGCAAGCAGCGAATGGAGGCAGCTCCCAGCTCTGTTCCCG 

3 0 TTTAGTAAGGTGGGTGGGGCTGCACGGATCAATCCATCAACTCCGTCTTAAGGAGAATCAGAAAGAGGAGATA 

GGGCAGAACAAAGAATTTGTCCTTTCCCGCTTCTGTCTAGGGTCTGCTAATGCTGGCTTGACGAG 
TTGTGCAGTTGGCTTGCCAAGCAGGCTCCAGTAGGCCCTTGCCTGCACTCTCTACCATGTC 
CTCCCATCCCrrTCCTAGCACCCCAAATGCCCCTTCCCATCTCTCCTTCCAGAAGTTGG 
TGAGGACACTTGAGCAGAACGGATACAACAATTTACAAGTCTCTTCATATCTATGTATTCTAT 

3 5 TCCGGGGCGTATTCAAGTAGCTGACAAGTAATTATTT 

TAGCATCCAATGGGAGGTCCCTACCAACCTGCTGTATCCAAAGTTTTGTAAAAAGTTC 

ATT C AAAAAGT CTCTTTTT AT AAATATT ATTT ATT AT ACAATG T AT AT ACCTTTG AG TT AACT AAG ATT AT AT ATT AT AT AAAT AT 
ATATATATTTGGAGAAAATCTATTTCATCATGCAGTTTTTTTCTGTTAAATCAT^ 

GTACGATGTGTGGAGTCTCCAGATTCGTCCTTAGGTCAAGGGTATCTGAATGTTGTGCTAATTAAATGGCC 

4 0 TCCTCTGCTCrCTAAGCAAGATGAGGTATGATGGACAGCAGATGTGACATGAGCCTTAAA 

AAACAAGGAATTGCCAACTTGGGGAAGAACAAACCTGGTCAGCAATTCATTCATAGACTTGAAAAA 

GTCTCAG ATCTTTCTACAAGGTTCAGCCCAGGTCAGTGGGTCACAGGGC CTGCCATAGTCTTACTCC CTTTCTCACTACATACCCA 

ACTCCAGTTGTTGGAGAAATAGGAATTGCTCAACTGTAAACAAAGCTGGACCAGAAACAGATAACAGCTGAACTG 

GGTAGGTAGGTTGACCAGATGGTGTTCTGCCTGTGCTCTGGGCCAAGTGACAATGGAATTG 

4 5 GCAGAGCTCTGGGAAGGGCACTCTCTCCTTGGGCTGGAGTTTCA 

AGTTTTAGGATGATGGGTGGTCCTTCACTrCTCAGACCrCCCTCTAAAATATCCCCTTCAG 

ATTAGAG ATAACGTATTCAC ACTGGCCATAGGAACATGAAGCAGT CTGCTTCTGACGGTGGGGACTG CCAACACCTTCTGGCC CTC 

GGTTGACCATGACCGCCTGAGCTTCCATTTGGTGCATCTTTTGCCCCTAGAAATCTTTTTCCACCTATGT 

AAAAGCAGAGCTCCTGGCTTCAGTGCTGGCCATCTTTGACTGCATTTCCCTTGAGCTGA^ 

5 0 CTGGGTTTTATTTTATTTTATTTATTTTTTTGACTGTATAAG^ 

TTTGTCTCCCATGATCATTGTATTCCATTAAAGAGAGTACAATTTATAGCTATG 

AAGTTT CAAGTTGTG C CT AAGTCGTGTGTGCACCTTTT AATTT CAG CAT CTGGG AGG CAG AGACAGGT AG ATCCCT CTGTTCTTGG 

CAGCCAGTGCTAGAGACGAGACTTAGTGGGTCTTAAAAAAAAGG 

TTTTTTTTTTTTATTGAAAGCAACACTATTCTG 

5 5 AGAAAGACAGCAAGGGAAACGAGGCGAGGCTGTTTTGGGTGGAGTGCTGGTCGACCTGTGTAGTGCT 

ATGGGTGGTGTGGGTCTAGATTAGGAAGTGAGTTCAGCAAAACGTATGGTGTTAAGCCATGACTG^ 

AAACAGCTATCTCAT CACTAATCAAT CACCAAATGTTCATT ATGGAG ACTGTGCTGG AG AATCATAAATACATGTCATTACAATTT 

TTCCTCGCTACCTCTGGGATTAGAGGTGTGCACCCTTCCTCCCGATCT 
60 GCATCACGACACACTCACACCCCCTGGGCTGTCACTGGGGCTCAG CCTCAGCAGTTGTCAGCAAACG AGTGGCTTTCAT CTTGC AA 

TAACTTCTAAAATTTAAATGCTGCTTTTGAACTCTTAGCAAACC^ 
GCCAGG CAAGCAAACTGAAGTCCTX5G C CTTGCCTTCTGTTTGGCC^ 
GACCAGGCTTTGATTATTAAATGATAACTGAATCTCCTCCATGATC 
ATTTTTAGGCAATAAAAAGACAGCCTTAAAAACACATCTTAGGCTCGATAGATGCCCCA^ 

6 5 GGGAGTGGGTGGGTGGTGGAGGAACACCCTCATAGAAGCAGGGAGAGGGAGGATGGGATAGGGG^ 

TGGAAAAGGGCTAACATTTGAAATGTAAATAAAGAAAATATCCAAGAAAAAAGAAAAAAGTATTCAAAA 
AAAACCACATCITAGATTCTAGCTACAGTTATGCATGTTTTTGGATAGTACATG 

AGTAAGGAAAAAATGAGCTCTTCTTTGGAAGATTTAGTATTTCCTGACAGCCGGTGTGTGTGTGTGTGTG 
TCATCAGAGCCCTCCACACTGTTCACAGGGCTATGCTGTGGGCTGGAGGTAGGAAGGCCT^ 

7 0 GGGGGCAACTG CTGTCCCCCAAATTGCTGG ACATGCCTCTTCTCTACACAGCATCCATTTCT CTTCCTGGTGCAAGGACTGGG 

GCTGTGGCATCTAGTATCTGAGAGATCCAGAAGGTATGAAGAACAGC 
AAGTCATTGTACCGCAAAGCAGCTTCCACCCAGCTCCTGAACAACCGA^ 

GTATCAAACGTAAATATGAAGATTCTCGTACATTAGCATATACTAATGAACTTTTGAGGGGTTTGT^ 
CAGGGTTTCTGTGCAGCTCTGGTTGCCTGGAATTCACCCTGTAGACCAGGTTGGTCTTGAACT 
7 5 CTCTAGAGTGCCTGGATTAAAGGCATGCGCCACGACTGCCCACrc * ^ *CxC 
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TGAATGAATGGTTCCCAACCTTGCTAACGCTGTTTGCCTTTAATGCCGCT 
GTTGGTATTTCATAACTGTAATGTTGCrCCTCT^ 

TAAGTTGAGAACrACTGTCCTACAGGGTCTTGGGACTCACTATGTAGAGTAGGCT 
GCTGGGATTAAAGGCATGTGCCGTCAGCl^GCATGCGTATCTGTGTGCCA 
5 GGAATTGAACTGAGTCCTCrGGAAGAGCGGTAAGAGCTCTTACCCACTGAGCCAT 
GATTTAAAAACAAAACAAAAAAAAATCTCCTATGCT 
CCCATTAGTGATTGATCCTGTTCCATATAAAATACTGGTCT 
TTGCATTACACAATATAATATCTTTTTCAAT^ 

AATTCATCTACGACTTTGTTGGGAGGAGAAAGGCAGAACCGCAGACAACA^ 
1 0 CCACGGGAATCACAGTTTCACAACACAATTCACTCTTT^^ 

GACGGCAAGACTAGGAGATGATTTGCCTTCTACTGTGGGGACAGTGTCAGAGTA 

CGGGCGGCTTCTGATGGCTGTCTTTGTGTTTGAAGCAGCAG 

TGACGACCTAATTGCATTCCTTCTAACCCACTGATTAGGCAAACCACCCT 

GTGTTGAGTACGCTGGAATGGATATTCGAGGGCn'GAGGG 
1 5 ATGTGCTGCAGACACTACAGATCACTTGAGATACAAGGAGGATGAATTGTAACT 

TAATAGGCCTCTAACGTCATCCTAAGGAACTGACCCACTTAGAGCCGGACGCTAATTAGA^ 

TTTTTTTTCTGTTTAGGGATGTACAGATTTCCATATGTATCTTTC^ 

TATCTCTTACCACAGAGCAAATTAATTACACTCAGAGCATATCACAGCC^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNITONN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNh^ 

2 5 TCACT AAGAAACACTCGAGG CTTC CTGTT CT CT CT AC ATTTTC CAC CATTTGTTTGCT ATTTGATGGG CACTTTCT AAAAGAAAAT 

GGTCACATTCCGGGCACTTAAAAGCAGGTTCCAGTGAATCATGGACCATCAAGTTTTCT 
AGCAAGCCAAGGAGAGCAGACCAACACGACAGCCT 

GAAAACAAGGCTATACAGGAAGGAAGGAAACCATCAGCGGGCTCATATGCGCTGGGGAAGACTTGGGACOT 

ACTGCGTGT ACTT ACGCG CCTT CTT CTCT C ATG AGGG CCAAG AATTTCCT CACACGG T ACTCAGG ATCCAT CTAAAAAGCAT CGGA 

3 0 AGAAACAAATCAGACGCGCGAAGCTGCCTTTTCTCCAjCAATGTC 

gtcgttcagaaccatgtcaccattccctagttgttgtttcgaagcataataaagtgggttatgttatt^ 
cccaacacggagaggctactggtcactcacaggcacctcagccaagcaccccctgcaccc^ 

acac cagccaggtgt caccacaacact agatcg tggtcact c acgg aggggtccag acatg c ctgggtgcattcatggtg ct ataa 
atcatgagagtttaaatgctaaaattcatatatcttattgaaggaggaggatagaacctgag 

3 5 gaccaaaatacagttagaactttttgttttgtttttcgagacagggtttct 

agaccaggctggcctcgaactcagaaatctacctgcctctgcttcctgagt 
tagttatgactttcacatactcactgaacrrggcattcttagctt 
agtgctgttggcctccccrctccgtacacctggcctci^ 
atggttcaagtcaccattactaattatctctc 

4 0 ttggcctgactagggtgtgtggggatctggaaaagaat 

gaaacacctcacttttaaagcatgttgcagggcagaagtct 
caagggctcagctcgcgcagccgtgggcaagagtgcactca^ 

aaaacatggaacgtaagtttcaaatgtatcaaagttaaggggtctvgacracaaagaactaagaa 
gagtcattttttttccccctctatgggactagaattacacatgaatcaattgtc^^ 

4 5 gctcgctggagtcagtacatgcaggtcttcgcccatt 

atggaattaagagtcaagagaaaggggaggaaaaaagcaacct^ 

catccttaacaccgagcaagttttaacaaaatcgagtcc^ 

cctctccactcgggctctgcccctgctcctgaggcagatctaaagagaagtcact 

gttcaaactccaggcctatg aagtgtcagtcx5ctcaagattaggccg ag cgccccactct cactttctacctcacacacataagta 

5 0 TTAACTTGATCACTGCGCCG CAGGCATCAAAGGGAG^ CATAGTTTCTCCT ATGCAATTA 

TTTTAAAAGATTGAACTGAAACTGGAAAAAGAAGGGCAGAGCTGACTTCTC 
ACCTGTATGTATGCCTTTACC^GGAATAAGCCTGAGCAGGGTGCTTAGTCT 
TAAATTGACTTAAAAACAGTTGCTGGAAGCTTGGGAAACTTTTGGTTTCTGA 
TCAGAGAGAGAGTTAAGATAATTCTCTCCATATTTCT^ 

5 5 ATGTGAGCAGTGGGCGCGCAGGGCTCCACAGTGACACACCTCCTCAGGTCTG 

CCTTTCATTGTATCTAGTGAAATTTATTGAGC^^ 
GGGGTCTGAACACACGGAACTCAGTCTAOICAAGTGT^^ 

CCAGAAGGAGAGAGGXSAGGTGCAGTCCACACAAGAAAGTTCCAAACCAAAATCA^ 
GAGTCAGTTAGTTTGAGATTTCTAGGGCACTGGGCIATGGCGGTATGAGCCTTTAATCCTAGCACTTC 
60 GGTCTCTGAGTTCGAGGCCAGCCTGGTCTGTATAAAGAGCT^ 

AAATAAACATTTTTTTAGGAGTTCAGTGAGGGGAAACGCACAGTCACAG 

ATCTCCCAGAGCTCTTAGCCATCAGTCCTGTCAGATGCTCACCCTGAAGGACGG 

CACTGACAGACTCTTCTCCCATCTGCATGAGACT 

TCATCCTAOUVCATCCAGTCTCCCCTCACTGAAGGTAAAGTTTAAATTATCACCAGGA 

6 5 GCTGTAGATGTAAGCCCTATCCTAAGCTGTTCAGGGTCT 

CCAG^TCACCTCTCTGCAGCCTCCAAGATGTTGAGGAGCCTAGGGCAGGGGTGGGAGTG 
ATGACGGGGC^GTTGTAGTTTTAAATTCGCTTTG^ 
CACGCTCTGCrCTGCrrCATTTATTTCrcAAGAGAAAGAGGAGAACA 
TGGTGTGCTACAGCCATTCCTOTGACTCACGCAGAGACGGGA^ 

7 0 TGATTCCTATCCAGACTTTCTTCCCTTTGA 

AGCAATAAACAACAATTAGACACTTTTATGGTGTGCAC^ 

ATGAATTGACCCACCTAGTCCTGTATGACCATAAAAGACAAGGTCTTATTCTGTAGCC 

TCCTGCCTCAGCCTCCGCGGTGCTGAGACTACCTCATCTAGAAGAGATAAATCTTATGCTGT^ 

TGAGAGACGCGTTTCCTAGGGAAAACAGCTCCAATCTATGATAAATACAAGTTAAACGTGTTAGCAT^ 
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TTGGCGTCTGCTCTGTTGGGAGCCGACGGCAACTCTGAAGTTGTGGAGCC^ 

CTTACATCACCTTTTCCATCAGATCCCATCTAGTC^TCTG 

AATTAAAGGCTTTATCATCTTCAAAAAAAAAAAAAAAAAAAAAGGAAACATT^ 

T e n i i c ri r rrrci i' ittggtagttccaaagg 

5 

mouse sequence - mrna 

cctcttcatgatcgccgccaaacttcctccrrcggcgcrn3cttctaat^ 
cagtggcaccagggtcctcgctggctacccaatttggtggtccctggccct 
tgctctgcggctccatcccaggcctggtccccaagcaactgcgcttctgccgcaatta 
1 0 ggtgtgaagctggccatccaggagtgccagcatcagttccggggccgccggtggaactgtacca 
ctttgggcctgtcttggacaaagccacccgtgaatc<kk:cttc^ 
gctcctgcgctgagggaacctccaccatctgcggctgtgactcacatcata^ 

TGCAGCGAGGACCCCGACTTCGGGGTGCTGGTGTCCCGGGAAT^ 
GAACAAGCA<3AACAATGAAGCAGGCCGAACGACCATCCTGGACC^ 
1 5 GCGAGGTGAAGACCTGCTGGTGGGCCCAGCCCGACTTCCGTGCCATTGGC^ 

ATGGTGGTGGAGAAACACCGTGAGTCCCGAGGCTGGGTGGAGACCCTGCGGGCTAAGTA 

GGACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGC^ 

ATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTGTGCT 

TGCCATTGCOTCTTCCACTGGTGCTGCTATGTCAGCTGCCAAGAGTGTATTCGCATCTACG 
2 0 CAGGGCACTGGGAAGGGGTAGATTGTGCGGCTGGATCCATTC^ 

CGGCACTGGCCAGCAAGGAGCATGGACTGTTGCCAGCTGCATGTGATAAACGACCTGGACCCAGCCG^ 
CTTCTTTCT CAACT AACG TCTCTC CC C CTGCTCTGGATGGTGT ACGG CTTT ACAG AG^jGGCT 

GCTGGGGACAG ACTCGAGGCTTAC CTTTG C ACATG TTAAAGAAAAT AAAAATG AAAAAAAAAAATCTACCGCAACAGAACAGG CTG 
GGCTAGTGTGAGCTCTTGGCCTG<3TG<5GAAG<3ACAAGACCATGGCGAGATTCTGTGTC 

2 5 GATGCCTCTGAGGTG<K3AACTGTGAAGTAGGACAGAGCCCC 

TTGTCGTTCTGAGAAAAGCCATAGATAGGTGTAGCTGGGATGTAGTGATGGGGAGGCCCCTGGCCAACAGT 

AGTTTTGAAGACCTCAGAGTTCTGGGCGGCCTGGGAAGCCATCTGQAGAACAGACT 

GTTCTGCCCTGGAGCGACAGTCAGATCTCCACGCCCCTTTCTGT^ 

3 0 TGCCCTTCTCAGCCTTGCTAACCAGCAGACACCGAGGAGAGCAGCGGGGCACCTTAGA^ 

GGAGGGTAAAGAGTCCCACTTCCTTTGTGTTAGAAGGCAGACTACCCTGCGTCCTTTTCT 
CAAGAGATCCTTAACAAGCCCTTCTTCCCACTTGTAAAAGGGATAGCCTATCT 

AAAAGAGGCAAGCAGCGAATGGAGGCAGCTCCCAGCTCTGTTCCCGACGC^TGATGGTACTGGCTGGGTT^ 
GCTGCACGGATCAATCCATCAACTCCGTCTTAAGGAGAATCAGAAAGA 

3 5 GTCCTTTCCCGCTTCTGTCTAGGGTCTGCTAATGCTGGCTTGACGAGGGGTCAGCCACTTCTTTCC 

AAGCAGGCTCCAGT AGGCCCTTGCCTGCACTCTCT ACCATGTGACCATG AGCACT CTCTAGGG ACACCTCCCATCC CTTCCTAGC 

ACCCCAAATGCCCCTTCCCATCTCTCCTTCCAGAAGTTGGAAATCAAGTCAACT 

CGGATACAACAATTTACAAGTCTCTTCATATCTATGTATTCTATATTAAAAGTGATAAAGTCATG 

GCTGACAAGTAATTATTTAATAATAGT ACATG AG CGCATTGTAATT ATCCTCG CCATAGTCAGGTAATAGCATCCAATGGGAGGTC 

4 0 CCTACGAACCTGCTGTATCCAAAGTTTTGTAAAAAGTTGTAGAAGTTGTTGATCT 

ATAAATATTATTTATTATACAATGTATATACCTTTGAGTTAACTAAGATTATATATTATATAAATATATATATATT . 

MOUSE SEQUENCE - CODING 

ATGGAGCCCCACCTCCTCGGGCTGCTACrrCGGCCTCCrGCTCAGTGGCACCAGGGTCCT 

4 5 GGCCCTGGG CCAGCAGTACACATCTCTGGCCTCCCAGCCTCTGCTCTG CGG CTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCT 

TCTX3CCGCAATTACATCGAGATCATGCCCAGCGTAGCAGA 

CGCCGGTGGAACTGTACCACCATAGATGACAGCCTGGCCATCTTTG<X5CCTGTCTTGGACAAAGC 
GCATGCCATCGCCTCGGCTGGTGTCGCCTTCGCAGTC 

5 0 G CGG ATGCGCGGGAGAACAGG C GAG ATGCCCGCT C AG CT ATG AACAAG CACAACAATG AAGCAGGC CGAACG ACCATCCTGGACCA 

CATGCACCTAAAGTGTAAATGCCACGGGTTGTCCX3GCAGCTGCGAGGTGAA 

TTGGCGACTTCCrCAAGGACAAGTACGACAGTGCCTCCGAGATGGTGGTGGAGAAACACCGTG 

CTGCC^CTAAGTACGCGCTCTTCAAGCCACCCACCGAGAGGGACCTGGTCTA 

CCCAGAGACGGGCTCCTTTGGTACCAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCT 

5 5 GCCGGGGCCAGAACACGAGGACGGAGAAACGGAAGGAGAAATGCCATTGCGTCTTCGACT 

TGTATTCGCATCTACGATGTGCACACCTGCAAGTAG 

HUMAN SEQUENCE - GENOMIC 

TG CCITGCCTTGGCCTCCCAAAGTGCTGGAAATACAGGCATGAGC CACCGTGCC CGACCTGGGCAGAACTACATTCTGG ATGCATCT 

6 0 TAAACCf CACCTCAACCCTACACCCTCCCTTTTGTTGTTTCCTTTCT 

CTCCAGTTGCAACCTCACCCCCACCTAGACAAACCCTC!AAATCAAG 

ATCCCAGCACTGTGGGAGGCTGAGGTGGATGGATCGCTTGAGCTCAGGAGTTCAAGACCACCCT 

CCATCTCTACCAAAAATTTTAAAAAATTAGCTTGTTGTGGTGGTGTATGCCTGTA^ 

GATCATTAGATCCTGGGAGATGGAGGTCGCACTGAGCCAAAATTGTGCT 

6 5 CTCAAAAAAAGAGAAAAGAAAAAGAAATCCIAAAGGAAAAAAAGAGAGAGAC^ 

TGTTGGC CAGGC CCGGTGG CTCAT ACCTG T AATTCCAGCACTTTGGG ATAGGCCG^ CGG AT CAT 

G AGGTCAGCGGAT CGAGACCATCCTGG CT AACATGGTGAAAC C C CGT CT CT ACT AAAAAT AAAAAAAAATCAG CTGGG CGTGG TGG 
TGGGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGGGAATGGTGTGAACCTGGGGCGCGGAG 
TGCACCACrGCACTCCAGCCTGGGAGACAGAGCAAGACTCTGCCTCAAAAAAAAAAAAAAG 

7 0 AGAAAAGAAAAGAAAAAGAAATCCTAACACTCATCCATCCTTAAGTAAAAGCAACTCACCTTGTAAGTCAGT 

CAACrACCCTAGACTAGACATGATGGGTGAGGAGGATACAGATAGCT 
AACGAAC!AAGGAAAATGTATTATTTCATATCACAAGAAGTTCCAAAGC!AG^ 

GTCAGAGATGAAGTTTGTTT CTCTCTCTTGGCTCTGT CTTCCTCTGCCGGT CAG CTCTCATCTTCAGGCAGCTTTCCTCATAGTCAC 
AG AATGG CTGCTGCAGCTCCAGGTATC CCAACCCAAAACAACAATGTCCAAAAG CAGG^ AAAAGATATTTCTT ACTGTGTGTCATT 
7 5 TTAAGAGCCCCAAAAACTCTTCCCAUAAAC CrlCTTG CACCCTT CAGCTGACCTCC CT CaGGCACATGT CACCCAGI 7 TAGGCCAA 
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TCCCTGGC^VAGCAGAATGGGTCCCCGTGATTGACTT^ 

GAACAAACTGGGGGCirrCTGCCATCCTGGAGGAAGGGACCACCAGCAGTGTGA 
AGAAGTTCAAAAACAGCAGCTGAGGCTGGGCATGGTGGCTCATGC 
TTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTC 
5 GCTTGCATCTGTAGTCCCAGCTAGGAGGACTGAGGCAGCAG^ 
AAGATCACTCCAGCCTGGGTGATAGAGCGAGACTTC^^ 
GAGTGGATGCTCACrcATCATTTTACATGTCCCAGG 

TTTGAGACAAGTGTTATT ATACC CATTTTG CAGTTGATG AAGCAGGT A CT CAGTGT CACAT AGT AA 

GAGGCAGAGCTGGGATTCACTGGGGTTGTGGTAGACCTGGGC^ 
1 0 ATCCCCATTCAGGCCTTATCTAGAAGCCAAGACAAAGGTGCATAAGAGGTGGGTCCTGCTC 

TCCCTGCTTAGCCCAAGGGCCCTCCCACGTCTGTGC^ 

CCCACTGGGCCTCCATCCATGTGATTAGGCTCCCAGGA 

TGGCCATACTGGTCTGTGGAGAAGGTGGTTAGTTGGCAGAGTT 

TCTCCAAGGATATTATCAGGCTTGTTCTATTCTATCTCCCT 
1 5 GAGAACTTCCACACCTCTGTTTTGTTTC XSTTTTGTT TTGT ^ 

GCACX3GTCTTGGCTCACCGCAACCTCCGCCTCCTGGATTCAAGTGATTCTC 

ATGCGCCACCATGCCCAGCTAATTTTGTATTTTTAGTAGAGACAGGGTTTTGCCA 

TCAGGTGATCCGCCCACCTC<^CCTCCCAAAGTGCT^ 

AAACATTTACGTATTTGGTCCTCTGATGAATGACTGTC^ 
2 0 CTCATCAGTGCTCAG CTGCCAAGGGGCAGCTGCACACAG AGTTG CTCC ATGAAGATGCGTTGAAGAAATGGACCAGGTGAAATTCA 

C^GAGCACCTACTGTGTGCATCAACCGGAGCTGGGTTATAAGGAAGAAGGCAGACTCCTTGAGG 

CTTGTCCACAGCAGGCATGCAGTCAATGTGTGCGATGAGAAATACAGA 

CCATCGTGACTAAGGTTGGAGACAGAAGAGACCAAAGATGX3CTCCAAGCCCCT 

TGCCTGCTTATGTCCCATACACAGGK5GAGTGGAAGGAAAACTCAGGAGCTCTAGTTC 

2 5 CACACAGAGGTCTGG CTCT AACTCCTCAGGTAGGATCACAGCCATG CTAGG AGCTGGCCTTG<^TTAG AGCCCCAAAAGCCTTGGG 

CAGGCCAGCCATGTCCAGGGArGGGGCTCAGGGTTATGAGGCAATGGCAGAATAA 

AGGGAAGX3AGAAAGAGGTGTGGCACATGGCTGGAGAGGCAGCACG 

TCCCAACAGGCTGCCCACCCTTAATCTCATCCC^ 

CAGGGTAGCTGGGG TGGGTT GGAGTTG T AGGGG CCAGGG CCACT CAAGAT AGGGGCACTGAGGAC C CTGAGGGG C CACAGGGAGC C 

3 0 CAGACCTGAGTCAGTGGCCCITCTATCTGAGACCTCGGCCCTGCCTGCTCCCTCTCCT 

GGGGCTGCTAAGTGCTGGATCCCTCACCACTGXX3GGTTCAG 
ACGGACAACATATGCAAGAAACTGAGCAGATATGTCTGCT 
CCCAGCACTTTGGGAGGCCAAGGCGGGCGTACCACCTGAGGTCAGGAGTTCAA^ 
CTACCAAAAATACAAAAATTAGCCAGGCATGGTGGCGAGCGCCTGTAATCTCAGCTACTTG 

3 5 GAACCCAGGAGGTGGATGCTGCAGTGAGCCAAGATCACACCACTGCATGCCAGCCT 

AATAATAATATGTGTAATCACATATTAATCACATCAGTGTCAGGGCTATGCrGGAAAATAAAGCT 
AGGGCTAGGGAGTGGCAAGGGGTCTGGAGGGGCCTCTCTGAAGTG^ 
TTATCCCGGATAAATCCTTCTGGAGAAGGCTCCTGGGCGCCTG^ 
ACCTGAGAGAGAGGCTTACTCCTTGCCTCAAGAGAAGAAGGAAACCCAATATTT^ 

4 0 AGCACCATTCTCATTTAACGCTGACAGCTCCCTACAAGGTGGAAAGTGTTATCTCCACTCCACAA 

TGACAGGCAACCGGCCAGTGCCCCCACCCCACCCCATTC^ 
TGGGGAGGCCAGGCATGGTGGCTTGCACCTGTAATCCCAGCACTTTC 
AAGACCAGCTTGGCTAACATGGCGAAACCCTGTCTCTACCAAAAAAACACAAATT 
CAGCTACTTGAGCGGCGGAGGCAGGAGAACCACTTGAATATGGGG^ 

4 5 CAGCCTGGGCGACACAGTGAGACTCTGTCTTAAAAATAT 

TCTACTAAAAATACAAAAAATTAGTCAGGCGTGGTGGCGGGCACATGTAATCCCAGCTGCT^ 

TTGAAC CTGGGAGGCAGAGGTTGCAGTG AGTCGAG ACCGCGCCATTGCACTAC^ CACCTTTC ■ 

TATATATATAATATATAATATATTATATATAAATA^ 

AATATAATATATATAATATATAATATATAATATATGTGTTTATTATATAATACATATTTATTATAAATACATATTTATT 

5 0 CATATTTATTATAAATACATATTTATTATAAATACATATATATAT 

CAGGAGGAACTACTGCCACAGGGAGAACCTCTCAGGTGCCCACAGCGGCCCCTGTGATCTCGGGATC 
CCGCCTGCCGCTGGCAGCCTCCCATATGTCAGCGGAAGAGGGAG^ 

ACCGGC ATCAC CTGGGCCCGCTCGCCCTCCGCCTGTCTGGAG AGG AG AGGGGAG CCTGAAGCCATGCACCCCTTATCCCCAGCTAA 
GCTCCGGGATTGCGAGGTAGATTAAGGGCATTCACTCAGGAGGGCACCTGCTTAACCAAGATCCT 
55 AGCAAGAATTGAGGATGCCGGGCCCGCAGGGAGCGGAGTGGGAC 
ATCAGCACTGCCGCGAGGGGGCGCGCCCGCGCCGCTGGGCTCT 
TAACATCTGGTGCTTACCTACTATGCGCTTACTCGGTGCAGGTOVGGGGCrrCCTC 

AACTAGGCTGGGCAGTGGAAAG AAATCCGATTT ACTCCTGCGCCCACCAC CCACGCTAGGG CCTTCCTG CTTCCAGGCTCTGCTGT 
CCAGCCTTCCAGTGGCCTCTAGGGAGCTCGCCTAAAACCGCTCrcCGGTTCCTACAGCT 

6 0 TCCAGGTCAGAGGCCTTGGAGCCCCCTATGTGCTCCCTTTGCACTGTTCCTCCATAAGC 

GCACTTCCCCACCTGCTTCCCCGCTGC^CGGCTTCGCCTCTCCCTTTACCCCCCCAGGT^ 

GCCCCATGAGGGCGGGCGGGGGCCTGGCCACCGGATTCTTTGTCTACACGGTGCCTCCGCAGGGG 
TACACAGCGGCAGACTCTGAGCCCCGGGGGAGTGCACATCAAAAGGAACCTCCGAGCCAGCTAGAGAG^ 

6 5 GGGGTCCCCTTCCCAGGTCTAAGGTCCCACACATTCACA(^CACAACA^ 

TCCACCCCGCCCTCCCrrCCTGGTCTTGGAGTGAATTTAACCCTTTCTCCACCAAGTGA 
TAGAGAGAGGGTGTTTCTATCTGTGATGTGTGGTCAGGGGC^^ 

GCAGCGCAAGGGGTGCAGGAGGAAGGAAATTAAACCTGTGTTGAAGGAAGGACACCTGGG 

7 0 TGTCTGAGGAGCTTTCAAGTGGCGGAAATCCCCACACACACCTCCATTCACTAGGG 

CTCTCCTCTGTCACTTCCCAGGCGCACATGGCCAGCTCCT 

CTGCGTCTGAGTTCTCCAGGCTGCCTGGAGGAGGCATTCAGGTACACATGACCCCTCA 
AGGATGCTGGCTGGGCATATCTCAGCCAACACACACTCCCACCGACAGCCT 
TGTGGAGTCTTGAGGCCCAGTAATGACCCCAAGAAGCCATCAAGTC 
7 5 CTGAGTGGCCAGGGAAGGTAGCCCTCCCCAGCTCrCAGCACCACT a C 
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CAGCAGGCAGCTCTGATCAGAAAGCCACCTGGGAGCC^ 

G AGGGAGGT ATG TGGAAAGCTCTCCACTGC CAG AGC CTTGG TCCTG AGGCTCTG CATTCATGG CAATGGT AAC CCT CTTT CCAGGT 
GAATCAGAAAGA.GGGCACCCCAGAATCAGCCCAGCCATGAGGAGCATCTCT 
CC^CTCCTGGTCAGGTTCACGATTTTTCTTCGCCTTGCAGCTCCTTGGTC 
5 CGGTGOCTCAGCCTCCCCCCGACAG TTTTC T T GGTTTCTCT 

CTCGGATTCTCCTTTTCCTCCCCTTCCTTCCTCTCTTTCAACCTCC 
GCAGGAGCCATGACAAGCCCAGGCXJGCTCrCCCGA 

CGCGTGCGGAGAGGGCTCGCCGACCAGAAGGGGCAGCAGGTTGGAGCTGTGG 

TGGCCGCCTGGCGTAAGACTTCCTTCAGCCATGGCCCTCACCTCGT^ 
1 0 GCTCATTCTCTGCCCTTTCTGCCACAGTTCCCTACACAGCGGTCCCTC 

CTGCGCAGAGTCCTGGTGCCTCAGGTGGCCTGGGAGGGGGCCTAGGGAAAGGCACCCAGGTG 

GACCATGATGTCAATACCACCCAGGGCTGAGACCTAATTCCTCGCTCTGGAGAG 

TGGAAGGACTCCTTTCCTGTGGGGAGGGGTGGGTAACTGGAGGCATTTGGTGTCCGCAC^ 

OVGGCTGGAGCTCCCTATCAGTAGAACCGGGGCCTCAGGCGGGACGGTCGGGCAGTGAG 
1 5 CCAAGGACCTGGGCCGCCCTCGX3CCAGAGCTG CGCGTTTCACCGGT C 

CACGCCTCCGACCCGGGCCAGAGGTCG^CCTGAACCCCTCAAGGAGGAGAAAGTGACCCGACTCCGA 

GCCGCTGGTGTCAGTTGCAGGAAATGCAAAGGCAGCAGGAGGTCCCGATACCGATCTGATCCCTGACA 

CGGCTCCGTCCCCCGGCGCCGCCCGCAGCCAGACCTTCGAGCCTCCGCGGACTTTTCCGCACGTTCCTCGGACACCCGGGAAGTGA 
CCTCCCGCX5GCCAGGACGAGAGGCCAGAGAGCGGGCGGACTCGGGAGGCTGGAAGCTGGGAGGGGGTGCGTTCCCCCACC 

2 0 TCCTGACCTGCCCGAGCGGGKjCTTCTGXiAAAC 

GGTCCCCGCCAGCAAAGGAGCCACCCGTTCATGCCCACGCCAAGCCCTCGCCATGTGAACXSCCCGGGTCAA^ 

ACCCTGGCCAGCGGGGGACCGCGGCGGrrAGCAGAGGGTCCGGCCCACCGCGTCCCCCCGAAG 

TTTGCCCZAAGTTTGCTGGGCGATGCCCTTCCCGGAGCCCGAGTTGTGGGCCG 

CAGACTCATCTCTTACCTTAGGCCGGGGGCGCTGATTGGCTGCTCGCTGACATCCTCAAACCCGGCTGCT 

2 5 AGGGGGGCGGCTGCGGGTGGAGGTGCGGTTCTGAGAAGCCCX3AAAGTCATTT 

GGCGTCGCTCCCCCTCTTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCCAC 

CGGCCTCCTGCTCX^TGGCACCAGGGTCCTCGCTGG CTACCCAATTTGGTGGTAAGACTCG CCrCTTGTCTGCCCGCGGCCCGGTT 
TCTCCGCCGCGTCCGGGGAAGGTGGGGGCGCCGGCTGGGGGTGGGCCGCTGCAGGGCT 
CCCCCACCCGCCCACCCCTCCTGGATTTTCTGCrGGTGTCGCCTTC^ 
30 CTGCTCCCCGCTGCAGGGGCTGGAGGACTTTTCCCATCTCAGAAGGTTTCTT 

CCATTTTGGGGTCTTCATGGGCAAAACCATCCTGTTCCTTCACTGCCACCCCACCTCT 
TAATAACCCCCCTTTGCCTTCCTATTTCAGACCCC 

CAGATTCTCCCCACCGCTCTAGATTCATCTTGAGCCTCTGAAGTGAATATGGTATTTGCTGA 

3 5 TTTATTCATTTCTTGCTTGTCCACCCTGGCAAAAGGGGGGTTC 

GACCAAAAATCCAATTCTCCATGTCACATTTCTTCCTTTTGGAAAACCGAGGACCGAGGAG 
GTCCCTGAGAAATGTGGATACAAAAGACATTTTAGTAAATAAGACT^ 
TGGGGGGCCTGGGTGAGGAATAAACATCGCTTTGGGAGCCCCAGCTCCAGGAGTCCCCT^ 
CTCCCACCCTCCGCAGGTCCCAGGGGCCAGAGGGGAAGACACTGAAACCAGTTCTCTAAATTACACTCOT 

4 0 AGCCCCCTACATTCCGGAAAACTTTAATTAAAGAATCTTCCCCTCCTGGACATAJGG^ 

GAGGAAGAATTTGGGATCTGTGAGTGAGAAGTGGGCTGTGGCCCCCGGGGTGAAAGTCAGAAAG 

CGCCTGCAGGTCCCCAGACTCCCTGCCTTTCGCCAACCCAGAGTCCAGGGTAGGAGGAGC^^ 

CCCCGAGTCTTCCAGTTGTTTGGGGATGGCAGCATTrGTGTCT 

GGGTCTCTGTTTAGAGGGTGCTAGGAGTTGCTCCCAGGACAGCCAGCCAAGGGGAAGTCTTT 

4 5 GGGGGGCTGGGCTAACCTGAGGCTTCCAGCACATAAAGAAGTCCATGGGGTGGCCAG^ 

GGCCAGTCTGGGGAAAGGCTGCTGCCAACACCAGTCCAGCCTCACCCGGCCATTCCTGACC 
ATGCCGTTTTTCGCCTTGCTTTCGCAGAGACTCTGGTGGGTTGGGGGAGGGTAGGAAGGAAAGGTTTGGCGGCC 
GGACTGAGGAAGAGAGACCAGTCTCTCCGGCCCTATGAACTAGAAGAGGGCACTCTTCCAGTTGGT^ 
GTATCTTGGGGACCATCTAGATAGGGATTGCCCAATAGGAAGAA^ 

5 0 AATGCTAGACACATGGCTTGTTTGGGAATTTGGTGGGGGACAGGAATGCTGCTTTGTCTA^ 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGTGTGTTCTGGGTCTATC 

AGGTTGTGCTGTGTGTGTCTGCACACXSAGTGTGAGCAAGTGGCTGGTG 
AAGTCTAGAAGTATCTTTGCCCATGTAAAGGTCAGAGCCTGTGTCTGCC 
55 GTGAGTOTGTGTGTGCGTGCGTGCGTGTGTGTGAGTGTGTGTGTGCGTGC^TGTGTGTGTGTGTGTGTGTACGTGC^ 

CTTCATGACAGTGTGGGGAAAGCCCC^CAGATGTCTTTGTGCATGAATATCAGTGCTCATGGCCATGACCTGATTGTGCTTGTAAG 

GAGGAGAGTGAGCCTGAAGTCCATAGCTGTGTGAGGGCTGTGTCTGAGTCCGCCTGTGATCCTGTCCCGAGG 

CTGTGTGG CTG TG AATATCTTTGCAG ACATCT ACCTG T GGGT ATG AATG ACACCTG AG CTT GGGTGTTTGGTGGG CAGC CATGGGC 

CTGGAGGGAAGGTCCAACAGGAAGCAAAAGTCTCACAAGGTGGCTGGAGGGGGCACA.GCT 

6 0 AGGGGCACAGGCTTCCrTGACACCAGCTTGTCCTGTC 

TGGTCTTGGAGCTGGGGTGGAGGGTGAGTCACCTTCTCAGGGTTAGGGCTGGGGCTGTAGGGTAGATGGGCCTCT 
CCATTTCTGCCCCCACTCCCCAACCCAGCCCCCCGCCrrGGCAGTCAGATTCCAGTACTTTGAGCATCTAAGCCC^T 
CCTCTTTGGCCCACCCAGCTCATGCATTCCCCAGGAAGGATTTACAGGGGAAGTCATGTAGCT^ 
TCAAGCATGACACCCTGAGTCCACACTGCTCAGACCCCAGTCCTGACAGAAGGTTGACCAGAAGG^ 

65 ACTGACACCCCTGGAAGGAAGAGATGTTCTCCAGGTAGTGACAGAGCCGGGACAGGAA^ 
TTGTCAGTCCCAGTTAGCTCCTGGTGGAGAAACTTGTCTTTCTTCTTCTTCTTG 
ATTAGGGCCTAGCCCAGCACCTGCTACATTCCAGCGATGGACGTGCAGATGGAGGGGGCCTGCTGGA^ 
CGCTGTGTGACTGAGGACCAGTGGCCACCCATCrcGGTCrGCTTCCTTACACT 
AGCTTGGGTGAAAATGTTTTGCAGAGGACAAGGGCAGTGAGCTGGGAAATAGCACACCT 

70 TCCCACTCGAGCACCTGCCTAAGACCTACCTCAGTTTCCCCATCTGTCAGATGTCCTGCTGCC 
CCCACCTTTCAGAGGCTTCTGAACATGTTCAGTCAAAC^ 
GGGGAGGAAGGCCCCAGGCCAGGTGTTAACACTTGTAACAGG 

GGAGCTGAGCTAGCTGGAGGCTGTTGATCCCACAGACAGACAGACAGACAGACAGATGGGCAGATACCAGATCGGATGGCCAGGCT 
TTAACCGGGTGGTCTCCCTTCCAGWCAGACCTTTCCCrrGGGGGGGA 

7 5 AGCCCTCCCX3 CTCACCCCG CCCCCATTATGGGTGGGGCTTCGGAGTCCAG AGGGAGCAAAATTGC 
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TCCGATAGGATCACATTTTGCTTAAAAAGAAGAAAACAG^ 

AGAAGAACCTGCTGTAATATGTATCTGGATGGCATGGGGGAAGGGGACGCTCTCAGG 
ClAACAGCTCCGGGGCCIAAGAGGTGAGAGATGGGTGGAGTCrG^ 
CTGTGATGTCTGCCCTGCAATTAATCCTGGCCTGGTGGGAAGACC^^ 
5 GCCAAGTCCTTTAGGAGGTGCTTGCTGCAGCTTCACCAGTGGCAAACCCAGC 
ATACATGCTCAGTCACTGACAAAGGTGGAGAAACTGAGGCAGACAGCGAG 
CTAGCAAGCATCTCTCATCCTCCTCACTACAGAACTTAATGCT 

GGATGCAGCTAACAAACCCTTCCAGGGAAAAGGAAGTGGG 
1 0 GAGCAGAACCTTTTCCACCTACCCTGATGTGAGCCACAGAAACCTTCT 
TTCCTAACTCAGGGAGGAGGTATGCCTAGCCCCACCCC^ 

CATGTTATCTTGTATGCTGGAAAATGGTGAGAGTGGGGA^TGGAGAGACCTATGTCT 
TGTGACCTAGGATGAGTCCCTTCCCTTCTCTGGGCTTCATTCAAGTCCCTG 
AGGGGTCCCTTTTGCCTCTCAGAGTCCCAGGGCTATGACRGGTTCTGG 
1 5 TCCGGGCCTGTATTAGACAGTCCACCCGTTTCTTTCTGTGGCTCCATCTGACC 
CAAGCCAACCTGCTTGCACTTCCAAACCTCGCCCCAAACACT 
TTCATGTACTACCTGGGTAACTAGGGCAAGTTAACCTCTGCAAGCCTCA^ 

T ATTTCATAGGGTTTTG AAAATG ATTG AGTG AATACTTCTT AGAATG AT AC CTTGCAT ACAGGAAGCAT ATCGATGTT AG CT ATTT 
TCACCTCCT ATCTGCAAGCTATTGGCTCACTG AAGCCCTGGAGCC AAACTTG CCTCTTCCAAGAAGCCCTCCCAGATTTG CCCCAC 
2 0 CTGG CTCTGTTTTCCCCTTT T CTTLT 1' 1" 1' 1'G C CTGTTTG C C CCAC AGTGCTTT AG CTTT ATTCTT AAGTTGAGGGTG CTGGG C CAG 

GCTAATCTCACTTCAGCTCTTTACCTGCTCAGGAAGGGGAGGGCA^ 

ACCC CCAGAGTCTGCCAG ACTCTTGCTGGGACCT AG AG CC AGGGGG CTGG GCAC AGAAGGGGTGGGGG AGG CAGGCTTC C CAGGGG 
G AGTGGCAC CAGGG CCGC CCAGAG CAG C CTG AGC AG AAGTTGCTT CTCT AGGAG CTG AG AG ATG ATTCAG AAGGTGATTT AT AGT C 
GGAAGGCAGAAAGCTTGGCTCCTGGCAGAAGCTGCTGCTTAATTAC^ 

2 5 GAGGCAGCAGGGGCCAGACITCrrGCCTCCTCCCCATCCAGGCCCTCCCTGTCCCTTTGGCTG 

TTTATCCCXpCTGCCAGAGAGGGAATTCCCTCCTCTCAGTGAGAGGCTAGGCCAGG^ 

GTGGGAAAAGAGGAAGGAAGCTGGAT C C CATCTCC CGGGACTTCTC CAAGGGAG ACTGGATGG CG ACGGGCTGGAAAACGGG CCTT 

CTGGGAAGGAGCACTGGGTCTCCTCCCTGCTTCTGGGCAGCGGAATAGCTAAGGCAC 

TCTTTTTTTCTTGAAACAGAGCCTTGCTCTGTCC^ 

3 0 TOAGTTCAAGCAATTCTCCTGCCTCAGCCTCCCG AGTAGCTGGGATTACAGGTG CGCAC CAGGATGCCTGGCTAATATTTGAATTT 

TTTAGTAGAGATGGGGGTTTCACCATGTTGGCCAGGCTGGTCT 
AGTGCTGGGATTATAGGCGTGAGCCACCGCACCCAGCOtfa^^ 
ACCCTGTCCTGCTGCTGCCTGGGAGGGCCTGCCTGC^^ 
AGAGAAGGTCCTGCGCTCAGAGGCTGAAACCCGGTTCTTCAGCG 

3 5 GCACAACGGTTCTGCCAGCAGGCACGGCCCCATACCCTCATGTC AAACTAAT AT CTCAG AG AGTC ACCC CAATGCCGCTGGGTGCA 

GCrACAATCACTTCCGCGCCATTGACAGGAGCCCCCATTCTGTTTCCATACC^ CTG AGTTG ACATCAG CATTTCCCAT ATTAACA 
CT G ACC CATGCAACAAC C CGCC CGATG ACACCTAAGG CCGCTT AGTT C CCTG CAT CG C AGC CGTG AATC CCAAATCCCTCACTGCT 
AGTTCITAAGGCTGG AAAGGTGT AGGG CX3GGCAGGGGT AGAGTTGCT CG ATTT ATCrcTTTGCAGCCTGGCC 

TGTACAGCCGGGAGGAGTTGGCAGGGGTCAGGATGGGGACACTAGACAGCTCT 

4 0 GCCT CATTATGTGTGTGGAGTGAATTCCAGAAATGTCAGGCACACC CATACATCT AGGGAATG AAGGAGATGCCAGAGCGGG CTCT 

GGGGCGAGAGGGGCCAGTGTGAGGAGGTGGGCGGCTTCCGGGCAGGGAGTTAGGGCACCTGAGA 
TATTAGGCAACTTCCTGAACTGTTCTGAGTCTCAGTAACTAGGAGAGTTC 
TGCAGTCATGGAGGCTGTATTCCGCGTTCAGGTCACCCAGCTATTTGGACCA^ 
TGGTCTCAGGGAAGGGTGGGGGGTGCAGAGCTCTAAACrcrACACACAA^ 

4 5 CAGTTTTCTCATCTGCAAATTGGATGATGATAATAACIACGACCACCTC 

AATAATCTTTGTGGGC^TGCCCCX3TGACCAGTTG<3CCTCAGACCTGAGGCCTCTGTGCC 

OlGGACCCCTAC^GGAAGGAGCrcAGGACTCCTTCCT^ 

TGTGAGTCCTAGAAAAGAGGAATAAGAAGGCTGGGCATGGX^CTTAC^ 

ATCACCTGAGGTCAGGAGTTTGAGGC CTGCTTGGCCAACATGGTGAAACCCTGT CTCTACTAAAAAT ACAAAAAATTAGCTGGGCG 

5 0 TGGTGGTGGGAGCCTGT AATCCCAGCTACTCGGAAGGCTGAGG CAGGAGAATCG CTTGAACCTGGGAGGCGGAGGTTGCAGTG AG C 

CG AGATTG CAC CACTG C CTT CCAGCCTGGGCAACAAGAC CAACACT CTGT CT CAAAAAAAAAAAAAAAAAAAAG AAAG AAAGAAAA 

GAGGAGAGGAATAAGAGCTAGTCCCTGCTCAGACAGTGCTAAGAATGGACCTAAGAGCCCCCCTGAGCC 

GATATTTCCAGCAG^GGAAGAAGTGGGGCTGGACAGGGGAGTCCTC^ 

CCTGGGAGAGGGCACAG AGG CCTCAGGTCTGAAGCCAACTGGTC ACGGGG CAT ACCCAGGTTCACAGCT CAGGG CATGGGGCTTGG 

5 5 TTATCCTCCTGGCCTAATGOUWVAACCACTGGCCAGACCCAGAGTGAGATT^ 

GTGAGGGGGGCGAGGGCAAAGGGCTGCCAGCTGGAAGGGGCTCTGCTGAGGCCTGTAGTCATC^ 
AGGCCTAGCAAACTAGGAGACAGGCACAAGCCATTCCCTCCA^ 
GTTGC^GAGGGGGATGGGAGCAACAGGACCAAGCACCCAGGTCTGT 
TCCCACTTATCTGCCAGAGCCAGCTG'GGCCGGCAAGGAAAACCAATCTC^ 

6 0 TGAGTGAGCAGGATCCCGCCTCCTCTC(^GCTCCCACCAATCATCTTCACAAGGTGGATTC 

ACACCTCCCATGGCTCCCCTCTTCCTATCTGGTACTGAGGCTCCCTATGATTTCACTT 

CAAACCCCCTCGATCCCCCCAATAACCCACTTCAACTGGAATACTCACCrCT 

TCTGACTTTAAAAGGCAGCCCAACCrrGGGGCCTGTTGACCTGGGGTCAGTCT 

CAGGT CATGT AG CT AACAG CC CTTGAACTATGG AGGGTCCIT AG AGGGTCT CTCAAGG TGG ATGGGCCCCTCG AC CTG AGCAGG CTG 

6 5 CTGTCCCAGGAGTGGAGATGAGCCTTTTCTCCACAGTGTCAATT^ 

TGT AAACGTTTC CTATTTG TGT ACT ATTTCCC CCTC CATGTTCACCAGTT CACACG ATG CTTAAGTGT CT AAGTCATT AATTTT AA 
GATTGCAATTGGCAGAAGTTGTTTCTCTATCTAGGAACAGGTTAATACGTTTGTTGACC^ 
ACATCATGATTTATCTGAATCTAATTCTTGCCAGTTTATAGATAGGGAGTCCCACAGTGCAGCCT 
GAGCCCACACCCTGTTGGACCTAAAGCCTTTTTCCTTCTA^ 

7 0 G CTTGTTCAT ATT CT ACTTTGTT CCC CAAAG^ ATGGTTTTGAG AC AAGTTT ACACTTGACCTT C CACTCAAAGG CTGATGCTTTGG 

G CTTAT AGAT ACT ATTTG TGAACGTCTGGGAAGTCTCTT ATTTGGGGATG TGTC CT CTTCACTGG CCACCTTTC CACAAGCCATCA 
GGCTGTCCTCAAGCTGGGATTGCACAGTTG CTGACTAGCAGTGGCCATGTGAGG ATTTTGTTGC CCCrGGGATGGGTAAGGGGCGAC 
CTTGATCAAATGACTGCCTCCCGGGCCTCAGTTTCCTCATCT^ 

AGCACTAAGCTCTTTCCTTCT ACTT A CTTT GCAT CTGGT CT C CATCATAT ACGTGCT AGAAAGCAGCTGGGCA 
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10 



CAGCTTTATTTCTCCCCGAGGAGCCAGCATG^ 

CAGAGGTCCACCTGAAGCCACATCCTTGCACCCGTGTGGACCTTCCCTGATCCTGCT 

ATGTGTTTTCCGTCATC^CCTCTTGAGGGCAGA 

XTGTCTTTGGGATACACATTTGCTGTCAACCTC^ 

AAC^TTGCTCTGGGAGGGAGTTGG^CCTGACira 

TTATTTATTTACTTAGAGATGGAGTTTTGCTCTTGTTGCC 

CTCCCGGGTTCAAGCAATTCTCCTGCCTGA^ 

TTITCAGTAGAGATAGGATTTCACCATGTTGGTCAGGCTGGTATCAAACT 

AAGTG CTG AG ATGACAGGCGT AAGC CACC AGGCCCGGCCTTCTTTCTGTTTCITT ACTG TCAG ClTi" 1 C CC CAAGTCT ACAAAGGT 
CTTGTCTTCAGAGAAGTCX5CCTCriTTTTrrTCCT 

GACnSTCTCTCGTTCCTGGCCAATGACTATGTCCACCCCTACCCCTGGCT 
TGGAAGCCGACGGTGCTATTGCCACCATTCCACAGGTGACGGAGTGCCCAGGTAG 

GTCCCCAGCAG CCATCGTGAGGGCCAAGCCCCACTCTGCAGATGGAG CCCCAAGG AGG AGCAGAGGGGAGCGG CGGCCAGGGATCT 
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GG ATT CTGGAG CCTTTCT CTGCT AC CAGTAAGGATGGAGT CTGTGACCT CT AAAGCAG CCTCGAGAAAG 
CAG CCTCTTCATCCC CATG TT ACCTGTGAGAAAACTG AGG CTCAGCTGGG CGCGG TGG CTCACGC CTGT AACC CCGGCACTTTGGG 
GG ACAAAGG CAGGCAGAT CAC CTGAGGTTGGG AGTTTGAGGCCAGCCTG AC C^U^CATGGAT AAAC CCCAACTCT ACT AAAAAT ACA 
AAATT AGCCGGGCGTGGTGG CACG TGC CTGTAAT C CCAGCT ACTGGGGAGG CTG AG AAAGGAG AATCG CTTC AACCG AGGGCTGAA 
GTTGCGGTGAGCTG AGACCTGGCCATTGCACT CCAG C CTGGGCAAT AAAAG CGAAACT CTGTCTCAAAAAAAAAAAAAAAAAAAAG 
GAAAAGAAAAAGAAAAAAGAAAAAAAGAAAACCGAGGATCACAGGACCCAA^ 
GCTCCAGTGGCAGCTAGTAATTCCTCGCTTATGATGTCCCGGGT^ 

CTCCCGACAACCCCAGGAGGCAGAGACTCTTGTTATCCCCATTCTACAGATGAGAAACTGAGGCCCT 

GAAGGCT C CATGGTTGGTAAACCC CAG AG CTCT AAACAAG AG ACAAACTG C CTG AAGCCAGG C CCTTCAGTG ACACCACAC CACAC 

CACAGCCAGGAAGTTCCTCCTTAGATCTGACTTCAGTCT^ 

TGTCAAGTTTCAGTTTGGTTTGGTTTGAGACAGGAT 

TTTTTTTAATTTTTATTTTTATTTATTTTTGAGAC 

TCACTGCAACCTCCACCTCCCCGGTTCAAGCGATTCTCCTGCCTC^ 

CCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGATTTCACCATGTTGGCCAGGOT 

CCCACCTCAACCTCCCAAAATGCTGGGATTACAGGCGTGAGCC^ 

TCCCTACATTACTCTTGCTTGAACTGGACTCAAACAATCCT 

CAGTTTGCACTTGAGGACCAGGGTTCGTCTCATCCCTGAATCTTCCTGTGTCT 

ACTGACTTCAAGAATGAAGCCTACGTACCTTrGCGGTGAGTGTTACIAGGTCTTA^ 

CCTGAAGAGCAAAAGAACAAAGTTTCCACAGCATGGAAGGCGA 

CGGGGAGGCAGCTTTTATTCCCTCATTTGTCCCAGCCCATGTCCTGCTGATTGGT 

ACAAACCTCTAGCTAGCTACAGAGCACTGATTGGTGCATTCT 

GAAAAGTTCTCCAAGTCCCCAGTTGACCCAGGAAGTCCXX5CTGGCCTCAT 

ACCCATCCTCTACTCACCTTTAACTACTrGTTCTTCCT.CTTCTGATCTCTTACCA 

CCACAGAATTGCCITTAAATCAGGGAAAGAGGGGGCCTGCCAT^ 

TTCCCTTCCCCATAGGGCAGGGAGTCTGGAATCCAGCATGCCTGCCCTCAAGAGGCTTATGACG^ 
CAAAATAAAAACAGGCTAGTTGTGGTGGCTTGCGCCTGTAGTCCC^GC 



TAGTCCCAGCTGCTGGGGAGGCTGAGGCAGGAGAATCGOTTGAACCCAGGAGG^ 

4 5 CACTCCAGCCTGGGCAACAGAGTGAGACTCTGTCTCAAAAAATA^ 

AGAAGACAAAT ATGCAAGTCAAGGTCAATG CAGGTGCAAGGATGG AGCTTGCCCTGAGTTTCCTCACAG CCAGGACATAAGGGAAA 
ACACGCITTACCrrGAAAGGAGAAGGCATGCTGGGTCCTCAG^jGAGAAATGCCCG 
CCIATGAGAATTTAGGCTCCTGGAGTATCTGGCCCAGACAGCCGTG 
TCTGCCAGGGCAGGCCACACTGCCTTGTGGCCAAGGGCTGGTCAA 

5 0 TGGCTGGGGCCTCCAGCATGGGATGCAGAAGGGATGGGCTVGGCTGCAGGCCTGGGGCTAGATCTCCCATGTC^ 

TGAACTTTATCAGGTAACCTAGCCCTCTAGGTGGCAAGTAAGGTTTTTGCCAAGATAG^ 

TTTTCrrTCTTGAGACATAGTCTTGaVCTGTTGCCCAGGTTGGATTGCAGTG 

CAAGCGATTCTCCTCCCTGAGCCTTGCGAGTAGCTGGGAT^ 

5 5 GATTACAGGTGTGAGCroCTACGCCTGGCCAAGTGATTAACTTT^ 

CTTTCACCCTGCAATAATTGGTACAGAAGACTTCTGTGACCAAATGCGTGCATACAC 
TCAGCCAGCGTCCTCTAATCCAATTCAATTCTCATGATCTCCACCrrGGACATAGC^ 
CAGGACAGCCTCCCACTTCCAGTAGCAATCCCAAGCCCCAGGCTGCT^ 
CCTGTGACTTCCTCCTTGGGTTCCATTAATTTGCTAGAGTGGCTCGCAGAAC^ 

6 0 ATATTACAAGGATACAGATGAAGAGGTGC^TAGGGCGAGGTAT 

ACCTTCCAGGAACCTCCACGTGTTCCGCTATCCGGAAGCACTTGAA 
GTATCATTGATTAAACC!ATTGGCCACTAGTGATCAACTTAATCrT 



TGATCTGAGGTCAGGAGTTCGAGACCAGCCTGACCAACATGGTGAAACCC 

6 5 TGGTGGCZAGGTGCCTGTAATCCCAGTTACTGGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGG 

TGAGATCACAACACTGCACTCCAGCCAGGGAGACAGAGTCAAACTCCGTCAAAATAAATAAATAAA^ 
TTAATCCrGCCTGGGTCTTrTGGGGTAACCAAC^ 

T ACAAAAAG ACAT CACTTTGGAGTTTCT AAGGATTTT AGGAG TTGT A<X CTGGG AAACT AG GT CG AAG AC CAAAT AT AT ATTTT AT 
AATATCACAAACATG CGTACAGGCG AG CACACAACTTGTCAGTGTACAGCTTAATGAGCTGGATAGAATAT CTT ATGTTT AACAG T 

7 0 TTGTG AACCTGGCATGAAACTrrTCAATACTGGG CACATCK5TAG ATGAGGCTTCGTTGAGTGGTCTCAATGCTACAG CCAGGC CTT 

AGGG CTTGGCGGGG AAACAGTTCCACTAGGGTCGAACTGGCACAG CATCGGGAAGAAGG CTTGCCTCCTCTAATGTGGCTGCATCA 
TCrCCATCCCTCCCTTGGCAGACCCACAGTGGCACTGCCCTCCCTCACAGCTGTGGT 
CTTGGACCATGTTTTAGCCTTTCTCAGCCTCAACGGATCT'rLTTTT 
TAATTCAATTCAATTCAATTTGTTTTTCAAGATAGAGTCT 
75 TCCAGCCTCOTCCrCCCAGGTTCAAGCTArrCTCCTGCCCCAGCCrCCCA 
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GCTA ATT T T T CT A T1 T TT AGT AGAGATGGGGTTT CACXATGTTGG CCAGGCTGGTCTCCAGCTCCTGACCTCAGGTG ATGTGCCCA 

CCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCACTTT^ 

GCAGCAGCAGCACUVTATGACTGTGTTGT^ 

ATGGGTAAGCACATGGGCCTGGGTACAGCAGTCCTGGACTCTAGATGGTTCTGC 
5 CCTTGOTTTTCrCTCTGCAAAATGGGGATAACGAGGGCTATTGTGAGGATGA 
AGGTGTGTACTCAGTGAGTTAGACACACATTTTGCTGACATTAATTTCATG 
CTCCGTGCCTCTGTGCTACCCCCTGCCTTCTCTTGGGATGACTACAAACTCCTC 
TCCTCTGCGCAGCCTTCCCTCAGCACCGCGGCCCCAGCAAAGGCCC^ 
CTGTGGGCAGGACCAGTTCTCCATCATCTTTCAACCCATACCACCAAACATAGCACA 
1 0 TGTGGGATGAAAGGACACTGTTGAGAGTCACCTGCTGGTCCCAGACTCTTGT^ 
GATGCTCTGCCAGGAAGACCAGCTGGTACTTGTTGTGTATTTTCCTGTATTCT 
CCCAGTGTAAGAGTCAGCCCCTGAGGGCAGGGATTCTGCTCTGTAGGCCTCTGCATCACTA^ 
GTTGCAGGAGCACAGTTAGGAATTCTGTTATCAGGCCAGGCATGGTGGCT 
GGGCGGATCACCTGAGGTCAGGAGTTCTAGACCAGCCTGACCAACACGGTG 

TGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGACACAGTGAGACT 

GTTAGCACTTGGCCCAGGAATCAGACTCCTTCCCAAGCAGGCTGCAGrrCAG 

CCCTCACTCTGTGATCTCCCAGGAAGTCCGGCTCT 

AGGCTCTGTGACAGAACCACCTGTGGTGCAAATCCCAGATGTCCACTTCATAGCTTCT 
2 0 AATCTGAOTCTCAGTCTCTTCCrCTGTATAATGGCAATAAAAAA 

CAG AAACTGGCTAGG CGCAGTGGCTCATGCCTGTAATCCCrACATTTTGGGAGG CCAAGK5CGGACAGATCATTTGAGGTCAGG AGT 

TCGAGACCAGCCTGGCCAACATX3GTGAAACTCTGTCTCTACTAA 

CCGAGCTACTGAGGAGGCTGAGATGGGAGAATCGCTTGA 

CCAGCCTGGGCAACAGAGTGAGACTCCATCTCGAAAAAAAA 

2 5 TAAAGATGTTCTACAGGTCATCTAATGTAGCAGGGTTATAAAAATATTCTCTCCCAGGCCAGG C l'l^ 

AATTCCTTCCAAAACAGAAAAGGCTGGCTTCAGGGAATCCCAC^ 
AAAGAAGOSGGTGGGGAGGCATGTGGCAGAAGTCTGTGACTTGGT 
GGTATCCTGTTCAGCCAGAGTACTCACAAAGGCCTCCAGTTCTCAGGAAGGGAAGGGT 
TTTTTTTTTTTTTTTTTTTTGA^ 

3 0 CGCCTCACAGGCTCAAGTGATTCTCCTGCCTCAGCCrCCTAAGTAGCTGGGA 

CAAAGTGCTGGGATTACAGGTGTGAGGCACTGCGCCCAGCCTATC^ 

IT AAT AT CAC CCT AGTT AATGGCTGAGG CACTGG CTGTGT AAG C CCTTG CT AAG CACAGTGAGTG AG TGTGGCTGG AAGTGCAGGA 
GGCCTGGGTGGGATGCAGGCTG ATCCAGGGCTGGCAGTCACTGTGTG CATCCAACCTTTAGGCGAGAG AGGGCTG CTAAACTCTGA 

3 5 GAAGGAATTCAGGGTG AAGAAGAGCTGATGTGCGAATGGT ATGGCTGGCACAAG CCT AAGCAGAT ACCAGCTATTCT CCTGCAAGC 

TGATCAAAAATATTTTTATTAC^TAATACrTGACATATACAAGAGAA 

GAGCCTCACTATGTCCCTGAAGGACCAGACCGTGACTAGGCCX3TCGTGTCCATCTGTGGGTCCTCCT 
CTCTAGATGTGACCACATCCTGAATTTTACCTATTTATTCCATTACTT^ 

CTGTG AT CC CAG CACTTCGGGAGGC CG AGGCGGGAGG AT CACCTG AGGTCAGGGGTTCG AGAC CAGCCTGG C CAACATGGTG AAG C 

4 0 C C CAT CT CT ACT AAAAAT ACAAAAATTAGCTGGG TG TGTT AGC ACACGC CTG TGGTCCT AG CT ACTCGGG AGG CTG AGGCAGG AGA 

ATCG CTTGAATCCAGG AGGCAGACGTTGCAGAGAGCCGGGATC ACGC CACTGCACT 
AAAAACCAACAAACAAAAAACTGGTTTTCCCAGGTAATAATGCCTAAATC 
ATCX3TGTTATACTGCATGTGGCCCTCTATGTTGAGCTATTCTCTCTCTGCATTGTC 
CKTTGTAGTTCTa^CATGCTCACAGTTGCATAATAGTCCATCTTTAGAATATACA^ 

4 5 GCATCTCGCTTGTGCACAGGAGGGAAAGGTGTTC!ATGGAGGTACCT^ 

G AGCTTTACAAGATGATGC CAAACrGTTTTCCAAAGAGGTTGTGCCCATCTGTG CTGTCATCTTTG AG A1TGTTTTTCCCATGGCC 

C^GGCCCTTCirCCOSCCCGAC^CAAGCCC^CT 

TCCCTACTGGAAAACTCTTTCCTTTCCCTCCTCCACT 

CTTCCTGGGAGTCCTCTCAGCCTCACTTCCACTCTCTCCCAGGTTC 

5 0 ACATCGCATAGTGTGTTTCTTATAACATCAGCCCCACATAATCCTGTTGAGAAAGGCTTCTGG 

CACAGG CAACTTC^GAG CCT CTTGG TGTTTATTGTT CT CATTTG AAT ATT AGCAG CT CATT AGCAT AGTT G CAGCACTG AG AGG TCC 

TGCCAGTGTGACCCGTTTTTATTTGTTTAACTTGTTTTCCAAAAATA 

AAAATGGGATCXiAGAAGATTCTCCTGCCTATTTCAG^ 

TCTGACTTCTGTTGTAATCATCCCGTGTAAACTAGTTCAGTCCAGACCACCCTTTCTGGAG 

5 5 TGCACTCAGGAATTAATGAATGAAGCAGTCAGCCAATCAGTC^ 

AGGGACTGATCAGGCACCGCTCATCTTGCACCAGKKX3ACA 
GTAGTTGCATTTTTAGGATCTGTCACCCAATGAGGCTGGGAGCCCCT^ 
CCCGAGTGGGACATACAGATG^rGATCTCTCATCCAACTCCTTCTGAGAGG 
GAGCCZAGACCAGATGTGTCCCTAATTACCTCGGTAATGTGGGCCAATCT 

6 0 ACATACCTTCCTTACAGAATTATTATGGGGTTAATATGCGGCAAGCACACAAACAGTGCCT 

ATTAGCACTATGACAATGGCAACC!AACATTTATTGTGCACCrACTATGTG 
CAATTCTTCCTCCCCGCA.GCCTCITTGAAGTGGGGGATGTTTTTCT 
GTGGTAGCAATAAGATCAGGATTGGGGTCAGCCAGATCACAAAGCCCTC 
ATGGGACJUjCTCCTCTGTlTCTlT T TACTGATAATTACCCTGCCGACAACCTGATC^ 

6 5 TCAACAAAATGGAAATCCATC^TACTGCAAGGGTTAGAATTTAACATGTGTGCACACGTG 

TGTGAGAGAGAGAGGGAGAGACAGAGACAGAGAGAGACAGAGAGCAAGAGTGCAC^ 
TGGGGGTTACTAGCACCTTCATGACAGACCAAGGAAGACAGAGATCCCC 
CGCTTGTGAGTCAGCAGTGCCXjCAGAACIAGCAGGGTGCTCCC^AAC^ 
CTGTGTGCCGCXSAAATCnTTGGAGGCrGGACACCT 

7 0 CCTGAGGGCTCGCAGCITTGGATCTXiAGAGGCTTA 

TTTGCCCACGGGCCGGCTGGTCnTCTCAGJVCCCCAGAGGCCAGCTCCTCT 
CAGGGTCAGCTGTCTGCTTTCCACACTOCCT 

GGC^CGCCGGTAGAGGGGCTG^TGCGCTGGTGTGGGGTAGGAGGTTTCGGATGCACAT^ 
CAAGCCTGGGGATTCATTCCTGGTAGGGAACAATTCCCTCAGCAC 
7 5 ACACAGGTGAGGTAAGCAGGTAAGAATTCCCACCAGGTGTCTCAGAAATGTCCTC 
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AATCCCACTATTTTGGGAGGCCGAGGCGGGCAGAT 

TCT CTACT AAAAATACAAAAATT AGCCAAGCATGGTGGTGGCTG C CT AT AATTC CAG CT ACT CGG AAGGCTGAGGCACG AG AATCA 
CTTGAACCTGGGAGGTGCAGGTTGCAGTGAGCTGGAAT^ 

AAAAAAAAAAAAAAGAAAAAGAAAGAAAGAAAGAAAAGAAAGTCCTGGGAAGCTGGTGG 
GAAAGGAGGGAATTC C CAGG CATCTAT AGACCAAT AGGACTGTCGAAAGTTTG ACTGGTGTGAG AAT^ 

TCTTCTTTGCCT 

CCTTCTAGCTCTGACAGGTCCTGTCGGGGGATCCAGCCTGCATATAGGGC CTGGG CTTCTTT CAGCCCCATAGATGGTGACCCCTC 

TTCACCGCTAGTAAGATTGAAGCTCATCAGCAGGACCTCAGGCCCTGGCCT 

TGTGACATCZAATGTCCCAGAAATGCCCATCCrCTTCCTTTTCTCT 

ACCCCrGTTCTATGTCCTCCTTCACTTGCTCTCCCCCACGCCCCTCCCACCAGAGAG 

TCCTGCTGTTCCCTATGAACATGCCTGTTAGATTTCACCGTGTGACACTGTCTT 

GAACACATAGCAGGTGCCGAATAAATGCACTCTG 

TAGCGATAGCATGGGATTTGGGGTCCAACTCCACCTCCACTTCCTGGCCGATGGCAATCTC 

TCTACTCTCAGGGCTGTTGAGAGGATTCATGGAGAAGCTGTGTGAGGAAATC 

ATCTACCCTGGCCTCTCTCTCAGGAACTCTTGCCGACATT^ 



CCTTGTCTGAGACACAGAGAAATGGACCTCCAGTGCGGAAGATCTGCACCA 
ACATTCGAAAGAAAAAAGACTGATAGGCGGGAGCCCAGCATTTGAATTTACA^ 
2 0 AGCAGGACCTGAGCACTTCCTTCCCTTTCTCCTTAGCCT^ 
TCA TrrriTTTTTTTTrn ' T T m 

ACTGCAGCCTTGACCTCCTGGGCTCAAGCAATCCTCCCAACTCAGCCT 

CAG CT AAl 'Trr T ri"rrrri"ri 'l ' AATTTT AGT AG AAACAAGGT CTTGCTTT CT CGGCT AGGCT AGT CTTGAACAC CTG AG CTCAAGT 
GCTCCTCCCGCCTCGGCCTITCAAAGTGCCGGGTTTACAGGTGTGAGCC^ 

2 5 GTGCCACCATTTCCTAGCTATGTGACTTTAGACGAGT CACTTGGCCTCTCTGAGCCCATGTCAT CATCCX3TTAAATTGGGTTCTTT 

ggcaaattaaaggatgtattctatgtaaagtgctttaca^ 
ttaataatggccactggccaggcefigatggctcacacctgtagt 
gtagttcgaggctagcctggtcaacatagtgaaactctgtctct 
atagtcccagctactcgggaggctgaggcaggagaatcgcit^ 

tactacctVacttagagca 

caccitatcttattcattatgggtaatcacctgtaaaagcgttctttatttcat^ 

atgacgtcatcccccctctggaggacacctcctcccacccatg<3tgatttgagccttct 

ccttaactggggctaatgtgoictgacattttgccttttgtgt 

3 5 taaatgttcgctatttctatgtatgtgtaagagagagttgtttc 

gctggagtgcagtggtgcaatcacagcttactacagcctcaa^ 
ctgggactacaagtgtgcaccaccacacccagctaatt 
ctcaaattcctggcctavggtgatccccctgcctcgaaattctgaagtgct 
agattttatcttttctacacaacgtgtgtttcccccagaagaaatgcct 

4 0 gaaattacagaaactccctgtgccccagciacccritaccagggcatgc 

ggaaggaagggagtgggagaggaggtgaggctctgcagagcagagtggaccatttct 

ggtggctcacgtctgtaatcc cag cactttgggagg cttgaggcgggcgg atcacttgaggtc^ggagtttg ag accagcctggcc^ 
ggagaatcgcttgaacctcgaagacggaggttgcagtg^ 

4 5 CTGTCTCAGAAAAAAAAAAAAGAGTAAGGAAAGCGAAGTCCTGGGTTGG 

TGGAGCCCCCAACCCAGTAGGGAAAAGGCTTGCTGCCTTGAACCCAAAGAGTGTGAGCATCCTGTAC 
ACCTACTCACAAACAGATACACCCCACTGC^CAGCTAAGAAAGCTAA 
CCAGAGGAAACX3AGGAGTCAGGCCTGGGCAGGGGGTTCCAGATGACGCATC 
TCTTTTATCCTGTCCTTTTCCGATGTAOTTGAGCGTGTGTT^ 

5 0 CTGCCAAGGTGGACGGAAACGGAGGGGCGTGCCCAA^ 

GTGAACATGGAGATTTGCTGAGAGGCCAAGGAGAGAGATGATGGGGGAGGGCT^ 
TGGAAAACAGGGGACGGGTTGATCCTAACAACTTGGAGGGACTGGTA^ 

CCCAGGAGCCCCAGCTGTGGACCCAGGGTGGGGCTGTGGGCAGGTCCCCAGCCCTGTCTGTGAC 
TGAGGGAGGGAGGGAGGATACCACAGAGGGGAGGGAGCACCTAGACCACTT 

5 5 GTTTCT AAG CTCAG CAGGG CCACTT ACT ATGCAG C CAGGG ATG AG AACCAATGG C CCACC CTG CTGAGTG CTGG AATGTGT AATG A 

AGTCCTGCGTGGGTCTGTAATGAACCCAGATTCGGCTCCTAACAGGTGCTGAGTGCATGTTTC 
GAGAGGACAGAGCTAAAGCAAGGAGCCTGCGGAGGAATGGGTGGC!AGGGACTGATGA^ 
GTTGAGGCTCGCTCTGGACTGCCTCGCCTGGGGATGGAGAATGGGGCTGCAAGG 
GCCTCCCTAAATAGAGTGGGGGAAGTCCAGGTGCTGGGCAGGGCTGTCTTCACCTGCCCATCCT 

6 0 GCAAAGAAGCCCTCCTGGGAATGTGGTGGAAATACCCACCTTGAAAGGTAGAGAGTGGA^ 

TCGCAGCATCCCCCGTGAGGCCTCCCAACCTGCAGCTGCCTCCTGCGTGCGCATCTGCCACTGCTGGC 
CCCAGCTGGAGGTCCAAGCTGGAGCCCTGGGCAGTGTCCAGTGCACTGTGGCCT 
TGGCAAGGCCCTGGGTTCAAACCCTGCTCTGAATTTGTCCAGTTGCTTCCCCATGGTGT 
GTTCCCTGATAACAAGAGGTTACCTCTCAAAGCTTGGTTAGATCAGATTAAACGTCTTTG 
6 5 GGGCCTTCCTGCTGCATCATCTCATACCATTCTACTGTCACACTTCT 

CCAG AGG CTCTGATCCTGCCACGCTGGTGTTGGAG CCTGGGTGTCTGTATTTTTCAAAACCACCCCAAGTGATCTGCCGTAGTGTA 

TAGCCTGCATCCCTGACCCCAGTAGCCAGCTGGTCAGGGACTGTCCTTGGTTTTCTGAAGAGC 

CACTCCAGGTAGGTGCCAGCCCTGGTGGCCAGTGCTATCAGGTGTGAGGCAGGGAGGG 



7 0 CATAAGGCCTC^CCCAGCGTGAATGTTTATTGAATAAAGACACACACCAGGCT 

TAAGAGTGAGCTCCCAGCCAAGCAGCACAGCTTTGTGTGTTCCCCACTGGGCrrGCTTCTAAG 

GTGGAGCTAGCAGCTGTGTTAAAAGGCCCAGAGAGCCAGC^ 

GTGGGACCTGGGAGAGACGAAGGAAAGGACTGGGCACTCAGTGGGGCT 

GGAGGTCCTCG CAGGG CCACTGCACCTGCTGAAGCAGGTGCTAG ATT CTACATGGAAACGAG 

7 5 ACTCACCATCAATGGAGTCGGTGGACAATTCACTTGAGTGCATTGAAGGTTACCTGTGACGGATGTATAAGAG 
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CAGAGGGAACAATTTGAATTGGAGACATTAAAAGC&GATTTAA^ 

GAACAAG AGGGCTGGG CAGGCTATGTCTTGTCTTTT ACCAGAACTGGCTTGAAGGGT AT AG CAATG TGGGG AC CCAGCAAAATCCT 

GACTTAGGGTTTGGAGGGGTCAGGAAGGGGAGAAATGTTACAAGG 
5 TGAATTAGAATTCCJVCTTAGTG<3AGTCGGAjGGCAAAG^ CTGAGCTCAA 

CTTCAAGGCCCAGCTCGAATGTCACCTCTTTGAAGGGGAC^ 
GAAGTCCCAAGTGGGACTTCTCACACTCTGCAACTGACT^ 

crrm i m itx-i'I'itttctgaggc^gagtctcactctgttgcccaggctggagtc 

TCTGCCTCCCAGGTTCAAGCGATTCTCCTGT 

1 o ttttctgtttttagtagagacaggcgtttcgccatgttg^ 

gcctcccaaagtgccgggaatacaggcgtgagccaccgc^^ 
gtactgaatgaggcccagccttcagatgatcaggtctggg 
tgggagtcagcaggcctgggctgtgctctctgcactgcct 
cttaatttcccc^aggagctgaactaggtgatgtcaaaggtcccgctt 
1 5 actaacatgostggccatttacagaatgctcctaaagcccttcc^^ 

gcaagaaaactttcacaggtggaga ctr ' rr c ttcaacaggtctttattggctgggcacgctggctcacac ctgtaatcccagcact 
tggggaggctgagg1xk3gaggattgcitgggcccagg agtt 
aattttaaaaacttagcagagcatagtggcatgtgcctgtagtctcagctact 

AGTAGTTCAAGGCTGCAGTAAGCTGTGATTGTGCCACreCAC^ 

2 0 AAAGAAAAAAGAAAAGAAAAATGGCCTTTACTGAACATCTATCTTTGTACT 

GTTTCTAGAATGTTCTAGAATGTTCAGCATCCCCTGAGTTGCCAAGACT^ 
CTGATGGGCAGAAATGCTGTGACCCCAAGTAGGCCTCTGGGCCTTGATGACATCCTCAGTAACA 
TGTCTGAAGGCAGAGGGCTGGGAGCCTACCTGAGCACGCACAGGGCATCTG^ 
TGGAGAAACGGACCCAGCTCTCAGCATGTGTCCTTGGGAGCTGCT 

2 5 TTGGTGCCTCTCAGGGAGACGCTCTGGGGTCTGAAGAGTGAGGTCCCTGCCTTTTACCCGTCTCTGGAC^ 

CCCTTCCCTGTGAATCCCCTGCCTGAAGGAAGAGACCCACAGCCCTGGGGT^C 
CCTGGGGAGGATGGGCTGGGGCTGCTGAGCAGGCCAGGGGTGCAGGACCCCTCCATCCCCCGCTTCT 
CACCCTTGACTCCTCTCACCTCTTGTCCCCTCTTCCCTCCTGTTTCCACTCT 
ATGCCCCGGACCACAATTTGGCTGAAGGCAACAATCCCTCTCT 

3 0 TGTGAAATGAAGGGGTGGGGGGGTGGGTAAGAAAGAAGGAAAAAGCACTTCATGCTC^ 

TTT ACAGG TTTCAACATTTTTCAGG CGGTCTTGT CACATCTTAT CAG ATCAGC CAG AGC CG CAG GCAGGCGCT CCAGG AG CTGCTT 
TGGAGAACAGAAGCTGCAACATTCCAG CCCAGG CACTGCCCCGCCCTCCACACCCCTTC CCTCCCTTCCCTTTTCTCCCCAAAGAA 
ACATCCCACAGGGCTCCCTGGGATGGAGCACAGGGCCCGCAGCCTCTTCCCTCTGACT 
CTTTGAGCTTCTGCCCGCACCCTTTCCTTCTCCCCACGGCTGCCCAGAACCATGGTCCTCCCGG 

3 5 CTGAGCTGTGCCACCTCCTCGGAAGGCCCCTCGTGGCCCTGAAAGGGAACCTTCAGAAGCTGCGGG 

CGTCCACAGCATGTAGAAGTGGTCATTTGGCCAGCAGGG 

CGGAGGGGAAGGAATAGCTTTTTCCTTGGTTATCTTTTCT 

TATCCCCTTCCACTCAGTCACCCCCAGACCTCACCTTTCTGTTCCTGCTGCA 

CAGGTCCAGGAAGGGCTGGCCAAAGCCCAAAGGCTCAGAGTTAAGCCGAGGCAACGTGAGGGACAGACC 

4 0 <^CAACAGGGAAGAGCTGGCTTTGTAGCTTGTAACA 

CCAAGGGAAGGCAGCTGTGCACTGAAGGCTTAGGATGCATCATAAGAAAAGCAAAAAGGGG^ 
GTAATCCCAGCACrrTTGGGAGGCCAAGGCGGGCAGATCACGAGGTCAGGAGATCGAGACCATCCT 

TCTCT ACT AAAAAAT ACAAAAAAATT AGC CGGGCATGGTGG CGGGTG C CTG TAGTCC CAG CT ACTCGGG AGGCTG AGGCAGG AGAA 
TGGCGTGAACCCGGG AGGTGGAGCTTGCAGTG AC CCG AGATCGCG CCACTGCACTCC AGCCTGGGCG ACAGAGCGAGACT CCATCT 

4 5 CAAAAAAAAAAAAAAGAAAAGCAAAAAGGGGAAAGCGCAGAGGAGGTAGGGGCACAG 

TGCATCTCGTGGGGACCTCGTCCGGAATATGGAAAACACAAGCCAGACCTGGG 
AGGTTTCGGGAGAGCCCACCTTTCCAGCCTCCTAGTGTGAGGTTCAGGCCCTC 
CCTCCCTGGTGCCTTCCACCCCTTC^CATATGGCATCAATGCTCCCACCGTGACGCTGA 
AGATGGGGCCTAAAGTATAATTCAACTTTTAGCTCCTTACAAGCATCCCGAGAT^ 

5 0 AGCTCTGTCTCTACGTGGTGCCTGGAAGTCCCTCTGGCCCCACGCA 

TCCCATCTTGGTGGACACCATCTCCATCTTCCTGGTCATCCrGGCTCAAAACTT^ 

CCTC^CACAGACCCCGGTGACCCAGACACATGGGTCTTGGCCC^ 
CCCAGGCCCO^GTCTCCTTAGCATCCACGT^ 

5 5 ATAACTAACAGGATGATATTCCAAAGCCCAGCTAAGGAAGGTC 

GCTGCTTCCCCTCCTCTGCTTCTCTGTGTGGCTCCTAGACCCrGGCT^ 

CACCTGGCTCCACCTGAGGGCCCCTAGCCATCCTTCTCCAGCTCGGTAGCTGCTCCACA 

CACTTGCCCCCGTTCCTAGAGATCGCrCGCCCTGACC^ 

TGGGACCATGTCTCACTCATTTAAGGCACCCACAGTGCCCAGACCTAGTAC^GCTCAAGAAATGCTTGTTGG 

6 0 GCAGCCTCCTGGGCTGGGTGCTCCTCAGGTGTGGCTTCATTAATCTCC^CGT^ 

ATC^CTCAATTCGTGCXSTGTTGAGTCAATAGGCAGTGAATGACTTTC 

GGCACTTCCACTGACTCCACAAATCACCAGGCCCCAGTCTTGACAGCTGG 

CCGGGCAGCTGCTCCTTCATGGTTTTATCTC^GACCTTC^ 

CCTGTCCT AG ACATAGG CAAGGGTGCTCCCACCCCTGCTGGAGATGCAGG CGAG CCAGAGGGCCCCGTTGAGAGAATGAACTAGCC 

6 5 CCGTTGAGAGAATGAACTAGCCCCTAATGGCAAGGCCCAGAGACATGTCCCACTTCT 

GGOSCrCTCCCACAGCCACAGGACACAGCCCCACCT 

ATGTCAAGAAGGTCGAGACCTGTGCTCAGTACCCGCAGGGTGCACATGTGTTCCAGGAGGGCGAC^ 

TGAGCTAGGTTTCTCTGGTTTTGTCACAAGCTCCCTGCCAAGCAGCCAGGTGAGCGGCT 

CCACAGCAGTGATTGCCTCGTCACTATGGGTGTAGTTGGGTGC^ 

7 0 CCTGAACTCXSCTAGGCTTCCTGTCCTGACAT^ 

AAAGCCAACGCTTGGTTCCTCACTTCCTCCTCTTTCCTCAAA 

TCTC^GCCCAGGAACAAGTTGGGCCTCTGTCCAGTACTGCCCAGGAGCCAGCAGGTG^ 

TGACAGG AGCAC CGTGG C CC CCACCACAGGG AG AGG AAAATG AC CACAGG AGAGGTGATTT CTG ACTCCTTCT C CTCACC TTC CCT 
CCTCCTCTCCCTCTCAGCAGCCATGTCAGTCATTTTAAGAGACACT 
7 5 G CTGGCACAG ATGGTT AG ATTTCCGCATGGAG AAAGGT AAGGG CAAAGGG C CT ATTT CTG CAAAG AAGG CATGACTTGGGG AG ACT 
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TGGCCCTGGAGAGCACCCTG AGACCGGAGGGAGGAGGGGAG CCTTTCCCTGGGCCGCAG CCCCTGGGTCCACCTTCTGCAGCCCCA 
CCCATGTGAAGCCAGGTGGGTCCTTTTTGTGCCCAGCCCCAGAGAAGCTTCTGA 
CTGGTGGGTAAAAGAACTGTGAGCCCTGGGTTCTCCTCCCACCT 
TCTGGGCCTCAATCTTCCCGTATAGAGGAAGAGGGAATTAGACTAGACCCXJGGGCTCT 
5 GTGCAAAGTTGAAGGCIXXSGGATGGAAGCACC^^ r AATTTTG CT ATTT AAC CATTTT AAAGTGT ACT AT 

TCAGTGGCATTAAGTACATTCCAATGCrcTC 

AACITCCCACTCCCTCCCTTCCAGAACCCCTGCCAACCTCTGTTCrACTT^ 
GTGGAATCATACAGTGTTTGTCCTTATGTGTCTGGCTGCTGTC<3CT 
AGAATTTCATTCCTTTTTTTTTTITTGAGACA^ 
1 0 TGAAATCATACAGTATTTGTTCTTCCX3TGT(nX^ 

GAAGeATCAGAATTTCATTC CrT ' ri T 11 I I 1 1 T GGTTGGGGGGACAGAGTCTTG CTCTGTCG CCCAAG CTGGAGTGCAGTGGCGCA 

ACTGGCTCACTGTAACCTCTGCCTCTCAGGTTCAAGCAATACTCCTGTCTCAGCCTCCT 

ACCAGGCCCGGCrAATTTTTGTATTTTAGTAGAGATGGGGTTTCACC^ 

ATCCACCCGCCTCAGCCTCCCAAAATGCTGGCATTACCGCCACCACACCCGCTTCCTTCCTT^ 
1 5 GTATGTATGGGCCACCTGTTGTTTATCCATTCATCTGCTGA 

GCTGGAGCGCAGTGGCGCCATCTCAGCTCACTGC^ 
GCTGGGGCTACAGGCGCCTGCCACCACGCCCGGCTAATTTTTTGTA 

2 0 AATTCTTTTGGGCGTATACCAGGTGTGGCATTGCTGGATTATAGGGTGATTCTATC 
TC^TATGTTCTGTATGTAGGGGAGAG^AGGGAGATTGTGTGTGA 
CCTAATCGATCACTCAGGGGAGTGGGTGCTGGCAGCACAGTGGGTGT 

GCCGTCACTGGGTGGACGGCAGCATCTGAGGCCAACAACCCCAGTGTCAGATCAGGCAGGACGCACT 
GTCTCA.CCATATCTTAGGTGCGTAAGTCAATGCAGCATCCAGGCTGACCTGAATTCA 

2 5 TCTATGGCCAGGCATGGTGGCTTACGCCTGTAATCCCAGCACTT1X5GG 

ACCAGCCTGGCCAATATTGCGAAACCCTGTCTCTACTGAAAATACAAAAATTAGCT 

CT ACTTGAGAGGC CG AGG CAGGAGAATCACTTG AACTCGGG AGG CAGAGGTTGCAGTG AGC CAAGAT CACG CTTTGT ACT C CAG C C 

CGGGCAACAGAGCAAGATTCTGTCTCAAAAAAAGAAAAAAAAGGTCCT 

TCTGGCCACAGCAGGCGTTATCAGGGTAGAGGCACAG^ 

3 0 CTTGCCrrATTTGTGGAGGGATATTTGACTCGAAG 

CCATCCGCCTGGCTGCCCGTGTGTGTCCTTGGCCCAGCTGGGCA^ 
GTGGCAGGGTCCTAGGGAAGGGAAGGGCCGGGCCAGGAGCTGGGTATAAAT^ 
CTCCGCCCrCCCCCTCACCAGCrGTCAGTGCCAGGO^TCrCT^ 
AGGAACTTCCTGAGCGGAACCAAGGAGTCTTGTCCrcTGT^ 

3 5 TCAGGCTCGTCCTTTGGAAGAGTTCTTCATTCCTTTCCT^ 

GTGGGCTGGGGTGTGTGTGTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGATGGGGTAGGGACCG 
GAGAAAACACTACCACTACCTCTCTrrcriTGGAGACTGAGACT 

GGATAAGAAGGTTTGAATTAGGTACAGGAAAGATTCAGGCGGAGGAGAGGTGACTTCTGT 
GGTC^TGACCTAGGAATCAGGTTTCGAAGGATGGTGTAGAATGGTTGCC^ 

4 0 CTCAGCATGTGCAAATGTGCAGAAGTGTGAAAG&GCCCCGCTC 

CTTTGGCACAAAGAATCGAGGAGC^TGAGAGTCACTGTC^ 

GGGGAAGGGCAGCC!ATCCTG(IAGAAATGAATCCCATGGCGAC^GC 

GCCAGAGGCCTTGCTAATTACTCGCAGGATCACCCCATCTGAACTGAAGCCTTTGAATGG 

4 5 GCACTCTTTGAGTGACCTTGAACCCTCATCCAGATCACAACTGTCTT^ 

CATCCrTTCCAGTGTTGGATAGGAAGGGGCTGAGATTTCTGATGTGTTCTACC 
TGGTGAGAAGGCAGGGAAGAAAGTGCTGGGGACAGCTCC^ 

GTGCCATGCACTGGATTGTGTCACTTATTCCTCCTGTCGTGTCTATTTTACCGATAAAGAAACT 
TG CTGAG ATOVGCAGOTAGG AAGTXX5TAG AGCTGGATGGG ACCCTGGCACC ATC 

5 0 AACAGG AAGGAG ACAGGG TTTGGTGG AAAG AGGG AAAAC AG CTCAT CATG ACTGAGG CTCAGG AATTGTTCT CAGAGGACATCACA 

CTGGGAGTCAGACAAACTCAGGGCTTGAATTCTGGCT^ 

TCATCTGCAAAATG-GGGATAACAATGCCCIACCrCCCAGAGTTGTTGTGAG^ 

AGAGCIAAGTGCTACATAAATAGTTGTCATTAGTATTGCCAGTAT^ 

5 5 ATGAAGCCATGCAGGCAGTGTGCCAGAGCTCAGGTGCTCTGCTGGG 

GAAACCAGGAAGO^GGGAGCCCCTTGGAGAGCTGGGTGAGGAGCCGCCTTGCCAGCATGG^ 

C^GCCGGCCrGGCTTTCTCTGTGGATATGGCTGTGTGACCTCXCACAAGTTGGTCCT 

CTTTTTTGTTfTGAGACGAAGTCTCGCTCTGTCGCCCAAGCT 

CAGGTTCAAGCGATTCTCCTGCCTCAGCC!TTCTGGGTAGCTGGGAT^ CGTG AGCCACCTGGCATGGT CGGCCTCTGTTTTT 

6 0 CATCTGTATAATC^CTTTGTGCTCCCTTAGGGCTCCT 

AGGGTGCCCAGGCCTGCAGAAGGTGTCAAGCAAGGGCTGGGAGAGGGGGAGGAAGGAGGAGGCAGGAA^ 
AGGAGGGCAGAGGAAGGGTCTCCGAGTGCCTCTCACTCCCGCCT 

AAACACCAGAGGAAGGAGCGGGGACAGCCCAGGCTTCTTCCCTGGGTCTTGTTCCTGAGCTGCCCATCT 
AGTTGTCH'GCACTGAACTCTCCCTGAGTTTCCCCTCCAGTGATTGC^ 

6 5 GATGCTGCTTAAGAGCACGGGAGGGGCAGCCTGAGCCCAGAGGGAGGCTCAACGGGGAC 

TACTTAGCTCCCTGGCCrCAGTTTCCTTATCTO 
GGATTACATAAGGTAACAGACACAAGCACrGAGAACTGTGTC 

ACTCTAACCrGCTTCTAACAGGGCAGGGGCAGTGCTGTGTGCGCCGTGTGCCCATCAGGGCTCT 
CAGX3CATCTGCTCAGTATGGAGGCCCAGGGTGCAGATGGGCCAGGCCT 

7 0 GCCCAGCAGGCAGGGTCACATAGTGATGACCCCCATGGGCTCTGGAGTCTGAC^CC 

TCTGTGGTCrGGGGTGCTTGATCAACCTGCIAAGGCTAAGCC^ 
ATTTTGAGGATTAAATGAGGCAAACCGAAGGAAAAGCCTCTGCTC^ 
CGATAAATCAGAACTTCTGGGCCGGGCGTGGTGGCTCGCGCCTGTAATCCCAGCACT 
GTCAGGAGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCT 
7 5 CACCTGTAGTCCCAGCTACTCGG^AGGCTGAGGCAGGAGAATGGTGTGAACCCAGGAGGCGGAGCTTC 
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TCACrcCATTCCAGCCTGGGTOATAC^ 
TCTCTTTAGAAAAAGCTGAGTGCITAAGACCTGGAATGC^ 
TCCACCCAAAATAAACAAATGGAAAAAGGAGAACCTGCCTCT^ 
CATGCCCCTGCACGGTGCCCAGGGAGCTCTGTGCCTTGGAGTCTCCA 
5 AGGGACCCGGGGGAAGGCACATGGCACTCTCTGACCCACCACAGACCAAAAAGCCG^ 
GTTTCCTAGTCCAGGAGATAGTGATGTTCTTCATGTTTGGGAA 
AGTTGCCAAATGGCAGATGTTTGGCTCGGCCCTGTGTCCAGCCCAGAAGCTTT^ 
TCCCCCATGGGGTCTTCTCCCGGCCCCTCTTTCCGA 

AC CAAAGC CTTGGTCAGTG CTGGACACAG AGGGAGAGGAGATGGG C CC CTTCTTC CAGCTCACACTG C CCCTGGTCAT AGAGCAGA 
1 0 TTGCAATTGTCCTTGAAGGCCACTGACACTCACGG^ 

GAGGGCAGGGAACGGAAAAGTCGTAATTCCTACCTTGGTTTTCT 

TTCTGTTGTTGACTATAAAACCTGAGAGAGTTGATGGGATATTGAT^ 

CACCrcGTGTTGGGTGTGGGCAGTGAGATCTGAGATCTCCCTTCTC 

GTCTCTCATGAGAAAAGAGGTGGAGCTCTAGAAACTTCTAGGAAACTGGAAAGATGGTC^^ 
1 5 GGTG<IAAGCCAGCAATTCCnTGGG<3AGATACTGGGAGGCTCT 

CAAGGGTAGTGTGTGTGTGTGCGCGCGCGCCCGTGTCCCATGAGTCACTGGAAGTGTCC^^ 

TGGAAATTGCCCAGCCTGATCTCTTCAATGCCCATCAGGGAAGCTGGACCCAGAACCT^ 

CCATCTTTCTAGTCCAAACCTCTTTTCATGCCTTTGCATTGCCCCC^ 

CAAGCTGGATGCAGCCAGAGGGCTGACTTGCGTGGGTCAGAGCTCTGCGCTTTG 
2 0 ACATTCAGGTGCCAGGGCACAGGCCCTCTGCCTCT^ 

TGTGCTTTGTGCCCACAGTGTTACTACCCTCTGTCTGCCTGCAG^ 

GCCCCTGCTCTGCGGCTCCATCCCAGGCCrGGTCCCCAAGCAACTGCG 

CCGAGGGCGTtlAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGG 

GCCATCITTGGGCCCGTCCTCGACAAAGGTACTGCCTTGGGGCTGAGG^ 

2 5 TATGCCCTTCCACCCCAGTCAGG^TGKTCTCTTCTCTGAAGCCAGGGTCCCTCT 

CAC CTT CCGAGG CGCTGCTGGGGG CTGGG C CGC CTTGAGATG CT CCAGGAT CCTG AGGTTTGAGGG CCG ACAC CAAG AAGG C CACA 
GACTTCCrTCCTGGCTTTTGCACAGTG 

TATGCAAATACCGCTTTGGGGTGGGGGTGAGAGCATTGCCACCCCTAGGTCCCT 
CTGCCTGGTGCTGTGGTAGCATTGCGTTGCCCCCAAGAGCCCCTCTGGGTTTC^^ 

3 0 CCAGGAATCATGGTGG^CTAGAGGGTGCACTCGCCTCTGGGTAACTTTCC^ 

CCTCAAGTGGACAGGGAACCACATGATTTGACTTTGAACCTGCAGGCCCT 

GGGGACTTGGGGGCTCATGGACTGGGTGGAAGATGGGCAGCTCTTTGTGGA^ 

GCTGGCCACAGCCAGAGTTGCTCTGCCACCCAAACCGCTGAGCCT 

G CCTATACAG CCAGAAAGAAAAGOGATCTGACGTTCAGGTGAAGGGGTGGGGTGCTGG^ 

3 5 CCATTTTAGAGTGCAGTTCCCCTTTACOSGTGCCTCAGGTGCTCAGG^ 

GGGACAGGAGG GT AATT AG C AAATCAATGG ATG ATGATTCTG TTT ACG CTTCT ATTGCCAAAACT AATT ATTCCTTG TTG ACAT AA 
AAGATTTGGCCTCTTAATTGGGGCCAAGTGGCGGCXSGT^ 

GGTGCCCCTCCTCGGGCCAGGTAGCGCTGAATCAAAC^CCCTGCGTCTAGACGCGCTC 
CTTCCCGTCTCCXK5CCAAGAGCCAGGGGAAAGCGGCTCTGGGAGCGCT 

4 0 GTAGGTTCTGGCACACGCCCCGGTCCCTAGATGGC^ 

TGGGAATTTGCTGGCGGCGGCGAGGAGAGCATCCCGGAAGACGCCGCGAGCGCATAC^ 
CTTGAACCCAGGTTCGCGCC<^GCTCCGGCCCCACGTT^ 

AAGGGTTAATCTCTGCCCCGCGCTTTGAGACCCGGGGCGCAGCG<^TCGGGCAGCCC^ 
CG^CGGGGTCGGCOKSCCGGAAGGGGCGAGGCGCGGCGGCC^CGCGGCGC^ 

4 5 GCGGGGGCGGGCGCCGGAGGGGCCTGGGCGGCCTGAAAAGCGCCCGAGCACCGCCCCCCC^CCG 

CGCCCTCTGGCCATTCGGCGCGGCGCGGCGCGTCCGCGGGGGCGGGGGGCGCCGCTTCA^ 
CCGGGGGCGCGGCCCGGCCGOKKSAGCCGCCCGTTGCTACGCGCTGGCGGCCGGCCCGGCGG 
CG<^TAATGCGGTGTGACACCGCGCGAACAAAGGCGGATTG^ 
GTCAGCGCCGCTCCO^GGCTAATCCCCGCCCCGCXCCrc 

5 0 GGGCG CCGCTGCCGCCXSC^AGAGGCCGCAGGCCGGCCGGAAACG AGGGGCGCCCG CCCTGTGAACCCG CCCTCGGCCCCTCTTCCC 

CGAGGCGCCCCCAGGGCTGGGGCGTCTAGGATTGGGGCGAGTGAGCGGCGC^ 
CGGGACGTCGGGCCCCCTTCCCGCCGTCCTCGGTCGGCCC^ 

CGCCCTCCCGTCGTTGGAAGTGGGACGGGCCCCTCTGTAGGTGGCTCAGGAGGGCGTGGG<^CC^ 

5 5 CACATTT ATGACCAATTTTGCCGACT AGGC CGGCCCGGCGTTTCACAG CCTGTGTCAGGTGGCGAGTCACCTCCGGCTCCCGGGTC 

CTGCCATGAGGAGAGAGATGGTTGGGCrGCAGGTCTTGCAGGCCTTGGCCTCCCTAATCGTCAATGTCC^ 
AAAACTTTCCATGTGAATGAAGCACATGCAGAGACGATGACCGTTGTGAGGAGGCT 
CTG CGCTGGACCCTGGGACAGAACCAGGG<3 AACTCCCCGG 
GCAGGAGAATGGGGATGGGGAGACTGAGGCACGGGCCACTGCCCTCCT 

6 0 TTTACAAAAAGGGCGACTGAGACCGAGAAAAGGAGAGGGGCCGTCCCTATGCATCCACGTGC 

T AGG CTGTAG ATTCCCAGGCCTCCCTCCTTACAGACCTGG ATCTCACTCAGG CGTGGGAGGGCTCAGGCCTGTGCAGGGTGGTCTC 
TGGACTTTGGGAAGGTGGAGGGCAGGGGGTGTGGGACCAGGACA 
GTGTCGGCGCCTGTTTCTAACAGGCACTGCTTTGCCCTGTGGCATTTC 
CTCATCGGTGAAGTG<^AATACCCAACTGGGTCTAAACCAGCCAAAGAGGGTGGGG 

6 5 GAGCTCTTCAACTCTTCCTGGCCTCGTTCCCCT 

GAGTGCTGCCTCCCCACGACCTCCCCGCAGCC^CCCGCGAGTCGGCCTTCGTTCACGCC^ 
GTCACCCGCTCCTGCGCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGC 

GG GCGGC TG CAGCG AGG ACGCTGACTTCGG CGTGTTAGTGTCCAGGGAGTTCGCGG ATG CG CGCG AGAACAGGC CGG ACGCG CG CT 

7 0 TCCCCTCCGGAGCTGCCCTC^CCCCCACCTCACCCCTTCCGGCAGCC^ 

GCCTTCCTCCCCACCCC^CAGCCACTTCGTCTGTCCACCCrCCTTCCTCCTTGGGCAGTTCCTG 
TCGGCCTGGGAGCATGGCCACCCCGTGGGATC!AGGTGCCACCCACCACCCATCCTGTG 
AGTCCTGGGAACCrcGCTTTGAATTCAGCCTCCACTCCCTTACTATC 
TCAGTTTCCCCATGTGTTAAATGGAGATAACTCCrATlT^ 
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TCAACCTCCCAAGTAGCTGGGATTACAGGCGT 
GTTGGCCAGGCTGGTCTCAAACTCCTGACTTCAGGTGATCCTC^ 
GCAACAACTCGTATTTCATGCGCTCTGAGAAGTGGACGTGCTGTCT 
CTATTCTTTAGGTCTTGCCTCAGTCCCCATCAGTTCTCTTCCTC 
5 TGGCAGAGAGGGAAGCCCTGGATGTGCCCCCTCX3TGGGGTCGGTCT 

CCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGTGAGGTGAAGACCT 
CCATCGGTGACTTCCTCAAGGACAAGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGA 
ACCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCAC^ 
CAACCCAGAGACGGGTTCCTTTGGCA.CAAGGGACCGGA 
1 0 GTGGCCGGGG CCACAACACGAGGACGGAGAAG CGGAAGG AAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAG 

GAGTGTATTCGCATCTACGACGTGCACACCTGCAAGTAGGGCACCAGGTAGGGCTCCGGGAAGCAGGG 
GCGCAGGGAATGGGGTTGTTTGCCCATCTCGTCTTCTTGACGACCCCCTTCT 

AGCCAAAAT AGGAAG CTAGG ATTTTTCAAG CATC CGTGGGAATGG TC CTTGGGCAGAGACTTGG TGGATTGCACAAAG C ACAG ATA 
AAAACCCAGTGTTGGCCGGGCGCGGTGGCTCACGCCTC 
1 5 GGAGTTCGAGACCGGCCTGGCCAACATGGTGAAACCCGTCTCTACTAAAACr 
TAATCCCAGCTGCTTGGGAGGCTGAGGCAGGAGGATTGCT^ 
GACTCCATCCAGCCTGGGCGACAGAGTCAGA 

ACATTAGGTGC CAGCAATTG ATTAACAT1T1T AC!ACAAG TT AG TT ATG A CAT AGCTG CT AGCTGT CATT 
CTTtSAGCATTATTTGGAGATGGGGGGCTGGAAGCTGGGAGACCTC 
2 0 CTTTTCTGGGCCTCAGCTTTCTTACTATAAAGTGGTGATGGTGGTG^ 

AAATGTCTCCTCTACCCCTGAGGCAGAGTCAGTGTGCATCCACCTTCAACTTAGAGGGC^ 

CAAC CTCAGGGAGGTGGGTGGGGGGAAG CGGCAT AT AAG C C C AT ATCCTG ACAGATCTG AAG CTTGGGTGG AGTGGGG AG CACT CT 

CACCGAGTTTATGGATGATGACGGGGAGCAGGAGGGATTCAAG 

GGATTTTGTTCCCAAGCTTCCCAGAGGTGGCCTGCATGGGAGGCGCT 

2 5 CAGCATGGATGAGGAGTGGTCCTGGACCCCCGGGGTGGAAGGTTTTAAG^ 

AGG AG C CT AGTGTGT CG AGGCATG AG CATGGG TGTTGG AATT AGG CC CTGC CCTG CT CCAG CT C CT ACCACTTGCCAACCGTGTGA 
CCCAAGCAAGAAATTCTTCATAGACCTGGTTCCrCATCTGTCATCCTCCCCAGAGATGCT 

GGACAGGGCTTGGTTAATTGGGAAGCACTTCCCTGCTGGGT CATTGTTGTCTGGGGTGCTGCTGGG CG AGAGCAGCCTGTGAAAGG 
GTAGAGTGCCCCCTCTTCCTGCAGTT CCACAGGGCACCT CTGAGCTACTGGCACATGTCT^ 

3 0 TGGACGGGCCTCTCTGAGATATGGGAGGTACTTGACGTCATGGAGAAGAGCTCTGGCT^ 

AAAAGCAGTGTGGGG AGTGGG AG AGG ACGAAGGTG AATCCAGCGC CT CCACTCGT CCACTCAGCTGTCTGGCCTGGGAGCGC CAGT 

GGCTGGAGTGAAGTGCGCAATCTCGGCTCACTGTAACCTCCGCCACCTGGGTTCAAGCGA^ 

3 5 CTGGGATTATAGGTGCCCGCCACCACGATCGGCrAATTTTTTTATTTTTAGTAGAGACGGGTTTCG 

TTGAACTCCTGACCTCAAGTGATCCACCCGTCTTGGCCCCTCAAAGTGCT 
TCAGTACTCTTCATCAGCCCTCTTTCTCCCCATTTCrGA 
ATAGATATGGTTTCAAGATGCTGTGTCCAAGTGGTTCCTACCCGGAACCCCT^ 
CCAGTGGAGGACACTCTCCC^AACATGGTGCCCrrTCT 

4 0 C^TTTGTGAACTGGGGATGAGAACTGTGTTCTGTGTAGGGTTGTCCTGTGGACTGGAGA 

ACCCAGCTGCTCACAGGGCAGCCACAGAGGCTCCTGGGCCAAGCAGAGCT 
TGCTCTG^AGCAATGGGCCCAGGCTCGGCCGAGACrCCGCTAG^CAAGACGGGAGCCT 
GACAGAGTCGTTAGCTGTTCTCrrTGTCTTCCCTGACAGTCTTGGCAGTGCAGG 
GGG CCCTCCACACG<3TGAGGAGAG AGCG CCTGTCAAA 

4 5 GTAGCTTTTACCCAGTGTCTCTGGCTAACCATTGATTTGGTGCCTTACAAGTGCC 

ACATGGTGAGCTGGAGCTGGAAAGGTTCTGGCAAGAGTCGAGTGGCCTGGCTGCGCCCAAGTCCTC^ 

agagacctttggatgtgtccctrcaatctggctgagcctgtttctgcctctgtaaagta 

gaagagctgggggttgactgggaatgcctcaaaccctttgtgagatatgactatgcaaat 

aatgggaaggaattgttcgtaattaagtaggctctctcccatttggacagct 

5 0 agtgcatgcctggcgtgagcccagcacagcagttgacttggagttg 

agtgacctgctcctgggcttcctgttgcccaaggccctcagctagaggtgggggaggtagaatttc 
aaagccttactgctcctgtgacrcatacgacctccctcaacagcaggcccaaagca 
cctcatcacctctgagcaccagggacacgttcagatccatcagc1agcct 
gaattatgtttctaattcttcctcttgattttccccaagggc^ 

5 5 ctcatgggaagcaggacctagagccgggciacagcccrcagcgtc^ 

ttaaagaagggggcttrctttttagttctctagggtctgata^ 

AAATGAAAAAAAATTTGACTCCAACAGAACAGGCTGGGCTAATGTGAG 

ttctgtttccaaactgctrgcttctggtgacattccaagac^ 

6 0 ctitttcttc^tccactcccattcaaatitgagctaatttctcattctgataa 

aggtccgtggcagttgttagagtaggatttggagtttggaagaactggcagctcagggtggcctggtcagccg 
catgtgttcttctcagtctgattttcrrctataaccctgttctgcacga 
cagtttccaccgtcaatgca gt ltrit l m rttgtttttttgttttttttttttttt^ 
taagggggcaagtcttctctagctcaaatggcitcctaaata 

6 5 ctgctcacctgcagattctctgtacaaataactccaggtagagcagttggact 

ccgtgagggagtagccatcaattcctgaaattccaactttgtgactagcagatg^ 
ccaatacggacccatcttactgtgtcctttcctctctggggccaatgtgagta^ 
cctccctcacctgcatgctgagatagcttccatccatgcagttccc^ 

CCTACTCCCTTAGGTGAAGCATCTCACG^CrGAGTTCr CCCTGAGG CAT ACTGGTCCAGCTG AGCGTCCTAG AGAAAGCTAGCAAA 

7 0 AGGGAGGCACATGGATTTCACAGTATGAATTGGTTCAACAACTGTCTTA 

GGTGCAGGTGGGCTTGCCAAGATGGTCGTCCAGGAGCCCGCCTTCACTTCTAAATCTGCT 
ATCTCACCCCCTCCGCCAAGTCTGAAATCCCCCTCCCCATCTCACCTTAGACTGAAM 
TTGCTTTATGTGACLAATACTTCAGCAGAATGGATACGAATTTTCAAA^ 
7 5 TAAAGTCATGTTTCTGGGGCGTATTCAAGTAGCTGACAAGTAATTATTTAATAATAGT^ 
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GTAGTCAGGTAATAGTATCCAACCGAAATTTCCTACCAACCTCCTGTATCCAAA 
. TTTTGATTTTATATTCAAAAAGTCTCTTTTTATAAATATTATT^ 
TTATATAAATATATATATATTTGGAGAAAATATATTTCATCATGCAGTTTTTTTCTGTTAAGTCA 
CTAAAATGGATACATTGTACACTGTGTGGAGTTrCCAGATTAATGCTCAGGAGGTC^ 
5 CAGCTAATTAAATGACCAAGTGAAGGTTATTTCCTCTGTTTTCT 

ACCACAGTACATTTTT AAAGG CTT CTTTTGTTCT AAGTT AACTT CT AACAATG AATTTC CAGTTTT AG AAAACAACAACTTGTG T A 
CTAATTCATTTCACAGACTCAAGTATTTTCAATGTATAAATTAATGTCACA 
CCGCAGGGGTGGCCACTGCCTTAATCCCCTCCTCACCACGTTTTa^ 
TCAAACTTGACTGAAAATTGTCTGTCCCCTGCTAGCTCCACTGTGA 
1 0 AAAGCGTGTGCCCTGGGCTGTGGGCTGTCrrGGAACAATTGTCATTTGAAG^ 
GGCATGCTCTGGGGATGGGCACCCTCTCAGATCGCCACTTCAGAGGACCCA 

GATG CCACAAGTTGGTTTT ATGACAACAGACAATCCTTT AG TTT^ CACCTTTGG TTTATTCAG CAC 

ATTATTTCAGAGAATAACAAAAATGrrr'ri'lL-rTTTTATTTCTCCTGACCTA^ 

GAGAAATAATGAGAACAGGCTGGGTGGGGTGGCTCAGACCTGTAATCCCAGCACTTTG 
1 5 GTCAGGAGTTCGAGACCAGCTTGGCCAACATGGTGAAAGCCCGTTTTTACTAA 

CATG CCTGT AGTCC CAGCT ACTTGAG AGG CTGAGG CAGGAGAATCACTTGAACCTGGG AGG CGG AGGTTG CAATGAG CTG AGATCG 

CACCACTGTACTCCAXKTCTGGGTGACAAGGCGAGACTCCATCTGAAAAAA^ 

GCCATGAGCCGCCTATGGTGGAGGCGGGCTGCTCTGCGCroTATTCAGACCTGTGA 

ATGCACTCTGTTTTGTCCATGATGTGTCTTTAGCCCCTGAAAAGCTGAAACCACA 
2 0 CGGACACGTACAGGATACAGATCTCCTCTACATCTTACA 

AAAGCT AATG CAGG TTGACTACAC CTTTT CTGAAATG CTTGGG AC CAGAAAT ATTT CAGACTTTGGAAG ATTTG CAT AT AT ACAAT 

GAGGTATCTTGGGGATGGGACCCAAGTCTAAACACGAAATTCATTGATTTCA 

TATGAAATTTTTAAATAATTTTGTGCATGAAACAGTTTAGACTGCATTTTGA^ 

TGGAATTTT CCACTTG TGGCAT C CTGT CGGTGCTG AAAAGTTTT AG ATTTT AGAG CATTTCAGATTTTGG ATTTTTAG ATTAGGGA 

2 5 TGCTCX5ACTGAACTTATCTATCCCAGAATTCTAGG 

GCTCCCGTGTAGCTGGGACCATAGGCATGTGTCACCATGCCCAGCTAATTTTTTAATT^ 

TGTTGCTCAGGCTGGTCTTGAACTCCTAGCCTCAAATGATCTCCCA 

CTGCCTGGCCAGTACTTTTTCAATCAATAAGATTGCCAAATATT^ 

3 0 TATGATTAGTITTTAATTTTTAAGAGACAAGTGTCTATGTCACTCAGGAGTACAG 

CCCCTGGGCTCAAATGATCCTCCTCAGCCTCCCAAGTAACTAGGACTCAGGCGCACT 
TGTAGAGATGGGGTCITGCTGTGTTGCCCACTGGTCT 
GGGATTACTGGGTGAGCCATCAC^CCCTGCCCAGTCTTATTTGAT^ 
GTTTCTCCTTATGTATAAGTACTTACAAACGTCTAAACACAAGAGGACGTTGTT^ 

3 5 ATATCTTTCAAAGCAATATTATTTTATGAAGAAAGAAGGGAAGGAAAGCATGTGAGAC^ 

AGAAAGGTGGAAGGAAAGAAGGTAGGGAAAAAGAAAAGCAAGGAAACATGATC 
TCCCCTGGTGGCCCTGGGCATAjGGGACATGTTCTTGTGTGTC 

GCTGCTACACTGCCACrGTTTCTCCTAGTTTAATAGAGAACCTTCTAGGAGGCAAAAAT^ 
AATAGTGAGACAGAAGTTGCATGGCGTCTACGCTGGGAAAATTTOIAA 

4 0 GGATGCTTGCCAATCACIAGGCGCTATCTCCCGCATCTTTATGCCATGGTGCCCGTCO 

GACCC^GTCTCAGTACGGCTTGCCAGCTGATGAGAAACT 

TCTCATACACACCCTTC CCCAGTCACTTATTTTATTTTTCGAGACAAGAGTTTTG CT 

ATCTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCT 

CCACCATGCCCGGCTAATTGTGTATTTTTAGTAGAGACGGTGTTTCACCACGTTGGT 

4 5 TGATTG CCCGCCTCAGCCTCCCAAAGCTCTGGGATTACAAGTGTGAGCTACTGCGCCCGG CTGCCCCC^ 

GCTATGGACCAAGTTGTCAGGCAAGGAAAATGAAATCCTGCTTTCACCTTC^ 
{^CTGTTTCAACAAGTCTGTTGATCAGGATTTGATTATTGTACAC^ 

TTTT AAAGGGG AGG CTG CAATTTTT AGGAT ATGAAAAG ACAGC CTTAAAGTAAAAAC AT ATTTCTGACTTT AAC CACAATT ATGCA 
ATTTTTCTGTGTGACATTGAAAAAAAGTGTCCCAGT 

5 0 TG AT CTCAAAAACATCTTATG ACAAAAAAATAG CCGGTTAACAATTCGCCTAT AG ATGAAGAAACCT AAGGCAGCAGCTAGCKK5A 

GCCTCCTTTGCTTCTTCACAGGCCGTTCTGTAATCTCATACGCTCTGCTGTACAGCACT 
AAGAACCCACCTTCrCAAGGCTATGCAGGGACCAGGTCGGCCrCAGGGTCCT 

HUMAN SEQUENCE - mRNA 
5 5 GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACrT^ 

CGCGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCCACCTGCrCG 

GGTCCTCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCT 

CCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATC 

GGCAT CCAGG AGTGCCAGCACCAGTTCCGGGGCCG CCG CTGGAACTGCACCACCAT AGATG ACAGCCTGGC CATCTTTGGG CCCGT 
60 CCTCGACAAAGCCACCCGCGAGTCC^CCTTCGTTCACGCCATCGCCTC^ 

AGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTG^CGAAGGCTGGAAGTG^ 

GCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTTGCGGATGCGCGCGAGAAC^ 

CAACGAGGCGGGCCGCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCT 

CCTTGCTGGTGGGCGCAGCCTGACTTCCGTGCCATCGGTGACTrCCTCAAGGACA 
65 AAGCACCGTGAGT CCCGAGGCTGGGTGGAGAC CCTCCGGGCCAAGTACTCG CTCTTCAAGC CACCCACGG AGAGGGACCTGGTCTA 

CTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGTTCCTTTGGCACAAGG 

ACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAGCGGAAGG 

TTCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGTATTCGCATCTACGACGTGC^ 

70 HUMAN SEQUENCE - CODING 

GGCCCTGGGCCAG CAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGG CTCCATCC CAGG CCTGGTCCCCAAGCAACTGCG CT 
TCTGCCX5CAATT ACATCGAGATCATGCCCAGCGTGGC CGAGGGCGTG AAG CTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGC 
CG CCG CT GGAACTGCACCACCAT AGATCACAC CCTC C CCATCTTTGGG C C C GT CCTCG ACAAAGC C A C CCG CG AGT CGG CCTT CGT 
75 TCACGCCATCGCCTCGKSCCGGCGTGGCCTTCGCCGTC^ 
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ATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTG 
GCGGATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAAC^UVC^ 
CATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGTGAGGTGAAGACCTGCTC 
TCGGTGACTTCCTCAAGGACAAGTATGACAGCGCCTCGGAGATGG 
5 CTCCGGGCCAAGTACTCGCTCTTCAAGCCACCC^CGGAGAGGGACCTGGTCTAC^ 

CCCAGAGACGGGTTCCTTTGGCACAAGGGACCGGACTTGCAATGTCACCTCCCACG^ 
GCCGGGGCCACAACACGAGGACGGAGAAGCGGAAGGAAAAATGCCACTC 
TGTATTCG CATCTACG ACGTGCACACCTGCAAGTAG 
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Table 10 

MOUSE NOMENCLATURE 
ICSGNM Batf 
Celera mCG5742 

HUMAN NOMENCLATURE 
HGNC BATF 
Celera hCG22346 



10 MOUSE SEQUENCE - GENOMIC 

G CT AACAGG T ACCACATG CTTTCT AGGGACT AAGCATGGTGTT AGGCAATTT^ CATTTCTATG AT AT AT ATTGCT ATT 

AGATAGG AAAACCCAGGCATGGTTTT AAGCGATTGGCTGGCATCACATGGTT A CAAAGC CAG ACACGAGCTTGCAAGG 

TCAAACTCACAGCrAAAAGACCATTTGCCTGTTCTCTrGCTTCCTAGGCC^ 

CTTAGGATCCrGTGAAGGCCAGAGGCTGCAAACCAATTTGCTT^ 
1 5 cCITTTGCAAGGCCCGGTGTAGACGGGATTGACCCrC^ 

CACAAACATAGCTTTAGACATGCTGCTTCACTGTTAATCCAT 1T1 C rriTTATTATTGTAGCCTLnTrCACrTTTCTCTGTGGTCT 

GCACATGGATGTGAGCTCACTCACCTAGGCATGGGGGCCAGAAGTCA^ 

AAAATATATAATAGTTCTTCTTTGAGAATTTCATGCATACATACCATG 

TCCTTCTAGATCCACCCCTCACCTTCCTACACCCTTACACCTTG 
2 0 GCTGCCCAAATACACACAATCACTAGAAC71AGGTCAACCT 

TTGTCCCCTTTCCTGCAACAGTCCTGAGTCTTGGAGGGTGTATGTGATGT^ 

TCTCTGTACACTGACCAGTTCACAAGTGTTTGCATTAACCACCATCCGCTGTA 

CIATTAATCTATGGATACAAGGATGCAAACTGAAAGAACTGTCCATTTATC^ 

GTGCCTCCACCATGGGCTTTGCCGAGACTTACAGTACCAG^ 

2 5 T AGTTGGGTTAACT CCATAATGGTTGTGC CACT AAT ACTT AGGT AT ATTGTG C CATGTTGGG CATT ACTGTGG CT ACCAGAAT AC A 

CAGCTGGGTAANNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNIJNNNNNNNNNNNNIINNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
GTTTTGGTTTTGGTTTTTTGAGACAGGTATCTCTTTG 

3 0 CACAGCGATCCACCTTCCTCTGCATCCTGAGTGCTG^ 

TATCTGATGGJAG ACTAAAATTT AGT ACCTCTCTCTTGCCTTTTGAGGGGC CTGGTCTG AAAGGTAAAAG C 

AGATTGGAAGCTTCCAGAAGCAACTGAGTGCCTCAGGAGCTCTGCTC 
CATTGGCCTGGCTTTATCTCAAGGAGGAGGAGACAGAGGAG^ 
GAGCTTCCTCCCTCAGAGCATCTGGTGATGACC^GAGATGTGTCAGATCTTCTC^^ 

3 5 GGACCAGGGCCTGGCCrGCrAGGCCAGAGACAGCCCCCACAACAAAGACTC 

TTTTCTGCTAGCCTCACGAAGAGCGAGCrAATGATGGTT^ 

GCCACTGCAAGGGACrTTCCTCACCATCACCTC^CCATCCCCTCTGGCCCCTTCCCT^^ 
CTCAACTTGTTTGGGGAACTTCGTAAAAACTCCCCGCTAGCCATCTCATAAG 

T AAAAAGG TTT GTTTAAAAAAAAAAAAACT AGAG CAG C C AGGG G ATTTTTATT ATTTTTTTTGTTCAAATAACTAAAT AGCAGCCC 

4 0 ATATGAAGACTTCAAAGTTAACACATCTCAAGMTATCCCT 

ATGTCGTGAATCAACTCATGGCTTAGACAGGAAAAGTTGGTAACCTGGGCGGTCT^ 

GGGCATCAGTGCATGGGAGCATGGAAGGAGCCTTAAACTTCTCCCCTAGCCGGGAACCAGCAGCCT 

CATTTGCATGCGCTCTTAAGAGACTCTTGCTTATAATTC 

CCAGTGCCCCGGGTATATGAACTAGCCTAGCCACCTGGTTCAGAAGCACCAGTGAGAGC^ 
4 5 CCTGGAAGTTGCTAGATAACTTGAGGCAGATGGTGAGAATGGATC^ 



TTGCTGAGACACTAAGCTG^CCCCTGAGGTCACACACrGTCCCCTACTGGACAATG 

TCATGAGTGACCAGCATTAGGGGTCAAGCCAGCATTTCCTTCCTTCTC 

HNNNNNNNNNNNNNNNNNNNllNinmNNNNNN^ 
5 0 TGTTGTCCTGXSGTTTCTTATGTGACrGGAGCCTCCCTGACCTTGTCAATCACCAGCT 

TGATAGTTGGATGTTGCAGTGGAGCCTGTCTTTATGGAAACAGGGAAG^ 

TGTT ACACTTAAAAAGCAAAACCAAAGAGACC CTGTTTCGGATGACCTTAAG AAAC 

CAGTAAAGAGCACTCCTTGTTCCTGGTGAGCTCCCGGCATCCATATGATAGC^ 

CAATGCCCTTTTGTGACTTCTGCAGGCATCAGGCATGCGTC 
5 5 AAATAAATCTAAAAATATTTTTAAAAGACAGAAACCATTTGCGT 

ATTCCAATGCCTGCTATGGAAAGACTGJAATACGGGTTGTCAGTTTGACCTTAG 

TATCCATT CTCCTG AAAATCAGCAACCAAATCACTACTGCCATGGCTCAGAGGAGCTGGAGT 

CATGGGACAAATGGACAGAGAAGTAGTGGTCCCTGATTGTACAT(^^ 

TTATCATATTCAGCTGGATCCCTGCACATCGGCTTTTAATTTCACAAGTCT^ 
60 AGGATTGTACACTATAACTCGTTTGAGTCTTGGAATAGCCTAGAGGAATTTGGGACT 

GGAG AGG TGGG C CT C CAGTGTGGG CT AG CTCC AT CAT AG AAAG T ATCACCAGGAGGGT CTGG CGTT ATGGCT CAG CT GGAAT AAAG 

GTGCTTGCCACAAAGGCTGAGGACTTCAGGTGATCCACGTGGTAGAACGAGAGAATTAACTCCT 

C CACACCCACAGGG CACACAT CACCT ACAC^T ACACACACTCAT AT AT ACATT AAGACACCATGAC CAAAT GT AACT CTGGGGGG A 
AAGGGTTTGTTTGTTTCCTTTTACAGGTTGTAGTTCTTTAACCAGGGAGGTCAG 
g 5 GAGCTGAAGGAGATGCCATGGAGGGGGTGTTGCrTACTGGCTTGCTCTCCATGGCTTACTCAGCTTGCT 
GGTGGCACCACCCACAGTAATCTGGACCCTCCCACGT(ZAATTATCAGAGGAGAAGATGTACC 
GGTGGGAACTTTTTTTTTTCAGTTGAGGTTCCCTCTTCCCAAATTACTCTATCT 
GTCAGGGTTATAAAGTCAGCAGAGGGGACAGCCAGGAGAAAAC^CGCTACTGCCTTC 



7 0 TCr»CTATGTAGTCTTGACTGGCCTGAAGCTAGCTATGTAGACCAGTCrGGCCTTGAACT 
CCCCACCTCCACCCCCAAGTACTGAGATGACAAGCACGTGCCACC^ 
AATAGGTGGGGGTCTTAGCCTGTGGTGGTATACACCTTTAAGATCAGCACTTGCA^ 

CAGCACTTGCCAGAGGCAGGCGG^TTTCT^ , 
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GGTCTTCATAGTGAGAGTCTGTCTCAAAACAAT^ 

GCTCACAGTCATCTGTAACGCCAGTCCCAGAGAATCTGGCGCCCTCTTCTGGCCTTGTGGGTA 

ACACATGCAAACAGAAACACTCCTACACATGAAAATTAATTAATTAATTAATTAAT^ 

AAAGAGGTGAAGTCTCTTTAGTCTCCCCTTAAGAGTGAGGTGTTAGTGCTTTGCC^ 
5 TGAGACATAAATGGCTCCCTCTATTAACAGCCATGTGGGGACTCCCACTTAGAAGTGAATCTCCTAG^ 

ACAANNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

lOJNNNNNHNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNiraNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNIWNNNN^^ 
1 5 NNNNNNNNNNNNNNNNNNNNNNNNNNC CCTG A 

AATGAG ACCCCTCCTTCCATACAGAAGACATCATTTGTATTTCT CTTCTCACTGGTTCCTAGTACTTAAACACTGT ACTGACT CGA 

CATCATTGAGTCAAAGCCACCACTGACTGCTGAGTGATAGACITT^ 

TTTGGTTGTTGGACTATGCTTTTCTTTCTATGAAAGCAGTGTCTCTGTCT^ 

CATAATAGGTGGGAAATCTGTCGGGTGAAACTAAATGCAATATAAATAA^ 
2 0 GGTTTAGAGAAGATGCAAGTCGTAACTTCTAGTTGTATGGGAGCTTAGTCGGACCTTA^ 

TAGGTACCGGACATGGAACTGTAAACATGAGACAGTCTTCAGCTTGAACAGGGGCCACATCT 

GATTCATGCTACCAGTATCrriTClTTTGTTTAGGACAATGAGATTACAATGTTAAACCCA 

CTCATTCACTTTTCTCCCCACAGGAAAATCTCAACTGTGTCTAAAGGGATCCCAGGA 

CTCTCACACTACCAACGTCTGCCCACCTGGGGGAGGCAGCAGGGTTGACCTTACAAGTCTTCAACTGGG 

2 5 ACTGTAT CTT CAG CC CACTGAACATTGAG CGACTTTG TG ACTTT CT.LTGGT CAAT AGG ATG TGG T AG ATGGGACTCTG AG CT AG AC 

CTGATCATGGATGTTGGCAGGTTCTGAGTTCTGCCTTTAGAATCCGGCACCTGCCAAGGGAACAACAGGC 
TGGGGTGGGGTGGGCCAGGGAGAACAGACCAGTCAGTTACCAACCAACTCAGCAGCTGA 
TGGCCTTCTGGAATCTTCTGAATGACCTACAAACTCAGCCAAAAGCATGCCTGTTTAAAGC^ 
ACAGTGGAAACTTACTGATAAACTGTCTCXSTTCAGGGGAAAAAG 

3 0 TTGATTAAATAAGAAATTCCCAAATGCAGTTTGCCTTCCAGTGAAGAAAATAAGTTCTAAA^ 

CTGAAACAAACAGGCAGTCG CAGTGGCTAGATCTGAGGATCTAGGG AACGGGGCACGGG CCTGTGGTTGGTTCTCTGGTTACTTTA 
GAATGAGGGAAAACTCCTCCCAATTGACTCCAAGAGCCTTGAGAGACGTCCCGGCT 
TCCCCTGCCACTCTCAGGACCTCTGATCCGGAGACACTGGGCTCTCTGGTGTTCCACCTCAGGTGTGTGC 
TTAGCCTCTCTGCTCCCCCTCAGGTTCCTGCTCACGTGTGACCCCTTCACCAA 

3 5 CTTCGAATTCTCCTACCCTGCTCTATTTTTATCTGTGACACTTGACACTCCCTGATGTATTATGTCAGCT 

CTGCTCTCACCACTAAATACAAATTCTACAAGGATTGACTITTATTATTATTAl^ 

TTATTATTTTGATAGGTTTGTCCTTAGAACAATATTGTCGTCAGGCAGAATTGTAATCA^ 

TAACATATTGTCAGCCCTCAGATTAATCGAGGTGAGCATTT^ 

GCGTGTCTTGCCTGCACATATGTATGTGCCCCGTGTCCATTTCCTGGTACCCGAGGAGTCAGGAAGAGGACCCCTGGAACT 

4 0 AACAGATGATTGGGCCACTGTCTGGATGTTAGGACTAGATCCCAAGTCCTCT 

CTTTCCAGTCCCCAAACTTTAAAAAAAAAT AGCCAAAACA^ CGACAAGCGTGTT CGT 

GTCATGGGACACATGCCAAACACCGCCACTCGACAGCATGCTTTACAGAGTGGGGGACATTCCTATTCA 

GCAGTAGCCCTGAAAGGTAGTAGGTGGGGCAGGTTTTAGGTGTGTGTGTGT^ 

GAAGTCACTGGATTCTCATCATATCGATCTGACTCCAAGTCCAGAGCCTCTGTCCCTCATTTTGACGTTCTCT 

4 5 TGACGTCACTGACTTCTGAAGACTCAAGGTCAACCTTGAACT 

GAGAATGAAAGTGTCTTGCTAAGAGGACAGGGGATAGATAATAAACTTCTTTAGCATCGATTCCAGAGACT 
CTAGCCCCAACGTCTTGCTTGGTCATCTGTAAAAGGCAGTGCCCTCCTTTCTTTTTGT 

GAAGG AGGAAGGCCGGTCATAAAACAGGATGGGCAGTTGCTTTCGGGTG AAAC C CAGAAAGGGTCCTCAGGCTTGTCG CTGACTGT 
ATTTAGATTTAAAAATCAAGGCCTGTGACCTGGAGGCTCTTTCCTGCTGCTCTCTC 
50 TGGCCGGGGATTTTCCCTGTGTGGCTGTTTTTCTTCTTTATCTTCATCAATT^ 
CGGAAGATACTGGGGGTTTGTGTATTTTTTTrTCT 
TTTCTAACTTGTTGACGTTTCCGCCCATGTGACTTCCAGCTTGAATT 

AGCAGCGAGATGTCCCTGTGGGGAGCAGTCCCTCTGCACCCGAGAGAGAGGAGGACGCAGGGGTCTGTCAGAGGTTGCTC 
AAGCAGGGGAGGTACCTGTGGAAGGTGGTGTGCTGGTGGCCCCCTAGCAGT 

5 5 GAGGCATCTGGGACtXSTGTGGGAGAGCCCGGAAGATTAGAACCATGCCTCACAGCTCCGACAGCAGTGACT 

CTCCTCCCCCTGGCAAACAGGTACCCCCTTCGTCCCCCTCCCCCACGCCCCGCATGCCTTGCCTTCTGATTT^ 

AAGAGGGGTAGGCTCCCTTGTCCTGCCCAGGGAGCTCTGTCTGGGGAAGGAAGTAGCT 

GACCGTGTGGTGTTTtSCTATACAGAGGTCCAAGGGCTGAAGCTTCGCATCCCGCC^ 

CGTCTCTGCCACTT AGGTTCCCACCGTGAAACTITCTAGGAAG CTGGGGG CAGGGTGAATATCACAGAGCTTAG AAGTATGGTCAA 

6 0 CCATAAGCAGAGGAATGTTCC AGT AG AT CGCTGCTTCGGGCTCCACCCAAATCCT AGAGTTGTAGCTTTCCAGG AAAAAACCAAG C 

AAGTGGCAAGACAGAACCGAGGTCAGGGAAGAGAAATATAGCJW^TCAAAGCAAGGTAGACAGGAGG 
CCTG CGGATTGCCTCGGTAACTCCAGC AGT CCX3TGGGAAGGGAGGCTAAAGCTACAACCT 
GGCTTCCAGATAGAAGAAAGAAGGCCATGGTCCAGGAGGCGTGTCAGAGACTTTCACTAGTAGCT 
GGGATGGAGGGAAAGATTCTGAGACTCTTAGACTCTTGACTO 

6 5 ATGCCTTTCTCACCTTGTGGTTTCTGGGTCTCTCTACCACGTTCCTC 

TAGACCCACGGATTAAACTTTTTTACATCTCATCTTAAAAACTATATTCTT^ 
GAGTGTTTGGCTCACACACATCCAGGGGGCrGCAAAACGCTACAA^ 

TGTCACAAGCCCTCACCAAAAGCCATACTGGTGAAGACCTCAAGAGGGAGATAGTCCCTCG 
GACTGCCTGGAAGATGGTACCCCAGTCTCTTCTTAAGAAGGAAATT^ 

7 0 AAGGAAGAATATATCCTTGCCCTCrCTAACCCAATGTCOSTTTGCTAGAATGACCTGGA 




GACTTCATAAAATGATTTGGTGGTGGTGGTGGTGGGGTTACCTC^ 
ATGTCTGCACAATGATGGACCGAATAAAACAGATCTGGGCGGCGCCCTC^ 
7 5 GGGGAATTCTCAGAAAGGATTGAGCATCAGTGCCAGAAAGATGTCATACCCCT 
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GCCTCAGCTTCGTACTGTGTAAAATAGAAAAGACG^TGC 

TGT ACAGAAGTTGG CACATGGT AAATG CTCAGCGGAGTTTT CAAAGACAACAGCAAAAGGAGCAT CAAGAAGGTTT AGCATG TG CG 
CACACATGCACATATATACATGCACACACACTCACACGC 
ACACACACACACAGAGTGAAGCTAGAACAGCTCrGGAGAGAGCAGAAGTGATC 
5 CTGAGGCTTCAATTTCTCTTCACTTTTGAGTAGCTTT^ 

TGAAGAAAATGTCTTTGGGTACAGGCCTTTTGTAAATAAGGGGAATGGACAATACAGAAA 
GAAAGGAGGGAGGAAACCAATGTCCTATTTTGACTAAGAGGAACGATCAGACCTACCGTTG 
TGATGATGTGAGGAAAGTTCAGAGGAGAGAGAAGAATCX5CATCGCTG 
CCCTrC^CCTGGTGAGTACCCACACTTCCTACCATGCTCCTGCCCTTAGGC^ 
1 0 CTCTC^TGATTCTGCTGTGGTAGAGCTAAGTACTGGGGGAGAGTAG 

TTCGCTCTCCCATCCATGTATGAAAACATTTAGTAGCAGAGAAGAGCACTGG^ 

GT AG ATGTGTGGG ATGTTGACAGC CTCAAGGCTT CATGGGAGG CAGAGC CACCTGTGTGG AAAGGG ACAGCAG AGAGGGTGT CGTC 
CTTCCTGAGCCACTGTGAGCAGAACACTGATGCTGAGAGGA 
ACATCTCATCCCTCTGGAAGGCAATGGTCTGCCACCCAAGCACACTC 
1 5 TTTTTCCCTGAGCATGGAGAAAGTTCTGTGTTGAGATTTACATTGTTAATGGTCTCTCT 
GCACITACTTGGGACAATCACCATCCTGCTCT 

CGACCCCGAGTCCTGACTGGGCGCAGTGTTCCCACCCTTCATACCTTTCCTCTTTACACT 
AACTGTTCCATCTTCATCGCATCCTTCTrCCTGCTCTTCCTCCTTC 

C CT AAAAG TGCAAGAGGTG CCAAACCC AG ATAAG CTG ATAATG AT CTG ATGTTT AG CATGG CTAAGGG CTGCTTT AATTTT ATCAG 
2 0 GGATAAATTATCGGGGGCTGGGGAAATGGCTCAGGGGGTAAAA 

ATGACAGTCAAATGGGCAAGAGGCAGAAGCAGCAGGTGGACATCTATGAAATTGAA 

ATAGCCAGGTCT ACAT AG AG AG CC CCTGTG TGAAAAGAG AAAAAAGAAAAAAAAGAAAG AAAAAAAAG CTG AAAAGCTT AATGG AT 

GTGGCAGCCTGACAGTGTAATCCCAGCTCATGGGAGGTGGA 

AGTAAGCTTTAGGTTCAGCAAAAGACTCTGCCTCGATTCTCAATACC^ 

2 5 CTGGGC CT CACACACGCATGTGCATCT ATGTACACG CGTGCCCACATACAAG AAAAAT ACATGGAAACACCACACGCACATAC ACA 

TGCAACAAAAGAATAAGTAACATACAGTTTTAGAAGACAAAGGGGAGGATATC^ 
CTGCCCTAGTCGCTCrcCCTGTGCTTCTGCTCTGTTTGATAGGT^ 
GGCAAATATGTGTCAGACTAACAAGAGAAAGAATAAACAGCACT 
CXteTACTCZAAACCCTTCAAGLATT^ 

3 0 AGTTAATATTCCATAGAGCAAGAAAGCCACACTTACGATAAACT 

CT CCTGT AAGCTTTGT AAG TTTG T AAT ATTAAAAG AAGGAAG CTGAGTTCAGC CT CTCTCTGG CAGGG AGGCCAG ACTTGAAC CAC 
CGCTCTGCCGACIX5CTGTCrGTGTGTCTGGACCCACAAGCATTCTTCATGGT 

AATG ACAGGAACTG CTT AT AGAG ACAGTGCAGGAGAAAAGCT GAG T ATGG TGC CTGG CAT ACAGTAAAT ACTCAGAAGATTCTGTG 
CAATTAAAGTTATAGTAATAGTGTAAGAGGCGTGAAAACTGAGACTGAGCAGG(ZATACATAT^ 
35 AATGTACGTGTATTTCCCAGTATGTATATATGCACATATTACATCCATGTATTGTGTGCCGTGTGTAT^ 
GTGTGTGTGTGTGTGTGTGTATCGTATGTATGTGAGTATGGGTGTGAGTGTGTGTGAGAGATGCATGC^ 
ACTTTTTCTCCACATCATCTCAGTGCTGAGAGACTTCTTAACAGCT 
AGAAGCTCCCCAGGCCACGCCCAGTCACCTCTAATAGCAGAGTGCTGCCTCCCT 
CTGCAGGCCAC(nTCCTGTAAACTCAGCCCCCTGGTCTCrGTTGCCrrAACTTCT 

4 0 ATTTTGAATACCTGTCTTGTATTCATGCATGGCGCGCTGA 

GTGTTACAATGTCCCCAGCATTGATTAACTGCACATAATT 
AGACACAGATGTAAACATAAGATAATTAGGATGGCATATCAACCTAGTCA^ 
CCTGGCAAGTGACTTGAAGAGACAAAAGGCCAGGTCTCTCATC 
TAAAGAAGTCTTGGGGGAAAATATGTAATTTGGAAATGTACCITGGCATGAAA^ 

4 5 AG AAAATTG AT AT ACAAATG TGATTT AT ACATG CAT CAAAAAAT AAAAAAATTTGG CTGGG CATCCTAATTGGG CT AGGCCTT CAA 

TCCTAGCACTTGGG AGACAG AGG CAGGTGGATCTCTGAGCTCAAG ACCAG CTTCATCTACAT AGTG AGACG CCCATCTTGAAAAAA 

GGATGAAGTTTACCCCCCCTCTTTCTTCCTTTTCTCTCTCTCT 

CTTCCTTCCTTCCrTCCTTCCTTCCTTCCTTCCT 

AAGACAGGGTTTCTCTGTGTAGCCCTGCCrGTCCTC^AACTCACTCT 

5 0 TCTGCCTCCCAAGTG CTG AG ATCAAAGG CAT ATG C CAC CACTGCC C^G CTG AAGTTTTTCTTT CATT CTTCAAAATATTAT AT ACA 

AAATCAAAGGCAAATGGAAAGCCAGGAGGGAATCATCTAGAAAATTCATGAG 

CTATGGCGAGGGAAATGATTGATCTTTATTAAAAAAAAAAAAAAAAAAAAAAAAGACTATAAAGTGA 

TGTACCTACACATGTAAGGATGCGCCAGGCCTCCTTGGGAAAATGCGAATGAGG 

AGGTGAAAGATGGAGCTGACCCAGGAAATGGCCAAGGGAGACX3GAGAGTGAGCA^ 

5 5 CCCCCTAAGACGGCAGCTTG AGCAGGCTCACTCAGGTGGCAGGGG CACCTCCCCTGTCTCCCAGCAGGGTG CTGTCCAGAATCCGT 

CCACACCACAAGCACAG CACAG AGGTTGTAGCTCTCTGAGCTG AGTGTGAGTCATTGTTGCGATGTGCTG CTTG 
GTTTTTGTTTAGTGGCACTGGGGATTGAACCCAGGGACTTGTACACCACTGAATC 
ATGGGGGTTGAATGAACACTATTGGTCCATTCTGAGTGTGCTGTTATGGGACTGTTAAGTC 
ATCGTTTTCTTTTCTAAAACAGTTTTTCTAAGTTATGGAAGGTGGGGTATGTGTACATGA 

6 0 GGGGTATTGGTCCCATGG CCAC^GGT ACTAGAAACTG AACTCGGGTC CTCTGC AAGAGCAGTAG ATGCCCTTAATCACTGAGCCAT 

CTGTCCAATTCCTCTTTAAAGCTTTCTATGTTGCAGTTATTTATTGTT^ 

GAGTCGGTTCTCTCCCTCCGCTGTGTTGGGTCCGGGCATCGAACTGAAGTCATA^ 

GCACGCCTTTAATCCCAGCACTCAGGAGGCAGAGGCAGGCAGATTTCTGAG^ 

GCCAGGG CTATACAG AG AAAC CCTGT CTTG AAAAAC CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG ACTTGG CAG CAAG CACT 

6 5 CTCACCCCCTGAGCCCTCTTACCGGTCCCTGGTTCCCAGACTCTTAAG 

CCACACTTTTTAATCTGTCTGAGCATAAGTATACAGGCAAGTTTCAAA 

CACACTG AAG TT AAAAG AC AGG AAAAG TGTCCTTTG CATTTTCAATC CTTTTTTATGTCAGTTAGGAGTAG CTAAGATTTGAGTGA 

CTCTAAGCTCCX3TGGTACTAGATGGAACACTTTAAAGAATGGCAACT 

CTTCTGGAATCTTCTATCATAGGGGGATTTTGAGTCAATCTCCTGG^ 

GCAAGTGGCGCCTGGTCTCCAGGTTGCCTGGCCATCCGTGCTGGA 

AGATGAGGCAGGATATGACTGCTGTGGGCTCCTGAAGTAATCTCACCACAGTTCAGGACAGG^ 

CATAAAACAGAAAAATCATGTAGACTGAGCTGATGCTGTGAAAA 

AAGAWrTAGCTCCCAGGGACACTGGGAGCA^ 

7 5 GTG AGGTTCCT C CAACATTT CT CATTCTCTGTCTCT AGG AGACCACATC CTG AGGTTT ATT AACAT AACAAATT CAT AGATCTGT A 
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AAAC^TAAAACATOAAGGGACATTCTC^ 

ATCrCTGCCTCTCTGAATTTCTATTTAAAATATGGTCATTAGCTGGGCATGGT^ 
AGAGGCAGGGGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCCA 

CTCCAAACACACACACACACACACACACACACACACACACACACACACAAAGGTCATTGAACACCTGGCC^ 
5 CGTTAAGTGTCCTACCAGTGGGGTTGCACTTGATCTGAAAAAACAAATTCTTTGCATTAATC 
GAACTTCAGAGCCCTCCTTTGTAAACATAAACATCTATCTGTATCCTTGCATCTCCTCCT 
TAAACAATACACTCTGAAATTCTTATTTTTCCCTGATCATCTTAG 

TGTCTATCTGTCTGTCTACCCTCCCCCCTTTCTCCCCCTCTCTCTCTATCTCrCTATCTC 
TCTTTATATAAATATGTGATAC^TATGTATGTATAATATTTATACACTAGTTTTTAGAAGATTGTT^ 
10 AGAATAACCTTTGGACACAGTCCTTCTCACTATAACATTGTAAGGCAA^ 

ACAGCAAACACTTTATGGGTGTCATAAAATTAGGCAGATGGTTAACAGGATATTTAACAGACTGTGATC 
TTGAGGCTGACACTAGAAGTGGCCATGGAGATCTTTGTGTATAGACTTAGAAAGATGTCTGC 
CAGGGGAGTGTGCAGAGCACCATCTGCTAATGTGAAGCTTGCAAGGTTTGGGGA 
TTACTTCTGAGTGCCTGGGTTGGGGTAGCTTGCCTTCCTCCTGGTGCTCTT^ 

1 5 caatcagaaatagtcccagagtctttacaaatcagctattt^ 

tatccaaggacgatgggctgctccgttagggtgggagggccccagacttgaggaatgtgtggctgaact 
tcacaacccaacagctaccccactcaacaggtgtcaggaggctctcctcagc 

ggactggtccacagg at aaaggttc cttt cggcttgtttttccatagcctcacc^tggggccccatagtagctttcagaggaaatg 
ccactaagtgtggccacactgtcttgttcataggatcaaagtgt^ 
2 0 accaatgaggaaatcagtgtctttctcggggacacaaaaggaccc 
attcttagttttgtttttgtttgttttgtttg™ 

gg atg ctt acacaag act ct catt acttc caaaacatttcac caacagggag ag ac cat aaaagg ag tt acagtgg at aaagcct a 

GAGTTTAAAGAATT ATTT ATTT AT ATGTG CATTG ATTGGTGTT CTG C CTGCG TG CTTGTTACAAGGGTGGCAGATCTTCTGGACTT 
GGAGTTACAGACCATTGTGAGCTACCACGTGGGTGCTGAGAATTGACCT 

2 5 CT CACCAGCCT AGGCCTAG AATTT AAAGGCCCCGTTAATT AGAAG ATGT AAAGGCAGG A 

G AAT AT ATTGTC^TTAAATT AAATACATGATG ATTT AAT ATGT ATACT AAG AAGG CAAGA 

CATTCATCACTAAGTGGGCAGAAACTACACAGAAAGCTTACACAC 

GATGACAAAAACAGTACACATTGAGACAGGGTTTCTCAACCTC^ 

T ACT ATTAAGGTATT AAGG ACCTT ATG CAGCTGGGCAGTGG TGG CG CACGC CTTTAAT CC CAG CACTCAGG AAG CAG AGGCAGGTG 

3 0 GATTTCTGAGTTCGAGGCCAGCCTC^TCTACAGAGTGAGTTCCAGGACAACCAG 

AAAAAAAAAAAAAAAAG AC CTTAT ACATG CCAGGCTAACACTCT AC CG CTG AACT AT AACTCAT CTTTT C CCCACTCT ACATTT AA 

AATTTTGAGACAAGGTATCACAAAGCTACCCAGGCrGGCCTTGAACT 

CTGCCTCGGCTTCCAAAGCAGCTTGGGATTGTGGGCCTAC^ 

CTTGTGCAC CGCAAAATGTAAACT CTACCCAGTAGATGCCACTAGATTCGTCTCCCTCCAGTTATG ACATCTGGAACTTGCCAGCT 

3 5 ACCCCCTGAATAGAATAGTTACCCCCATTTGGTT'ITGATGTATT^ 

GCTTCATTAGAAGTTGACAGCTTGCACCAGTAAGCAGGC^ 

CTTGCCGCCGGGCCGGGCCGAGGCGGGAGGGACCCTTCCAAGTTGACTTTGCGGGTGGA^ 

GTGCITGCGACTGTGGGCGTGGAGAGAGCGAGTAGCGGCTGGAGCAGCCGTGCTCCT 

ACTTCCCGGACAGCGTTTGCGCCGCCCACCCCGTGGCCACGGGCCCAGGATCACATGGCTAGACCCGGAG 

4 0 CCTC CACTCTCTCCGGCCTGCGTTTTG CGAGTGT AG CGGTGG AGG CTG AATCGG AT C CCAC CAGAAACC CCAACAAGTGGG CAGTT 

GGGACT AGGG C GTT CACT AGGG CG TT AAGAAAT CTT CACAAAACT AG TTGG CACAATTTCTTCCATT AG AAAGCATGGCAT CCAGT 

GCCTCTCTGTGACTGCrrCTGC^TGGCTTCAGGTGCAGCAACGTTAC^ 

TGGTTATACCGTCTCACAACACAGTATGGATGCATCAAGGTGGCATTTCTA^ 

AG AATGTC CCAT AC CCGG CAAC CATCT CAAGT AAATTC CTG AT CATG CACTTG ATG AGGTGG CCATGAAACACAGTACTTGTGACA 

4 5 CTGTAGCCATTACAAAGTT AT AAAAAAGG C AAAAG ACAAATT ATCTTTT CACT AGGG TTG AG CTTCT ACAG AAAG AAGT AT ACACT 

TTGTTTGAAAAAAACCAGGATGCTAGTAAGTAGCGATCCGAGCAGGTGTTAGGAGCTAGGATTATC 

CTCTCGAAGGCTAGGCAAGGTGGTACAAGCTTTAAT CCCAGCACTCAGX3 AGG CGG AGGCAGATGGTTCTGATTTCGAGG CCAGCCT 
GGTCTATATAGCAAGTTCCAC^CCATCAGGGGTTCATAGTGAGACCCTATCTCAAATTAATTAAT 

5 0 TGTGTGTAAAGAAGATGGCCAAAGTTGAGAACTCTGAAGGAAACTGAAATATCGCTG 

CTTCTCAGCTGGAGCT CTTTAGGGAATGAAAAAGGAGTCCTAG CAAATATCAT ACCAGGACAGCAG AG ATGAACTTG AGCCATCCC 
CAGAAAGCTTGGGAGAAC CAAGGACAGTGAC CTGGATCACTCACTAGTGCAAAGGAAAAGATAG AGTTCAAAACCAAGCAGATTAG 
G ACCAAAAGGCCATTCTGTGTGCC CAGCATGG AAGCCTAAG CCATTGGTCTCATGGGCTrTT ACATT(^UVCTGT AGGTCTGGAAACC 
AATTCAG AATCTCACTAG CT ACACAT CATGAAAACCGCTTT GAC ACTGAGG AT CAACTG CAAACCACAGAGGAGGCTGGGAAGACC 

5 5 TGGGGAAACGGGGCCTGTGGAGAGACTTGGGGCCAAACACAACATGAAGCATCAGAAACGGAATC 

AAGGATAAACCAGATATGTAATACAAC^GATGGAAGATAGGAAATTGG 

CTCAACTGGAGA C T 1T 1T TAGGTAGACCAGCTTTTATTAATTCTTAGACAGTTTCATACATATAAGCAGTACATA 
CTGTCCCXACTCrCACCGATACATTCCCTTCTCCCTGATCTACATCCTCTGATACTGATACCT 

TGTATATGTGTATGTATATGTGTATATGTATATGTGTGTATATGTGTATGTGTGTGAATGTGTCTATCTGTATTATGTGTGTGTGG 

6 0 TGTGTGTCTGTGTGTGTTTTAATCCACTGAGTTCAATTAACACT^ 

G CAGTG AG TTTATTGG C CACATT AC ACTCAAACG AG AG CTTTCT CT ACATT AACTGTTGAC CAT AG CAG ACAG CTG CTTCTTTCTG 

GCCAAGGCTGATGAAAGCACTAAGTCTGTGGTATAAATATGAACAAGACAGTTTGACAGCTTG 

AGGTTArcTCCTCTACCCTACCACCCTACCATCCCATCCTC 

GGCCTAAGTCTCTTAAGCCTTGAATTAGAGAGAAAGTAAGAATGTTCATCTCTCCTGGGGAGAAAA^ 

6 5 TTGATTTGTCTGCTCAGCTCATGCAAAAGTTGGTCTTGGC 

AACCAACCTGCTGTCTTGTCACTTGTGAGAAAGAACTATGCCCTTCTGTCTGTGGGTTCATGAGAACT 
CCTCCTTGGGTTGGTCTGATCCGGGAAAGTCAAATACAAAG>GAGGG^ 
• CTGGGCCTTTTGAAAAAGGAACCTCTTAGATTCTTTTGTAACTGTTATAA^ 

CCrCGAAGATGTCATAGGTCTACTGAGTAGACACCGTCAGGAGCATTTGTCACAT^ 

7 0 CTGAGCATTCACATTGTGCTGTCTCTGTCGCJ^CGCTGAGG 

AGCTCACCAATCCAAAC-GGACCTAGTGTGAGTGAGGTTTAACT 
GAAGCAC-GGGTGCTGGGACCGGGGCAATGCTGATGGAGTAA^ 
CTCACAGAACTGACTCTGGGAAGGTAACCTTTGACCrCCAAAGGCCT^ 
CACACATGTAATJ\ACAATTAATAAGTTTAGCACCTTAAGA.GC^ 
7 5 CTCGGGAAGCAGAGGCAGTCAGATTTCTCAGTrrGAGG 
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GAAAC C CTGT CTCGG AAAAAAAAAAAAAACAAAAAAACAAAAAAAAG AGCTGTT CTG^ CAAT AAT CTATTGT 

CTTTAAATGTCAAAGTATTCTAGAAAATTTTTAACTTAGAAATGTTCATTCAACTCCT 

ATATATTCTTAACTTAGCTCGGTGGTGGTGCACACCTTTAATCCCAGCATTCAGAGTGAGTTCCGG^ 

GAAAAACAAAACAAAGAAAG ATT AAAAAAAGAG AGAAAT ATGT CT AATGAGAT AAAACATG AAAT ACATTT AATAGTGATT AGTT C 
5 TGCTATTATACTCTTTTTAAGTATCTGCAATGAATATGATTCTCAATCT 
ATCTTGATGCCTACACCCCATTTGTAGCAATGAAATTGGGGCCT 
TGTGTGCTTG^TAATCACACAGCTTGAAAAGTGCTTTGTCAAAAAAAAAAAAAGTGA 
ACAGGGATGGGGCAGGGATGCAGGGAGACCTTCCTAAGCCCCT^ 
CCTGGGGAAAGTGACCAATGGCAGAGACTCTGGGCCACAGGCAGAAGTCCT 
1 0 AGCTATGCATGTCCTCGGTCCCAGCACCTCGTGGCTGACACACACT 

TGGTACCCTGGGAGCAAGCCATCCATCGGGATTTCTGTCAGCTACAAGTCAT 

GCAAGATGGCTGCC<^CCCTCCAGGCCTCAGTCTGTGTTGG CGGGG^K^TTG ATTTGTCCTTG CCCAGCAACTCTACCATGAAG AG 
AAAAGAGAAC CT CTGTC CCTG TGC CT CT ACCT AC ATATTACAGTCAT AATT ATG TCAC ACG CACACACACACACACACAGCC CCAG 
1 5 GTGCAAGG AAGCTGGGT ATTTG TGTCTCT CT AG CT G AAAACAGTGGGGT ATTTGGGG AGGACG CT AAGTGAGCCAACCTCT AGT AG 

GCCCATGGAAAAAATGAGGCCCAGGTAACATCCTTCCAACCCTGTGACTATGAC^ 
GCTATTGGTGGCCAGGAGCTGAATGAAGCCAACTCCAGAACCCAATGT^ 
GATGGAGGACCGGTCCTCTCTCTCCrGGGGAGCAAGCCCACCTCTCCATTG^ 
ATGTGTACrK3ATGTCTXMCACATGG 

2 0 TTTCCTTTGC^CAGAAAAAAACTACAGAGGGTATTTCATCCT 

ATAGCTTGGCACAGGCTGCTCTTTCAGATGGACrGCAGCACCC^ 
CCGACGCCTTCITCTGGCOTCTACGAGCACCAGGCACATCTGTG^ 
AAATGGAGGAAAGTCAAAGCACTC^AGAGCTGGCCATGAACAGAG^ 
GAGCCCCCCGCATGAGCTCAGGCTGGACCCCAGCCAGGC^GTGCCAGCT 

TTCCCACZACCTCAACGCCTCCTCCTTAGCCACCCCCAGCTC 
ATCAGCTCGAGGGAGGGAGGGAAACTGGGAGGTTCGCCAGTGAAC^ 
AGGCAAAGGACGGGTGTTTGTGCTGCAGCCTACAAGCTGTGTGGTCT^ 
CTCTTTTCTAAAAGGGTGTGTTATGCAGTGGTGTTACCAAGATTAAATATGA 

3 0 ATGAGGCATATTCCTTTAACCCCAGCACTCAGGAAGCAGAGGCAC^CTGATTTCTC 

CAGTTCCAGT CCAG CTAGG CT C CAAT AAG ACTCT CAAAAAAAT AAAAT AT GTGTGT AC CAT AT ATG CACAT CT ATTT AAATTGTGG 
TG C C^GACACAC CCAT AGG CACATGCA.CAAAAAT AT CIAAAA^ CG CT AGTCT ACT AAT ACAT CT AG 

GTAGCCAATTTCGAATCTGAAATATTGGAAAATTCAAGGGGAA^ 

CAACGGGCACCCCGTGTAGCCTATAACCACCAAATGGCAAGGTTAACCCAGATCTAGTTAGTTA 

3 5 GTCCTCCCTGGAGGACCCAATGTCAC^AATAGCAGGCTCCAGCTGAGGATC^^ 

GCTGGCATCCAATAGGAAAAGAGAGTCCTCCTGGGGCCACGCAGCCAAGACC^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNTOJNNNNNNNNNNNNNNNNN^^ 

GTCTCAATTGGTTT CCTTGGT AGG ATGGCTAT ACTG ACTGCAG CAGCCGTTGTGAGAACCCTGATGGG CCTAACCACATTCTGAAA 
GGTGAAGAGAGCAGk3CTAAGGTGGAGCAGGGAGGGAGGCAC^CTCGCTTGATCTCCACACT 

4 5 CACACACACCCTC^CCCCTCTCCCTCACTGAGTTCCGTCCCTTCTCACAC^ 

G G ATAAACAG CTGACC C CAACTGT CC CTTT AG CTAGATGAGTC CT CTCCAG AGG CC CTG CTG C CCT CT ACAG CTT AGAGGGATGGG 
GCCCAAAGCAGCCACTTCCTCCTACAAGTCATTCTTTCTCTAG^ 

TCTTGCAAGCAGCCGTCTGGCTTTAAGAGGTAAAGACACACTTTG CCTCATGGTG AGAGG CAG CAGGCAGTGGTTAGTCAGGTCTA 
CCCCAGCGTGGGCTTTGGCATCTCTAGGCCCAGG AGGTGACAG^A CCAGCGGG AGCTCCG CAGAG CGCCTGCTAGTG ACTT 

5 0 GCCCTGCAGTCACTTCCCGAAAACTAGGCCG^CCTCTAGGCGG 

GCCTTGGGGTTTGCCCAGGCTCTTCXK3AGCACXK3AACTTACAGTAA^ 
GGGCCAGGACTCAGGGCCCCAGTTCC(XTCTTAGCATACCGCTCCACCAGGTTCAT 
CTTCCTGXSTAAAAGGCTGACCTCCCTCrrGGGATGTGCCAGAAAACAAAA^ 
CAGCAGCAGCCAGGGATTGAGCATCCAGTTCTCGGCTTCCCTCAGCCCGCT^ 

5 5 CCTG^TTAGCTCAGAGAGCATTCAAAGTGTCGCTGAGGCTCAGCGCCTTCCCATATGACCTTCAAAAG 

TGAGCAATGTCAGGGTCCTCTGCTCCTCCACAACTTAAAAGAGAAACCCCGACTGTACT 
ATAAATCCCATCTCTCCTTrAGCCrGCAGCTCACrCCTGCTGGATCATTT 
TAACCACTATACTTTTGAGTCAACATTCTCATGCCAACAATTGATGTTATCAGCCAGTC^^ 
AGCTTAAACGGGCATTTAATACATGCTTAAATAATTAAAGCAGGCTTTAATTATTTGTAT 

6 0 CCCGGGCGTTAAAACGACCTAAGACACCTGTTAAAAGCTCAGATTTCC^GAACT 

GAGGCAG AT CCCTGAAAGCCCG AGGGCCACTGG CAGTTCCAGGTAATGAG ACATCCTCTCTCAAACAAAATGTGGACX5GTGCCCGT 

GGGAGTTCTTCTC^CCTCCATGTGCGTGTCACACATGCGTGCACCTGCACCCATACATGTGTGCACACACAT 

CAGATTCCAGAGTCCAAATTTCTTCCCIAAATCCCAGATCTGCAGAAGAGCT 

6 5 GCCTCTCAGGGGACTTGGGGGAGGCCCCCAC^ACTTGGCAAGGT^ 

GTTGTTTCT CAG CAACTTCT ATGC CC CAGTG CTGTG GTGG CTGGT AAATG ACG CAGTC C CCACAACACTGGAAGACATTT CTGGAC 
CAAGCGCITGTTGCCTCTGCACAGAGTACAGGGCTCCTCCTTGTTTGAAGTT 
TGTGGTCTI^XAGGATACTGAATGGAAAGTTCCAGAGATAAACAATTCATA 
AATGTTCCATTCTGTGTTTATTTATTGATAATCGCTTCCTATGTTTAATAAA 

7 0 T AGTGTCTT CTGGGTATTGTCCAAGGCTT ATGG ATGGTTTGGAACAGG ATCGCAA 

TCCATCGTAGATCCTCAATATTCAGAAAGGCTATAAGCACCCTAGC^ 
CACAAGGCACTTGGCGTGTGTTGGGGAGTAAGAGGCACAAACTGCTTTCCTCTACCT 
G CAT ACG NNNNNNNNNNNNNNNNNNNN^ 

TTTCTGAGGTTTGTCCCTACCAAACrTGTATGTGTATTTCATCC 
7 5 GGGGATTGTGTCACTAAAAAGTACTAATGTCTTTCTCAGTTTCTCTTCT 
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CTTTT CAGTGTTG CAG ATTG AACCT AGGGCTC CATTG CTTCTTCACACTAGG CAT AT CT C CCAG CCCTTGAGAAT ACT CATCCATT 
TAGAGACAGGGTGACAAGGGACTGTCTCTCTGTATTTCCTTTCT 
TGCITCATGACAGGACTGTGCCACCACTCCC^GCTTGTCT 
AGAGACCCCCTCTCTCTTATACACAAGCTCCTTTTCTAAGCTCCCC^ 
5 CATGTCCTTAAACCTTCAGAACTCTGATCTAAATAAACTACCCAGCCTCCTGGA 
AGGTGAAATTGCTATTCCTATTAGTCAAAATAGACAGCAATCAGCAGT^ 
ACAAATCTATGTGTATGTGTATGTGTGCACAGC^AATTCTCAGTTGCCCTCT^ 
TGTACCACCTTTCTGCOVTCATGTACC^CCAACCrTCCCCCGACCCCCGAC 
GTGAGGACCTGGAGAAAC^GAACGCAGCTCTCCGCAAAGAGATCAAACAGCT 

GCCTCACATCAGCTCGCC^CGCTTCCAGCCCTGACCTC 
TAAACTGAGGCCGGGCTGCCCTC^TACCTCTACCCAGA 

GGTCAGAGGGACTTAAGGCCTCCCAGGGAAGTATAGTCAATGTACTGGACTCT 
AAAATGACAAGTCAACCCTGGACTGTCATGAATGATGCCCAAAATACAC^ 
1 5 TAAATAAATATTTTCTAAAAAACCAGCCCTXKSAAAGCGGCT^ 
TTOK3CACCTGGTTTTCCTTCTTGGCTTCAGGGAGCT 
CCTGTTTGGGAAGCTGGTCTCITCAAATC^TCTTCA^ 

CAGCCGTGCCTCTTGCACGGTGCTTTCTCTAAATC^TGGAGCCAGTCACTCT 
GGAAATCTCAGTTTCTCCC^CTTGTACACCrrTCCTGGTTGGCT 
2 0 CCATGCCAGGAATATGGGTAACACAAAC13CAGC^ 

CTTAGAAATAGGACACTCAACCTGTACACTGAGTGTCCCAAGGTTGAGTA 
ACCAGTGTTCTTCATTTCCCTTTGATGTCCACCTGTTTCTCAGAAGTCCCCAGGCCAC^ 

TACCCATAACAGTG CCGAGAAGCAAGAACAGCCGTCTCCTCCAAGGT^ CTGACT 
GGCTGGGGACCAATGAGGTAGATAACCCAC<3GCCAGGGCTCTG 

2 5 GAAGCCACCTTCAACTTGCCIATGCTCTCATGAAGGGGAAGGAGGAGTC 

GTGACTCTGATTTCCCTAAAGCCTCATTTTGATGGTCCCTGCCCAGGTGTCTCTTACCC 
TCTGAGCATTAAACTTCATAGAGGC CTGACT AACCTGCCTCCTGCCTATGACTTCCTTG 
CAAATCATGGTGGTAGGGACAGAGTGGGCTCTCATCTTGGAAGCCATTGCCT 
ATTCAGAGTCTGCTTAG^ATATAGGCTGAGCCTGATTCAGAGTTTGCTCAGGTTACCG 

3 0 CCATCTCTGGAATCCTAATCTTTGGTCTCCCATCAAAACGGC^ 

TGGGATTTGTGACGGAGAGCGGCCTTCTCTTCAGTTCCCTACAC^ACCAGGATTTGGGGGTCCTGGCCC 
GCCCCAGCATGAAACCACTGACTTCAGGTGCCCCGTGTGTCATCT^ 

TGCAGCTGACCACTACTTGGCCTTTCATAC^CAGGCTTCCTTCCAGTCAGAGATGTCTCCCCTCACCCCT 
TTCCCCCTAGAAATCCCGCACAAGACACATGACTGGCCCGGGCATCTTCTGAATGAGACT 

3 5 CACGTGCGCTGTTCCCATCTCTCAGCAGTCIACTCCTCTTCCC^ 

CCTTT CTGATCAGCrTGGTGAAAGCAGACAGCAGAGGTG CAGTCTCTCTGCACGGGGACTGGG AGGGG CAGCCACTGAGTTCAGAC 
ATCACAACCC^GCTTAGCTCCTCAGTACAGGAATCAC^ 

AATGCAC^GGATCAGAGTATAACAAATGAATAGTATGTCATGTCTCTGTAGCTCCATCCAATGGGGTCATGCTATAAT 
AGTGATTGCTATATCTCCCAGTTTGTAAGACAAGCTAAAATCATAATTCCrCT 

4 0 TCTCTCTGTCTCTCrCTCrCTCrGTGTCTCTGTCTCTCTCTCTGTO 

. CAATATGGTAGGGACATTCTGTGCCACTGAACCACACTTC^ 
CGAGGCACTCAGGATCTGTAACCTTGGCTG<3CCTGGCACriTAAATAGCTCA 
AGCCTTATGGGTGCTGGAATTACAGACATCTTCrACCACACrrCTTCT 
GATAGGCCAGATTTGCCCCTTACATCTGCTTGACCTGGGAC^^ 

4 5 TTCTCTTCTCTGGCAAAGATGACTTCACATCTGCCCTGATGTGCCAGCTTCAAATTTAGGAl"^ 

AGTTCAGGTAATGGGGAGACAGACCTCTAGTGGCAGTCAGTCTGTTCTTGCT 

TCTGTTCACACT CCAGCCCCAGG AAGGACAATTCTGGAAAGAACTGGAGCCATG^ CTT AGGG AGTCTGGGCATC 

CTGG AGACACAG AGCACTGCGGG CTCTGAGGAGGTG TG CCTC CTTTGTCCTAG GG ATT C CTTT C CT CGAG C CAGGCGTGG TGG CAC 
ACGCCTTTAATCCCAGCAGTTGGGAGGCAGAC^CAAGTC^ATTTCTGAGTTC 

5 0 AGCCAGGGCTACACAGAGAAACCCTGTCTCAGAAAAAACAAAACAAACA^ 

ACTAGAAGGAATAAAAAACCAGGAGACAGACCAGGTTACCAGCTGCTG 

TTCCAGCCCCTCTCACAGTTGCTCCTGTGGTTTTGTGACACAGGAGGTGAG 

GGCTGTCTCCATCCTCCTTCTGCCCAGGTTCCCTTCT 

GAAAGGGGTT AAGG AACCAAGAC CAACACT AACTTCCAGTT CTG CTTTGTTTTGTTT ATTT ATTTTT AG ACATGGAGT CTTT CTGT 

5 5 GTCTATGTATCTAGTCTGGCTTCTAACTTCAAATCCTTCTGC 

TACCTTAAGACCCGCCAAAGGGCAGTGCGTGACTCAAAGGAATGTC 
TTTGTTAGAATTGTTGCTATAGAGATGAGCAGTGTCTGCCCT 

GGACAGAGATGTGCCAATCCTAAAACTCTTGTGGGGTATGTGGCnTAGTTAAGTTCA 
AGAACTTGGGGAGGAAAGGGTTTATTTGGCTTCTATTTCCACAGCATTGTTTGT^ 

6 0 CAGGGCAGGAACCTGGAGGCAGGTGCTGATGCAGAGGCTTGGA^ 

GCTTTCTTTCTTTTCTTCTTTCTTCTTTCTTTCT^ 

CTTT CTTT CTTT CTTTCTTTCTTCCTTCTTTCTTTTTCCCTCCrr 

TCTCTCTTTCTTTCTTCCTTCCTTCCTTCTTTCTTTCTCTCTCCCTCC^ 

CTTCTTTCTTTCTTTTCTTTCTTTCTT^ 

6 5 TTTCTCTCCTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTC TTCT T TCT T CCT 

TCCTTCCTTCTTTCTTTTTCTCTCCTTTCITTTCTTTCTTCCTTCCTTCCTT 
TTCTTTCTTTCTTTCTTTCTTTCTTTCTCTCTTTCTTTCTTCTTTCCT^ 
TTCCTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTCCTTCCTT 
TTCTTTCTTTCTTTCTTTCTTTCTTTCTrTCTTAAC^^ 

7 0 CATATCCCCTTTCAACTTTGCAACCTCCCCCCCAACATAAAAAAC^ 

AAGCTCTAGTATGTCACAGTGTGTCCCACAGTATATCCTTCTGTCCAC^ 
GTCATTGGTCTGGTTAGAGATCTCTGGCTTCrGTGACACCATCAATAT^ 
TGCCCCGTGTCATGGAGATCCTGCAGCTTTGGAACAACTGG 
TAGGGTCTACCAACTCajGATCTrW 
7 5 CT CCCTTATCTACATCACCAGGGGAGCTTTGCACT 
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CTGTTCTCATGACTTCTCACCGGCTCACCCACACCT 
GCTCTCCTGAGTC^TGCAGCTGGTGAGGGGTAGGGCCAGCTCT 

GTGATAGGGGCAGGGCATGATCTCTGTGCCCATGCC^CCCCACGACAGTAGAGTGCTGTCCCTGGTGGC 
CTCATCTTGTTTTTATGCTGATATGTGATGTGAGTTAACTCCCTTAAGATAAACAA 
5 T AAGCAGAG CT AAG AAACCCT CTCTTAGAGAATTTCAT AGG AAAAGGCACAG ATGGATCCTG TG CTCAGGGGT CTG CCTGGTAG AC 
TCCTGTAAAAACGCTGCCAAGCCTGTCAATGACAACC7VTGGTTGAA 
CAATAAGATAATAAGTTCCATCTTCCTGCCTACATTGCAGCTAGACCTGG 
GTAGGTCTTACAAAAAAAGCTCTTTAAAAATTGAAGCTGTCAGGC^ 

AG AGG ACT AGGGTTCAGTT CT CAG AACCCAT AGGGTGACTCAAAG C CATCCAT AATTT CAATTC CAGGGX5 ATCTT ATGTCCrCTGA 
1 0 ACTATGAGAGAACCAGGCACACATGGTGCAAATACACGCAGGCAAAACATTC^ 

AAAGAGTGAGGCTATCCCAGTGAGCACACACATTC^ 1 T T T T TTT TT T TGCCTGAAAGTA 

GATGTAATACTAGAGAGGGAGCCACTATCITAGGACCATGAGCAAACCATGCA 

CTGATAATTTTAAAAAAAATGGGGGGGGGGGGCTGGAGAGATGGCTCA^ 

GTTCAACTCCTAGCAACCACATAGTGACTCACAACCATCTGTAATGG^ 
1 5 ACTGT ACT CATAT ATATTAAATAAATAAAT AAAT AAAT AAAT AAAT AAATAAAT ATTTTTC CAT AAT 

GGTTTAAAAAAAATACC^TTCTTAACC^TGCATGGAAGTGCATGCCTT^ 

NNNNNNNNNIJNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
2 0 NNNITONNNNIJNNNNNNNNNNTINNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNIWNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNbJNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNHNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNblNN^ 
35 NNNNNNNNNNNNNNNNNNNNNN 

TCTCCCTCTCCCTAATTGGACTCCCCCTCATAAAGTAAAAGGAACACCA 

AAT C AGG AAAGG CAGG CT AAAAGT CAAAAG AATATAAGT AAAAT AATT CAAAGT CCAT ATGACTT AATT CAAAAATCACAGTT AT A 
TGTAATTCTTATGGTATATCATGATTTTCGTGAATCTCTTAATATAGTTACTGACTCAATATTCAGAAA 
GAAAACTGCTGATTTAGAATTAACATCACTGTTTATTC^ 
4 0 ATCCAATCCCATACAGGTCTGCCAGGCCCTCTAGCACAAGATC^ 
TTCAGAATTTCATTTTAAAACACACTATTTAAAAGGGGATTT^ 
CTGCTTGTTCTTTGTATAATCAAACTCCGTTAACTGGA^ 
TATGTTTCATTTGCAGAGTTTAGAAAGATTAAAATATGTGCACCATAC^ 
AAGTTCTGGAAAGGATGGTTTTGTAATGACACATTTAACAGAAGTTATGGCAACCATGGGAA^ 

4 5 GGCTCCAGCATATGTCTCTAGTAAAATGAAGCAATATTTTTGCCTACT^ 

ACAGGACAAGCAGTTGTAGAGAGATCTAATTGTATTTTTGTTTTG^ 

AGC CCTGGCTGTCCTGG AATT CATTTTGT AG ACCAGG CTGG C CTCG AAC T CAG AAATC CGC CTGC CTCTG CCTC C CAAGT ACTGGG 
ATTAAAGGCGTGCACCACCACGCCCGGCAAAGGGAGTCCAAAAAGCCT 

ACTAACCAGGCCCGACCCTGCrrTAGCTTCCGAGATCAGACGAGATTGGTCACGTTCAGGGTGGTATGGCC^ 

5 0 TTTTAAAGGGATGCATAACAAATGAAAAGGGATAACAAAGACCCCCAGAGAAAAATTACATAA^ 

TTTGGTTTTTCTin'TTTTTTCTTTAAGATTTATTTATTTATTATGAGTACA 
CGAATCCCATTACAGATGGTTGTGAGCCACCATGTTGTGGCTG^ 
ACCACTGAGCCATCTTACCAGCCCAACTTTAAATTTTTTAAATGCTAATGAACAACAA 
AGAGACATTGGATTACAGGAAAAAAAAAAAAAAACGCTGAGCTAACCCAGCCTGTTTATATTAAAGAT^ 

5 5 AAGGAAGGAAATATGTTATGTTGGGGAAGAGGTTTTATCTTTGTTTTAACA 

* AAAGATTAGATTTGACCAGAGAAGACTTTCCIAGAAGACCCTGACTTCAGA 

MOUSE SEQUENCE - mRNA 

GCAGTCCCTCTGCACCCGAGAGAGAGGAGGACGCAGGGGTCTGTC^GAGGTTGCTGTTGGGCAAGCAGGGGAGGTACCT 

6 0 GTGGTGTGCTGGTGGCCCCCTAGCACTCAAGAAGGGGAGCCAGCTAGTGA^ 

AGCCCXK3 AAG ATTAGAACCATGCCTCACAGCTCCGACAGCAGTG A CCCCTGG CAAAC AGGACT 

CATCTGATGATGTGAGGAAAGTTCAGAGX3AGAGAGAAGA 

GACACCCTTCACCTGGAGAGTGAGGACCTGGAGAAACAGAACGCAGCTCTCCGCAAAGAGATCAAACAGCT 
GTACITCttCATCAGTGCTGAGCAGCCACmGCCCCTG^ 
65 GTGCCCATGCCTTCCACCAGCCTCACATCAGCTCGCCA.CGCTTCCAGCCCTGACCTTCTGGACAAGAAGGGC 

G ATCCCTTGGAGGGGCATGT AAACTG AGG C CGGG CTG CCCTCAT AC CT CT AC CCAGAGGCCCAG TGGCAGAGG CCTGG ACAAGT AT 
TGAACACAAGAACTGTAGTGGTC1AGAGGGACTTAAGGCCTCCCA 
AGCCAATGTACTGGACCCAAAAATTGACAAGTCAACCCTGGA 
GGCAGGGGGTGGATAGTTTTCTAAATAAATATTTTCCAAAAAACCAAAAAAA 

70 

MOUSE SEQUENCE - CODING 

ATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCT^ 

AGTT CAGAG G AG AGAGAAG AATCGCATCG CT GCCGAGAAGAG CCG ACAG AG ACAGACACAG AAAGC CGACACCCCT 
GTGAGGACCTGGAGAAACAGAACGCAGCrCrCCGCAAAGAGA 
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GCCTCACATCAGCTCGCCACGCTTCCAGCCCTGA 
HUMAN SEQUENCE - GENOMIC 

ACCCTGGCCTGGAGATGTCCAGCTGAAGAACAGAGAATGCTTGGT^ 
5 AGGTAACAGGGCTGTGAGATTTGGGAGCAGTAACAGAGAAAGCCTC^ 
Glt^CTTTCGAGGGAAACTGTAGAAAGAGGATTTTTTGAT^ 
TGGAGTACAATGGTGCAGTCTCAGCTCACTGAAACTTCCGCCTCCC^ 
GGAATTACAGGCACCCGCCACCATGCCCGGCTAATTTTTGTATT^ 
TGAACTCCTGACCTCAGGTGATCCACCCACXrrCAGCCTCCCCAAGTGCT 
1 0 AAGGATTTTTATAGAAAGTTGGTATGTGATGTTATTCCC^ 

TTGCCCAGACAAGAGAGAGTGCAGCAGCATGATAACAGCTCACTGCAGCCT 
CCTCCAGAGTAACTGGGACTACAG&TGTGCACCACCAGATCCAGCra 

GCTCAAGTG ATC CT C CTGACT CAGCCT CACAAAGTG CTGGGATCATAGGTGTG CACCACCCCATC CGGT CTT ACT CTCITTITTTC 
AATATTTCTATTGGATGAGACTCTACTTGGATAATCCCAGTTTTATA 
1 5 TAGGCA.CTGTATGCTTACrrAGGCACrAATCATTGCACrAAGCAATTT 
TGAGGAAACTGAGGCACAGAGTTTTAACTGATTGTC 
TGAAACCCAAAGCTAAGAGGCCXSTTTTCGTGTTCCTGCTTTCCTCC^ 
GTGAGGGAAGGATG CTC AAAAGTCATTTGGTTCAGG CCTCACATTTTCAAAGCACT^ 

AGAT ATTT AG CCGGG C CTG AG AGATGAGGAG ATTTAC C CAG CITTTGAGTT ACTTT CCATTT ATTG ATGCT ACCAC CTCCAGAGAC 
2 0 CCAGGAATTAAATACTCATATAAATTCGATCATTTTCT 

AACCTTATTTTGATATATCTTCATTTTTAATCTGATGGGAAACTC 
CTCCCnTTCCTACTGTCTGAAAGGTGCAGGAAGAGGAGGCCCTGCAGGAA^ 
TGGCTGGATGTGCTATCAGCATGGCCATACCCACAGGTTGAAGTA 
GCATCTATCCGGAAATTTCCrrGAGCTTCAAGGGGGTTTATAAGGCA 

2 5 ATGCATGTTCTTTAACCCAGTGGCTCTCACCrGGGGCGATTTAGCATCTCCTCC 

TTTT ATTTTCAOUVCTTGGGG TT CT AC CAGCAACT AGTGGGTGG AG AC CAGAGAAG CTG CT AAACATCTT ACAATGCACAGG ATGG 
CCCACCGCCACAAAGAATTATCCGCCCCAAATGTCAAG^GTGCC^CAGTTGAGGAAATGTGTTCTAACCCT 
TGAAGCGCAGCCAC^CCCCTCCTAAGAGGCTTCCTCCAAGGCTGCAAAGCCT 
ATTTCCCCTTCIATCACCTGGCCACCCACTCTGGCCCCCTTACTGGAGGGCTCCGC 

3 0 GAAACTTTG AGAAAAATCC C CTTT AG C CATCTTCTG AG CAG TG AAG ATGT ACCATGTGTG C TriT'l CCACCAAAAAAACATT AAAG 

CG G CTAGGGGATTTTT ATT ATTTT AG CTC AAAT AACTAAATAG TGG CAG C CAACTC CTTT AAAGGCG ACTGTCAGC CCTCAAAGGG 
GCGCTCAGATGGAACTCAGAAAAAATTAAAAAAAGAGTGAGTT^ 

GGGACATTTG CATGTACTCTCAGGAGATTCTCCTTTCTTCTCCCTGGGTTT CAT CTTT A^ 

3 5 TG CAGGGCCAGCG CT CT AGGAAAGTG AAACTG CCTTG CAGAGC^IAAAGCAAC CCACTGTG<?CCC CGAAAG ACC CAGACCG CCAGCG 

GCCCCGGGCAGCCGCCGAGC^CACCGGCTCAC^CTTCCTGCATT^ 
GGX3TTAACCCAAAAAGATCAAGTGCGAGCTCAAGGCGGA 

GTTGGTGCrAATCAGGCCAAGGGAGCCCGGCrCCTTTCTGCCCCCAGGCAGGTC 
CTGTGTCTACTCCTCTTGGCCCAGAAGCCCAGCTGGACCCTC 

4 0 TCCTCCATCAAATCGCTGTGTCAGAGCCAGTTGGGGGCTTTTAAAAGACGCCCACGCTCAGG 

AATATCTGAGGCTAAGTCCCAAGCTAGAATTTCCCATCTTCCT 
CTGACTTCTGAAATGATTCTCACGAAGGAGAAGTGCTCCGTTCTC 
AGGCTTCTGCTCTGCGTGGTTCTTCTGCTCTCGTTTCCTTAACGATTGGGCC^ 
TAGATCTCGAAGTCTTTGAAGTGGGCAGATTCACGAGGCGT^ 

4 5 ACAGGGGGACTGAACTGGTAAGGATATAGCAGGTTTGTTGCACTTTTAAAAAATATGTTT^ 

TGCAGTATATCCTC^CTGCTGATTTTGAACAAGGATGATCCAATAGGCACT 

TGCTTCCCTAAATCCrTGGAGTTTCCTTCCTCCCTCCTC 

ACrAAAAGGTTATTTGCrTGGCACGGCAGGGAGTAGGGGCTG 

CACC C CT AATT AG ACATCATTTCAAT CT ACTCATGAAAG CAG CTTG CTG CAGG ACT CTG TG CTGG CCAT ATTT AATTAG AG CAGT A 

5 0 TAAGGTTTTTTTTTTTTAAGTTAAAAACTCATATTGTA 

ATTTGATCCrTATAACGACCCTATGATATTGGTGGTGTGATATCCTCCCGTTTTCrA 

TCTCTTG CT GAT ATCTT CT AG ACCGGGCTGG CTG CAGGGTCTGTTG CT CT CAACCAG AGG AAAT AGAACCATG AAAG AACAAGGTA 
GCK3TTTTCCTCCTTCTG CC AAACTG AGATTCACAAAGTGGACTCTAG CATTCTTCCTCTCTGCCATTCCCTCCCACCAAATTTACA 
CCTGGGG AAACAG TGGC C CAGAGAGGG AAGG AGAC CTTC CT ACAAT CT CAG AGT CAATTGGGGGCAAAGCT AGGAC CAGAATG CTG 

5 5 CCTCTCAACTCCC CCTCGACCTCATATCAT ATTGCTTACATC CAGCTAAATGTCTCCTCAGAAATCAGTTGGGGAAT ATGGTG ATT 

TTAAGTCCATAAATTCTTTATTTAATACTCCTCCCTCTAAGAGGTAGAGACTCATTTGCCTCCCCTCAAGTGTGG 

TGCTCCATTCTACTTTTTTCTTTTAGGTCAGCCTGTTGCCCAGGCTGGAGTGCAGTGGCTCAGTCT 

CTCCTGGGCTTAAGTGATTCTTCCACCTCAGCTCTGCCCAGCTAATTTTTTGTAGAG^ 

TCTCAAACTCCTGGACTCAAGCAATC CTCCTGCCTCGGCCTCCCAC AGTGCTAGAATTACAGGCATG AGCC!ATGCCACCCGG CCCC 

6 0 AAGAGTAGAACAAAGATGGAGGACTAGGTCATAAAAGACACTGCAACTCCTCT 

TCTCTCTCTTTCTTTATTTCTCTCTCTTGTTGTTCACTCTCCAGGGAGCCAGTTGCCA 
CGTGGGGAGACTCATGTGAGGAGGAACTGAGATGTCTTGCCAACATCCATATGAGGGAGCCATCTTGTAAA 

CAATCAAGTCTTCAGATG ACCAGAGCCTCAG CTGATAACTTGGCTGCAG CCTCATGAG AG ATCCTGAGCCAGAACCACTCAG CCAA 
GGTGTTCCTAGATTCCTGCCTCTCAAAAACTGTGTGAGATAATAGATGCTTATTGTTTAA^ 

6 5 GCAATAGATAGCTGAAACAGGAGGCTCTACATTTACTCTTTTT^ 

CTGGAGTGCAGTGGGGCAATCTCCGCTCATTGCAATCTGCGCCTCC 
TGGGACTACAGGAGTGCGCCACCATGTC CAG CT AATT ATTTTGTA^^ 
TCTCAAACTCCTGACCTCAAGTGATCTGCCTGCCTTGGCCrCCCA 

C CTTT ATT CTTAAAATGCCATAAATG CACAG AGTGTTTTGAGG AGTCTTTT ATC CC^ C CAACT AT ATTT CTTTT AAAAA 

70 TTTAGGGCAATTAAC^CATTTTAATGCATGTGACCCTCCCCGCCCCACAT^ 
TTTGAATTTTTTCCTGAACGTGTTTCCAGAAAGTTTATACCAA^ 

TACAAGGGTTCCAAGCCTAATAAAGTAATCTAAGGGCAGGAGGTTTGGAGGTATGGTTATTTCT 

TGGAGCTGAGTAGCATCTTGGCAGTTGTCTACTGTTTGAATACGT^ 

TGGGCAAATGATCGAACTCTCTGTGCCITA&TTC 

7 5 AGATTAAAATGAGATGTGGTAGAGCATGGTGCTTATAAGCATTGAACCAAATTGCC^ 
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CTAGCTGTATGATCTTGGGAATCnTACCTAATCrC 

TGCTTACCTCCTGTGACTGTCATGAGAATTAAATGGAGCCAATTCATATACTATTCCCGGTACA 

ATGCTAGGTATAGTTCnXXAGGTTGTTCACTG 

GGTATGAGGTTGCCTACCTAGAGX3TGGCGCTGTTTTCTAGTTTC 
5 AGT AT CCAG CAAC CACT AC CTT AAT ACT ATCTTT AGG CTTTAGG CTG ATGTTCACTGAGCACATGC CAG TTTT ATT AT AACTGTT A 

CTTGGTAGCTTCCTGGAGTTCTCCAGGATAACATCAGTGGGATGTGTATATCCCAAAC 

GAAAATCAAATAAAAAATATTAGCAAGGTAAAAAGTTTAAAATCCTGCAGGACAATATTATATA 

TTGGTTC^AGTATAGAAAAATGCTTAAGAATGATAAACCCCACAT 

GAAAGGGATGAAGGAAGCTTCGGCTCTATTCATAATTTATTC 
1 0 CTT AAAT ATTT AAAAAT AATGTAT AATTAAT ACT ATT AAAT AAGAC C CG TCTTTTT ATGT AAAAACTAATTTTG CATTTCTCT CAT 

TAGTCTCrGTTAAATAlTAAATCAGTACATTGATTGGGAATGACA^ 

ATGATAGGATGTTCAAAGAGCTAAGAACTTCATCCATACGCTGTAATACTTAGAACG 

TATTTGACTATGTTTTCTITTTTCTGTGAGAGCAGAGTCT 

TAAATGTTAATTTAAACCGAATGAAACATAAATATGACrATTAT^ 
1 5 TAAGTGCCATAAATGAAGAGCAAAGTGAATTCACTTATGCATTCATTCGTT^ 

GCATATGACTATAAACAAGAGAAACACAGACTTTGGTT1TTACAGGGCCCACTTCT 

TGCTGCCAGCATCTTTTGTTTGGATGGTTGCAGTAATCCAATACTGAAATCCTG 

TCGTAAAATCTCAACTGTGTCC!AGAGAGATCCTCTAAAAACCT 

TTCTCATCTGTAGGAGATTGTAGAAATTGCCACAATTAGTTAAATTTTTCCTA 
2 0 GTCTGAACTGGCCTTGTGACTTTCCTCXMCCAATAGAATGTG^ 

GG CTTCGAATGCTTCTGCTCCTACTCTTC^AATCCTGCCACTATCAA^ 

CATGTGGAGGAGAAATGCC CAAGGCAAAGC CAGG C CAG AC CAGTCAATTAT CAG C CAACCT AC CAG CTGTCTGAGGACACATGAC C 

GAGCCTAGCCGAGATCAGCCTAGCCCAGCCTATCCCCAAAGAACCTTT^ 

GAGTCATCGAAGTTTCTGGTAGTTTTTTGGGCAGCGAAC^^ 

2 5 AACTGTATCCTCAGTACTTAGTGCCTGGCACATGGTAGATGCACC^ 

TTAACATTCTATTGAGGGAAACAGGAAAAAGATAAATCAGTAAACGAATAAAATAAGTATAAAATTGG 
ATGAATAGTAGCTGAGGTAGAGGATAAGAATGAAGGTGGAAGGGGAAACCCTCCATGGGCTCA 
AAAATCCTGATGTTGGCCTCTAAGGCCCTCCCTGACCGGGCCATTACCTCACT^ 
TATCCAACTACCTGGCCCCCCTGCTCTTTTTCCAATACTCAAGCACACTTCCACCT 

3 0 TCACCATTCACTCCCTCTCTTTCTTCGAATCTCTGCTCAGGTATCCCCTCT 

AACCTCGAACCCTGCTTTATTTTCCTCCATAGGAAAGGAATTTTGTCTTACTT 

G AAATTCAG CTCATGG AT ACAAAAATCCT CT AGG AG C CX5 T AACTTCCTG T C AAT CCT AAAATTT AAT AAGGCAAATGTTTGGCTT C 

GAGCCATACTGGGAATAAGGTAAGGCCTTTTGAAAAAGCCACGATGCCCAGGTGAGGGTTGAATTGAT 

TGCGCTCATGGGAAGGCTGGGAATACCCTACACTTACCCTCTCGGTCATTTACTTCAGAG 

3 5 TCATTTGCTTTGTGCCACATTCCTAAGAGGTAGGCAGGGCAGGTTTTATCATCCTCTGTTTACAGATG 

TAGCTCAAGTGAGCTACCCAGGGCCACAGACATACAAAGTGGTAGAACC^ 
TATTTGTTTGACTTTTTCTTTCTTTCTTTCTTT^ 

TGAATCATAAAGATTCAGGGCCTGTCTTGAGCTGCTCAATATGGAAGGCATCTTTTCTCTAA^ 
GCACAGGCGCTTAATAGGCTTTCAGAGC!AGCAATTCTAGAACCTCCCTTGTGGC^ 

4 0 GTCAGGAAAGGAGAATACCCTTGGATCGTCTTATCTCCAGACCACCCCGAGGCTGGCTGGACTTAAGGGGTGAGGAAAGAC 

GGGTGAGCAGCTGCTTTCGGCTGAAAACCAGAAAGAAATCCCAGTGTTTTTGCI^ 
ATGGAAACATTTTCCTGGTTGTCCTCTCCTCTCTGCCCATGTTC 
CCTGGGCTGTTTTTCTCTTTTATCTTC!ATCAATTGAAGAGGGCT 
TCTTTCACATAATTCTGTAGGTCTTTTCCCAGATATGTATTTTTCT 

4 5 TCCAGCGTGAGTTACCAGAAACCACAAGAGAGAGAGAGAGCGTGCAAGCCCCAAAGCGAGCGACA 

CTCTGCACCCCAGAGTGAGGAGGACGCAGGGGTCAGAGGTGGCTACAGGGCAGGCAGAGGAGGC^ 
GTGGCCCAGGAGAAGTCAGGAAGGGAGCCCAGCTGGTGACAAGAGAGCCC^GAGGTGCCTGGGGCrGAGTGTC 
TTTCAGCCATGCCTCACAGCrrCCGACAGCAGTGACTCCAGCTTCAGCCGCTCT 
TTCTCTCTCCTACCTTCTGATTCTCCTGGGGGATGGAAAGA 

5 0 GTGGCTCGTTGCACGGGCACn'CTGTTAGACTTAGGACATGGAATTTGCTACTTUV 

CACCCATTCTTCCAAAGCCCCTTTTTCTCTCC ATTTTCCAAGGCT CCT ATCACCTCTGCCT CACTGGGGTTGCCACCCTAAAAAG 

CTTTCTAGGAAGCAAAGAGGAGGATGAACATCAAAGAATGCAGAGAAAAGAGTCTACTGTTCTCCAAGG 

GAGTTTTCAAGTGGGCCGCTTGGCATGCACAGCTCAAATCCTAGATATGCAGTTTCCC^GGA 

AGGGGTT AGGGAAAAGTTAAAAAAAAAAAAAAG AAGAAG AAG AAGGAGAGGAA CCAGGGC T AG 

5 5 CATAGAAGGATCTCTCTACTGGGGCACTGCCTCTGTTGCTAGCACAGCCAGCAAGGGGAG 

TTGCTTGTGGCTTCTGGATAGGAGCAAGGAAATGAATGAGGAGGTGGCTCTGGAAATTT^ 

CTTAGCTCTGGGGAAACCAGAGAGACTGAGATAGGGATTGTGGGAGCATTTGGAGTCTAGATTGTGACTTCCAAGTCCA^ 

TAGCTTGAGGATTACCAAAAAAGGAGTGCCCTGCCCATCTCACAGCIT 

GCTGTAGGAAGCCCTCCTCTTTCTCATCCGACATGTATCC^ 

6 0 TTCCTTAGCACAGATATCCACAAGGGTTTTCAAAATGCTGCAGCCAAAATAAAAGTATAT 

AGCCTCTAATAACGGAGCAGCAGGCTGGGATGAGTCTGGGTGGAGACCTCTAGAAGGCACAGAATCCAGTCTGCT 
TGTCAGAGTGGCTCTGACTGCCTGGAAGTTTGTACTTTTCTTAGGAAGGAAATTACTCATG^ 
TCCCCCTTAGAGGAAGAGCACTTCCTTCCTATCTATGCCAATGACCTCCTACCACAAAGGCCTGAAAGTATTG^ 
GCCCAGTACTTAGTCATCCCAGAGCTTTGCCCTTGGATCTGTC 

6 5 TTGCCAGGACTCCTACACCTACACTGCAGCATCTGGGAGAGAACTGGAGCTAACCTCTCT 

TGGGTGTAATTAGAGAACACCCAAAGATCCCCTTATCCTACCAAGCACCCATCTACTGCCCACCACAGCCAG 
TAACGTTCITTTTGGCATTGATGTGCCAGGCATTGTGCTGGGGCCATGGGGAATG 
GGAGGCAGATCTGGGATGCACCCTCAGCTCAGGAATGCAGCAGAAGACCAC^TGGGGTGCAG 
GAAAGCCATTGGGCTCAGCACCAGAAGATGTAGCACTCTCAGATGACCAGGA 

7 0 CACATATAAAATGGAAATGATGATATCAACTTCACAGCATTGGGGAAT^ 

TTGG CATAT AGT AGG CACTCAACAAAGT CTGAAAAAAAGAAAACAG CTAAAAG AAC AGCAAG AAG AG CCTAAAGGC CAAAAAG TGA 
GAGGGCGAGGGTGGAGCAGCCCTACAGAGGAGAGGTGATAAAAC^CACTTCATCGCCTTAACC^ 
ACAGCTCTATTGAGTCCTCAAGTTCTCTTTTCTTTTTCCTGT^ 
GCATGGAGAAGAGCTCTCCTTCGGGTGCAJ^CTrTTGGTAAA 
7 5 GGCTGGACGGATACCACACACCACCAGAGCCTGCTTTCACTCAGAAGG 
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CCAGGACTCATCTGATGATGTGAGAAGAGTTCA 

AGAAGGCCGAC^CCCTGCACCTGGTAAGTGTTCAGATCAATCTTCATCCTCGTG^ 
CCCTTGGACCATAGCTTTGATTCTGCTTGTGTGGGGATGTGtATGTGGGGTGGGT^ 
GGGAAGGGAAGCTGTCGTGAGAGTGCTTTAAGAAGGCTCTGTA^ 
5 AGGCCAAGGCGGGTGGGTCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGG 
AAAAATTAGCCAGGCATGGTGGTGCATGCCTGTAGTCCCAGCra 
TTGCAGTGAGCCGAGATCGCGCCATTGCATTCAAACCT 

AAGGCTCTGTAGCAGGGAAfGGGCCAGGAGGCTGAACAGAGAAATGGGCC^ 
CAGAGCTGTATTGGGAAAGAATTTAGAAGGTATCAAGTGCTTTGG^ 
1 0 GCATCCCTTGGCAGAGACCCAAATGCCCCTGCATTTGA» 

TTATTAAAGTCTTATTTTGAAATTTCTGTCTTTAATGGTATCCCAAGGGGTCTATG 
GATGATCACCATCCTAGTTGAGTTGCCTCTAGTCCCAAATTTCT 

TTG AGCAGTGCTG AACTG CAGTTCCGCCAGTGGCCAAGGCCCTAGATGGTGTTTCTAGGTTCTGG CTGTGCCCCTTA^ 
GCTTCCTCCCCATTGCTCATACCTTACCACCTTACCTCTCTTCACTCT 
1 5 GTTCCTTTCTCTCCATCCCCTCATCCCCTTCCCTAGCAACTCTAC^^ 

GCCTAGCAATTCCTTTCCATGGAAAAAGTGCAATTGGCACCAAAAATCAATAAGCT 

CAGCCAGTTAGAATGGCTGTGTTAACCGTATCAAGAATAAGTAAGTAAGATGTAGTTTTAG 

ACAAGGAAACAGAGCAGACTGCACCAACAGCTCTGATTGGGTGCTTGGGTTTTATTTC 

CACT AACATT AAATT CAC AG CT ATTTG TTAACTTTTTGGAAAGAATG AATGAACAGAAAGT CTGT ACT CT CAACAGT CT 

2 0 GAATGTCAAGTTTTTGGTGGCCCTGCTGCTTGTATGTTCTCCATGATTTCACTGTGTCCCCAC^ 
TGAACATCTATGATCCTTACCTCTTTCTCTATCAGAGTCTGATTCTGATTCAGTATT 
ATTTAAAAGGATAAAATCTTCCTTGTGTACAAGTAGACAATGTCCTCAGACTGGCAAACT 
GTTTTCTTAAGTGGAATGTAAGTATAGTTCGGCAGCTCAGCAGTCAGAACAGGGATTGAATC 
GTGACAGAGACGTTCTTGGCCTCCCTTGGGCTTGGTTTCCTTATCTAGAAAATC 

2 5 ACATTT AAAG AG C C CAACACAATG C CTGTCATT CAT AAATACT CAAAAATTT AAGT AT AAT AAGAGT AAT AAT AATT ATCAT AAG A 

AGCAAGATAATCAGAAATTATACACACATTTTATATATATATGAAAACATATATACATTTTGTGTATATTTAA 

GGGTTGAAAGGTTTCTTAAAGGCTAGAAAGTTTAGACATAAGTTTGATGCTTCAGACTTCTCT 

CATTGTATGCCTGGTCACTTCTAGGGCTGGAGATCITCCT 

CCCCTGTCTCCCTGTAAATTCAACTCACTGGTCTTCATCCAAACCCTTGGGGAACGTATTA 

3 0 GCTGCGGCAAGGCTTTCAGCACTGAATCAGCTCTCTCCCTCTCCAAGACAAATGTCAGAGCCCA^ 

ACAAGTCATGTGACATTAGGGTAATCTCTAGAGCTTGAGGGTCCACAGAGCCCCAAGTCATATAGTCAACATATCC^ 
CCCAGCCTCAGTTTCTAGAATGTTCCAATCTTCCCCACATCTGAACACAGATTCCTCCTC 
ATCGTCTTTACCTTTTGCCCAATGAGTAGGTCGCAGCGGAGTTGATGCAGCAAGCCTGCGGACTGCT 
CTTGGCTTCAGGTAGAAGCTTTTGGCAGCAACGTGTTCAATCTGCCCT 

3 5 CGAGATGACAATCACCTATCTTCTCCATCCATTATCCTCTTAATTATGTCTGAGATTCAGGTGAAAAG 

GTGTCAGGGCTCAGTGTGTGGTTTGCCCTTGACTCCTTTCCATGCTTGGCTCCACATGCTGGGGTTGTTCTCT 
GGGCATAGGCCTCGCCTACCAGTGCTTTACAGATGAGGGTTTGGGAGCTACTGAGAGTACCATATC^^ 

TTAGGGTTCT AGCCTCATAT ATACTGGT ATGTTTT CTT AGTTT CAACTGCTGTCACTATT ATCAACCTTAAAATACTGAAACAATT 
ATCTTGTTCCTCTAGTGAGAGTGTGGGATGCTTACTATTCGACAT^ 

4 0 TACATCGATTGCTTTAAAAATTGACAGAGTAGGGAGAGAGCCTCTTTTTATGATTGTG 

GACAT AAAATG G AT CGGTG ATT CCTTTCGT AATTGTGACATCAACAAAC!AATCT AATTCTGTTT CT C CTTTTGTTG AG CAT 

CTCACACAAGCACCCTGCACACrTAGTTCCTTTTCTCCTGCAGACAA 

AAAGAGAAAGGCATAACTTGACCAGTTAGTTTCTAAAAGATATCAATAGCTAATAAGTGGACCTAAGATAATCAG 
TTTGAAATAATAAGAGAATGCAGTCAGCTTCTAAAATTAAAACAAAATGTAATAACX5 

4 5 AAATTAATGGAAGGAAAGATCCTGTTTATAATACCAATGACAACAGCAAATGG 

GTGT ATC CT AAATG ATTTTT AAAAG ATTCAAAT AATTGGAAAGATAT AC CTT ATGT A TT AG CAA 1' 1 l"l C CTTAAGTT AATTGA 
TAAATTTAATGAGATCCCAATAAAAAACCCAATAGGGTTTTCTGGGATTTGGGCAAGAACT 

G AAAAAAAAT CAT ATG ACAATTGCCAGG AAAATT CT AAAGAAG AAG TAT AATGAAAAAAG ATT AACTG ACCAACAT ATT AAAGTGA 
AACACAAATACAGAAATCAAAACAATGTGGAAAAGACAG 

5 0 GATTTTAAG AC CCAAAT ACAT ATTT CAACTCCT AGG ACACAAGTATAAAT ACAAAATGGG CTGGG CACGGTG G CTCACG C CTGT AA 

TCCTAGCACTTTGGGAGGCCAAGGTGGGCAGATCACTTGAAGTCAGGAGTTCGAG^ 
TTTTTAAAAAAAC^AAGCTAAAGAAACTTGAAAAGACAAATAACCATTA 
AAACATAAAACCCTGAAGCCATAAAGGAAAAGATTGATATACATTACTTTTAT^ 
ATACTGTATGCAAAATCAAAGGCAAATGACAAGCTGAGGAAAAATATCrGTAATACATATG^ 

5 5 ATATGAGAGAGCTCTTATGAATAGGAGATGATTAACAACCCAAAAGAAAAATGAACAATGGGCC 

ATCCCAGCAATTTGGGAGGCCGAGGCXK^CGGATCACGAGGTCAGCAGATCGAGACCATCCTGT 

TGAACCCGGGAGGCGGAGCTTCCAGTGAACCAAGATAGCX3CCACTGCACT 

AAAAAAAAAAAAAGTAAAGAAAAATGAACAATGACTATGAATAGGCAATTTACAGAAAAGAGAA^ 

6 0 AAAAGATGGTAGACCTTATTTAGTTAGGAAAATGCCAATTAAAGCAGCAGTGAGGCTGG 

GATGTTGCCAAGGTAATGGAGAAGCAGGCACCATCACACACAGCTAGTAGTA 

C ACGTCCATT CAAATTAT AGGTGCACATACCCTGCGACTCAG C^^TTTGACIATCCAGGAGTGTGTTCACAAACAAATCCACACATC 
AT ACAATT CTGT ACATGTAAAG ATAATCATTCCTGCATTGTTTCTTAT AGGAAAGACTGAACATAAGCT C AATA 

AGGGATTCAATAAATTATTGTCCATTCTGTGGAATATTG 
S 5 GGATTATCACTTCTGTACTTTAAGTTAAAAAGGCATAAATATAGTCCATAGTTTGCT 
CTATITATTTATTTTTTATTTITAGAGACAGGGTCTCACTATGTTGCCCAGGCT 
CTCCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCAAC^ 
AGTTACATATATGCATAAGTATAAATGCAAACTTTTTCAAATAGATTATAAAAAA 
ACTAAAGATCTAAAGCAAGAGGGAGATACTTGGCATTCTTTA 

7 0 ATAATATAGATATCAAAATTCTCAGCATTGATCATTACCACACATGCATTA 

GAATAATCTTCCATCACGACATGGAAGAGTATTCAATCTT^ 

GAACCAACTGTAAGTTGGGATTTGAGTGACTCTAAGGTCC^CAGTACrATATGATAAAA^ 
CCATCTTGGTTGGTTGGTCATGGCTATCTGGTGCCGTACAGCTCGA^ 

CT'G AGGT CAATT AAG AA5CTTT ATTCATGAAAAC G CT AAAATC^GTG G AGTTGCCGGG AGCAGTGG CAAAT AGC AGTfiTT AG C CCT 
75 G AGCCGAGCTGTTGAC^GTC^TATCTGGCCTGGTGGATGGAGCT^ 
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TTTAGGTACTGTCTTAGATGGAAGTGGGCCAGGTGCTTGCCTGGGGCT 
AGAGGTTTACAGGCTCCAGGGATGTGCTTTGTCATATATC 

CrrAACACTGTGAGAAATACGGTATTTACAATTGGAGAAAAATl^AGTAATGGAGGAAAAGAT 
GCCTTCTGAGGAGGCAGAGAAGGATCCTTACGGTCCTATTTCACT 
5 GTTTGTTGGCACAGATAAGTCACAGGTCAGGAATGTTCAGGCTAGTTCCACC 
TATGGAAAGAGATGCGACCCTTCTGTCTCCATTGCTATGAATTTCTTTT 
TTAACCACTGTGCCTCAAGTGCTCTCCCTGCAGCATCTCATTTAATCT 
ACTTGATGGAGATGACACAGCTTGTGACTAGGTCTGTCTGACGTC^ 
TTCAAACTCTGATGAAGATGTATTCATCTTCCCTAACCrATATTTTCCTT 
1 0 CAATATCCTCCAAAATCTACACTGTCCCCCAGC^ 

ATCCATTGTACAATTGATATCATATATGTCATATATATCATGATATCAACATATATAGATAGGTACTATATATAGA^ 

TGGTATATCAAATGATACAATATGTCAATTGAAATCTTTTTTGAGAAGTCTGTC 

TACCTTTGACACAACAGAGTTTGAAGAAAAAATAATTTACAAAGATC 

AATG CT AT AAAACTTGACAGGGGG CTT ACAACAGAAG ATGAAG CAT ATT ATG AT ACATT CAT ACAAT AAAAAAATG T ATGCTGGTA 
1 5 CTAAAAATGATTATGTAGATCTGTATTTATCAGCATAGAAAGATC 
GCATGCAGAGTATAACCCTTTCACGTAAAATTTGTGGGTTTTG^ 

C AAAATG CT AAT AG CAGTT ATGAG TGGTTGAGTTTGGGT AGTT CATCTT ATT CTTC4GT ACTTTT CTCT CTTTTTT ATT AT AATG AA 
TGTGTGCCATTTTAAAAATCAGAATTTTAAAAATTAGGCTATTTTC^ 
TAACTGCTTGTCTTACCGATGACGGGTCGGCATG^ 
2 0 TGATGCTGTAAAGAACGACTGTTTGGGAGGCCAAGGTGGGCAGATCATGAGGTCAG 

AAACCCTGTCTCTACTAAAAATGCAAAAAATTAGCCAGGCGTGGTGGCGGGCGCCTGTAGTCC 
GGAGAACGGCATGAACCTCGGAGGCGGAGCTTGCAGTGAGCTGAGATCACACC^ 
CCGTCTCAAAAAAAAAAAAAAAAAAAGAATGACTGTCTGTCTACATAGGATGATGGTCCT 
CAGCAGATGTCTTGATGGAGCTATTCTCAGCAGCAC^ 

2 5 TGCAAGCTTCTCAGAGGTTTTCTTACTCCCCTCTTTTGAA 

GT ATAAAGGCC C CTTCAGATTGTCGGATATCTGTAG AGGGTTCCACT ACGGGTTCACATGATGG CTTGTGGAGGGAATAACAATCA 
TAATAGTCATACTACCTTGTTGATAAAATTrCAGAATTAAGTGGGAACTCAGAGATCTTCT 

AGGAAACTGAGGACGCCCAAGAACCCAAACCTCTTGCCTTCCCAC CAGTCTG CTTTTGTTTCATGC CACTTTGCTGAACAGAGCTT 
TACTAGGCAGGTGTTTACATAAGGCCTATTGCAAAGGTGGGCCtAAGAAGCA 

3 0 TCATCAGCTCCIAGAACATACGGCTATACATGAGACCAGAA^ 

TTGTTTAAAGAGATTTTAAAGTTGTAATCCAAAATTAGAAAGGGC^ 

TGAAGGTTTAATATTTATGCTACAGAAAGAAACTCTTATAAAGCAAAACATC^ 

ACAGAAAGCTTACATAATATTTTTCCTACCAAAATGAAAATACCCTAATTT 

AGGGTTTCTCAAC C C CAG TGGAACTGAT ACTTCG AGTCCG AT AATTCTTTGTTGTTGGGG ATTG T C CTGTGC CTTG T AAAATG T AA 

3 5 AATTTACCCACAAGATGCCAGTAGAACACCCCCTCACC 

CTGGAAAGGCAAAATCAGTCCTAGTTGAGAACGCTACACTAAGGAGAAAGACACATGGAAA^ 
CAACCTGTAACATTAACATGAAATTAAGAAGCCTTCATCAAGCTAAT^ 
GCCAGTGAGGTTGTAGTGAAATGGATAGAAGCATTCGCTAGTGGTGACATTATATACT^ 
TCAGGACTGGGAAGATTCAATGCCTCCTTTGCCTCATTCATTTCATTCCTA 

4 0 AACCCAjGTGTGCATCACGATGTTCATCTCAGCATTATTTTTAATAGTG^ 

CCACTTTAAGTAAATTGTGGATAATCAATTTGATGATGCAGTCATTAAAAATTATGTTC 
AGAAAGCAAAATACAAATTGTATGTGT ACT ATG ATTG AAATTTTGTAAAAATA^ 
CAAGTGGGATGGTTCATGCCTGTAATCCGAATACATACGGAG^ 
GAGATCCTGTCTCTACAAAATATAAAAATAAAAATAAAAATAATTAGCCAGGCG 

4 5 AGGTGGGAAGATCACTTGAGCCCAGGAGGTTGAC^CTGCAGTGAGCCCTGA 

AGACCCTGTGTCAACAAAAAGGATGAGATCTCAATATGTGTAAATGAAAACAGCTGTA 
AACTTTTAAAAAATATCTTTAATCTAGTTATACTGTATTTTTTTATT^ 
AGGAACTCTGGAGGACATCAA^TACAAAGAGACAATATTACTGAGCTCG 
TTATACCCCAAATAGGAATACTTCGAGAGTGGAAGATGGAACTTGTAAATATTAC^ 

5 0 GGTCACCCTAGACAGGGAAACCAAGAAAGATGACATGAAGCTCnTATGAAAACT 

GGACTGGATTCCTATGACCAAAATGTGATTTCCTGC^CTGAGGACCCTGTAACT 
GTCTTATCATTAAGAGAGTAGGATGACACAGAAAArTATAAAGCAAAGGCCATATTATCMTACTAG^ 
GATTCATCTACTCAACAAATATTTATCAGCCAGGAACAGTTCGTTCT 
TCTGATTTCACGAACCTTACAGTCAGTTGGAAGTCAGGCTGGAAAAC 

5 5 AAGAAGATGAAATCATGCAATGTCGAAAGTGTCTAAGGGACGGGATTGGGGC^ 

TTTGAGATCAAAGGTGAATCAAAGAGGAGACTGGAAAGATCTGGGGAGAGGGTG^ 
CX3AGGCCAGGCAAAGGTCAGAAAGGTGCTTGGTAA 
ATGGCAATTGTAGGGCTGAGCACAGTGGCTCACACCTGTAAT^ 
GGAGTTCGAGACCAGCCTGAGCAACACAGTGAAACCCCATCTCTACAAAAAAA 

6 0 GTCCCACCTACTTGGGAGGCTGAGGCGGGAGATTTGCTTAAGTCC 

CTCCAGCCTGGGCAACAGGGCAAGATCCTCTCTTTAAAATAAAATA 
AAATAAAATAAAGGCAATTGTGCTGCTAGGAGCAGGAACTGGATGGG 
GGGATTTCTCTGAACCTAAACTCTGATCATAAGTTGAGAAGCAATAGCAC 
AATGTAAGAAC7UUVGCAATATTGTCATTTTTTTTTTACACT 

6 5 ATCAGATATATTATCCTACATGAAC^CACACGTTCTTCTTTCT^ 

TTGGCCTTTCACAACTAGGCAAGTCTCCTTrGTTGGGTft 
GTTCTTAAAGTtK5TTGTAATTCTGCC!ACTCCCACGCTGTTGGTTCAGTTTC 
AAAATGAATCTCTTGGATTCATTTGGAAGTATC^ 
TCTAGGAATTCTCATGGACATCACAGATTTACTCAGCT 

7 0 TATAAGTTGGCAGTCAAGCAACX3TGATTTGATCGAACACTTACTCT 

AGACAATAATAGCTCCTGCCCTCACAGAGCTCAACAATO 

GAATAC^TCCCTGAGCTCTTGCCTTTCCATTATATGAATGAACTGTGCrGGACT 
AATAGTAGTGTGCTTAAATC^GACATTTTGTTTTAGAAATTT^ 

ATACTATAAATGATAAAATAAAATAC^.TATTTAA 
7 5 GGGGCTTC^CCT^ 
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GTTATGTTATATTTTTATTATTTTGAGATAAGGTTTCACTCTTTTCCCAGGCTGG 

CCTCCACCTCCCAGGCACAAATGATCCTCCCACCTTGGCCTCC 

CTTTTAAATTTTTTGTAAAGATTGGGTATCACTATC 

GAAGCCTCC CAAAGTG CT AGGATT AC AG CAACTGCAC CTGG CCTTATGTT ATT ATT AAAAGGAATC CACT ATG AACATG ATTTTTT 
5 GCCCATTTGGCTCACTT AGGAAGCATTTTT AAAGTCTTGATG C CTGCAAC C ACT CCAGAACAATT AAATTGGAACCTCTGGGGGT C 

TGGCATCC^GGAATCCAGACTGTTTAAGGCTCTCCGGGTGCT 

GCAGCCTATCAGGTCTGTACCTGAATATGTtSGAGTGAGTAGAATGTGTGGA^ 

GTGTGGGTAAGTACACAGATACTCTGTCIACATACAGTGGCCTGATGG 

CTGTAATCCCAACACTTTGGGAGCCCGAGGCGGGTGGATCACGAGGTCAGG^ 
1 0 TGTCTCTACTAAAAATACAAAAAATTAGCCGGGCATGGTGGCGGGTC 

TGGCATGAACCOSGGAGGCGGAGCTTGCAATGAGCCCAGATCGTGCC^ 

CAAAAAACAAACAAACAAACAAACAAACTATTCAAACCCTTGCTTT^ 

TTGATAATCTTGCTAGTGGTAACCTTTGTAACTCGAAGCATC^^ 

AGCCATCAAATGAGGAATCAACAAATCATGGCCTACTTTTCTC^^ 
1 5 GGAAGAACCAAGCTCCTTCTCTCT AAGCAGCCATCCTCAGTATCAGG CTTTC ATTCTCTAGGCTGTGAGATGGGTGCCGAGCCT CT 

CGGCATCAGTCTATGTTGAAGGAAGGAAGCCTTTGTCCTTATGAGCA^ 

CCTCCCTCCCGCCCATCAGGTACTTCTGCTTACGTATCTTTGGTCAGAATTCT 

AAGCTGGAATTTTTTTTTTITTTTTT^ 

GGGATGGACGCTGAATGAACCAAACTATAGTTTTTTTCACATT^ 
2 0 ATAGCCTTTTCACCCTGTGACTAGACAGGTCCCTGAGGGC^CGGCTGATGTC 
CC!TAGAGCTTCAGCTCTAAGACATGAAAGTGAAGGTTCCTCC!AGAATCACA 
GTGATTCTTCTTCTCCTTTCATAAGGTGAGAGGCAGCAGTTGC^ 
TCCCGCTGCTTGGGAAGAGTCGTACTTTTCAGTTGGAAGCC^^ 
GGCTGGTAACTGATGGAGCCTGAACATATATCTGGGTGTGAGGGATAAGCTTC^ 

2 5 GTCACCCTGCCAATACATGATTACATGTCCTGTTGAGGAGAAAATAC^ 

AAATGCAAGGAGAAGGCCAAGACACTAAACTGGAGCTGGTAGGGAGGAAGCCTCAG 

CZAGGGCCCAGCTAGGAAATGCCAGCCTTGTCCTTGGTCTTTGTCCCTAATGGCT 

CCATCGTTGAAGCGCTTCTGTGAGTGACCCCCrCCTAAGCCCCTCCAGTAGGGGATTTCCC^ 

CTCTTCCTTAGC CACCCCOAGCTGCTTGCGGTTCATGGTAGAGGT AGGTGG CATCAAATGACCCCACAGGTGAGGGATTTGCCTG 

3 0 AGGTGGGAAAAGAGGGAAGATAAAATTGGAAAGCC^CCGGGGCTGAGGAATTATCTCATAGGAGAGAACT 

TTTAAACAGACTTITAAAGGCAAGAAAGGATGGAGGGCACAGACTCTGAAGCCA 
TCCAAGCTGCGTGATCTTGAGCCAGTGACTTAACTTCTCTGTTCCTCTG 
CCACIACAAGTTTGTTGAAAATATCAAATGTAATGATAACCATAAAATACTTAA 
TGTTAGTTATTGGCCAGGCATGGATGATTGACGCCTGAAATCCCAGAACTTTG 

3 5 AAGTT CAAAACCAG C CTGG CCAACATGGTG AAACCCCGTCT CT ACT AG AAAT ACAAAAAGTT AG CCAAGTGTATTGG C GGG CACCT 

GTAACCCCAGCTACCCAGAAGGCTGAGGTGGGGAGAATTGCTTGAACCAGGGAGGCAGAGGT 
TGCACTCCCAGTCTGGGCAACAGAGAGAGACTCTGTCTCAAAAATTAAATAAACAAA 
TTTTTTAAAAAAATTTTGGTTTCTATCCACCTTTTTCGTATAAATTGAG 
CATCAGGATTGCTAAAGCTCCCCAGGGGACCCCAGTGTGTGGATGAGGTTGAGAATCATC 

4 0 AGTGAGGCAACTGGAGCTGAATAGGGAAATCGAGTCACTGAGGATCACCCAGCCAGGACCAGGGCCCAAGTTCCT 

CCGCTGATGGAGCTCACCGTGGTCATGAGTTCAGGACCTGGCTCTCT 
GAATATCTGGCCAG^TTGACCCCCTCTCCATGCAGCCCAGGAGGCAGATGGTCAGCTCT 
GATCTTGTGTTGTGTTGTGTTGTGTGTGGGTCACCAAAACCGAGACCCAGCTTTCT^ 
ATTTTTTTGTTTTGTTTTGTTTTGTTTTTTGAGACAGA 

4 5 ACTGCAACTTCTGC CTCC CAGGTTCAAGTG ATT CTCCTGCCTCAGCCTC CGG AGTTG CTGGGACTACAGG CATGCGCCACCACGCC 

CGGCT AATTTTTGTATTTTTTGT AGAGACAGGG CTTCACCATGCTGGCCAGG CTGGTCTTGAATTCCTGACCTCAAGTGATCTGCC 
CACCT CAGCCTCCCAAAGTGCTGIGGATTACAGGCGTGAGCCACAGCACCTGGCCAACT 
ACTCAAGTCTCCAGTCTCTCGGGGGAAGCAGTCAGTGTGAGAGAAGGAACACAGGG 

tcgcggg caatg agtctttgcatcaaacactcaaagtttcitgaact^ ctg agaaaac^agtgccttgaggtgaatcctat agcaca 

5 0 gctgtcaccagcctgtggtcattttggagggtatgtggcataggtttgtc 

taataac1agctgaccttccctatcccacacagagcgttattgtgaggagtctgatggacc 

tgggtacatggatgcccaaatg caggctcacccccacccctct cagtag cagaagacatg catatgggagaagctgcatttcctcc 

cagcctcactgtgctctaggtctccagcccrcattgttaggggatgtcactgttgtacctct 

cctttgccagttgtcccttcagctgggcgagccctttccagggcatca 

5 5 gggacatggagggacg aggccccag atcagccactt cctcctg caagggaagtcaaaccctttctctcat aagtgtcagg ctggct 

gcacggtccctgggctggcagggcaggtttcrtgggggccctagcaggaagcactct 

tcactggcggtggttttctggttagaggcagcaggcagcggttacaaact 

ggaggcagacgacagagctagagtgagctcagcagagcttgcttagggaggaaggacccgctgtcactt 

6 0 tgttgtcac cag tggaaactct cag aggggg cgaag aaaaggg ctag agg aagg ag c cag gact cc cgg c c cccag tt cccg tctt 

accaacaccccctagtctcatcgtgitgcttgatcttacctttcggtaaagaggggatgccct 
acccaaatggtaaaatgaatgagccggccacaccrtgcagaaggcagggct 

gc cccgttttagaggggcttcctttgt actg accttgg cac^ctttctagctcag agccc cattcaagaaatgc caggttgaggct 
gagcatgcttgtgtctgtccttcac ag atg atgg aacctggcttg atg agcaagg ccatg^ 

6 5 aaagaccccccccaggatgtggggtttaagatgattgacagtataaccgattaacctcattagcctacagctcatc 

attcgttagggcaaattttattttattattttattttatttt^ 
catgatcatggctcactgcagcctccaacccctg<;tgctccatagctaattttcttttat 

AATAATGTTGCACAGGCTGGTCTTAAACTCCIXKK3CTCTCAG 

GCCTGGGAGACAGAGCAAGACTGTCTCAAAAAAAAAAACCCAAAAAACCAAAAAACT^ 

7 0 CCTCCCAAAGTGCTGGGATTATAGGTGTGAGCCACTGC^^ 

GCTCACCACCCACCCTATTTTTGAGTTACCTTCTTCATTCCAGCAGATGATGT 
ATACTATGTGCCAGGCATTGAGCCAGGCATTAACAAATAGGCATT^ 

TTAACTAGTGCTTTCCAACTTGCCTGATCATTAG AATGACCTGTGG CACTAGTTAAAAAT CCAAATTT CACTCCCTAGTAGT CTAG 
TCCCTAGGATTTJL n JL B JL^AAAATTATTATT 
7 5 ACTTTGGGAGACCAAGGTGGGCAGATCCCCT 
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AAAATACAAAAACTAGCCAGGCGTGGTGGTGCCTGCC^ 
GGGAGGCAGAGGTTGCAGTGAGCCAAGATCACACTACTGCACT 
AAAAAAAAAAAATTCAGATCTGCAGACTTCACTCCGCAAATCCCAAAACCACTGAT^ 
AATTCTATTCTTAACAAGCCITCAAATTAATl'CJ 
5 GGTAAGACCTGGTCTCCAAATATCAGGTTGCCTTTTGAACTC 

CCTTTCCTCTGTGGTTACCAATGCTTAGCAGAGAAAGTCTGTTT^ 

G CG ATCTCCTGGG CTTT CTG AGGATGGG ATTGCTTTT CCATAATTG CT AATTACCAGATTG CAATTGTTG ATT AATT AATGATG TG 
ATCAXiAAATCAGCACITCTTAGCAGGCATTTrT 

GAAAAAAGAGAAGCTCACAGGCTTGGGACACCCCCAAAATACCGTCAGAATGCTTTC^ 
1 0 TTCTAGCAGACTTTXSACCAITTAGTATTTATTTATTTCATT^ 

CAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACTGCAGCCTCTX3CCT 
TAGCTCGGATTACAGGTGTGCACCACCACGCCTGGTTAATTTT^ 

ACCGCACCTGGCCCCATTTACTATTTATAAAGACCCTGAGCATTCTAGAATCTG 
1 5 TGTAAGACGAAATGGAGACACCGGGAGCTTTTGGAAAACAC^ 

TCACAAACGACCTTCCATTTCCTTAGGCTCTGGCCTGCCAGCTGCTT^ 
TCACAGAGftCCCCTACATGGATACACGCAGAGGAACACA 

TGCCGGGAGGAGAGATGTAGTCTTGCTGTGTTGCCCAGGCTGGAGTGCAGTGGCTCAATCT 
TGGTTCAAGTAATGCTTCTGTTTCAGCCTCCTGAGTAGCTGGGACTACAGGCGCACT 
2 0 TTTTAGTAGAGACAGGGTTTTGCCACGTTGCC 

AGGGCTAGGATTACAGGGGTAAGCCACCGCGCCCGGCCTGAAATTGCTATTCTTATAGCT 
ACGTGGTTCIAGCCTCATATCTATGTGTATACATGCACACCCACC 

TATACACTTGTGCACACATTCACACGTGCAGGAAAATTCTCTGGTGCCCTTCTCCATGGCT 
CCTCCTCTGTCCCTCCGC^CCCCACCCACCTTGCCTCCTTCCTAGACACTAACCTCCGGTGCT 

2 5 GAAGACCTGGAGAAACAGAACGCGGCTCTACGCAAGGAGATCAAGCAGCTCACAGAGGAACTGAAGTAC 

CAGCCACGAGCCCCTGTGCTCGGTGCTGGCCGCCAGCACGCCCT 

CTCATGTCAGCTCCCCGCGCTTCCAGCCCTGAGCTTCCGATGCGGGGAGAGCAGAGCCT 
GCTGCGCCCATCCCGCTVGAGGCCCCrrGTCCACCTGGAGACCCGGAGACAGAGGCCTGGACAAGGAG 

ACTGG AAGGG CGTGAGGCCT CC CAG CAGTG CCGCAG CGT TT CG AGGGGCGTGTGCTGG AC CCCACCACTGTGGGTTG CAGGCCCAA 

3 0 TGCAGAAGAGTATTAAGAAAGATGCTCAAGTCCCATGGCACAGAGCAAG^ 

TAAAAGAAACCAGTCTGACAAGGCMTTTCTCTCTTTCACCCTC 

TTTTTTGGACTCTAGTACCCTCAAAGAGCTCTAGGGACTCC^ 

CTTTCAATTCCTTCTCTAGGTGTCTCCTTCTCCTTCTCCTrCTCCT 

CTTCTCCTCCTCCTCCTCCTTCTTCTTGACAG AATTTCTCAG ATCTTGGCT CAGGCTGGAGTGCAGTGGTACG ATCTCCG CT CACT 
35 GCAACCTCCGACTCCCAGGTTCAAGAGACCCTCTTGCTTCAGCCTCCCTAGCAGCTGAGACT 
CTAATTTTTTTATTTTTAGTAGAGACAAC^TTTCACCATGTTGGT 
CTG^CCTCCAAAGTGCTGGGATTACAGGCATGAGTCACCATGCCCAGCCTAG^ 
ACGGTGACCTCCCACCTTCCATTCITCACTGCTTTCT^ 
TGCCCTCACCTGGTGAGGTGGTAAGTCAGGTTGACCTTCTCGTCT^ 

4 0 TGACAGCQU^GTCTCCGGATGCTCCAACCAGGCCCT 

GGGGCAACATGCAGCTTG^^TCCACAGGGACTTCTCTG CCAGTGGTGTACGTGTCCTG CTG AAGACTCCACCTC CCTTAGAAGGCG 
GTGATAC^AGAGCACAGGACTTGTCCTGTCCCCGCAGGGCTCCGCAGGCTGCCATTTG 
TCATCTTCAGTTTGGGAGAGCCrGAAACCAGCATGCTCCACCCCCCTGGTCTT 
CCTGACACAGCCTTACCCACACCACTGCCCACGAGCAAAGCCTGCAGCAGGGCCTC 

4 5 AGACAAAAGCAACGGAGATCATTGTGCCAGGCGCCCACCT^ 

C^GTGCCCCCTGAACCCGCTGGCCAACCACTCTGGCCGGTTTGAGGCC^^ 

TGCCCCTCTCAGATGATGCCAGTGTTGGGCTATTGTGAT^ 

GGGGTCACAGCCCCTTGGGAGGAGTCCAACCCAACAGCCT 

5 0 GCCCATGGGAGCCTAGGATGTGGACACTGGGCTGAGCCCCGTAATTCAGAGCCTGGGAAGAGA 

CTGGCATTTCTCCCAG CACAG AATCTTGCATTGTAACCTTTGGTCTCCa^CGAAACGTGG CAAAACAGGCTCCCCTTTCAGTGCA 
GAGCCTCCATGTGGACTGGGCAGATGTGTC^TGAGGAGGAGCTTTCTTAATTTCCTTCA 
TGTCTAGCTCTTTCCCACCCCTCTGTTAACGAGAAGAGCAAAGAAGCCGATCT^ 
CAl'1'1 1TCTTT CAT AATT ATT ATTTTTTCTTTTTTTGAGACACGGTCTCA 

5 5 TGGCTCACTGCAGCCTTGACTACCC^C^CTCAAGC^ 

TGC^CCTGGCCTi'L'riTCACGATCTAAAGACGTAGAACTTTGAGTCCAGCTTTCT 
TGTGGGCAGGCTTTCITTGAATCAGAGATTTTCATCTC 

CACATGACTTCTG CCGGCATCTTCTGAATG ACATTCAGGG CCAGAACTAGAAAAGCAG AGTTCACGTGCACGGTTCCCATCTCTCA 
GCAAGGACTTCTCTTCCCCTTTCATTCTCTACrTTCTACTGACCCCAGG 

6 0 TAGCAGGTAGCTGAGCCTGGAGCTGGTGCCAAGGCTGACTTC^ 

TGAACATCCGTAGCnTTAACCTTGGATGTTTGACCAACCAGGTTCTGTCCCAGCCTTCT 
CTGGTGGGCCACTTGACCCACACTCAAACGCCTTCTACCCAGGCAGGCACT 



GGCCTGGGCTTGC(3ATGTCTTCCAGTTTGTCAAGAAAAGCCAAAATGAGAATTTC^ 

6 5 CCTTCCCTTTCTTCCTCCCrCCCTCCCTCCTTCCTTCCCTTCCTCCCTCCCTCCTTCCCrC 

TTTCTTTCTTCCTTCCTT CCTTTTTTT CCTTTCTT ATTCCTTTCTTCCTTTCTTTCCT^ 
CTTTCTTTCTTTCTTTCTTTCTCTCTTTCTCTCTTCT 
AGGCTGGAGTGCAGAGCTGCTGTCATAGCTCACTGCAGCCTCGACCrCCTG 
AGCTGGGACTACAGGGGCATGCCACCACIACCCAGCTATTTTTTTATT^ 

7 0 ATATTGCCGAGACTGATCTTGAACTCTTGGCCTCAAGCCATCCT 

CACCACATCCAGCCCATGTGTCTAAATAAAATACTTAAAACATT^ 
TTCTTCCTTGACAAGGATACTITCATCACAACCTAAAAGGCCAAG 
GCACAAGGTAGTGAGAAAGAAGCAGTACCCGGACTGCAGCTTGCC^ 
CTCCAGTTCATCCCGGCGGACATGCG^ 
7 5 AGATGCACATACCAGTGGTCTGGTCCAC^CTCGGGAAGAAGTATCCAGGGAAGAGGC^ 
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TAAGGACTTTTTGCCCCAACAAGJ^GAGCGTAGCAGG 

GTCTTTTGTCCAGGTCTGAAGGTGGCACTGCAGAGACCTGCCCATTTGCAACT 

CCAGG CTACAGGAAGGCCCTGTGTAGCAAGAAATGGACACAAGACAGCGT CGTCAACATCTGT CACATTGAAATCCGGGTTCCTGG 
AGAATGTTAGTTTCCCCACCTGCCATCCTCTGCATTTATCTCTTACT 
GAGAAAGCCAGATCTTCAGTCCTCAGGATGAGAAGTGTGGGTGGGGAGAGAAAGGGAG^ 
GCCGTCTGCTGGACCAAAACTTACCAGCAGTTTATTA^ 

ACTACTGGATGCCCC^CTTCTCTTGGGGACAGATAGGCTATCTCTATTTCTCTTC 

TGCTTGGGAATTCCAAGGGGGAGAAAGGGCTAGTGAACTAAGATCAACACCAACGTACT 

GAGGAGACTGTCATTTTAAGAGACACCTAGGAGCTCAAGGCTATAGTGAGCT 

AAGCXaAGATGCAAAACCCTGTCTCTAAAACAAAAAAG^ 

GGTAGCATTTTAGGCTTTGTAGCAAGTGAGCAGAGG^ 

CTCATGTAGATACCAGAGGAGTGGGAGATGGGGATGGGAGGGAAAGAGAGATGTAGCCCTTCTAAA^ 
GTGTGTGTGTGTGTGTTTTTTTTTTGACTGAGTCCCACT 

CCTCCACCGCCCGGATTCAAGCAATTCTCCTGTCTCAGCCTCCTGAGTTGCTGG^ 

TTTTTTTTTI 1 1 T T Trrrr Tl w l " l M r r i"lTrTGTGGAGATGGTGTTTCACCAT ATTGGTTAGGCTGGTCTTGAAAAATCrTGTTTTr T 

AACCACCTGATATGTGATTTAATTTAATCACTTATAGGAAGATAAAATAAAAACCTGA^ 

AGAGATGAAGAGCCCCCTTTTAGAGGACTTCAATAGGAAAATTTGCACAAA 

CTGGAAGATTCCTGTAGACAGATTGCTGAGTTTGTCATTACACCTAGAAACCAAGGCTAGG^ 

CCTCCCAAAGCTATTCTTCCATTTTTCCTTACTACGGAACrCI^ 

ATCCCAGATTCCTTTGATGTTTGGGATGGCCATA 

ATGGGAAAGCTCTT CAAAAGGTGGGGCTGTCTCAGTTG<3CACATGCCTTTCTCCT CCTTTCTGTCTGG AAGGCAGAT ATGGTGTTA 

GAGGTGAAGCAACCGTGTACACAAAGGAGGAAGAGATTCTGCGTCCCTGGTAATACTGCAGAAGTGTGAAG 

GCTGGCCCACCTCTAGACGGCTTCTAAGTGAAAGTGGAAAAGAGTTTATTTGG^ 

GCTTAACTCAATTCTGTCTAATGCAGGAGCCAATGGAACAACCAAATAAGGAATT^ 

CTGGGGGTGATCCTCCCCATTCCCCAAGATATTTGACAATGTCTGGAGATATTTTTAGT^ 

T AACATCT AGTGGGAAGAGGCCAGAAATGTTGTTGCrAAGCATG CTG CAATGCACAGGGCACACCCCACAAACAATAATCCAGCCC 

AAAACATCAGT AGT ACAGAGGTGGAGCACCCCTGCAGCTGGCCCT AGTG CTTCTAGGGCACCAGCC CATCAGGCCACCAACCTACC 

CTG ACACGCTTGTC CATCCCTCrrCTCAGAAACAAACCCTGTGTATTGACCAGG CTG AAAAAGCCACGTCCTTCACCGTATCTGGCA 

CTTTCCTCTCTCT AAG CCTTTGCTCATGCATCATT^ 

TCATTTTTTTTTTTTTTTTTTTTGAGAT^ 

CACTGCAACCTACACTTCACAGGTTCAAGTAATTCTCCTGCCT 

CCAGCTTTGCTTTGTGTTGGGAACATTCAAAATCCTCTCTTCTAGCTATTTGAA 

CCCATCAGTGCTAAG ACIACTAGGACrTATTCCTCTCATTT AG CTGT AATTTGGT ATTCTTTAAC CATCCTCTCATTATTTCGCCCT 
CATCTCTGCCCTGCCTTTCCAGCCTCTAGTAACCACTATTCC^ 

TGACAGGATTTCATTCTTTTTTATGGCTAAATAATATCGCATTG 

CTTGGGTTGATTCCATATAGTACTTTGCATTGTTGCTTTTTAATGTGCATAGAAGATAAGGAATA 
GCAAAAACTCAAACTGTGTTTTTCCTCTGCTCTCACACCACTCTGCAATCAACACAGA 

TTCCCCATACACCXIAGCAAGCAATGAGT^ , 
GATAGCATCAGCTCCTACAGGTTGAGAGCTCAATCCCACAAGACTCCAACCAACTTCAGA 

AACTTCTGGCCACTGG CTTCAAGTTAGGTTCCCACGAC CCTCTCTTTGG ATTGGACAAATGTGCT AG AATGGCTCAAAGAACTCAG 

GGAAACACTTACATTTACTGGCTTATTATAAAGGATATTGCAAGGGATACGGATGAGATGCATAGGGTGAGG 

GCAGAGCITCCATTCCCTCCTTGGGTGTGTCATCCTCCAGG 

TCTTGAGCTTTTTATGGAGACTTCATTGGAGAGCCATGATTGACAATCATGTAGAA^ 



TCTCTGTGTAGCATTCCTTCCTTCAGGGTATG CAGCAGGACTG CTTCTGAAGTGAGGGTCTT ATGACCACAATCAGAAAAGGGGAG 
GAAGATTAGGGTCCTGCTTTGCACTAGGGAAAGAAGGGCAGAGAAAAGTCAGAGATAAATTTT 
GACTGAAGTGCC CAACATT ATTAAAAAAAAAAAAAAAGACTGT AACAAGGTGTTATATCACAG AG ATAAAT AAG^ 
TCCAAACCCAAGATTGGCTGTACTCTTCTTTTGCTTAAAA 

ACACTGAAGTTTGGAGGCCTACAAGCATTCATGAAATTTCATCTTTTCATTTTCTCAATA 
TGCTATAAGAAAATACCTGAGATTGGGOVATTTATACATAAAAAAGATTTA 

GGCACCCGGTGAGGGCCTCAGGGTGCTTCCACTTGTGGTGGAGGTGAAGGGGAGCATGTGTGTGCAGAGATCACACAGTGAGA 
GAAGGAAGAAAGAGCAAGATGGAAGGGGCTGGGCTCTTTTT 

CCCCCAC CCCAGGGAG AACATTAATCTATT CATGAGGGGTCTACCCCATG ATCAAAACTCCTCCCATT AGCATCCCACCTCTGAGT 
TTGGGATCAAATTTCAACATGAGGTTTGGTGGGGTAAACATCGAAGCTATAGCAGCATCCATGTG 



AGCAATCCTACCACCTCAACCTCCTGAATAGCTGGGACTACCA^ 

TGTGTGTGTGTGTGTGTGTTTTCAATAGAGATAGGGTTTCACCATGTTGCTCAGTCTTGTCTTGAACTCCTGG 
ACCTGCCTTGGCCTCCCAAAGTGCTGGGATTGCAGGTGTGAGTCACTG 
GTGGCATGTGAAGATACTGATACTAAACTTTGCCAATCATCTTTCAAC^ 
GCTCTTCATATCTGGGCCATGGGAGCTGTGAAATTAGATTGCGAGTCACATTAGCACCACGT 

TT AG CACAT CTTCTCT AAGTG AAG T CAGCATT ATTCACAT AAAATGTT ATCCATTG ACT ACTGG T G CAC AG TGTG ATTGGGG TTT A 
TG AGTT ACT CGGGGTG ATT ATATG ATG AAG ACTCAATTCTT AGTGGGGGCG CTGG AG ACCTTG CAC CCT CTGTCCC CCAG CT CACT 
TCATATCACCCTCTTTTCCTCTGCACTGGCATCT 

TCTGCTCAGCCCTAACTGAAATGCTCrTACCCTTCTAGCTCATTCATTCCT 
TTGATTCCCAGGCTAGATCAGAACTGCAATTTCAAATAGTTTCATAGC^ 
AAAATTTACATTTATCTATGTGGTTATTTGGTTAAC^TTTGTCTCTGTGACT 
TGTTTTGCTC^CCATTGTATTCTCAGCTCCTTC 

TGAATGAATAAATGGATGGATGGATGGAACACAGTGCCK3CCTTCTAGGAATGTAAAATCC^ 

AATACAAATGACAAAATAAATATTCAGTTAACATATAAAAAGTAAGAATCJ^ 

ATGTATTCTTGTGAAGATG 

HUMAN SEQUENCE - mKI^A 

CAAGAGAGAGAGAGAGGTGCAAGCCCCAAAGCGAGCGACATGTCCCTTTGGGGAGCAGTCCCTCT 




iTTCTTTTTGGCC 



'TTTTGAGA( 
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GCAGGGGTCAG AGGTGGCTACAGGGCAGGCAG AGG AGGCACCTGTAGGGGGTGGTGGG CTGGTGG CCCAGGAGAAGTCAGGAAGGG 
AGCCC^CTGGTGACAAGAGAGCCCAGAGGTGCCTGGGGCTGAGTGTGAGAGCCC^ 
CAGCAGTGACTCCAGCTTCMCCGCTCTCCTCCCCCTGGCAA^CAGGACT 
AAAATCGTATTGCC^GCCCAGAAGAGCCGACAGAGGGAGACAC^ 
5 CAGAACGCGGCTCTACGCAAGGAGATCAAGCAGCTCACAGAGGAACTCA 

GTGCTCGGTGCTGGCCGCCAGCACGCCCTCGCCCCCCGAGGTGGTGTACAGCGCCCACGC7V 
CGCGCTTCCAGCCCTGAGCTTCCGATGCGGGGAGAGCAGAGCCTCGGGAGGGGC^ 
C^AGGCCCCTGTCCACCrGGAGACCCGGAGACAGAGGCCTGGACAAGGAG 
GGCCTCCCAGC^GTGCCGCAGCGTITCGAGGGGCGTGTGCTGGACCCCACCACT 
1 0 AGAAAGATGCTCAAGTCCCATGGCACAGAGCAAGGCGGCAGG 

HUMAN SEQUENCE - CODING 

ATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCT 
AGTTCAGAGGAGGGAGAAAAATCGTATTGCCGCCCAGAAGAGCCGACAGAGG 
1 5 GCGAAGACCrGGAGAAACAGAACGCGGCTCTAOSCAAGGAGATCAAGCAGCT 
AACAGCCACGAGCCCCTGTGCTCGGTGCTGGCCGCCAGCACGCC^ 
ACCTCATGTCAGCTCCCCGCGCTTCCAGCCCTGA 
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Table 11 

MOUSE NOMENCLATURE 
ICSGNM Irf4 
Celera mCG4922 

5 ■ 

HUMAN NOMENCLATURE 
HGNC IRF4 
Celera hCG20902 

X0 MOUSE SEQUENCE - GENOMIC 

ATGCTCATCATCCTTGAGGGCACACCTCCCTTGCTTTTGCCCT 

ATATTTTACACCTCTATCCAGGCTGX3TTITTGCA 

ACTAATATATATGCCITGCCTGTTCTCTTTCACTCATC^ 

AGCTCCOUVACTCTACAGCCTGAATCCCAGAGCCTGAGCT^ 
1 5 CTGTCaVAATCAAT(^CCTAAACGGTTTAGAATTTCTCTGTGGTCTT 

GITTTTTCTTCn'ACTAGCATCCCCTGTCATACrCTTAAGATTTCr 

GTCTTATATTGAAATTCTGTGACTCrGAGGTTCAGAAC^ 

tcctrccctactrggtactttcccagagaatgataccttttgggcagtgagtgatatatacc 
gcatactcttttaggcaaccctgggtctrggccatgcctcattgccttttcctgtgcct 
2 0 titggagaacgtcacagtggtccatgcagttgaccctix;tagcttgag 
gacaaagatgtggagatgtggagtcacagcaacagtgtcaccaa 
ttgaatacgagacaacatatcattggcctttggaggaatgctcaacac 
gaagcagagaggtgttcgctggacttgggaaggatgcctcgtagcacaaaagag 
ttccaacacaagagtaggggcaccagcttacttgtct^ 

2 5 ttcttcttaggttatattgtggcatgctgggatgctttctcctgctgggcag 

tttaagtttcatctgatgttgaattaccccccaccccccaggatc 

attaaaagtcaacacattaagtttccaacgttctgtgtagctcattccctctaagcatgcctatct 
gttttgcatctgaacacactatcagggactaciagtgcaccatga 

TAAGAATACTCGCTGCTGGCCCTCAGGGGCTTTTCTTCTGTGTGGACTGGTrrC 

3 0 CAATATATTCTGTAATCACAAGTAGTCAAGCCTGTGTACTTATG 

GGGAGAGGCACAGTGTTTAGATCAGAAGGTGATCTCCAGATGACGTTTTACATA 
GTTTCCTTCMTCGTGCTGAAGAC1TCAAAGGCTGGGCAAAGAGACCTTTA 

GTACAAGGAGACCCGGAAGGTAGAGAAAGAGGAAAGGAGAACAGACTGGCTCACATGCTCAGAAGTGA 

3 5 ATAAAATAATCTAAATCACATGTTCCTTTAGAGATTGCCAAAATGTTTTGTATACTGTCGAAAA 

ATGACCTATGTTCATGCCTACAACTCCATTAGTACTT^ 

TGAATGCCCCCAACATGAGCTCTG CCCGTGCCCAT AGCACCATTAACACTG CCACTCTGTCTTTCCAGAATTCCAAA^ 

AGAATGCCTGAGGAACATCAAAATGCAAAGCTTGATGAGTACCTGGCCTTCCTCT 

CCTTTGATGGACTCCATAATTGTTTTACTTACTCTTGGACCATGGA 

4 0 GGGCCTGAAACCTCCTCTGAGAAGGGACAAGTGGAAAAGGAGACACCAGATGCTCGAATATCA 

GTGGTCAATGTGGG1TGCAGGTAGGGCAAGGGCATTGGTAATTGTGAACCATGAGGTATAA 

GCCTGACTTCGTCTACTTGGCTGGGCTTCTCAAGGACCAAGATGAGTCTGAGGAGAGGAACAAA 

ATCCTGGGGGGTCAACCAAGGGAGGTTACATCAGCAGGCCAAAGC 

CCTCAATC CCAACTG5GTTTCATTGCCATCTGGGCAAGTCCAGGTG AC CTCCCTGATGCCACCATATCCTTTTCTGAAATAAGACCA 

4 5 TCGTCCTGCCTCACTTCCCTACCTGTTCATAATAAGAAGTGATTGAG 

AGCAGACATGAAGCCCATTGTCTGCAGGCTGCCACTGAG CTG AGGGCTGGG CT AACCTGGGATTTTCCTCTACCGTGCCCAGCCTG 
GAGGAGTTCAGTACITCTAACAGAGTTCTACTCATAGGTGGTTTCTCATATGGTAGTTCCTTTGC 
CTGTGTCTACTGTGGAGAGTATCTGCCAAAATCTCAGGGAGGTGCAATC 
TGGCTTATTGCATTATGGATAAAAGCACACGAGACACGCATGCATCIAAAACATTGTA 

5 0 TGTGTGGGGCTTTTTTAATCACCrrAAAAGATAAGTTTAAAATGAAAAAC 

TGAATGAOVTGAGAGCrAAGACTGCTTTGGATCAGAAACC^ 

AGAGACCCGITTCTGTGTGCTCACGGTCATCAGATACAAGCAACGAAAGTTCTAATGCAACTC 
TGTTGTAGGATAATCTGCATTAACAAACTGTCAAGATGTTGAAAAGGAGATGTGGGGGAGAAG^ 
GG AAGCAAGCAGGTGAACTCTACACAGAAAATAAGAAAGGTAGAAATTCCCTGCACT 
55 ' CATGGATGCAGGAGAAACAGCCCAGGTGGCCCTGGGGATGAAGGGCTCTTCCTTTGCCT 

CCTCAGCTCITTCTATGATG AAAAGGATAGTAAGGCTTTCCTTCTGCAAGTC^ CTCAGATTATCTGCACCTCT CCCATCC 

CATCCTCTGGCTTGTCCAGAGAGCAACAGGCAGGGTCAAGGGACT 

TACAAAAAGGACTAACAGTACAGATGTCTCCAAATTACCTTAGTTAGACTTTATATCT^ 
GATAGACATCCATTAAAATTATCCTTCAAATTCTGAAGACAGGCAGT^ 

6 0 GGAGACCCTTAGTTCTTGTTTGGCAGTGTTGATGGGCAACTCTA 

CAATTCAGGTTCATCATGGCAAAAAGCTTGTTTTGTGATGTCCT^ 

TGGTCTCCTTAAATGCCTGAAACAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGAGAGAG 
GGAGAGAAAGAGGGAGGAGAGAGGGGGGAGAGAGAGAGAAAACACAAATATTTATGTTCTATGAAGCT 

ATGTATATCTGGGTGCTOTGATTTGAATATCATATAAATGTATATATGATATTCAAATCATATATAC^TATATATTTATATATGTA 
65 TATACGTATATGCATGTATGCATATATGTGTGTATATATGTGTATATGTATGTATGTGTATATGTGTACATGTGTACAGTGTGCAT 
GTGTGCATGTGTGCATGTGTGCATGTGTGCATGTGTACATGTGTATATATGTATATATGTATATGGGCTGGGAGCA 
AACTACTGAAGAGGTTGTTTTGCTGAATCATAGCATGACAC^ 

TCTTTTTGATAATGCCTCTTCrACTAAGAAGCTTGCTTAGGACrCCAGCCAATTCTA 
ACTGAGGAATTAAATTGCTAACCC!ACAACTCTGAGGGAGGAAA 

7 0 AAGAGTTTCACATATTACTTGGAGCATACTTAAACTGTA 

ATTACACGCTTTTATGTACCACAACTGGCAAACCTGCTCCAG^ 

TGT AACAG AAG TGGAT CAAAG CAGAACAAGG CACAGACTTTGG AGG ACACT ATCT CATGAT AG C CAGATTTGTTT CG AGAGAGTGA 
TAGCTCAAJSCCTTTTGOATATAA^ 

AGTAGATTCCCCACGGTACCATTTCCTAATTTCTGCACCTGGATCTTCCCACA 



304 



WO 03/008583 



PCT/US01/51291 



GACA.CGGCCAAC CACAGCCAG AT AATTCCTCTTTGG CTGGGGCTTT CAT CT C CT AACAG TGG ATGTG TGT C CTGG CG CTGTG AAT A 
ACTGATAATTGCCTGCAC^VATAAGCCAGTGTATGTTTCTTCCT 
CCGCTCCTGAGTAGCCCATAGGAACAAGAAGGCACTCAC^ 
5 ATAATCTCTACCGCCCCGTGTGCCCGAGCATCACTCCTGTGTTTTAC^ 

AATAATCCAGAGCCAAGCCACGTTGCCAATGGGTGAAATGAGACAGTGCC^^ 
TTGTCCCCAGTGATTACAGAGTATTTGCATTTTTGTTC 

ACGT CACTATTTACG T AG CAAAC ATTACAAG TCAGAG AACGG AG CATT ACAGGGAAACTTCTT AG AATG CT CACAATCAAAGGAGA 

CAAAAAGGGAACAGTTGGTTGTTTATAAACATX1A.CGAAGGAAGAAAGG 
1 0 TATTACATGATTGATGAAAATGACAAGTCTCATCTTCTATAGGATTGAA 

TATTAAATGTGTTCCCAACAACTTCTCCTCAATGGCrcATGATCT 

TTAGAGGATTTTTGCAATTTCAGTTAAGGTATATTCGGGACCTTAT^ 

TTTACTAATAATGTTCAGTTTAGCrCTTC<GGATCCATC 

CTTGCAGAGTAAGCACTCTTACCAACTAAGCCATCTCCCCATACCCCT^ 
1 5 GGTAAGACAGGTTCACAAAAGCTTACTTCTTGGGCAAGTCACAGTGATGGCT 

AATTGGACTAAAATGCGTTGCCCTGAACACCATACAGGTACAT^ 

TAATGTAGGATATACGTAATCATAGTCTAAGTGGGTGAGCACTTGCGATGCACATAAT^ 

GGTCACCGCCACTCTTGAACGTGGGAAAGTGAACTCACC^CCCACGCCTTCACCCCT 

GTATCATGAAGTGGTTGTCAGGGCTCAGAGTGGGCGTTAGTCCATGCACCT 
2 0 AAT CAG AGGCAATGGATT AAAAC AACAAAGTriTTIT CT TTTTAT AAAAT AATTTTTC CCAAAGA 

TAGAATTTTACTTAAAAACAAAACAAAACAAAACAAAAAC^ 

TT ATCCAAAACAAGGTCTCCGTCT CTTGGAAGAGAGAGT ACTGGGAATTGAACCCAGTGCCTT CCACACACAAGACAAGCATTCTA 
CTGCCACGGTACACCCT AGG CTG AATAATGT CTTTCTGAGTAT CT AACACTGACTGGCTCT AT AAACCACTTGTT CTTACTTGTGA 
ATGTATAATTTTAAAAGGTCAATTCTCAAAATACTGAGCTGCAGTCTAG 

2 5 AAGTTTGCCCCATTCAGACCTCGATCACTGCTCAGTGGCTCCTTAGCCCIAATAACA 

TGCCTTTGGGACGGATG CTCAATT CACT AATTTCCTGTG ACATTTTTCAAACAACAGTCCGCCATCTCACT 

TTT CTTAAGTGTCAGTG CGAAAG AGGATCGGAAATTGCCTACTGGC AAAATTGTCC CCATTCTTTCAAGTCTG ACTGTG ATTTAAG 
AAGTTCTTAGTGGTCAG CTAG AAGGGC CCACCTGGCCATTCCrTCCTCTCCACCAG CAACAATGGGAGCATGTGATTCACAAGGGA 
ATCACATTCAACTAAAAAGAGAAACCGGGGTATGCTGTTTGCAAGGAACGGTTGAAACTGGA^ 

3 0 ATAAATGTGTGTCTCACATGTTTTCCCATGCTGGGGGCA^ 

GGGCTGGAGGGACGAGTGGTAAGAAATGGCGACAGAGGAGATGAAGGTAATGTCATAATGAAACCCATCACTGCTGT 
ATAGATGCTAATAAAATAGGAAGTTTTAATGATTTAGGTAGCTTATTGCT^ 
GATCCACACAACGAGCGAGCGAGTAAACCAGAAGATGGCGTTGAAGATTAGTAATC^ 
TCTCAAAAGGTTTCTTGTAATGACTGTTGTTTAAACTTCTGA 

3 5 AAGTCAAGGTAGAAGACACGTGTGTCTAAGTGAAAAAAAGAAAAAAGAAAAAAAAAAAAAAC 

TGTCCTTAGTCTGTGCACGCTTTGAAGAAATGTAATTCCTCAGC^^ 

CGAGGTGTGTCTCTGK5TGACCGGGATAGTTCCCGACAGACAGAAGGTGTTCAAAG 

AGAAGAGGGGAAAATGGGTGTCACCAAAATTTTCTTTGAACGAAACTCTGTTGT^ 

TTGGGGTG AAAG AACG ACATGG CATCCTGTTAATTTCTG AG AAAGCCTGTTG ATGTTAGGAAAAAAAAACA CATCT 

4 0 CTGCACCAGTTTTCCTGTGGCCAAAATCAGATGTTTCTCCTAAA^ 

CATGTGAAATGAAAAAGTTGTCAAATCTTTACAAACGCTCCAAGTTGACCTGTG^ 
TAGTCTGCTTGCITACCTGAAAACACCAAGATAACATATAAGCTCCAGGCATC 
TCATGGAGTTTACATATGAAGGTCAAAGAAAACACGAAAATAAAGTAAAACCTTCACTCA^ 
GTACCTCACCGGCAACTGCCTCC(ZACGAGGCTGAGGTTAAAATTATCATTTTA^ 

4 5 CTGAGTACTCAGGGATCCCCCATCTCTTTTATGTTGCCATGA 

CGGTTTCATTCACCCAACATGTATAAGCGTTTCAAACACAGT 

AGTGAGCGTGATATTCTCTAAACGCTCAGAGAGACAAGACTCCGCTTTGCCCAAGGATGCT 

CTGTCACATGCTGTGAGAACCAAGTTCCCTGTGCTCCAGACTCT 

GCTATGCTCAGTGCAGATTCCAGTATCCCCTTTGCATGCCTCC^ 

5 0 AGGATTTCTGAGTACATCZAGAGTCCCCAGCCCCCTCC^ 

TGACCCGGTGGGCTCTAGTCCGAAGCTCCTGTTCCTGCGGGACACCCAGGCACAAGGCAGA^ 
CAAGCCACGCCCTCTCAAGGCGGGGCCGAAGCGCCCGCCCTGCACTC03CCTCCGGCTCT 
TTCCTAGTTTCACCACTTGAACTTGGGACCCTTTGCrGCCCT(^ 
GAGGAGTGTGAGGCTGGTGGCAGAGGAAGCCTGGCITCCATCTCTGAGCCT 

5 5 CTCATCTACGCT^CCCTAGGACCTGTGCACTTCGGGTTTTGTATGAA^ 

GGG CGGGAOG AG CT AATG GG ACTGTGG TGTCATCAAAGGATCGCACTGG C CACAGCTT GTCCT CAG AGGGACAG C CT CTG ACT CT C 

TCTGCTCCAGTGGAAAGCrrCCTTTCCAGCCCTGGTTCCTAAAGGACCCAAACTCATCT 

GGCAGCC!AAGAAGAGCTGAGAGCTCCAAACTTAGGGTGCTCAG^ 

GTGCTAGTGTTGCTGTCCGCAGCACCCCTTATTCCCAGCCTCGGCTTCATTCCTGCCAGGGTTCACGCTGA 

6 0 AATCTCCTGTTTCITGGCTGCGCTGCTTGCCCCATAACCAGACTTCCACT 

GGTAGGCCTGGGGTTATTCCGAGGACAAAGTAAGGGTGTCATAGAAGAAAGTCAAGAGAGTAAGCT 
GCAGGGACACZAGGACCTGGACAAGGGCTAGTCCATGTGCCAGGTCCTTTTCGCCT 

GGG CAAGTGT A^IAAACAGTG AGGGAAAAGTCTCG ATGAAAGCT ACTTGGATCCAGCA TGAGCGCCC 
CAGCGTGGCGCAAGACTTGGGATCTGCAGAGAAGCTGTGTAGCTAGGAGCTTTCAACGGAGCGTGTTA^ 

6 5 AAATTACCrTAATTTTTTTAATAAAAGAAAGAACAGACAC^CAAAA 

GCCAAGGGATGCTCTCTATACCTTCGTCAAAGTACCrTCTCTTGGGGGACTTCGGA 
CAGCCTCAGAGACTCGGGGCCTCGTGGGCACTGCAAGAGTTTGGGACGGGGCTTCC 

GC^GGG AATGTGTGTCTCTCCTCAGCGCACAAGCCCAGGAGG AGGTCCG CACGCGTC^ CAG CCGGGGCTC 

AGAGTTCX3GCATGAGCGCAGTGAGCTGCGGCAATGGGAAACTTCCGACAGTGGTTGATC 

7 0 GGCTGGTGTGGGAGAACGAGGAGAAGAGCGTCTTCCGCATCCCGTGGAAACACGCGGGCAAGCAGGACT 

GCTGCCCTCTITCAAGGTTAGCAGCATTC^GGGATCCCTGGGCZAGGGGTG 
GTGGCTCCCGGGCAAGGGACTAAGAGGTGGGCTCCTGCAAGGAGGAGGCCAGA^ 
CCCAGGAGAAGGGAAAGAGGTTGCAAACTCTCCGGGGATTGCATACACAAGAAACC^ 
AACTTCCAGCTTCACCCCCATCTCTGGGGGACCGAGGTTCC^ 
7 5 TAGG^CCACTCTCTGCT^CCCTGTCTCTCTGT 
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CCTCTACTGTCTGGTTCTAGTTCTAGAAGCTGCGATCTCTGATTTTCTTTCTTTGA^ 

GTATCCCCCTCCGACCCCGTGTGAGTCCCTTAGGACTGATGTCCCCAGAGAACTGGCTCACTGAACTGTGAAGCCCC 
CCTGCCAGCAGGCCGAGGAAGGGGACTTCCTGCGGGAATTTGTTCAAAGTACCTC^ 
GCCCCCTCCACAGCTCTGTCCCCAGTCITGCCCACACTTC 
5 CCACTAACATCATGTTCCACATTTCAGGCAACAGCAAATTTGAAACA 
TGACGCTCTGGGCTTGACTTTTCTTCTCATCGTTCTTAGGCT^ 
AGATCCTCCTACTTGGAAGACAAGATTACGATGTGCTCTGAACAAGAGCA^ 

AT ATCTCTGAC CCAT ACAAGGTGT ACJW^ ATTGTTCCAG AGGGAGCCAAAAAAGGT AAGGGGTTTT C C CAG CCCAGGTGG CAGGAT 

AAAGGCATTATGGCACTCAGAGAGCCCTTCTTCCTAGAGACAGT 
1 0 GCCCATGTCCTCTCTGTTATAAGTGACAACCCTGTGGTGTTAGTATAGGATGACCTGGCAG^ 

TT ATG CACTTG AAGG CATT ATTTT CAGTT ACTC CATTCAATT AGGATCTGG AT CAAATTT C CAAACAAAATCTGG AAAATC CATT A 

AATGTTTACTTACCTAATATCCTCTAGTAAGCATTTTCAAGAGGAGAAAGCACATC 

AATAAAACTGCTAGAGTTTCIX5GTTTAACATGGCCTGCTAG 

GTCTACCTCAAGCAGTGCTGAGGAATCTGCATTTGGT^ 
1 5 CACTT GGTGGATCACAGGGATCCTGGGTGGTGTTTCCT^ 

CAAAGACAAATATCTCTCTAAGAAATTGCTTGAAAAACAAACAAAC^ 

AGGGCTATGCATTCTAAGAATTAAGTGCATAGGTACTITTATAAGATTTAAGTCAGT^ 

TTTTC CCCAAACACACAGGAGCAAAGCAG CTCACTTTGX3 

TATGGCTCTCTGCCAGCCCAGGTATGTGGTAGACTCTTGGTCTTGTGGAAGGCTGGC 
2 0 GAGGCAAACACAAATGAAAAGTGTAGGGCTGACTTCTTATTTGCTATGGCTAGTACA 
GATGTTTCAGTTCCAGTGTGGTGTCACTGTCCCTGACGCCACAGTrTTGTTGGGGAG 

TTC CACTG ATGGTCAG ATCTT CTGGGAAT CAG ACCTTTTGTGGAAG TCAAAGG TTTTGGAAGT AGT ACTTT AT CATGTG AAACCG C 
AGAGCAGCTGACTTCTCTAGGCGTCCCTGATGTGAATTACAGTACTGTTTTATTCACTT^ 
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ACCCTCTATGGCTGTTTTTTACCTAGTGTGATAGACAGTCCATGTCACTTTTTGGGCAAAATGAC 
GGGTTCCCTGAGGAGTTTGGGTGATATGAAATGACTCCGACACCCACTGATGTCT^ 
GGTTCATAACTACATGATGCCACCCCATGACAGGAGCTGGAGGGATTATGCCCCTGACCAGTCA 

GTC CT GTG ACGTTTGGCCCACGAGG CCAC CACTGGC AAGG C CCATCTTGTG AAAATGGT AAGG ATTG TGC CAGGG CAGCAGACAGA 
AGAACAACCTGAGCTCGGGGTGTGGACAGCACCACAGGGCTTTTCCCTACCATTGAGAT 



GTGGTTGTGGGGTGTTGGGGTATGTTGCCTTGTCCTGTGAAATGTTGAAGTCC 

ACTTAACTAGAAGACCCCAGAGATCATTCCAAGTAGCTGAAAAGTGCCCCATT^^ 

AATTCTCAATGACAGTAATGTCAATTATAAAACTCGTCTATATTTTGTTTTAACT 



3 5 CAGCAGGAAATCATTGCTAGTTCATGTCTCTTGAATAATAATGAGTCATCTATAGCrrGTTCTT 

AGCGAAAGTAATCCACAAAGTGTCTCTACAAATGAGTGCCCTGCCCGAAAGACCCACAGGGGTC 
ACAGAAGAAACGCCCACTAACCAGACACAAAAAAATTTCACAAACTATGTTCAGTGAGACTTGG^ 
GAGTCCACCAAGCTCCACCTCGGGTCCTTTTTTGGCTGTGTATTTTAAGGTAGAGTCTTGCTAAATTACCAA 
TGCCTTCAACTCTTGAGTAGCTGGGACTACAATCTTGTTCTAACGGGCTGAACATAAAACAA^ 

4 0 GTTTAAATATAAGTCTTGACATGGGTCGCCGTGCGAGTAGTTCTTTT 

AACACCCTCACTCAGTCTGTGGATGCTTACACTAATCTTGCTTTAGTC 

CCCTCCCCCTAGCTAACAAACAGTGAGCAGAATAAATGAGGGTTGAATAATTAATTCATCTTTGAACTAGTCT 
ACTCTGACCCTGCTGGTAACTTGCTATGTGGGCTGGTGCAAGTCCCTCTCCTTCTGGGCCTCAG 

GCCCCAGGTTTCCAT CCAGAGCTGTACTGTGGCTCCTT CCTT CATCACCCTAATTTTT ATCACTGGATGTGG ACTTTGGACTTTGT 

4 5 CCCATAATCACACGTTATGCTGCTAGCAGGTGCTTAGAGGCTGTCAGGCTTGGGTTGGA^ 

CTCCCCGCACTCAGACTCGATACTTAGACATCATCTGATTTTT^ 

TGTGCCCCACCTGAGTCCCAGGCTCCTGGAATCCCCATTGAGCCAAGCATAAGGTCTGCTGAAGCCTTGGCGCTCTCAGGTGAGTG 

TGGCGCTTCCTGTAAAGCTCCGAGGGAGGGGGCATCTCTCCTCTACTGAGG 

CGGGGTCTGGAGTAGGCATGGTCCAGGCTATGTGGACATCACGCTGAGTCAAATACAC^ 

5 0 TAGCCAAATACTAITTGGACGATGTCTTTAACCrTCTACATCATTACCTGCC 

CCAGCCGACATTCTCCATTCTCGCAGTGTGGAAAGGCTTTATAGGCACAAAA 
CCACAAACAGGTGGTCTGAGTTGCTTCCAAGGAACACTGGTTCTGAACCCTGGTCT 

AT AT AGG TCTG ACT C CCTCACAACTTTGACAG AATT G CTGAGCATGTGTGGATGTG AT CTG ATTTT AAAGTT CTGTTACTAAGGAA 
GCCTCCACTTGGAGATACTGACCAGCATTTTAAAAGCCCACACTCCGTGGAAGCA 

5 5 TTTTTTTGG ATG TTTT CAAATGG AATTATT AG AATTCT CATCATG C C CT CXXXTT AC CTT AAAAG CCTCTG ACTGAAAACAT CAACT 

GCATTTTGACAATTTTAGACACTTCCCTTGTTCTCGAGGGAGGAAGAAGTTTTAAAAT 
GAGACTTTGTGAGCCACTCAAGAACAGCCGAG^AGCACATCTGGGCATCAGGGTTTGTCACAGACAC^ 

ACATTAAATGAAATGAAACCTGCCCTACCACTCACCTCAGATGGCCAACGCCCCCTCTCIT 

6 0 AGCTGCAGCTATCGAAGAC^CCATGATGTGGGCTGTCAGAACTTGC 

TTTGCATAATTAATTATCAACCCTGAGTGCACTTTTCAGAAAGCT 
GTATCTATCTCTCCCTGGGGAGCCCTTTGCACACCCAGCTTG 

AACAAAATCATGTGT CAATAAATTCCTGTCTTAAAGCTGTAGAAAAC CTAGTTGTTGGGTTCTTTTC1AGAGTTG AACACGAAGCTT 
AGAGGGATTTCAGGGGGTTTTACATTAACCACTGGCITTTAGAGCAGTTCT 

6 5 TAAAAATAAGAAAATAAAGGTATCATTTTCCAGAGCCCAGAAATTGTTATTTTAGTGCCTGT 

GTTGTTGTTTGGTTTTACTTAATGACATCATGGTAACACC 

ATGTTTCTGTATTTCTTTTACTAATAGAAATCTGTCCCAGAGGTCAACTCCAAAA 

ATTCGGTAGTATTATTTCATATTGTAGACAAAATGCCACCACT 

TTCTAATAAAATTTGGCTGAAAATTCAAGAAATATATACCTCTCC^ 

7 0 GTCTTAGAGAGATCAGTTTCTTTCCACAAGCTTCTCTTAGTA 

AACATCTCCAAGACTTCCTTGGTCCTAGGATCTTTCACTTGTT 

GTATCAGTCTAGTCACCGAGTTTTTGGGGCTCTGAGCAAGGTGGCACCT 

CACAGACTGCCGGCTGCATATCTGCCTGTATTACCX3GGACATCCTCGTGA 
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G ATCT ACTGGGATGGG CC C CTGGC ACTGTG CAG CG AT CGG CCC AACAAG CT AG AAAG AGAC CAGACTTG CAAG CTCTTTGACACAC 
AGCAGTTTCTATCAGGTAACACACCTCACAGTCTGTTAGAATGGAGGTGGTOT 
GTCTGCCTACCCCAGACAGAGGTCTTCCTCCTGAGATCTGTGAGCTCATGCAGAAATAGAATTCCT^ 
TTGTCTGTTGTGTACTCCCCTGATTAGCAGAGGGCCAGAAAGAG 
5 TCTCTTAATTT ATCATCAT1 C 1 1 1 1C^CTCTAGGCTTTTG'1U"1TGTTTGTTTTGTCAC^^ 

CTCCTGCCTCC^GGATTTTATCTGTACACCAGCACAT^ 
AGAGAGGTTTCACTGTGTTGGCTCTCTGCTCT 
GAAGGTTAGAAACACCATCTTAACATTTITGGTT^^ 

CTGAGAATGTAAGGAGC^GATTCAGCGAGGGGGAAATTCGGAGAAGAATTTGGTCTTAT 
1 0 ATTTTCCAGAACCTGTAC1 T C1TATGGTATGATT 

GTCTCGCCACTGGTGGGTACCACCCAGAGGCCTCCCCT 

TGCTAATTGTCATTCCCrrCACCCCATCATATCCn'AAGCrTGCTTTCTTCCCAATG 
CCrGTGCTTTCCTTT TCTTG AACCGGGACrro 

GGATCTGGCTAAATCCACTAATTAGCATGGATATCACCTTGCCTGTCTATCTTTTT^ 
15 ACTCTCGATATTTTTTTAAGCACATGGTCCATTCATTATTAGTC^ 

GTCTTAGTGACASTTTAT A TrC r rrGACCCACAGCTTCCTAGCCACTTCC 

AGCTCAACTATTTTGGAGTTCGCAATGCAATGX3AGCGTCACG^ 

TCTCCAGGCTCATTTGTGTCATTACTAATGACAGGCCTTCC1TCTTTATAAAGGCT 

GTCACTTTCTCTTTATCTGTGTATCACTGTGGGTACTTGGGATGATGCCTTGTCT 
2 0 GAGTATAGCTGTCTCTCCAAAACCAATTTGCrCTGCCTTGGTrACG 

TCTGCACACAGAAGGGGACTGCTGAGTCCTCAATGTGCTGCITCCTTCT 

TGGTACTAGTGGTGTTTCCTCAGCCTTTTGAGCTCCAGTGGCCAATATTTGT^ 

ACAAACATGCTCCTTGCTAAATATCCTCCCCTGAGAGGTGGGACACAGAAAAGAGAATCATCTCAGG 

CCAAGGCGTCTCTGGCTCCAAGG ACTGATG AAGATCCTTGGAG CTGGGTGTGATCT AAG CCT^GGTCAG ACAGCCTTTGCCTCTCT 

2 5 TAGAGTGAGTTCTAGTGTTTGTGGTCCAAGGCAAAGAAAAATAACTTCT 

GGGTCTGGAAGTCCTATGTGCCCTCCAGAAGGCTACCACTCACTCCCTGCTTGATTAAGGTAGCT 
CGACAGGGCTTTGAGGTACACTAAGTCTTGGTGTGCCA 
CTTGGACTAGJUATTTGCTTCCTAGGACTTATTAGTTATTAATGAGTAAG 
CCTGTTGTTTTTGCTGTCCACAGTCCCCAAGCCCCACAGAGGCTCAGGGAAGATC 
30 AACCTAAGGATGCCCACCCAATGGTCCGGGAAGAGCAGGCTGAGGTGGGCTTTCTCAGAGAGTCGCATC 
AGCCCCTCGTAGGTGTCTTCCCGGCCACAGTGCTATGCACTTTATACAGTGTGCCTAAAC 
GAAGAAAGAGAAAGAAATGCAGGAAGTCAGTTCTGTGAGCTTCTAGAAGCAGTTCC^OT 
CTACCCATTCTGGATTTGCACCCCACTAGGGACAGTCCCCATAG^ 
GCCTGCTCTGTGGAGGCAGCTTTTTAATGTCAGTTCITTGAT^ 

3 5 TTAATGCCAGTGAGTGGATGTGGCTTTCTGCTTGTGCTGGGGGAGGAAGCCAAGGCCT^ 

AAGTGGCGATGCCCCCAGCCATGGGCAGGTTTCTTTTCAGCAATCT^ 
GACATCCTTGTCAGAGGGGCTTCATTGGTCCCCTTCrCCCTGTATCATCCTGTCCCCA 
CCAGTTITGGAGTTTTAGACCTAGTGCCAAGTAGGACACAGCTGCCAACAGCT 
ACCACGTG^CCTCCATGGTTAGCTAGTGAGGTTAAAAAAATAACCCTG 

4 0 TCCTTGGCCTGTGCTGATTGCTTGACTCAGCATGGCTACTTTTCTTTTTCT^ 

GTGAGTGTAGTGCTGCAGACCCAAGTTCITAAGGTTGGGTCATGTTCCTTAAGAGT 
ATCATATGTGAACTTGACCATGTGATTTTGTGTCTAGGGTCTCCTCTAAGGGCTG 
AAGCAGACTTCCACGTGACTCTGGCTTTCCGTTCACCCGCT^ 
TGTGGGTTTTCAGAGCTGCAAGTGTTTGCTCACCATGGCCGGCCAGCACCGAG 

4 5 TCCAGACCCTCAGAGACAGAGGAAGCTCATCACAGCTCATGTGAGTACCTGGTTACA 

TCCCTTTTAGAGAAGTGGCAAGTGACGAGTAAATGTCAGCTCACCTGGGAAAATAGATGTAGA 

CAGGCTCCAGTGAACACCACAGCTCAGGGAGGCACCCGCAACCTACTTCC^^ 

TGACTTGGACCCAAGGCTCTGTTTGTTCTGTATTCITTAT 

GTTGGGACCAAGTACTGCCCTTCGACATCTCrcTTTCTTGTGTGGTTTTAAAGATG 

5 0 CCTGGGAGGCTGCCTGTTATGTATTATGCTTCATCGTGCCTCCTAACTTCAGATAGTTCCCA 

TAGGGAGTTATTTAACAGTTGGACAAGGGAGGTGGAGGAGGCTGAGTCTTCCCAGGAA 
AGGGTTTATTCGGATAATGTTCATCACTCACTTAATAATTAAAAGGTAATTCTGAATACATG 
CTTAATTACATATCTTGAAAAGTATGCAGTGTGGAGTAAAGGTTGTGTCCCAGATAGC 
AGCTCTGATGATATCTCTTAGTGGAGACTGAAGACTAGGCATACAG 

5 5 GTTCTAAACCCCGCCTAGGATGGGGGTTGGGCACTGATGTTGAACATCC^GCCT 

AGCAACAAACGGGCTGGAGAGATGGCTCAGTTGTTAAGAG CACAGGCTGTTCTTCCAGAGGTCCTGAGTTTAATTCCT AG AAAC CA 
CATGTGCCITACAACCATCTGCAGTGAGCTCTAATGCCATCT^ 

AATCTTAAAAAAATAAAAGGCAATGAAACTATGATCCTGGCCTTGAGCCTTTTCTCAGTTCTAACTG^ 
ACTGCAGATGTGTGGATGAATCTAGCATAGATAAGCAGTATTTTl'TTTTTTAAGGTAGTGAGTAAATTCTAGCA 

6 0 TAAGGACTTTGGGTGCAGTGGGGCTCCGCAAAAAGGGAG CAACAAT AGTCAT AT AGGCAAAGGG CCTCAAAATGCTGCCCCGTGGT 

CCACAGATGGAAAACATACATGGTCACCCATGAACTCTGCTGGTCTCCTTATTACAGACT^ 
ATCCTACCAGACATCACATATOUUVTAACAAAGAGGCTTGATTTATTGA^ 

AGG CTGG CTTTG ACTGGGGATG CAATCG ATG CTT AT AAACAAACT AGGT CCATCAG AGCCAG CG AG CTG CTGT CTTGTGG CTGG CC 
AGCTCTGTCTTCTACTTGTGGTTCAGAGTTCTGTCTATTTCAGAGTCAT 

6 5 AGC CTCACTT AC C CAG CATGTT ACTT AGCCTTTT AATTTGGT CATCT CATTC AAT AATGTC CAGTT AACTCATTCGCT AAAT AT CA 

AATCCAAGAGGTGATTGGTTTCAAAATGCCATATrrATCTTCT 
GTATTATAAC^GCAAATGAAACTGGTTTTTCAAATGGCTCTGAATGTGAA 
AGGTTTTTACTTAGAGTCCTAGGGTCATTTCATGATGTCCCAATTC 
TTAAATGTCTTATTAAAGGTGGAACCTCTGCTAGCCAGACAACTGTATTACTTTGCTC^ 

7 0 CT ACGAGTT AC CTG AACACGTT AC CACT C CAG ATT AC CACCG CTCC CTC CG TCATTCTTCCAT CCAAG AGTGAGAAGA^TACT CT 

GACAGGGCAGCCGGCTGCTGCCCTTTCTCTTTGGAAGAGCTAAGAAGTC 

GAC^ AAAAACAG CTGT ATCrrC^TCAACAGAGG AG CTT C CCCCAGAAG AGTG C CTGT CATCCAGGTCTTGACAAGTGC C AG GACTTG 

TCGACTTTTGCTTfiAAGACTGAAAACCTGTC 
7 5 GAAGTAAAGTCTAGACTAGAGACTACGCTCTCCACTCACAO 
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GTCCACTGAGTGAAGGGACTTGGTTCTTAGGTCCAGTCTAGGCTATCATGTCCTGATCCTC 

CCTGTTTGCTCCCTCTTCTTGCCCTTCCCTAAAAGCCAGCTGGTGGACT 

ACATATAAGATGAGACATGTTTGCATCTCCTCTGATCT 

TGCCT CT CATTCTTT AG AATTGTTG CHTl^rrGT AAAGAAAGG AA.G AG AGAG AGAC CG AATAT ACT AGAG ATAGATTTTT AAAA^ 
5 CTGTGCCATTTCCCCTTCTATTTGTTTO 

GGAACTGTCCCGGACACCAGTGGTTCAGTGTACAGCTGTAGACTCAGTGAGe^u 

TGTGC CAAT AAT AGTTTGACAGAGT CTCAAAG CTACCGGG LIT CT T 1 T CCAACAACACATTCTTT C ACAG AATCCCAAAG AGT CT C 
ACTTGCCTTCACAACCGTCTGATCCTTTATTGTTACT 

ACAT CATT C CTGACTCTTTCGGGCT CAGAAAG AGTTAAG CG CT CTTGCACACTTGGCACTT ACTT CTTTTT AATTTTTTTTTT AAC 
1 0 ATGCCACTCTTAATATTTCTTCTTAAGTCTTTCAGCAGC^TA 

T ATTTTT AAAAAGG AAC CTTTGACAAAT AATACAC^TTACAGAAAATGGAAG CATTTTT CAT AG AGAAGGT AAG AACTG AAATG CA 
TTTTATGTTCX5AATCCTCCTTGCTAGTATTAGCCCCAAGAGTT 

TGAAGTTG C C CTTT CTT AG AGAG AGAG AGAGAGATATGTGATG ACACT CTT ATGT ACCCAACTAT ACAG ATTT ACCTCAT CTTTTG 
AGTGGCATGTGGACTTCCACTGCTATGTGCCrAGGTGCAGTTTAGAC^ 
1 5 GTGAACTAAAGTCACTTATACATCTTATAGATGCAAA 

GCACTCTTCCTGAGGCCCCAAGCCACAGCTGTCCTCCTGGAGTATC^ 

CCTGTATGACCATGACTCTTCAGTCCATAAGCAGATGC^ 

TAATGTTAAGCTGTTTTAACAGTGAAGCTGAATTTTCTGGAA 

CCATGCATGCTGATGCCTTCCATCTTGACAAATGGAATGATGGGTAAACACTCCTGAC^ 
2 0 GCTGAAACAAAGTGCAATGCACACACTAGAATTTAGTTTGCTTGCACT 

ATCCTAGGGAGAAAATATGCAAGTCTCACGGGTGCTITTCTGTTGGCrTGTTTCT 

AACTAGAAGCCCCAAAGCCCTCAGTCGTTGTCCTGGACAGAGTTGGTAC^CCCCAC 

GAGCACAGAGAGGCTTGCTCTATCGTGAGAATCCCCAGCCT 

GGAACGCO^GCAGGCTGAGTGTGGTGACTGATATGTGAGGTGCCCTTGCCCTGCAGACAGGCT 

2 5 GATTCTGAGAGTCAGGGTGGTGTCATGTCCTACACAGGAAGAGCCT 

TTTTGATAAAAACCCACACAAGGTTTAATTACATGCCCCCTCATCCCTCTCCACACAGTCTTGCCCATCCCAC^ 
TTGTTCTTGTTGTTGTTTTTTGTTATTGGTGAAAATTGTGAAATTGTTAAGGT 
TGGCTGCTGCATAC^rrcCAATGTTGCCCTACCACATCTATGTTTTTG 
AAGTGAAACTCACITAGTCCTTGCTAATTGCTGCTCACTGTC^ 

3 0 GCTTTGAGGAACATC^ACTCCAGAGTATATTGAAGTAGAAGA 

ACAATCTTCAAGGTGGACCCACCTCATGTGGAAGGGCATCATTGCCCCT 
GACCTGCTGGGGACCAGGACTCAGAGATCTCTGGCTTCGAGCTC^ 
TGGCACCATCCTGCCTGCC^CGTTGGATGGATAGGCCATGGATTTTGCTGTAAACT 
ATATTTGATAAATAAAGGATTTCTAGAAAGATTGACTGAGGAGCATATTTAATACCATCCT 

3 5 TCTGCTGGGTTCTTGGTGAGGCTGTCAGTGGCCTACTTCCTTCCTGGCTGCCTC 

AGCAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGAGGTTCTAACTCATC 
AGCATACTTTTCTCAGATCACTATCCCCCACTGTTCTGGCTG 
GTATGAAATCGTTTCTCAGTGGCACAGAGATGCCACTCAAAGGTGAAGTGCCT 
AGCTGGCCIAGGTGAGCAGGTGGGCAGAAGTCTTCCAGGGAAGAGCAGGGTACCT 

4 0 CTATGTTCTGAAAGGCCrGGTAGACTTGGTGlT'm 

ATATCTAGTGTGTGAGTTTACCTACTTGTAGGTACATGCACATGCAGTAC^ 

TGAGCCATCTCTCCAATTAGGGAAAATATl'riTTAACTTGTTTTAATGGTTTGTTTTATACTT 
ATGAGGGTAGCATATAGTGAGCCTGCACTCTGATAGGAGAGATTCCGGGG^ 

4 5 CTAGGAGAACCTTGGCTGAAATTCCTCTCTGAGAAGCCAAGTCAGAGAAT 

GGCTGTGAACTT CCGTATCTTCCAGTGTCATAGGAAGACTGTCAG CC CAGGAT AGGACTGTGGCCTGTGAAAAGCTCAGAG ATTCA 
GAG AT CTG AG AAAAG CATTT AG CAAGG AT CT AAACAACATATTGAATTT ACTAAAT AAAAACCAATTG CT CTACT ATT CCT ACTTT 
GGG AG CTCTGTGTGT CC CAT AAGT AT AGATGGGCT CAGGCCAG CATG CACAG CCAGG ATTT CT C CAGAAAGTT CT ACT ACTTGAAC 
CTAAGCTTGlTTGAAGAGATTT/rGCTGACTTTTGAGTTCTT^ 

5 0 CTTCCAGCTACAACTCTGGTTGTTGGATTTGGTTTTCTATAGTTGCT 

CTCATCCATCCCAGAAACAAACAAACAAACAAACAAA^ 

GGTTGG TTGT AAT AT CTCATGTTTTCT AGCAGAAATGT CTTTT AATAAATGT CT AG TT AGGAATTAT AGGGCACCCAAGG ATG TTT 

CTTCTACAGCCTAAGCCAGGGGCCAGAGGATGTCTACACAGCTACCCTGCTCT^ 

GCCAGCTCTCTCTCATCATGATAGAACATCCTTTCTTTCTAAGCA 

5 5 TTCATACTTTCTCTTGCTTTGCCTCCTCTCCTTTGGGACT 

GTATGCTC-GAAGGAATGGTGAGAGCCATGAGTTAGAAAGAACCATAGCTGCAAGATGAGAGAACAACT 
ATCAACACCTGAAAATGATGGAGATGCTGAGATGGGTCTGATCAGAGCTGACTC 
CACACACACACACACACGTGCGAGAACCCCTCACACACACAGGATGGTGGGGCAGAGCTTAGGTACT 
CATCTTCAATAATAATCCTCAATTATCTAACTTTTCTCACACTAC^ 

6 0 CCACTAGTGCTCATCAGCCACAAAAGCTATTAACACTGGTATTTACTGTTTGTGTCT 

CTCTTTTAACTTTTTATTGATTGTTTATGAATCTCACATCATGTACCT 

CACCCTTGTCACCT ACCCCCCTCCGAAAAAAAATTTTTGGAAGCTG CAGTGTGT CACAGTGTGTCCCACAGTATACCCTTnxSTCC 
ACACTTCCTTATATGCAAATGCTCATTGCAATGAGTCACTGGTCTGGTT CAAGG CCT 

TC CTCACCAGGGCT CTTCTCATATAG AATCTCCTCCTGTTGTCCT C 

6 5 TCCTTCACATATACCAGCAGTTCATAGATGGCATAGAGCTAACTCAAAACCCTGGATCTGGGCCTGGGTGGT 

GCATGCCAGCTCTCCTGCACCCACACCACCAGGGCAAGCTCTCCAGGAGTACCC^ 
AGCCAGAGCAAGCTTTCCAGCTCTGCCGTTGAAGCTCACCCACACCA^ 
GGACAGGACTCACTTTCTTTAGTGTTATAGCTAGTGAGGAGCAC^GCCAGCTCT 
CTCTACCTTAGGTGGAGAGC<XSTAAGGTGGAAGAGGAGGGCATCTCCTTCCT 

7 0 ccagctctcccatgctcatacccacac^ctc^ctcacttgc^cctctg 

tgcac^cccacttttccaagtcctacagccagtaattct 

AACCACTGTGACATQ^CAGGATACAACCTAGAGTCGTTAGATAGAGGAAACCT 
GAAAGCAAGTCTGTAGTGCATTTTCTTAATTAGTGACTGAAGTGGG^ 
CAGGCTAATCAAGCC^TGGAGAGAAC^GGC^^ 
7 5 CTTGAGTTCCTGCCCTCACTTCCCTGGAAAATGAACT 
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GTCATGTTTTATCACAGCAATAGAAACCCTAACTAAG^ 

GGAAGCTAAAACAAAACTGTTAGAACCATCTTCATACCTGATAAAGTTCAG 

AAATCAGATCACTGTAACGTTACTGAAAGGACAATTCTTCAGGAGCAAAT 

AACATTTTT AG CAAAAT AACTGTAT ACAAAAG CAACATG CAAAAACCAAAAAACAAAAACG CCAT AG CITTTT AAAC^ 
5 GAAGAGATGACCTAAGAAAGAAGAAAATAGGATAACTACACAAA 

AAAAACCGATCCAGGTACAACAATAGTGGAAGATGTCAATAATTCCCACTC^ 

TG CCAG AT AGGGTTTCACCATGAAG TCTTG CCGGACT AT AACTT^ T CCACAG 

GTTAGGGTTAGGGTTAAGTTCTCCTGCCTCTGCCTCTGGATGCTGAGATTAAAGA 

TAGGTCTCTGGGATGGTCAGCTTTAATTC^ 
1 0 TGTCAACATGGAG TGT CAACAACTTTGGGCAAGC CTGT AGAGA^ TT AATTG ATG T AGGAAGAC C CAGCC CACT 

GTAAGTGGTACCATCCCCTTGGCAGGGTTCCTGATCTGTTTAAGAGTGGG 

ATTTTTCTCTCCCCTCAACTGTAGTTTTAATTTGACTTG 

TAGAATTTTAAGCTGAAATACAAACCCTT1TCTCTCTAAGTGGTTGTGTGTGA 

ACACTCAGAGAGGGGGGAGAGAGGGAGAGGGGGGGAGGGAGAGAGGGAGAAAGAGGGGT A ' r 
1 5 G AAACTAGG AGTTT CCAAACTAAAAGT AAG CAAAAAAACTT CACAAT^ CAACTGGACTTAAGAGGT ATCA 

AACAGACTGGTCCATCTAAAAGATACAAAGTATACATTATTCTCCTTTAGCCATAGAATCT^ 

GCCTACCTGTGGATCCTGTTCTCCTAACTGAGCTGCCTTGTCTTGCCTCAG 

TGGGGTGATATCCAGAGCGGCTCTCCCTAITCAGAGGAGGAATAGGGAAGGGGCT 

TGTGGTCAGAATGTAAAGTGAATGAATAAATTAATTAATGAAAAATAAAACAAAACAAGACAACGC^ 
2 0 AT C CTTATC CAT CAAAGGAACATTTGAAAACG CTTG CACTG CATT AT AC CAT AG T AGAATAAAACT AAAATT AAAAAGT AACAG CA 

AO^AAACCAACACACATTCTAGGAAAGTGAGCAATGAGTCATTGAAGAAG 

GAAAAGAACTGCTCAACTTACTAGAACrTCTGTGGCACATCTA 

GCAAAAAGTATCAGGTAGACCTCAAATACATAATTAAATAATGGGGGGGGGGAGAGGGAGA 
CAGAAGATTGCAAGAAAAATAAAGAGAAGGTCAGAGATCAATTAAATGGAGACTA^ 

2 5 TGGTATTTGAAGAGATAAACAAGTTTGATAATCCTTGCAGAGCAAAAA^ 

TGGGATCACTAGGGAATACTTTGAAAATTAATTTCAAAACCTCAGA^ 

TG CT AG AGTTAAATCAGG AAAAT AC1AAGCAGCAT AACTGGACT CAAAAT AAC CAAAAT AACAAC CAGTTGTTG AAGC AG T AACT AA 

AAATCTCCACACAAAGAAAAGTCTCAAACTAGATAGATTCACTGCTT^ 

CTTAAATAGATTCACCAAATAAAAAGAGAAGGAACAAGACCAAAGTCATTACA^ 

3 0 GAAGGATACAACAACAAAAACTGAAAACTGTAGACCAACTTCCTTTA^ 

T AAAGCTCT AAAACAAAACAAAAAAAAAGCATT CACAACAAAAT ACTTG CAAACCAAATT C AGAAT CCAAT AG ACAATG CATCTTA 
ATCGAGTTACTTTCATCTCAGGAAAGCATGATTAGT CT AACATATGCAAATCAATACATGT ATAAGGACAGAAAT CAGAAGATTAT 
CTCAAGAGATGCAGAAAAGGCATTTGAAAATTCTCAATATCACTAGACATCCCGAA 

3 5 ACCTTGTAAAAGCTTACAGCTGACTAAATGGGGAAAATCCAAGAGGATTTCTTCTAAACT 

GTCTATTATTATTCAATATACTGCTTAAAGTCrrrAGCTAGAGCGACAAAGGAGGTGG 

AGTCX^GGTATCCTT ATTTGCATATAGAGCTATCCCTATTTGG AGAC AG CATG CAT ACATTTAAACTCT ACTCAAGACTCCACCAA 

AAACCTCTTAGATATGATTAAATAGTTTCTGCAAAGTAACAGGATACAAA^ 

TGAAACCACCTAAGAAAGATAAGTTAAAAAAAAAAACTATTCACAAGA^ 

4 0 TGGATGCCTCTAAAGTAAGGCCTTTAAGACACTGAAGAAAGAAATTAAAATTGATATTAG 

GGAATGGCATAAGCTAATATTGCAAAACTAACTGGACCACTGAA^ 
TTTTGACAGAAATTCTAAAGTTCGTATTGAATCATGAAAGT 

GATCGAACTATCTGACTTCAAATTGTACCACA.GCACCACAGTGACAAAGACGTTGG 

TAGAGTAAC CAGAAATTAG AG CCCACACAACTGTAGCCAAATAGTTTTTGATAAAGGGGTCAAAAACATACA^ 

4 5 AAAAAGATAGCTTCTTTAAATAATACTAGGAGACCCAGATAGC^ 

ACAAAAATAAATTTGTAATTGATCAAAATCCrrTAATATAAGGTCTGA 
GNNNNNNNNKWNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

TAAAACTGTCCCCAGACTCAGAAAACAATCACCAACAAAGCCTGGCAAG 

TAGGGCAAGTGTGAACCCATATGGCCTCTATGGAGATCAGTATGGAGGTTTCTCAAAAACACTAAA 
AGTCTTITAAATCCTGGAATAGACCAAAGGATCCTAAGTGCAATCAAGAGATG 

5 5 CAAAGGGATACCCTGAGTGAGACAAAGAGGAGACTCAAAGGAATAAAGAAGAGGTAACTACAT^ 

GGGTTGAACAAGTTATGAGTGGTCAGGGAATGGGCCAAA.GTTCATGGACAAGGCATTAATAA 

TTTCTGAGAG CTGGGACTAGG AAGGAAAACCAGAITTTTTT TTTTT AACAAGGAGAATCTCAG AGCTCC CTTCTCTACAGGTCTAA 

TCTCAGAGATTTGTCCTAGGGATGGACCAC^GCGGTtXSCTC 

CGTTTTATGTAACACTGTGACCAAGGAGATGTCCAGA^ 

6 0 AACTTTGGAGACCTGGGGAAGGTTITAGAGCTCAAGTCTTTAGTCCA^ 

ATAGATGGCTACCCTTTTGTTCTTACATTGTTCCTGAGATTC^ 
GGGAGCCCCCATCATTTGATATTTTCCAGATTTGCTGTATGCCACCAAGAGAGCAC^ 
GCACCGGCTTTGGAAAGTGCGTGATGGGAATGGATGGAAAACATTGCTGTGGGTGAACTTTGTTGTTCCC^ 
ATGCTGTTGTCTTTGTTCAGACATCTTAAACTCAGTTGTTATG 

6 5 ATAGGGCTTGCCTGTAAAGTGCACTGACAGAGGCAGCGTTCTATATTTAGTGAGGCAGATGCT 

TTGCATCAAGGTCCCTGGTTGGTGAAATAAACCACAATAAAGAAATGAGGAGC^ 
ACCAAAGTTGCTTGACTCTGACTGGTAATGATTTCTTGGTCCCCJVAATGCCCT 
GGGGTTAGAGGAAGAATAATCA.CATTGAACTATTTCCTCTACACCCn3 
TAAACCTCAAACCACCAC^GCAACTGTGTGATTTTAGGAAGGACGAGAAA 

7 0 GCTATCAGAGGGCTTCCTGACACTTTTAAAAGAAAAGAAAGCGTAACTACAAG^ 

CCGAGGGTTCAAGTTTGTAGTGACCGTGCCATTTTCTTCCCATCCTGAATAC^ 

ACAATTCAGGTCACCAGAGTAAACTG CTTCTGT AATGGT AAATTT CCCAG CAGGTCTTAAAATAAATAG ATAAATAAATAAATAAA 
CAAACAAACAAACAAATACACA(^TACATACATAAC!ATACACACATACATACACATATAT^ 

TTATTTTTCAG CTC^.CATTTATTTAATT ATT AGGAATCCAGGTT AC CTT AATTTTTTTGTG CTTGCTACACACTTTCATGCAT ACC 
75 AGGT 



309 



WO 03/008583 



PCT7US01/51291 



MOUSE SEQUENCE - mRNA 

GGGACCCTTTGCTGCCCTCAGCTAAGAGTGCGG 

GCAGCCGGGGCTCAGAGTTCGGCATGAGCGCAGTGAGCTGCGGCA^ 

5 GGCAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCGTCrrCCGCATCCCGTG 
TCGTGAGGAGGACGCTGCCCTCTTCAAGGCTTGGGCATTC 

CTTGG AAGACAAGATT ACGATG TGCTCTG AACAAGAG CAATG ACTTTGAGGAATTGGT'CGAG AGGAG CGAGCTGGATAT CT CTG AC 
CCATACAAGGTGTACAGGATTGTTCCAGAGGGAGCCAAAAAAGGAGCAA^ 
CC^CCCCTACCCCATGACAGCACCTTATGGCTCTCTC^ 
1 0 GGAGGGATTATGCCCCTGACCAGTCACACCCAGAAATCCCATATCAATC 

GG C CCATCTTGTGAAAATGGTTGCCAGGTGACAGGAACCTTTTATGCITGT^ 

TGAGCCAAGCATAAGGTCTGCTGAAGCCTTGGCXXTCT 

AAGAGCTGACCACGACGAGCCCTGAAGGCTGCCGGATCTCC 

CCCTACCCGGACGACAATGGACAGAGGAAGAACATTGAGAAGTTGCTGAGCCACCT 
1 5 AGATGGGCITTATGCCAAAAGACTCTGCC^ 

TAGAAAGAGACCIAGACrTGCAAGCTCTTTGACACACAGCAGTTTCTATCA^ 
CCGAGATTCCAGGTGACTCTGTGCTTTGGTGAGGAGTTTCCAGACCCTCAGAGACA 

T CTGCT AG CCAGACAACTGT ATT ACTTTGCT CAACAAAACAC^^ CCTG AGGGG CT ACGAGTT AC CTGAACACGTT AC CA 
CTCCAGATTACCACCGCTCCCTCCGTCATTCrTCCATCCAAGAGTGAGAAGAAA 
2 0 CTCTTTGGAAGAGCTAAGAAGTGAGTGGGTTTCCACTTGAA^ 

CAGAGGAGCTTCCCCCAGAAGAGTGCCTGTCATCCAGGTCTTGACAAGTGCCAGG^ 
TGAAACTTGATCAGTGTTGTGTTTACATGTACTTGAATGCTC 

CTGTGCCAGCATGAATCCCTGACAAGAGAAGACATACGTATTATTGGTCCATTTCTCAGC^ 

gctotccactcacagagaaacgcattcctggatagaggagctgtaggttgcccagatcatgtccacrrgagt 

2 5 ttaggtccagtctaggctatcatgtcctgatcctgcactctctctgagtajgaccct 

CCCTAAAAGC CAGCTGGTGG ACTCTTGTTG CCACATACITG AGCCAAAA 
CTCCTCTGATCTAACAACGGCTACAAAGTTTTTTGTTGTTGTTGTCTCCCC 
l"rriTTGTAAAGAAAGGAAGAGAGAGAGACCGAATATACTAGAGATAGATTTTTAAAATTC^ 
TTGTTGGCCATCAGTGGTCCCCATGGCAACACTGGAAGGGCGGA 

3 0 AGTGTACAGCTGTAGACTCAGTGAGCTGTTTCTGCTTATTTTGTATATTAGATGCT 

TCAAAGCTACCXSGGCTTCTTTTCCAACAACACAT^ 
TTATTGTTACTGTTrTGTTTAGGTGGCT 
AGAAAGAGTTAAGCGCTCTIXSCACACrrTGGCACTTACTTCT^ 
GTCTTTCAGCAGCATACATTCJATAAGCAATGGACCATTGGGTGTCT 

3 5 AT AATACACATTACAGAAAATGGAAG CATTTTT CAT AGAG AAGG TAAGAACTGAAATGCATTTT ATGTT CGAATCCTCCTTGCT AG 

TATTAGCCCCAAGAGTTCAGTCTGTCCTTTAAGCATG 
GAGAGAGATATXJTGATGACACTCTTATGTACCCAACrrATACAGATTTACCT 
GTGCCTAGGTGCAGTTTAGACATCTTCTTAGCTACCCATCCCAGCAGCCCA 
ATAGATGCAAAACATTCTGGGGTATATTATCCQ^AGGGAAGATGAAGGCA 

4 0 AGCTGTCCTCCTGGAGTATCTTAACCATATCTGGCACTTCACCAAGGTCCCTTGCCTGGTCCT 

ATAAGCAC^TGCTTTCTCTCAATTGCCATGTGAAGTCTTTTC 
G^TGAATTTTCTGGAAAATGCTGTATGGTTCGAGCCCCCAGTTC^ 

GACAAATGGAATG ATGGGTAAACACT CCTGACACAAAC^GG AA CTT CCT CGTC CCTTG CTG AAACAAAGTG CAATG CACACAG 
T AGAATTTAG TTTG CTTG CACTGG AAG ATGCCG TTGAAG AGG T AGG CTG AGG AACACTGT ATCCT AGGG AGAAAAT ATG CAAGTCT 
4 5 CACGGGTGCTTTCTGTTGGCTTGTTTCTGGTCCTTGAGCAC^ 

GTTGTCCTGGACAGAGTTGGTAGACCCCACCTGTAGAAATCCAGATGCACTCT 

GAG AATGCCCAGCCTGCAGAGGCC CAGGTGCGGTGCCTGAGTGGCTGTATG CCAGACACAGGAACGCCAGCAGG CTG AGTGTGGTG 

ACTGATATGTGAGGTGCCCTTGCCCTGCAGACAGGCTGAATGTGAGCCCCAGCAGCAGAGGAT^ 

GTCCTACACAGGAAGAGCCTTCAATATTACTTTTTAAACGAACTTTCCCATT 

GGTGAAAATTGTGAAATTGTTAAGGTCTCTGTTTTTGTTATTTATCTCCCTA 

CCTACCACATGTATGTTTTTGCAAG'lTGC'r'rL' I'l CACTCATTGAAGAAGCCTGCACCGTAAGTGAAACTCACTTAGTCCTTGCTAA 
TTGCTGCTCACTGTCACGTGAACGCyiGAGAAGGCTGATTTT 

ATTGAAGTAGAAGACAGTGTTTGGCAAGCCAGTTCTTATTAGACACTGCTACAGGCT 
55 GTGGAAGGGCATCATTGCCCCTCTGCTCCCATGCGCTGTGCACCCCCTTCACAGTGAAAGACCT 
TCTCTGGCTTCGAGCTCATATCTTGGCTTCCTGTCATGTCTAAGCCCGACAGGA 
TGGATAGGCCATGGATTTTGCTGTAAACTTGCTTCACAA 

MOUSE SEQUENCE - CODING 

6 0 ATGAACTTXK3AGACGGGCAGCCGGGGCTCAGAGTTCX3GCATGAGCGCIAGTG 

CGACCAGATCGACAGCC^CAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCGTCTTCCGCA 
GCAAGCAC^ACTACAATCGTGAGGAGGACGCTGCCCTCTTCAAC^CT 
AAGCCAGATCCTCCTACTTGGAAGACAAGATTACGATGTGCTCTGAACAAGAGCAATC 
GCTGGATATCTCTGACCCATACAAGGTGTACAGGATTGTTCCAGAGGGAGCCAAA^ 
65 CACAGATCGCCATGGGCCACCCCTACCCCATGACAGCACCTTATG^ 

CCCCATGACAGGAGCTGGAGGGATTATGCCCCTGACCAGTCACACCCA 
AGGCCACCACTGGO^AGGCCCATCTTGTGAAAATGGTTGCCAGGTGAa^ 
CTCCTGGAATCCCCATTGAGCCAAGCATAAGGTCTGCTGAAGCCTTGGCGCTCTCAGACT 
CGGGACATCCTCGTGAAAGAGCTGACCACGACGAGCCCTGAAGGCTGCCGGATCT 

7 0 GGACCAGGTCCTGTTTCCCTACCCGGACGACAATGGACAGAGGAAGAACATTGAGAAGTTG 

TCCTCTGGATGGCTCCAGATGGGCTTTATCCCAAAAGACTCTGCCAGAGTAGGATCTACT 

GATCC^CCCAACAAGCTAGAAAGAGACCAGACITGCAAGCrrCTTTGACAC^ 

CCATGGCCGGCCAGCACCGAGATTCCAGGTGACTCTGTGCTTTGGTGAGGAGTTT 

CAG CT CATGTGGAAC CTCTG CT AG C C n JGACAACTGTATTACTTTGCT C A A.C AAAAC ACTGG ACATTT CCT G AGGGG CT ACG AGTT A 
7 5 CCTGAACACGTTACCACXCCAGATTACCACCGCTCCCTCCGTCA 
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HUMAN SEQUENCE - GENOMIC 
TCCTGAGATCTTATCITTGATATAAGTTAGCT 
CAGAGAAAACAGCCTAGCCCATGGATTTGTTCTCAATGCACT^ 
5 ACAACATTCAGAATAGAAAGGGAATTTCTGAAGGGAGTTTCATAACTGAAATT^ 

TCAGTGACCACTGAGGACCTCCGGCAGGCTTCGGGGTGTACAAAGACAGCCAGGAT^ 
ACAAAGGGACCAAGAAGGGATAGCAAGAGGACAGCCAGACTGCCC^ 
TCGTGATGGTTCTGGTAGAATCGTCCTCITGAAACCTTCTGCTGCCATTAT^ 
ACAATAGCATACACCTTTCCAGAATGCCGGCACGCGCTGTGCAAT^ 

AAATGACTGAGCCACACCTAGTGCAGGAATGAGCACCAGGGCCTCTCT 

GGCACCCCCAGTTATTCCCTGTTTCGCTCATTGGACCACCCTGATAGGCTTCAG 

AGGGGTGGATGGGGCAGCCAACAGGTGCATCTCATGCCTCCGTGTTCTCCATAGGGAGG^ 

GAGGCCTGGGCCAGCGTGAAGTGTCAC^AGTAAGGGC^AGAAGACTGGAG^ 
1 5 GCAAATAC^GCCCGGTTACGTCTGTGTAGCrTAGCTTCCCCAGCCC!ACCTGAGC 

ATTGACATCCAGCAACTCCAAATGACCTCTCCCATATCAATGG^CTCATCTCC^ 

ACCAGGCTATAGTGGGAAATCATTGAGGAAACTATGGTTGAATGCAT^ 

AGCAGGCTGCCCCTG<3TGGGCAGGTGCCGCATGGAGGTACCACGCCT 

GGGAGTGGGCCAGGCTATCACATCAGATGGGCCAGGCCTCGCATCGCTCTCT 
2 0 GTTTGTGGTACTGCAATGCCACAGTACCTTGGGGAGCCCCCAGGCAAGCCAAGA 

TTCAAAGGTTCAAAGCAGGGATTCTGTGTCACACTTGCGAAGGTGCCCACCC 

GGTAGACCTCTAAGGCCTTGTCCCAGTGGAAATGCCCTCCCACTTGGC 

ACTG GACTC CTGAATGGG AGCCTCAG AGCATT AGAAAAG ATAATGTTCT CT AAACGT ATATGGAAGG AAG CTGA 
AAG AGGACTTTTGATG AGT CAC CTAATGTTGATGTTGAAAT AAAC CT AATG ACCCT CTGGCTAAAATCTG TTCTCATTTGGGAAG C 

2 5 ACCTGAACCCTCATTCTCCCGCCACTGCTCTGACCTCTGGAA 

TCTCCCGAGTCCTCCCAAGCAGCCAGAGGACGACTCCACACCCAGAACCCAGAGTGGCCTGGGAACCA 
CATTTGTCATCGCCC^VGGGTCCrrCCTGCCAGGAATCCCCCCACATCCTGCCCTGCCTAGAGAAGCCCAG 

TG CTCAAATGTCACTGCATGTTACGTCTTCCTCACACACTCCCACTAG CCTTACTGCATCATGCTTG<5TTTGCTG AGCTATGTGTG 
GGGTCTTTCCTTGCCAACCCTTCTTTCCCTGAGAGGTGCAATGACCTCATAT^ 

3 0 CACCAAGCCTGGGAGGCACTGGCTTAATGCATCATGGTTAAAAGAAATGGATGGTTG 

GAGATGTGCCTTGCACAAGATCCCTGCAGCACATGCTGGGAGGCTCTGCCCXKMTAGC^ 
GCTCCTGGGAGCACTGTGGGAACTCTGGGATCCCCCTCCTCCTAAGGGCTGTGCT 
CTCCTGCAAGAAAACATGCCCCCTGCCCACCCAGGATCCACCTGGCTGGGGAATCT 
GGAGGAATGGGGCTGTGTCTGAAAGAAAGAGATGCCAGCAGGATCCAGGAGAGCAGTCACGGTCCT 

3 5 CAAGGAGGTGTGGGGAGCIATAGATTCATATAAGAGAGAGTGGTTCAGTCTGTGGCCATGCT 

GCAAGGCCCGAAGAGATGCTGAAACAGA.CAGCTAGGATGGCT 
GTGCTGCAGCCAGGGCGTGCTGTGGGAGAAGGGCTGGGAGGATGCAGCTC^ 
TGTCCTCAGTACGAGCCAAGGTGGGGGTGAGGAACGGCCCACTTACCAAAACT 
TTTCTGCAAGTTTCGAGGAGAGACAGAAAGGGTGCCACTAG^ 

4 0 TAGCAGCAAGAAGGCATGTTGACCTTGGTCATCAGGAGGTGTGGAGCTGCTGCTATACAAT 

GGGCACCTTCTTGGCTCTCCCTGTGTGTATTGATGGAAAATGGTATGTGGAGGTGCAGCA 

CAGTTCAAAGCCCTCAAGGATGAAGGCrTGGGTCrGGCCACTC CATCTAGAAAGGACAGAG CAG 

GAGGCGCTTCACGAGTCTTCACTGGCGTTCCAGCTGCAGGGGCTGAGGCTGGAATCTGTT 

TC CAGGAT AAAG TTG CTATTTGAATTTCAG AT AAACAACAAGT AGTTT CTTT AG TGTAAG T ACGT CCCAAATATTG CATGG AAT AC 

4 5 AC CT AT ACT AAAACAAGTT ATTCAATG TTT AT CAG AAATGCCAATTTAACTGGGAGTTTTG CATTTTT ATTTGCTAAAACTGG CAA 

CCCTCCTCCAGGGATGGAGG CCCACTGG AACCAG<»AGGAGCTGCCCTCAG^ CAGAACGAGG 
TGT AG ACTTGGG TGG ATGTTG CTGTGCACTG CITGGATTCC CACTTGGG AGTG AG AGGTT CATTCC CT CGGGT ACTGAGACTG CT C 
CTGGGAGGTGGTTCTCAGGAGGCAGCCTTCTGTGGCGATGGCTCCAGCT 
GGCAGCTGGCACCCAATGCTAGCTCAGTC^GGGCTGCGGGGGCCCTATCCCTTCACT 

5 0 AGAGCTTTCTGCAGCATTGACTGGAGTCCATTTGGCTTGGCGTTGCATCAC 

TGACTCCAACCAAGCATGCTCCCTAATCATACATGCTG^CCTCCATCTC^ 
ACTAGGATCACTGTCAA(^AAAGCATGACCCAAGAGGCAAGACACAGT CATGTTTA^ 
CTAGATGTCACGGTTCAAAGCTGATTGAAAGAAACTCTGTGTTTCCCAATGCCT 
CAGGATCTCTAACTTCGCCTGTCTCCCCGGTAATCCTGCT 

5 5 CCTTATTCTCAGGGGATAGATATGTTCCAAAACCCCTAGCAGATGCCTGAA^ 

TTTTTTCTGTACGTACATAGCTAGGATAAAGTGTAACTTATAAATTAGGCACA 
GCAATGATGACCATACGCTGTAATGAAAGCTGTGGATGTTGTTGCTCTCTCCAACATCTC^ 
CCTGATGGCIX3CCACATGGCTGATGGGCCAGCAGCATATACAGTGGGGTCTGCTG 
TGAAGCGGGACCCTGCGCGATGTCACCA.CCCTGCAAAGAATGGCATGCAAC 

6 0 TTTCTTATTTTCAGACTGAGGTTGCCCTCAGGTAACTGAAACTGTC 

CCCATCCTC^AAACCCTCCAGCAACCCCTGACTCCCCGACCGCCCO^CCCCCTGCCGAGCACGTCT 
GTCCCTCCCTCCTCTGCCACCCATGCACCTCCCCGTCTGAGAGCTCTCTACCTCACCCCGAGGCCT 
CTCTCTCTCGGGCGGTACCCACACTATGGCCAGATAATTCCTCCTTTACT 

AGCCATGTAACTCGCTGCCGGGCAGGAACTAACAGAGCCAGCACATGCTTCTCCCGTACCCTTCT 

6 5 GTTCCGGATCCCCGTGGCTTTGCCATGGATATCTCATT^ 

ACAGCATAACCTAACCTTTCTAGAAAGATACACTCCTCTATCCAGCATGCCT 
TTCACAAAATTrCGCTGAAAGAGCCGTTGATGAACTTTAGAATAACCCA^ 

AGCCGTTTG CCTTTCACCTCTTGCTTCAGTTTTCAGAAAATTG CCCAAAGGTATTG CAAACCATTTGCGTTTTCATTGGCTTGGTT 
AAGGACTTGGCAGTGTCAACTGGCCCTCTGTAAAGCCACCTCACTGTTTCTGTAGAGAGCATGACACGTCAGAGAGGTG 

7 0 TTGCCAGGACTCTTCTTAGAATGCTGGCAAGAAGAGCAGGCGAAAAGGGAA^ 

CGTATCCCTGGCTTCTCATTACGTCATCGATTACAATCACAA^ 

AGGAAAAAG CCCCGCTTACTAACTCTGCCCTCAAAGAGATCACAAG CTGGTCCTG AACGCCCTGTGCCCTGGGAATGCTTACTGTC 
GAAGCTCTGCAAATTTTCTCATTGAAGGCACTCAGGG 
TCCAGCTTTATGCTCTGTATTCTATTACCAAAGAATTTA 
7 5 GAGCATTTATAATATAATCTCATTTCCAGAGTGTTCCATCCCAACTCCTCTCT 
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TTCCy\GAGGATTAAACATTC(^GAAGAGTGG 
AGAAGAATAAAGTGAAGGTCAATGGTTAGATCCATGGCreAAA 
AAGGCCACGAGGCAGCCATTAATGATCCTAAGTTGCAAGAGAAAGAAAAAAAGA.CCT 
ACTTGCCGCCGGCCTCCACTTCACAGGGGCGGAGGATGAGGCCAGGGAAGACTTTCTT 
5 CTCCCCCTCCTATGACAGCCTGCACTCAGAGGCTGATTCTGAAATAGTCATGCCAAATGC^ 

AAATAGTTCCTCAGAGCTGTGTCCTTGGGCCAGCCATCTGGACCTCTCACAATCTCTTTCGTTCTC 
CGCCTTCCTTGCCTGCCAGGTTTGGTTTGGGGGC 

TGAT AAT AGCCCTG ATTTGTTGAACATGATGATG TG CCAGG CATTGCG CAAAG CAT CATTTCAT CATCACAACAAACACTGCTGGG 

AGGCAGGGACACCCCCACCCTCCTTGTACACCCAGAAGTAGGACTTTGGTTAA^ 
1 0 TCAAGCACTTATTTAATCCCCAATGATACTACCTTTTGGCCATGCAATGTGT 

CCAAAGAGGATTAGAAGAGACATATATTCTATATAAATTACATGAAA^ 

AAAAATGATTGCCCTTTATAl'r rC l' T TATGGATACAGTTGCATTGTTTTTAG 

GTGGGAAAATAGTGTTATCCAAGATTTTAAGAATCTAGTGTGTGTGTTTGTGTGTG 

GCATGTGAATGTATAATTTTAAAGATTGGCTTTTAAAATACTGAATTGAGGT 
1 5 ATGTGAAATTGAAAATGGAAATGCTCCATTCAGATGGTGTGCCGACTACAACCTCAACATACGGATA^ 

TCCTCGTGG AAT AT CACGGTCAG C CTTTGTTTTGGGT AC ACATT CAATGAACTGGTTTCCT ATT AG TTT CTGCTTG TCAACAAGTT 

GTGTCATTTACTGATAAATATCCAGCCTTTTACTAAGGAATACCGATTTTCTCA^ 

TGGGGAAGCTTTTCAATTTGAGAACCTTTGCCGTTTTTTGTT^ 

CTGCTACCTGTTCACTGCCTCCCTGACGGGTAATGAGGCCATCCGGAGTTTTA^ 
2 0 CATCATTAAGAAAAATAACTTCTAGTGGATCAATATGGCAAATGAGTAAACCAGA^ 

GAGGGTATGTTGAAAATGACTAAACCCGGTGCAGAGGACACGTATACGAAAGTGCAAAAAGCTCT 

AGCAAACTCCTCTCTGGTATGTTTCTGAACGTTTGAGGGAAAGGTAACACrGA 

TTGCTTTGTTCACrGATGTGTTCCAAGTGCCTAGGACGGTTCCTGACAAG 

2 5 ATCH^GAAAGGAAAGGAAGAAAAACAGGTTGTGACCAGAACTTTTTCT 

ATGGAAAACTTACTTGGGTGAAATAAAAAATGGCAT 
CATACCTCACTACCTGTTITGCTCTAGCCTAAACATAGATTCTT 
AAAACTGAGAAATAAGTCAAATCAAAAGCTTTTCAAAAGT^ 
AGAGATCTGATGGTTTACCTGAAAATGCGAAGTAAATGCACATACTCAGCATC 

3 0 AGGGGGTTAATTTTGCGATTAAGGACATCTTGGAAAAGTATGTAAAATC 

TGGCTCGTCCTCTCAAGTAGCTGGGGCCACGCCCGGCTAATTTTTGTATTTT^ 

TGGTCTCGAACTCCTG ACCTCAGGCG ATC CAC CG C CT CGG CCTC CC AAAGTGCTGGGATTACAGGCTTG AG CCACAGCGC CTGGC C 
TATTTTGGGCTTTTATACCCC^CTGGTAAACTGCTTTCCTCCAGGTTGA 

3 5 GGAAGTGATTAAGCACTTGG ATCCTTAGGGAG CCTCTCCCCGCCCC CATCTCTTTCATG CTAAGATAATTAAAACTTCGGGGCCGG 

GGCATTGTCTGAGTCACTTCAATTCLACCAGCCTAATAGA 

CAAGCITAATAGCTCAAGGGAGCTGGGCCATTTCCTATTTTCTTTTTAGTGAGTGCGATGTTCT 

GTTCCCGGTGATGGCCTTGCCGAGGGTGCTCCCGCAACCTCCACCTCCAGTTCTCTTTGGACCAT^ 

CTCTGCAAAGOSAAGTCCCCTTCGCACC^GATTCCCGCTACTACACGCCCCCC^TTTCCCGC 

4 0 TGATTGACTGGCCTCCIXIAGGTCCTGGCGCAAAGGCGAGAT^ 

GTCCCCGAAGCGGCGCGCGCCCGGCTGGAGGTGCGCTCTCCGGGCGCGGCGCGCGGAGGG 

CGGCCGCGGCAGCCCCCAGCCITCACGCCGGCCCTGAGGCTCGCCCGCCCGGCCGGCCCCGGCTCTC^ 

CCCAGTCC^CCCCCGGCCCCCACAGGCCTOX^GCCCCG 

CCCACCTCGCACTCTCAGTTTCACCGCTCGATCTTGGGACCCACCGCTGCCCTCAG 
45 GTCGGGCZAGGAGGAGGGGTGTGAGGCTGATACC^GAGAGGACCCGGAGCGCGAACCAGAGGTTCGACCTCC^ 
ACCCCGGCTTCGGAGCGGGAAGGGAGCGCGCCCCGTCCTGGAGCrCCGACT 
TGGGCTCOtfVGGCCCGCCTCCTTGGCTCTGrc 

CGGGATGAGCTAACCGGACTGTCGGGGCCCCAGGAGTG<3CTGAGGCGGGGCCGTCCAAGGCA 
TGCGAGAAACAGGCCCGGCCCTGTGGACCCCAATCCTAGGCTCCTTCCCCT^ 

5 0 CCTCCGGCGCGGTGCTCACCCGCTGGCGCAGGAGGAGGAGGAGCT 

NNNNNNNNNNNNNNNNCTCCGGGGT CCX3GCGGACG CTCTGCGCX3 CX3G AATCCCC CGTACTGGGGCTGCAGCCCCCGCGTCTGCGCC 
ACTTGTCGTTTGCAGAGCCCACTTAGTGCGCGCTAGCTGGGCAGGGATAGGGGTCCT 
GAGAAAGCGGAGGGTGGAGGAACCCGGGGCTGO>CCCCTGGAACGCCCGGCCGCAGGCGA^ 
GGGAGTGGAAACTGACAGAGTCGCGGGAAGGGGCGAGAAGCGGGTTGGGAGTGAGC 
55 GAAGGAGGGCCAGGAGGGGTGGCGGCTGGGTGGGGAGAGAGGGTGCAAGACGAGCGGCGCOT 
GCGTTACAGGAGAGCAGGCGGGTAGGAGCCITCGCGGGGGCCGAGCTCGGAAG^ 
GGGCCGGCCGCGAAGGTGCCTTCTTCCGGGGGCCCGGACGACCCTGACACGGC!^ 
GCCTCGTGGTCACTGGCGCAGGGGATCGGGGCGGGGTGCCCGGAGTGCGGTGCCT 

TG CAGAGCAGAGCGGGCGG AGGACC CCGGGCG CGGGCGCGGACGGCACG CGGGGCATGAACCTGGAGGGCGGCGGCCGAGGCGGAG 

6 0 AGTTCGGCATGAGCGCGGTGAGCTG CGG CAACGGGAAGCTCCGCCAGTGGCTGATCG ACCAGATCG ACAGCGGCAAGTACCCCGGG 

CTGGTGTGGGAGAACGAGGAGAAG AGCATCTTCCG CATCCCCTGG AAGCACG CGGGCAAGCAGG ACTACAACCGCGAGGAGGACGC 
CGCGCTCTTCAAGGTCTCCXIiGCCTCGGGAGCCGGCGGGGGCGCGCCGGG^ 
CCTCCGAGGCGAGCCCAGGGGACCX3CGCGGGGCGGACGGGCGGGCGGCGGAGGCATCAGGTC 
GAAAGGAGGCCTCGGCTCTCAGCGGGACCGCX5GGGGCCGGGAGCCGGGTCCTGGGCGC^ 

6 5 CCCGGCCGGGCCCGGGGAAGGGCGGCCAAAG^CTTGAC^G 

GCCCrTCCrCCXKKSCTCCCGTCTGCCXSCCTCCGTCCGTGGGTCCCCCTCGCCCTCrCCGTGCG 

TTTCGTCTCTCACCGCGTCrCTGTTTCTCTTTTCGCTGCTTTTCTCTCTGAGT 

TCTCTTCTCXSCrCCTGCrAGCTCrCTGCGGGTACTCCCACCTCTGT 

7 0 TCCCCTAAGGCAGITTACCCAGAGAACrACGTGTCTGGGCCCAGCCCCCACCT 

TCCTTAGTCCTAGGCAAGCTCACTCIAGACACTGGGTGGGTAGAGCCCCCTGAAG 
TACCCAGAAAATGTGTCTTCAACTTTGGCAGTGATAGGGTCCAAGAT 
TTACGTTCTGAGCaj^CGGTGTAAATCTGAAGGACCTATGCG 
7 5 ATCTXjrATTTTGGGTTTAGCATGGTGGCTC^ 
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AGTGAGACCCTGCCTCTGCAAAAAATAATAATTAAAAAAATTAGCCAGGCATGGTGGTGAAC^ 
CTCTAGAGGCTGAGATAGGAGGACCCTAGAACTCAGGAGTTCGATGCTGCGGTGA 
GCAGCAGAGCAGGACTCTGATTCCAAAAAGAAAGCTAATAAACCAGACATGTATTTTG 
CAGGCTTGGGCACTGTTTAAAGGAAAGTTCCXaAGAAGGCATCGACAAG 
5 TTTGAACAAGAGCAATGACTTTGAGGAACTGGTTGAGCGG^ 

CTGAGGGAGCCAAAAAAGGTAGGGGCTCTCCTGAATTTGGGTCA^ 
TCTGAGCATCACTTTCTAGCTTTCCTTTTGTATTGCCTGCCT 
TTATGTGGGTCACAAGTTGAAATTCCTGTGGGGTTCAGCATAGGAGAATTA^ 
TGCATTGAATATGTGTTTTCCAGTTAGTCTTTCAAAAAAAAAAAGAAAAA^ 
1 0 AT AGT CTTT AG TGAGTGACT ATTTTG CAGAAAAGGAGT AATGAGAGG AAT ATCTCAT CACT ATGCATGGTAAT AAAACTGT CAACT 
TTTAAAGTGTAACATGGTCAGCTGAGGTGTTTATTAAAAGTCAGATTCC^ 
GAAGAGAGCTCAGGAATCTGCATCTGGAACAAATCCTTCAGGTGACCT 
ACAAGGCACATTCGCATGCCATCTGTCATGTArrrCI'lTACCTGTCCTGTACAACT 

GCATG AAAAAAT AACTT CCCAAGGGAGTTG CTGAAAAAT ACCTT ATTTGT CAACAC CGTGTTATGCATTCT AAG AATT AAATG CTC 
15 AGGTA 1TT1T ACAAGATTTGACATTTAGTTAGGTCA 

CCAAGCAGCTCACCCTGGAGGACCCX3CAGATGTCCATGA^ 
GTATGGTGGAGGGCACltKXSCTCCCTGAGGGCGAGGCTGTGTGGGCCAGCTGC 
GCAGAACTTGCCATTTGCTATGGCTGCrCCAACAGCCCAGAA 
TGCCATTGGTGTGGATTTTAAGTTGGGGAGGGTCGGGCGTGTCCGCCTGT^^ 

2 0 TTTTGTGGAAGTGGAAGATTTTGGAAGTAGTGC 

ATGACAGCrrTGTTTCATCCACTTTGGTGGGTAAAAGAAGGCAAATTCCCCTGTGGTACT 

G AAG CTTTACAT AAAACAGTACZAAGTATCTCCATTGTCCTTTATGATCCT CCATGAGTGTTTTCACITAGTCTGATGAAGGGTTCA 
CTCCAGTCTTTTCGGATGATAAAATGCTTCGGCTGTCAGTCTAATA^ 
CCGTCTCAATGCCAGTGCrrCTTATCTCAGCCTCTCCTGCACTCCTTTAC 
25 TCCTGCACTCCTTTACCCCCGTCTCAATGCCAGTGCTTCCTATCTCAGCCTCTCCTGCA 

GCTTCTTATCTCAjGCCTCTCCrrGCACTCCTTTACCCCCGTCTCAATGCCA CCTCT CCTGCACTCCTTTAC 

CCCCGTCTCAATCCCAGTGCTTCCTATCTCAGCCTCTCCTGCACT 
TCTCCTGCACTCCTTTACCCCCGTCTCGATGCCAGTGCTTCTTATCTCAGCCTCT 
GTGCTTCCTATCTCAGCCrCCCCGTCTCAATGCCAGTGCTTCrTA 

3 0 ATGATG CCACC CCTCGACCG AAGCTGGAGGGACTACGTCCCGGATCAGCCAC 

TGGACCCCGCGGCCACCACTGGCAAGGCCCAGCTTGTGAAAATGGTAAGGAGGATA 

AATGTGGCCATGGGCCATGGCGAATCCTGGTCTGTCCTGGGCAGCACCAAAGCTCTTTCCCC 

CGGGGAGAGGGGGGTTTTCTCACTCCTGTGGGATGGTGGCATCCCACAGCCAAGT^ 

TTCGTGGG ATCTT ATTTAAACACCACAGAG AT C ACT C CAGG CAGTGG AAAACAG C CC CC CACTTTTT AT AT AGAGAGG AG ATTGTG 

3 5 GAAG CTCAT AAATAATAAGTAGGG ATGCTGGG AG ATT ATTG AGCTGGTAT ATTTCTCT CG AAT AGTGGTGTGTTCATTGTTGCTAT 

TGTTGGAATTAGAGGTGATACTAAAAAAGGAACGGAAAAATACCATGG 

CTGGGTCCCAGAAAGACTTG CTGATTGTGG ATT AGATGTGGTCCCCTGCCCT CGGGAGTTCCCCAGACATCAAAATGAGCT 

AGCAGAAGCTCTCATTAGCTGGGTGCAGGGAGAATCTAGCAATGCGGTAACT 

TGGAAGTCCTGAGACAGGTCAACACACTGAACCCTTTGGATGTTTTAGGAATATT^ 

4 0 AGTCATGTTCAGGTTATCCTTGGCTGCAGGTAGTGAAGAGGTGTACCACTCTTTAATGTTTGTACTGAG 

AGCTCAGAATTCAGTTGCATTGGCITTTCAAGAGGGATTTGCAGCAAa 

ATAAAACTAGCTCCTGAAATTC7VGACTTGCGTTTACTGCTCTGGCTCTATGGAATTAGTAACCATG 

ATTGTCTAAAGTGATGGAGTAGGTAGAGACCTGTCTTGAAGCATCGGTAACAGTA 

GCCACATTCTTTTTCCATTCATGTTTC^TACAACATTGTOT 

4 5 AAATAAAAATGGACTAAATAATTTGCTAGTTTTTAATTGGGCTCTTTGGC1A 

TGAACCCGCTTTGTGAACAAGGACAAAGCATGTCATCTGTCTCTGAGCCTCAGTCTCCCT 

AGAGCGTGCCCTGCTAGTTGCTGGTGCTGGGAGTGGGTGGGAAGGTGATTGGGCGCCAGCCC 

CACC C CAGGAAG C CC CG CGGTG CG T CGG ACTCTCTGT CT AGACATCAT CTG ATTTTT ATTTG CAAATGCAGGTTGCCAGGTGACAG 

5 0 GAACCTTTTATGCTTGTGCCCCACCrGAGTCCCAGGCTCCCGGAGTCCCCACAGAGCCA 

TTCTCAGGTGAGTGCAGGGTTTGCTCCTGGAGGCACCGCAGGA 
ATGGGACrTTAGACACTTGCTTTGCTCCCTCTGGGGTCTGGA 

AGGAGCACCATTAGAAAACCrGACATCACTGCTTGTGGTTCTGCTGACCGTTTCAGCC^ 
CTTCACT<K3CACCrCTCTGAATTTCTAGGAATGTGCCTlTACCTTTAC 

5 5 ATAATTGCTTGGAAGTGTAGAAGGGCTTCTCATTTTGAGAAGCTGATCATC 

TACTCCCTGGGGACATTAGTTCTGGTCCCTCATCTCTAAAACATT^ 
CTTTAATATAGCTTGCAAACAAATATGGATTCAATCTGATTTTTAAAG 

TGGAG ATCTT AC CT ACT AT AAAGAAGGCAC CT CT AGG CTTGG CAAG CACACGTGCTAT ATG TATATTT ATTTTT CAG AT AAT ATTT 
TGGATTGTTTTTAAATGGGATTTCTTTTTATATTAAAACCAAATAGCTTAAG^ 

6 0 AAAAAT CAGTGATG AAAAAT ACT AACTAAATT CTG AAGGTTT CAGGG CT G CTCAGAGGGG AG C 

TGGAGTTTGACCTACCAGCTCTTTCTAGTTGTGAATGAGGCTTGCACC 
TTTGGGAGGAGGTGCTTCTGAGCAGGAAGGCTTGTGTTTGTTTT^ 

TGGGTGCCTGAAGAGTAGGAAAT AAACAG CTATTT ATAT CTCGG CAAC CTCGTG ATTTCTGATGACATTAAATGAAATG AAAC CTG 
CCCCGAGAATCACCTCGAATGGCCAACACCCACTCTCTTTGGGGCGCACTGTCT 

6 5 AATTGAGACAGTGATGTGGGTTAAGTCTCAAAACCTGCTGTTGAGGA^ 

ATACAGATTAATTATCAGCCATGAGCACATTTATTAGATAACT 

AGCTGCCTCCTGTCTGTGCCACTCCCCTGCGGTCKWCTGAGGAGCTTGTGCAAACCC^ 

ATTTTAAAGAGGCTTCTGAAATCAGCAAAACGTTTGTAAATGGATTCATCTTGTTTA^ 

TTTTCAAACTTAAACACAAAGCCCAGTGGGAATTTCAGCAGACTTTACATG^ 

7 0 TTTCACAAGAATT AATTTTGTGAT AAAAACAC CACAT AT C CAGTGTGGAAAACTAATTT AAAAAT CAGAAAAACAAAAATGG AAAA 

TAAAAACATCACATTCTACCACCCAGAAATCACCATTAAGGCCTCAGTGCT 

AT AAATT AT AT AG ACAT AAAT AATT AAG CACCT AATT CT AT AT ATAATTT AT ATATGT ACACACACACACACACACACACAT^ C 
TTTTTACCTAAATGATGTCATGTTATTATTTTGGGGArc 
TTTTTT,CTAGTATAARTTAAACTTAGGGTTCATTTCCTCCAAA^ 
7 5 CATGACITAAATGACATCACCACTATCATCATCATAATTCACAAATAAGACTTAGC^ 



313 



WO 03/008583 PC17US0 1/51 291 



AATTC^CTTACATTTCCAGAAAATTAATGA 

CTGGTTTAGACAGATTAAATGGGTTTCTTTCC^CGGGACTTCTCCAAGTGTTGACCGAG 

AGACAAGGGCACGCTGTGTTTCCCAGGCCTCCTTGACGCGGGAACCTTTTGTTCCCCCACGGGACATGTGGGACACT 
CCACAGGTCCTTGGCTCTGTGGAGTCGTTGGCCTCGAGGTGGTG 
5 ACTGCCGGCTGCACATCTGCCTGTACTACCGGGAAATCCTCGTGAAGGA 
CATGGACATACGTATGACGCC^GCAACCTGGACCAGGTCCTG 

GCTGCTGAGCCACCTGGAGAGGGGCGTGGTCCTCTGGATGGCCCCCGACGGGCTCTATGCGAAAAGACTGTC 

ACTGGGACGGGCCCCTGGCGCTGTGCAACGACCGGCCCAACAAACT 

TTCTTGTCAGGTAAGGCACCTCGTTATCTGTTAGAATGGAGGT^ 

GCTCTGTTGTGGGCAGAGAGTCCGGATCAGATGGTCCAGGTGGACCATGGCTCAGGOT 

TCGCCTCTTAAAACAGTTCCATTTCATTGCACTATGATCTCGACAAATACGTCTGCTGATGTGG 

AAAATCATAAAGGTTGGAAGCATATTGTTAACATTCTTGGTTGTCAGGTGTGTCACCCTCATGCAGATG 

CGGAGGGTGG AGTGGACTGTACTGGTTTTGCACAGGAGAAGTTAT AGGAGAGTG CTATAAGTTGGCT CTGTGATATGAACATTTCT 
1 5 AAGCATATTTTTCAGAACCAGTGAGTTTTGTTTTTCTACC 

GCCCCTCGCCCTGTGATGCTGGGCTTCCTCACTGGCTTTGTGGGGCTGTCCAGGTGGAGGGAGCCTCC 

TCTACCCGCCACTCGCCACCCCACCGCCAACTCTTCCTCTGCCTTTTCCCAGTCCTCT 

GTGTCTTTGATTTCTCATGTGCCTCCCCTTTTCTTGA^ 

GTGCTCTGGCACTGTTGACATGTGCTCGTACTTCTGGGTGTAAGTGACCCAAGTTT 
2 0 CCCAAAGCAAATATGTCCCCAGATAACTAAGCTTCAGGC 

GGTGGGCAGATCACTTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCGTCT 
GCTGGGCGTGATGGTGGGCGCCTGTAATCCCAGCTACTCAGGAGGATGAGG 

TGCAGTGAGCTGAGATCGCACCACGGTGCTGCAGTTTGCAGGACAGAGCGAGACTCCATTGCCCCCC^ 
AAGCTTCACACAGAGGCTGACCATAGGAAGAGGGAGAACTGTGCTGGTTCCCCGAGGCGGCGGCAAGTGAGGAA 

2 5 GGACACCTCTCATTCTAAGGGCAGAACCATCCCCAAGCCCTGGCCAGGGCG^ 

GGGGGACCAAGGAAATTGAAGGGCAACTGGGGGGACCCCCCCTCCCCTC 
CTCACATGGGCCAGGAAGGCTAAGGCCCACTGGGCCTGGATTTCTGAAGACTCTC 
ACAAGGTGAGGGCTTTCTAATGAAAGTGTCACAAGGAACTGGCACTGGCTTTTCTGAG 
TAGAGCCCCCGTGGTGACTAAGCTGGAAGGTGCACATTGAGTCACAGGTGCACTGCGTGAGAG^ 

3 0 CCTGTGAGTGTCCTGGGGCTGTAGACTTGCACGACTAGAACITACTATTAATCT 

TCCTCTGTAACCAAGGGCTGTGCTCTTTGCCCACTGC 
CGTAAAAACTGCAGCTGTGCCAACC^GCTTCTGCATAATTAAG 
GCTTTCTCAGAGAATTGTGCCAACTCGCCATTCrrcATTAGCCT 
CAGATTCTGACTAAGTCATTTGGGTTTGATTTGAATTCTGGAAAAAAAAAA 

3 5 AAACTTCTGTTCAGTTGAAGAGAACAGTGGAGATCTTTGATATCTTCCT 

CCCGGCCCGAGGGTGTGTATCTGATAAGGACGCGTCTGTTCrGC^ 

GCGTGACATTTTATTGTA^CTACAGGCAGAAGATTTGTCTTC 

TGTCTTTCCAACAATGTTCCAGTCCGTTTTGTATCriTTGGCT 

GGAGGCCAGGGGGGCTGCCCCACTCCTGTTCTCAAGTCAAGTTACTGTTCC^TCCCTGGAGACAGGCAAG 

4 0 ACACAGTTCAGGCATAGTGACAGTGGGACTCAGCTGCCGATGGCTGGTGCTCAGTCACCACGTTGACGT^ 

GCCTGAGTTACGTGGATGTCCGCAGTAGCACAGATACTGGATTATGTGGGCTCTGTAGTGAGGGGATGAGGTTTTAAAATGA 
GGGATGTCGAAACATTCTGATTriTTAAATGAAAACTTGTCCATGGCATAAATTGGTCTTTTGCCATTGCGTGA 
TTCTTTATCTC CCGACTTCACTGTGGT CTACT ACCTTTTGAATCTTCGTGGGTCCAATG CTGCAAAGCAGTGTT CTTTAGCTGTCG 
ACTAGTTCCTCTTGAAGAATCGAGGGAGACCGAGGGCCCTGGGGGGAAAGCACCCAAAGGAATC 

4 5 AGATGTTCACATCAAGAGCCCCACTCAGCGGAGTAAGAGTGCTCATTCCTCTTGCAGTGT^ 

ACCACGGCCGCTCCCTGCCAAGATTCCAGGTGACTCTATGCTTTGGAGAGGAGTTTCCAGACCCTCAGAGG 
AC^GCTCACGTGAGTCCTCAGTTACACTCCTACCATAGTGGCTTCCTGTTCTTTGTAAAGGCCAGAGTT 

AAATGAAAAGT AAATGTCAAATGACCTGGAAAAATAAGCGTAACC CTTAAACTAGTGAGGGAGGAAGCATGGTCCAACGAGCAG CA 

5 0 CAGTCTGAGGACTCACGTGCTCCTCCCAGACTTGAGATCTGCTCAT 

TTCCCCATCTTTTTTATTTTTCAGGAAAGTTGTCGTATTTCCATTTTATAGGGATTGAAA 
AAAATTCTCGAGGTTAAGATTGCrrcTAAGAATGCTATCTAGCTAGTGGATCTTCA^ 
CTATCCATTCCCTGGAGGCATTTTGTAGGTCTCTGCAGCTGTGTCCTAGCACCTCTTTTATTTC 
TTATTGGGCATATGATATTTGAAAGAGGCTGAATCTTTCAAGGAATTCAAGCAAATCAGAC 

5 5 CTCAAGATGGGTTCAACTGTGGCAAGTAACACTAAAAGGGTAGGGTTAGGATTAGGGTTAGAGTTTTCTTTTTTA 

TTCCATAGATAACATGAAGCATTAATTTGCCATGGGAAACAGAATCTTGA^ 

CTACATACTTGCAAGCATTATTAACGGGCTTGCATGCTGAGCAATACCAGGTAAAAGTTAC^ 

ATGTTCTAGCTCTGATGCTATTTCTCATTGAGAACTGAAATGAAACATG^ 

AAAAACAAAGACCAGCCAACCAACCAGTATATAATCCCATAGACTCAGGAGTTTTCTCATGAGTTC 

6 0 AATTGATATGATTGTTGAAATAATAACTGATACATTGTC 

AGCATTTAAGAAGTTGATGATCGGCCGGGCACAGTGGCTCACGCCTGTAATCCCGGCACTTTGGGAGGCT 
GTGGTCAGGAGTTCGAGACCAGCCTGACCAACATGGTGAAACCCCCTCTCTACTAAAAATACAAAAA 

CG CACCTGTAATCCCAG CTACT CGGG AGGCTGAAGCAAGGG AGTCACTTGAACCCAGGAiSTCAGAGCTTGCAGTGAGCCGAGATTG 
CACCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTCrCAAAAAAAAAACAAAAAAAGA^ 

6 5 CACATCAATCCAGTTTTAAAGTTGACATGGATGCAAGTCACT 

TGCTGAAGACGAGGCTGCGTGGTCCATAAAATGAATAAACACACGTGTACACCTATGAGTTCCACTTTTAAAAA 
TTCATTTCAGGTACTTAGGAGTTTAGCTTAAGTCGTCATATATAAAATACAGTCTCTAATATC^TGATTGATGG 
TTGCCTTAGATGCTGTAAATTCAGGAAAAGCTGAACATTAAGGCGTTTTTGl'l" 
CAGTATAAATTTGAGGCCAGTTAGCTTCTCTTTTTCCACAATAGTAGTTTTGTTGTTGTTTAAC 

CAAAATCATTGGCCTAAATTACCCTGCAGATT^ 

TAAATGTACATTTTAAGTGGCAGTCGATTTGAAAGTAACATCTTTAACC 

ATCTATGGAAGGACTAACCAAAAAATTGCTTCTTTAGATGGCTCCAAATATGAAATGTTTTGATGTGTTCT 
CCTTTACCTTACTGTCTCCTTAGAATCTGAGTC 

7 5 TCTTATTAAAGGTAGAACCTCTGCTAGCCAGACAACTA^ 
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TTACCAGAACACATCAG CAATCCAGAAGATT ACCACAG ATCTATCCGCCATTCCTCTATTC AAGAATG AAAAATGT CAAGATG AGT 

g g -itttcitttt c crm "i t t t i 1 1 rrr itt ttgatacggggat acggggtcttgctctgtctcccagg ctggagtgcagtgacac 

AATCTCAGCTCACTGTGACCTCCGCCTCCnKSGTTOVAGAGACTCT 
GCCACTGCACCC^CCCAAGACAAGTGATTTTCATTGTAAATATTTGACT^ 
5 TTGAGGAACTCAGAAGTGGAGATTTCAGTTCAGCGGTTGAGGAGAATTC 

GG CAGATG AG CTT ATTTCAAAAGGAAGGGTGGCTTTG CATTTCTTGTGTTCT AT AG ACTG CCAT CATTGATG ATCACTG TGAAAAT 
TGACCAAGT<^Tt;TGTITACATTTACIX3AAATGCGCT 
CITTCAGTATTGACACTGACTAGAGTGATGACTGCTTGT 
AGCOTCACACGTAAAAGAAATGTATTAACGTATCTAGGAGCTGCAGT^ 
1 0 TTGACTGAAGCCGTGCCTGTAGCCITGGGGAGGCCCATCCCCCACCTG^ 
CCTTCCCATTGGCTTTCTCTCCITGGCCTTTCCTGGA 
ACTCITGGATGGGACATTTTTGTCTGTCCTACAATCTAGTAA 

CCCTCATT CTTT AGAGATG CT AAAATTCTT CGCAT AAAGAAGAAG AAATTAA TTGC C CT 

TCTGTCCAAGTACTTAACTATCTGTTCCCTTCCTCTGTGCCACGCTCCTCTGTTTC 
1 5 ACTAAAGGAGGAGGAGCCGGGGACTCCCAGGCTGGAGAGCACT 

CTGCACACTCAGTGGGCTGTTTCTGCTTATTTCATCTGTTCTATG 

CTTGGCTTTTTCCAAATGCTTCTArrTATAGAATCCCAA^ 

TTGAGAAAGTACAGCAGTAGACTGGGGCGTCACCTCCAGGCCGTTTCT 

AGAAGATTGCGT AG CTCTCAAATGTGTGTTCCTGCTTTTCTAATGGATATTTTAAATT CATTCAACAAGCAC CTAGTAAGTGCCTG 
2 0 CTGTATCCCTACATTACACAGTTCAGCCTTTATCAAG 

AAAAGGATCTCCTTTTTTCCCAGCCCAAATTCTCCTCTCTAAAAGTC 

TTTTCTGTAAATATACATAAACTrAAAAAGAAAACOTCATGGAGTCATCTTG 

AGTGAAGATTTACCrCGTTCTGCTCAGAGGCCTTGCTGTGGAGCTCCACTGCCATC 

2 5 ATGCAACATGGATATGTATTGGGCAGCAGACTCTGTTTCXaTGAACTGC^ 

G T AT ATT ATC CTAAGGGAAG AT AAAGATG AT ATT AAG AACTG CTGTTTCACGGG GC C CTT AC CT CCT CTTTG CTG AAG AAT 

ATTTAACC CCACACAGCACTTCAAAGAAGCTGTCrTGGAAGTCTGTCTCA^ CACCCTGTCTTCTT AATTCTCCAAG CGGATGC 
TCCATTTCAATTGCTTTCTGACTTCTTCTTCT 
TGCTTCTTGGCTGGGGCCACTACCTCCTTTCCTATCTTTACATCT 

3 0 ATGACCTAGGGAATGAACTAGCTATGAAATACTCAGGGTTAGGAATCCTAGCACTTGT 

TCATTCCTTGTCTTGATAAAGTGGAArTGGCAAACTAGAATTTAGTTTG 

GTT AAAG AGCACTGGGTCATATGG AAAAAATGTATGTGTCTC CCAGGTG CATTT CTTG^TTTATGTCTTGTTCTTGAGATTTTGTA 
TATTTAGGAAAACCTCAAGCAGTAATTAATATCTCCTGGAA 

GGAAGCCCCTGAGTCCTGAGCGAAAACAGGAGAGTTAGTCGCCCTACAGAAAACCCAGCTAGACTATTGG^ 

3 5 GACTCTGCCATGGTGAGAAAAATGTAAAATCCTACAGTGAAATGAGCAGCCCTTACAGTAT^ 

TTAGTGTTTGAAAAAGCTGGTCTTTGAGCGAGGGCATAAATACAGCTAGCCCCA 
CGCACCTCTTGGCTTTGTTGATGCTCCGCCAGGAA 

GTGTAAACTTCCGTTCATTGCTCTCCAGTCACATGCCCCCACTTCCCCACAGGTGAAAGTTT^ 
AGGTCTTTATTTGTATTACGTATCTCCCX3AAGTCCTCTGTGGCCAGCTGCATCTGTCTGA 

4 0 ACACACCATTTTGTAAGTTATGTTTTACATGCCCCGTTTTTGAGACT 

CTGTTACAGGAATGAAGTAAAGGTCAGTTTTTTTTTC 

TAGAGGGGACGTGAGAGAAGGGCCAGGCCGGCAGGCCAACCCTCCTCCAATGGAAATTCCCGTGTC 

GGACTTAACAGGCAATGGGGTCCACTTCCCCCTCTTCAGCATCCCCCGTACCCCACTT^ 

ACCCCAGAGGCCCCCAAATGAAAGCTTGAATTTCCCCTACTGGCTCTGCGTTTTGCTGAGAT 

4 5 CTXSAGGTAGATAATGCTATGCTGTCGTTGGTATACATCATGAATT^ 

AAATTT AT GTTTTT AGG T AAAT AAAAACTTIT AAAAATTTGTT ATGGAATGTGTGTG TGTGTT ATT AAACATG CAT C CTGAT AG CT 
TTTAAAGCACTTTCTCCTGCGGTTACGTTCCTTCAGA 

TTAAAGAGACAAACGCTGGCCAAGTGCCGGGCTGCTCCTCCCACACAGAGGAGTGTCTGTC 

TAG CATCTGC CTC CT CTTCCTTT ATCTGTACTTT AAAAGGTTT AGGG CTTT CAAGG AAGTTGTTTTT AATG TTTTCTTTTGTTT AT 

5 0 ATTCGAAGCTCCTCTGATTTTGGTCATTGCTAAAAAGAAGGGATATAGGTTAACTAAC^ 

CTGGATAAAGAGAACAGGATTTAGATGGAACCCCTGGGGATCCCT 
GAGCAGCCAGGCAAGGGAGTCTCCTTTCTCAGCTTCATCGCTTCTACTGATTACA 

AGTCCAGC AG AAAAACTCAAAGAAC CAGG CTCCAGCTGCTX^TGCTAAAATCCTTGCAGCTGAAGGTTACATAGAGCT 
AGTTTGTAGTTTGGATCTAGAACCCCCCCTAGAATAGTAGATTAGCATAATT^ 

5 5 ACCCTCACAAGTTCAGTTGAGAAACTTTCCTGAAAACAAAGATCAATAAATC 

CCAGTGTTTTAAAAGGGGACAAAAATGAGAGAAGGTAGTAAAATCTGTTCCCAGAT^ 
GAAGGATTTATGTCCTGATCTCAGGAGACAGGCTTCAGCAGAGTG 

TATTTTGG GGAG AAAT ATTITGGTTT CCTTCAGTCCCC CCTTG AAGCTTT ACTTT C^ TTTCACATG TT AAAAG CTAAGTTG 

GTAGTTTTGGAGAGATTTGGGTTAGAAGTTTTAAGATAAGAGATTTO 

6 0 GGAAC CAATTTGGG T ACATTGTT CCATCT CTT ATTGT ATCAGTCT^ CATTG CAT 

CCCATTTCAGAGGTGGTGTAACTGGGCTTCCATAAGAGTTACGTCTTCCATATGGTGTAGGC^ 

TTGT ATGGAAAT AAAAAGAAAACAAAGGTCTATGTTTCAAG CAGTCTAT AAGCT AGTTT CTGGAGAGCAACCAG CTGAGATTTCTA 

GATTTGGGTTCAAATAATTTTAAGTTGGAGTGGACAGGCAGTGGCAATCTGACA 

TCCAGTGAAGTTTTTGTCCATAAATAAATTGTX1ATGACTTCTCCAAA 

6 5 AGGAAAGGCACGGTTTTAATTCCTAGTGATTTCAAGTCAGAA 

ATATTAGAGGAAACTAAAACTTCAGGATTCAGTCCAGATAATTTTTAAAAACT 
AGCTATGCTATAGCTGTTCTGAAACATTTTTTCTCTCTACAGTCACCTC^ 

CTTACAAAACAAGTTTAGTGTCATTATATTTAACCGGATCATTTACATAAGTACAGCAAGAAC^^ 

TTTAAGTCAGCIGTTGCTGGAACTTrTCGTAAAGAATTTTATTAG CCGTG CCA 

7 0 CAGACTTACCAACAGATTTTATCTGAGATACCTGGGTAAATTTTTTTCT^ 

CAGAAAATGACATTATTTACTTACTGCAAGGCTA 
TTTCAAGGGTTTTATTGGCTCCTTAAAGTTAACGTTCA 

AGCCTTGATGATATAACCTGTGTTTCCAATTTTTCCAGTCCGAAGAGACCAGATTCCT 

CATAAAAATAGAATTT CT CAC^.T ACAGTTTCT AAATC CTGG AGGG CT C AGGT AAAG AAAAAG AT AAATGTTTCAAT ATTTGTG CAC 
7 5 AAAAATATACTTTACAAAATTACTGTAAAGCTGTAGATTACTTAAGAG 
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AAAGAATCAGCAATGCTTTAAACAAAAAATGTTC 
GCTTGATTTGGGGTTACTAATTTTATGAGTCCAGTTTTTT^ 
TGAAAGTTGTCAGGAACCTGTATTCAAGAGTACTTCTTTCAGTCCTTTCT 
CAAAGAGCTTTTAGAAAAAGTATCATAATAAAGCAATTAATAGCAGA^ 
5 ACATGGAAATTTGGTTACTTTCTGTGGCTTACAACAATTTAACATAAT 

AGATTTTTAGGAATCTCATACAATTTTAGAACACATATTAATAACATATCCATAT 
TCTCATGTTGAAATGTAATCCCCAATGCCAGAGGAGAGGC 

TTAGOlCCATCCrCTCAGTACTGCTCTCATGATGGTGAGTGAGTGCn'CTCAAGATCT 

CACTTCCTCCTTCTGCTCCTGCCATATAAGATATGCTTGGTTCCCCT^ 
1 0 CAGAAGCCAAGCAGATGCTCCCATGCTTCCTGTACAGCTGGCAGAACrc 

GTCTCAGGTATTTATTTATGAAGTGCAAGAATGGAGTAATATAGAAAATTGGTACCAGTAGT 

AAAATGTGGAAGTGGGTTTGGAACTGGATAACAGACAGAGATTGGAAGAGTGTGGAGGCT 

AATTTGGAACTCCCGAAGGACrrGCrGAATGATTGTGACCAAAATGCTC 

CTCAGATGGAAATGAGGAACTTATTGGGAACTGGAGTAATGGTCGCTTTCGTTACG 
1 5 TGTGGAACTTTGAACTTGGG AG CAATGATTTAGGGTATTTGG C AGAAGAAACTTCTAAGCAGTGAAGTGTTTAAGATGTGACCTGG 

CTGCTTCTAACAGCTTATGCTTATATGTGTGAACAAAGA 

AAGTTTGAAAAATTTGCAGCCTGGTCATGTGATAAAGAAAGGAAAATTTCACTAGC^ 
GACAAAGTACACAAAGGCCTTACAACCCCATGTGTCCATAGATAGAAGAATAAAAAAAGACAGTCT 
CTGTTCATGGTAGTAGAAAGGATTACCAATGGGTACCACAAAAAGTCAAGACTCATATAAA 
2 0 AATTTGTATAAATGTTTCTCrcCCAAGCTAAGGTATTTACTAAGGAAATCA^ 
TGTGTCAAATCCAATCTCTGCTTGAATGCTGCTTAATTAATTCCCTGAA 
TGGGCTGAGAAAGGCTTAGAGAATTAGGCTGAAGAGTGTTTTGTCAATCTTGCAATGAC^ 
CTAAAAGAAATTTATTTTAAATATAGCCAOTCTAATTTATTTCAAAGTGATGT 

ACT AT AT CCAAGAG C CAATTT ATACAAAT AATTTT CTCCTATTTAAATTTGG AAATG AAGGAAAAAAG AA C 

2 5 CCTCTCAACTGAGCACTAAAAATAGAGATGTGAAGAGCTGACGTTGGTAAGAAACCTT 

TCTCAGAGCTCCATCCATTGGCCCTAGTGTGAGCAAAGTGTGCCTTTGTGGTTGTCAAATTG 
TCTCAATCCTCATATTTTCTTACCTGAGACACTTTCCTGA 
AATATTGTACCCATCACCTAGAAGAAGGCCTCAGTTCAAAT^ 
AGTATTTTAACAAAAAACAATAAATCTTAGCATAGAGACAAAGC^ 

GTCCTGACATCAGGCAAATAGAGGTGGGGCAGAGTGTTTCCTTGCATTTTCAG 
AAGAAGTATTTTGGTTTCCTTCATAACGAAATAGAAATAATGGACAAA 

ACAAAT ACAGGTGG ACT CAAGT CAG T AT ACATGCAT AGG CAAGTTTTT CTTCAAAT AAT AT ATCAACTGGTT AT ATCTAAAAGGGT 
ATTCCAAAGGGTTTTGAGCCCTTGAGATTGCTCTGAAGTAATTTTATGTATCTCT 

3 5 CTGT AACTG CAG CACAAGTTTGGCTG T ATTGG ATTTGATTT CCT AAAAAC AATGATCIAC CTTGACAAACAC GCTTTC CAGTG TG C 

CACCAGT ACTCC CCT AGGC CAGAAAG ACACGTTGGGTC CTGTCTCAT CTCT CTG CAAAGGAGCTCTGGACAGG CT AT AC CTAAATT 
AACACAGGCAG ACAT AT ATT CTCTG TTTT AGTGT AG CATT ATG AATT AT ATGGTGTTTG AAGG OVACC CGGC CTTTTCAGTG AC CC 
CTCCTCCTGCCACACTGGTAGTGTGTTGGCAAAATGTCTGTCCTTAGGGAGTCCT 
TCTGCCTCTGCTCCCCTGGCTAGTCTGrcCCCACAGGTCTTGGCTGTGGCACACT 

4 0 ACCCTCTGTTTGGGTCTCCAAAGTGTGATATTTGTCTTGAGAAACACTTTGC 

GCAGCrGGGTCACTGGTGCTCAGGACTCAAACTGGCTGGGAATGT^ 
AAAGCAAAATGCTGAGGGTTGAGGAAGGGAGAGCGAGTGCTCGCTGAGAAAGGAAGAAAT 
GATAAAGGGGAACGATCTTCTGCCTCCTGGGAAAAACTTTGATGAGACAGCAAACT 
AACCGCAGGTTACGAAGGGGCCAGGTGGGCAGGACTGAAGGATAACTCCATGGTA^ 

4 5 GAGTGGGCATGCCTGGGTGTGTCTGAGC^GAAACrrGCATTGAGGACAAGATC 

AGCTATGCCAAACTTAGCCCCAGTTCAAGGGAAGGAGAGGGAGGA^ 
TTGAATTTGCATCACCCTAGTGGAATATGAGCTTAAAATTAACAATTATAT^ 
TTTGGCTCTGGACTTAGATTTCTACGCACCATGTCCGCAGCATAAC^ 
TCTTCTGTACTATAAATTCCTTGAGGAAGGGAACTGTG<rTTTCCAGTAAACACCCAG 

5 0 CACCATCGAATCCrCAGGAAATGACACAGTTGGOSAACCT^ 

GATGAGC CACCT ATGG CATGAAATG CT AT AGAAC CACTTACAAATTGT G CAG T ATACAAG T ATTTCATGGCATGGGGAAATGGTTG 
TTACTTTGTAAAGTAAAAAGAAACAGGTTTCAAAACAATGTGATATCTT^ 
AAAAGAACATTCAACAGTGTAAATTATACCCCTGAGATGTGGGGTCATGGAAATTT^ 
TCAAAGTTTCTCCTGAGACCAC^TATTTTGTTACTAG 

5 5 ATATTTTTTGGTGGGCTTCCAGTTTCTCAAGCTGCCTTGTGGGGTCCCCAGAGCT 

ACCAGGGGCTCAGGGTCTCAGGGCCCATCTCCCAGTCCTGCCTCCATTAATTTCT 
CCCACCTCATCTTTCCCAAGATGCTTCTTTGTCCTCTGAGCTATGACACTGAGGATGAACGCTCT 
GCCTCCATGCAGGGACCCCCAGCCTGTGAACTTCCCTCTGGCACCCCCACTCCCCCATGCGGAAACT 
ATCCCCAACCCAAGCGGGACCTCCCTCCCCCACCAGCACCCCCATC 
60 NNNNNNNKNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNl^^ 

GCACCCCCATTGTGGGATCTCCTTTCCTCACCTCCAACCCCCATGTGAGAACCTCCCTCTCACTCTCACCC 

CCTGAGTCTCTGCCCTTCCTCAGTCATCACAGTCTCTGTACAGCTA 

CGCTAATGTCAGCATTTGGGGTTGCCTGGAAACAGGA^ 

6 5 CAT AAAATTTACCATCTTAACCATGTTTAGGTATA CTGTGCAGCCATATCCAG CAT 

CTGTCTGCAGGACTTTCTCATCTTCCCAAATTGAAACCCTGTCT 

AGCCCCCACCATTCTACTTTCTATCTCTATGAAACTGACTACTCTGGATACCTTGTATGCGTGGAATCCTC 
TGTGTCTGGCTTATTTCIACTGAGCTTGATGTCCTCCAGGTTCGTTCAT^ 

T AAAT CAT ATT C CATTG CATGGGTGG AC CATATTTTG TTCATTT ATCTG T CCTG AACACTTGAGTTG CTT CCGTGTTTT AGCT ACT 

7 0 GTGAATAACGCTGTTGTGGACATGGGTGGACAAATGGGTCTTCAAGACCCTC 

AACrcCTGGAGCGATGAAAATTCTATAATTAATATTTTAAAGGTACCACTGTACT 
CCCACCAGCAATGCACAAGGATTCCAGTTTCCCCACATCCTCACCAA 
GGATTCATTCCACAAACATCTTCTGAAGACTCCCCATGrrCCTGACCCTCCCT 
r^CTGTGTG^TnrJVGGAGGCCAGCAAGCACTGAGACAG 
7 5 ATTTTT AAGGTG AGAAACTGAGG C CCAG G AGTGT AAAGCCAGTACCCT AAT AT ACAACT ACT AAAAG AGG AAG CC!AGGATTCAAAT 
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CCAGGCCTTTGTGGTTTCATAGCCTTATCTCTTAACT 

G AGGTGG C ACGG TGT ACCTGTAT CCACAC CACTGGTG AGCAG CCCT CCAGCATCACACC CAGACTTTCTAAGATGCTTGGT AT CC C 
ACAGATGGGAGCCAATTTCTTTAGGTGGGGAAAGAGAATGGGAG 
GAGTTTGGGATTAAGCATAAAGGTCTAATTAACATTTATTAGGTGCTAAGTGCA 
5 TAATCCTC^AGACAGGCATTTCATTTCTCACCITACAGAT^ 

GCAAGTGTTGGTGAAAGGATATCATCCTGGeTGGTCCTACCCAGGTGCTCCATCrrTCATC 
CTTCTATGAAACAGGACACCTATTTCATAGGGTAGTTGTCGGGATTAGTGATAAAGTGCT 
TCCTATGTAAATATTGGCGAATTGTCTTAGTCTGTTGCCTGTTGCTATAATGGAATACCATAGAG 
TATGTATTATTATGATTCTGGAGGCTGGGAAGTCCAAGAGCATGGCACTGGTACCTG 

10 

HUMAN SEQUENCE - mRNA 

CGCTCGATCTTGGGACCCACCGCTGCCCTCAGCTCCGAGTCCAGGGCGAGTGCAGAGCACA^ 

CGCGGACGGCACGCGGGGCATGAACCTGGAGGGCGGCGGCCGAGGCGGAGAGTTCGGCATC 

AGCTCCGCCAGTGGCTGATCGACCAGATCGACAGCGGCAAGTACCCCGGGCTGGTGTGGGAGAACGA 
1 5 ATCCCCTGGAAGCACGCGCK3CAAGCAGGACTACAACCGCGAG 

GTTCCGAGAAGGCATCGACAAGCCGGACCCTCCCACCTGGAAGACGCGCCTGC^ 

AACTGGTTGAGCGGAGCCAGCrGGACATCTCAG ACCOST AC AAAGTC 

CAGCTCACCCTGGAGGACCCGCAGATGTCCATGAGCCACCCCTACACC^TGAC^ 

TCACAACTACATGATGCCACCCCTCGJVCCGAAGCTGGAGGGACTArc 
2 0 CCATGACGTTTGGACCCCGCGGCCACCACTGGCAAGGCCCAGCTTGTG^ 

GCCCCACCTGAGTCCCAGGCTCCCGGAGTCCCCACAGAGCCAAGCATAAGGTCTGCC^ 

GCACATCTGCCTGTACTACCGGGAAATCCTCGTGAAGGAGCT 

CGTATGACGCCAGCAACCTGGACCAGGTCCTGTTCCCCTACCCAGAGGACAATGGCC^ 
CACCTG^iAGAGGGGCGTGGTCCTCTGGATGGCCCCCGACGGGCTCTATGCGAAAAGACT 

2 5 GCCCCTGGCGCTGTGCAACGACCGGCCCAACAAACTGGAGAGAGACCAGACCTGCAAG 

AGCTGCAAGCGTTTGCTCACCACGGCCGCTCCCTGCCAAG 

AGGCAAAGAAAGCTCATCACAGCTCACGTAGAACCTCTGCTAGCCAGACAACTATATTAT^ 

CCTGAGGGGCTACGATTTACCAGAACACATCAGCAATCCAGAAGATTACCACAGATCTATCCGCCATTCCTCT 

AAATGTCAAGATGAGTGG TriTC T lT TICCTTTTl Tl T T ^ ^ 

3 0 CTGGAGTGCAGTGACACAATCTCAGCTCACTGTGACCTCCGC CTCCTGGGTTCAAGAGACTCTCCTG CCTCAGCCTCCCTGGTAGC 

TGGGATTACAC^TGTGAGCCACTGCACCCACCCAAGACAAGTGATTTTCATT^ 
ACTGCCCTCTTACTGTTTTGAGGAACTCAGAAGTGGAGATTTCAGTTCAGCG^ 
ATCAGTGACATCTGATTGGCAGATGAGCTTATTTCAAAAGGAAGGGTGG 

GATGATCACTGTGAAAATTGACCAAGTGATGTGTTTACATTTACTGAAATGCGCTCTTTAATT^ 

3 5 AAGACAGAGAAAACTTGCCTTTCAGTATTGACACTGACT 

TAAGATGTAAATTGAAGAAGCCTCACACGTAAAAGAAATCTATTAATGTATGTAGG 
AGTGAAGGAAATGAATCTTTGACTGAAGCCGTGCCTGTAGCCTTGGGGAGGCCCATCCCCCACCT 
GTCCCTCTGCCCCACCCTCCTTCCCATTGGCTTTCTCTCCTTC^CCTTTCCT 
TCAAAAAACATGAGCGCTACTCTTGGATGGGACATrTTTGTCTGTCCTACAATCTAGTAATGTCT 

4 0 GTTTCTGCATCTTTTTGACCCTCATTCTTTAGAGATGCTAAAATC 

TACTTGAACTCTTGCCCTTCTGTCCAAGTACTTAACrATCTGTTCCCTTCCT 

GCGATCAGCCATGGCGACACTAAAGGAGGAGGAGCCGGGGACTCCCAGGCTGGAGAGCACTGCCAG^ 

ATGGAGCTGACTACGGAACTGCACACTCAGTCXKiCTGTTTCTGCTTATTTCATCTGT^ 

GACAGGGCCTTAAAATTACTTGGCTTTTTCCAAATGCTTCTATTTAT^ 

4 5 CACTTTACATTTTTGTGGTTTTGAGAAAGTACAGCAGTAGACT 

TATTACTCCCAGGATTCAGCAGAAGATTGCX5TTAGCTCTCAAATGTGTGTTCCT 
CAAGCACCTAGTAAGTGCCTGCTGTATCCCTACATTACACAGTTCAGCCTT^ 
ATTTTTTAATGTTTAAAAAGTTTCTAATATTAAAGTCAGAATATTAATAC^ 
GTAGAGAACAGCAAAAAAATAAAAAGGATCTCCTTTTTTCCCAGCCCAAAT^ 

5 0 TATTCTTCCAACACATTTCACTTTTCTGTAAATATACATAAAC^^ 

TGCAGTGCTCTTTGTAGCTAAACAGTGAAGATTTACCTCGTTCTGCTCAGA 
TAGGGTTTGACATTTCATTAGCCATGa^CATGG 

TTATAGATGCIAAAGTATTTTGGGGTATATTATCCTAAGGGAAGATAAAGATGATATTAAGA^ 
TGTGACCCTCTTTGCTGAAGAATATTTAACCCCACACAGCACTTCAAAGAAGCrrG 

5 5 TTCTTAATTCTCCAAGCGGATGCTCCATTTCA^ 

GTGGAGCTGAATTTTCTGGAAAATGCTTCTTGGCTGGGGCCACTACCTCCT^ 
TAAAATTCTGAGTGATCCAGGGTATGACCTAGGGAATGAACTAGCTATGGAAATAACTCAGGGTTAG 
AGGACTCTGAAAAGGAACGGCTTCCTCATTCCTTGTCTTGATAAAGTGGAATTGGOkAACT 
AGTGCTGTTGAAGATTTGAGGACTTGTTAAAGAGCACTGGGTC^TATGGAAAAAATGTATGTGTCTCCCCAGG 

6 0 TTTATGTCTTGTTCTTGAGATTTTGTATATTTAGGAAAACCTCAAGC^ 

GACCX3ACTCATTTACAACTGAAACCTAGGAAGCCCCTGAGTCCTGAGCGAAAACAGGA 

TAG ACT ATTGGGT ATG AACT AAAAAGAGACTGTGC CATGGTGAG AAAAATGTAAAAT CCTAC^ CAG CC CTT ACAGT 

GTTGTTACCACCAAGGGCAGGTAGGTATTAGTGTTTGAAAAAGCTGGTCTTTGAGCGAG 
GGAACAACTGTGGGAGTCTTGGGTACTCGCACCTCTTGC^TTTGTTGATGCTCCXSCCAGGAA 
65 TACTTTTTTAGTAAC1AATTCAGATCCAGTGTAAACTTCCG1TCATTGCTCT 

TTTTTCTGAAGTGTTGGGATTGGTTAAGGTCTTTATTTGTATTACGTATCTCCCCAAGTCCT 

ATGGTGCGTGAAGGCTCrCAGACCTTACACACCATTTTGTAAGTTATGTTTTACATGCCC 

GTGGATCTCCTTGAGATCCTGATAGCCTGTTACAGGAATGAAGTAA 

ACATG CAT AAG AAATG TAG CTG AAG TAG AGGGGACGTG AG AG AAGGGCCAG G CCGG CAGG C CAACC CTCCTCCAATGG AAATTCCC 

7 0 GTGTTGCTTCAAACTGAGACAGATGGGACTTAACAGGCAATGGGGTCCACTT 

HUMAN SEQUENCE - CODING 

ATGAACCTGGAGGGCGGCGGCCGAGGCGGAGAGTTCGX3CATGAGCGC 
Ca a .CC»GATCa»CAGCG«CAAGTACCCCGG«CTGGTGTGGGAGAACGAGGAGAAGA 
7 5 G CAAG C^GG ACT ACAACCG CGAGGAGGACG CCG CG CT CTT CAAGG CTTGGGCACTGTTT AAAGG AAAGTT CCG AGAAGG CATCG AC 
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AAGCCGGACCCTCC CACCTGGAAG ACGCG CCTGCGGTGCGCTTTGAACAAG AG CAATGACTTTGAGGAACTGGTTG AG CGGAGCCA 

GCIX3GACATCTCAGACCCGTACAAAGTGTACAGGATTGTTC 

CGCAGATGTCCATGAGCCACCCCTACACCATGACAACGCCTTACCCTTCGCT 

CCCCT CGACCGAAG CTGGAGGGACT ACGTCCCGGATCAGCCACACCCGGAAATC C CGT ACCAATGTCCCATGACGTTTGG ACCCCG 
5 CGGCCACOVCTGGCAAGGCCCAGCTTGTGAAAATGGTTC 

CTCCCGGAGTCCCCACAGAG CCAAGCATAAGGTCTGCCGAAG CCTTGGCGTTCTCAG ACTG CCGGCTGCACATCTG CCTGTACTAC 
CGGGAAATCCTCGTGAAGGAGCTGACCACGTC<^CCCCGAG^ 
GGACCAGGTCCTGTTCCCCTACCCAGAGGACAATGGCCACAGGAAAAACATTGAGAA 
TCCTCTGGATGGCCCCCGACGGGCTCTATGCGAAAAGACT^ 
1 0 GACCGGCCCAACAAACTGGAGAGAGACCAGACCTGCAAGCTCTTTC 

COVCGGCCGCTCCCTGCCAAGATTCCAGGTGACTCTATGCTTTGGAGA 

CAG CTCACGTAG AACCTCTG CTAGCCAGACAACT ATATTATTTTGCTCAACAAAACAGTGG 

CCAGAACACATCAGCAATCCAGAAGATTACCACAGATCTATCCGCCATTCCTCTATTC^ 
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Table 12 

MOUSE NOMENCLATURE 
ICSGNM Notchl 
Celera mCG18747 

5 

HUMAN NOMENCLATURE 
HGNC NOT CHI 

Celera hCG1780817 



10 MOUSE SEQUENCE - GENOMIC 

GTCTGGAGGTTTTATTTGTCCCTGAGGGGGTGCCATTT^ 

AGAGAGTGGGCAGC^CCAGCATCCTGGCATAGTCCCCTTC 

GTCTGCTCTGCTTCCCGCGTCCACTGTGCTATTGATGTTTGAAAGTGGAAGGAA 

ATTCATGACAGAAGCTGGGAGAGAAAAGCAGACCAGCCTCCTAAATC 
1 5 CTGGGGGGAGGGGGCTCTGCTGCGCCTGTAGGCTC^CTTCG^ 

GGTCTTGTCTCTAGCCTGGCCCCAATTAACAGTCATTATTATTTCCT 

CCCTTTAAAGTGAGCGCCTCCATGGCATGCTCCAGAGCTTACGT 

ATTCTAGAGCCTGATTCCAGTTGCACCCACTGCCCAAGGTCGGGTAGCCTGTGGCCAT^ 
CTACATGTCTGTCAACAGCTGCAGCGGAGCCTTTGTGGGCCAACGATGC 
2 0 ATGCTGGAACGTGCCACGTTGTGGACCATGGTGGCACTGTGGATTATGCCTGCAGCTGTCCC 

CTGJlCACCTCTGGACAACGCCTGCCTGGCGAACCCCrc CTCACT CTCACAGAGTACAAGTG 

CCGCTGCCCACCAGGGTGGTCAGGTAAGGGCTATAGGAGGTTGTGAACTT 
TATCTTTGTTGACTTGACATACTCTGGTACACTAAACITTAT 
ATCTTGGGGATTCCTTGGGAACTCTGGTCCTTTTCAACCCCAAACTATCT 

2 5 CACTGTTTCTGTGCGTAAAGAATGAGTCCTCATGTAGCCCGGGTCTTACCATCTCC^ 

CTCTATGTGGGGTCCAGGAGACAGCTAGGTACAAATTTTAGACTGCC!ATTCCT 

ATG ACTT ACAGT C CTCTGT ATG CAG AAAG ATGGCCACATT CTGCAACAGAT ACTGAG CATCATGAAG AT AACACTTTAGAT CTAT C 
CCTGGTTGGTACCTACATGATGTCCAAATCTGCCCTGGGTTCACAG^ 

GG AAAATCATGTCAG CAGGCTGACCCCTGTGCCTCC AACC CCTGTGC CAATGGTGGCCAGTGCCTGCCCTTTG AGTCTTCATACAT 

3 0 CTGTCGCTGCCCGCCTGGCTTCCATGGCCCCACCT^ 

GAGGCACCTGCCACAATGAGATCGGCTCCTATCGCTGTGCCTGCCGTGCCACCCATACTGGTCCCCAOT 

CCCTGCAGCCCCTCACCCTGCCAGAATGG AGGCACCTGCCGT CCT ACAGGGGACACCACCCACGAGTGTGCCTGCTTGC CAGGTAG 
GTTCCATCTCAGTATGCTGGCATCTAGCCTGGTTGTCCATGTCCA^ 

AATACCCTTTGAAGGGAGCCCAAAAAGAAGTTTAGAGAAGGGCAGCAGCCATGGAGCCTTAGGTGTGGTA 

3 5 TAGGGATACCAGAAGTGAGGGGACACTCTCAGTGACACCACACACGTCTATTTCTGT 

TGGTGCATAACCrCGAAGCCTTCrrTTCTGCTGATAA 

GCCAACTAATACCCACCATGTCAATACCCTGAGCCTCTTAGGGCAGGGAGCCAGCCTC 

AATCCACATGTGACT CTCTGTATGTTACGGGAG CACCCTGTCATGGG CAGTCCCTGGGGTTTCCCCTTTCTCTGGGG ATGGTAATT 
GAATAACAGGGTCTTGAGGGCCTTAGGGGCCTCACGTTTTTCATTACTTCCTCTGCAGTGTT 

4 0 GCAGGGGTGGGGCCTGCAGGCACAGCCCTAGTCCTTTCTGTTGCTGCTCTAACGATGG 

CTGCCGCCAACCC!ACTTTTGGCAAGGGTGGGGTTGGTCAACATACCAGGCCGTTGA 
AATAGATTTGGTTGCTGGGGGTGACCTCGGACCACTCCCCTTGCAACCCCGGGGCC^ 
TCTGCCCCAAGCTTGCAGAGCCAGCCTCAAGACTGCGTGCAGACA.CTGGCTTATACA 
GGTAGCCGACAGCTGCCTACACGCTTCCCAGCCTCCCrGGGGGGGCT 
45 GTTCAGCTCTGTCTTGGTGGCCACCTAAGACCTCAGCrGTTAGAACCTGTGTTGAGAG 
GAGAAAGCCTTGTTCAAGGAGCITGTGAATGGAGCTGCCACCAATCCACC*^ 
AAGATGGGCAGGAGGGAAGCAAGGACTGGGTGGGGTCCCTTTACCrTAGCCGGTGGGTCATCCCT 
CTGTGCTGTGTGGAGACATGCTGTTTCAGAGTAGGGAGAAGTGAAAATTGTCCACCCAAATTC 

G CACAGTTATACCCACCATTTG^MCTCCCAGGT AGCTATTTACCAGGTTT CTGAAGAATGTATGAACTAGGGTGGACAT AAAG AAC 

5 0 TAAGATGGGAACAGAACTTAAGCCTGAGAGCTGACTGGGAATTAAAGCGACTCT 

CAAGCCCGGTTTCCCTGGAGGAGGCTGTATAGGCAGAGAAGGCTT^ 
CCCCAGTGAAATCCTTCTACACCTGCTCAGGGCCCCTCCAGX3CT 

G CACAGG C AGGGAGACAAG CT ATTTGGT CTTGCTG AC C CCTGGG CGGG CACT AGG CT CCAAGTTGGTTCCTCC CTCAC C CAGAT AA 
GCCTTTTCCAAGCCTCCAGCTTCTTGCTGTGGTGCAAGATGGTGGG^ 

5 5 CTCC^AGGGCAGGAGGGGTTCAAAGGCATCCCCAGAGATGGTGTTAATT 

CTTTCTGGGTTGATCATTGACCCGGCAGGAGTTGAGGAGATTAATAGCTAACT 

CAGGCCACACCTTGGGGCCCAGCCCCAGGCTAGATGGGCTTGTAGAAGAGAGATGGTTGCCCTC^ 

GGTTCTGACTGAGCGGGCCTGCCTGGATCCCACCACTCTGGTTTCT 

6 0 AACCTCCAAGGACACTATGTTCCTAGATCCCTGTCTGTTGAGTCAGCTCTTTACATAGCGACTTTCT 

TGGAC!ACGGGAGTAGGGATGCCACGGAAGGACAGACAGATGCCTTGTCCAGAGCAGAACATTGC^ 
TACCGGAGCCCACCCGAGGCTCCTCAGCAGACTTCTGAGGTCAGGCTAACCT 

CAG AACTG TG AAG AAAATGTGG ATGACTGTC CAGGAAACAACTG CAAG AATGGGGGTG C CTGTGTGGACGG CG TG AAT ACCT ACAA 



6 5 GGTAGGAAGGTCTGAACTCTGTTCCACGCT ATGAAAGCCCACACT CCAGGGTGGCTGGTAGCATGGG AGCAGCAG CTTC 

TGTGCITCCAGTTGGAGTCAGCAC^GCTC^GCATC^GATGGTTCTTCT 
CACTGAGCCATACAGGAGCTAGCATTCTGGGCAAGCTACCAGGGATGAATAC^ 
AG CTCATCTTGTAG CTGAGGTTC ATGTTCCTGTGACCCCTCIATAGGTCAGTA 
CCCAATGCCTGCCAGAATGGCGGAACCTGCCACAACACACACGGCGGCTACAACT 

7 0 CTGC^GTGAGAACATTGATGACTGTGCCAGTGCCGCCTGTTTCCAGGGTGCC^ 



CACCTGGGGAGCCATGTATGAGATATATATACAGATATATGTGTGTATATACATATATATATATATATATATATATATATATATAT 

ATATATATCAGATATAGATACAGCTCAGATGTAGATACAGGGCTGGAAATGCAGCAC^ 

.*/wi/«i«^«n^p«iPit * ii/^^^i~r^rn* r^rv^r , n/^/T!^nr'r^rrr^iynr:r"Pj&Iir^ 

AUVjA 1 rALTA'IAxM. J. A_rt/4 A V-V-V. * ww~ *#w»™_. *. w * WMW * «~ - ~~ 

7 5 CAAAGGTT AAGGCCAT CTTT AGCT ATAT AG CAAACTTG AGTT CAG CCTCGAAT ATGTGAG ACCCTG TTTT AGGAAAGGG ATG GGGA 
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GAGAGAAAGCTCTGTCTCTTTGGCACCTCCAGCAATGACCCAGTAAGCCTTGCTGGACAGAGGC^ 
GGGATGCTTGAGTGAGTCAGAGAGATTCATTCOVGGGTGTra 

GG AAAAGCAGCACCCACTCACCGCAGCTG CCTTGTCTAGGTCTG CTGTGCCACCTCAACGATG CGTG CATCAGCAACCCCTGCAAC 
G AGG GCTCCAACTGTGACAC CAACC CTGT CAACGGCAAAG CCAT CTG CACCTG C C CCTCGGGGT AC ACAGG G C CAG CCTGCAGC CA 
5 GGACGTGGATGAGTGTGCTCTGGGTAGGTACAGATGCAGTGGCACA 
ATCTGTCCTGGAGAACCACCATTTTGATGGCCTGGGCTGGGGCA 
CCGTGCCTCTTTCACCAGGTGCCAACCCTTGTGAGCACGCAGGCAAATG 
CAGGGCTA(^CGGGACCCCGCTGTGAGATTGATGTTAATGAGTGCATCT 
GATTGGGGAGTTCCAATGCATATGTATGCCAGGTATGTC1TATGCCATTTCCCACCATGGGG 
10 AATG AGTCAAGGCTG CAT AGCTTCAAGTAACATTTCTGTCATGGTTGGTAG CATCTGGGAAGC CGAGG C C AGG AT AACAGG AC AAA 
GTGGAGAATGTACCCAGAGTTGGGTGCATAGACTCAGTAGATATAGACATGAACC^ 

TATGAAGGTGTATACTGTGAAATCAAGACGGATGAGTGCGCCAGC^ 
TGAGTTCCAATGTCAGTGCCCCAAAGGTGAGGCCACCTACTT^ 
1 5 AGGGAGGACTTGAGTTGACTAGGGCGTTTTCAGGCT 

GGCACCGGAAACCCTTTACTAAAG<3AGGGAAGGATGGGCAGGAGCTGC^ 

GT CCTCTTCCTGATGGT CATGGG AGT AAG GACAT AG TGTGGGG ATCTGAC CT CT ATG CCTGGGAAGGACTTGG AGGAACTG AAG AG 

GGGACCAAAGTCCTGGGAATCTGTCTCrTGCCTACTCCAGGGCTG 

AGACTTGGTCCAG CTAGGGG AGTCCCATTCACAGCCTAGTGCIAG 

2 0 TCTGTTGGGCCAAGK3AGGCAGCCACTGAGCAGTGCTAAACAAGGAGCTGA 

CTGGTTGAAGCCTGX5AGGTTAGTGGCCCCTGTTATCAAAGTACCCTCT 
GTGGGTAGCAACCAAGGGGTTAGGATACTGTCAGATATAACCATCTCGTCTTCCCrc 
GTATGATX3TGGATGAGTGTGCCAGCACACCATGCAAGAACGGTGCCAAGTGCCTG^ 
CAGAAGGTACAAGAGGCAGTGC^AGATCTGGGGGAGGGAACTCTTGAGGCT 

ATGTGATACCCCACATCCIATCGATGACCAGGCAGTGATCCTGGGCCT 

GGGAGTAACTGGCCCTGAAGCTCACTAATGTCCACCTTTCCTGCCATCTAGGTTAC^ 

GAGTGTGACCCTGACCCCrGCCACTATGGTTCCTGTAAGGATGGTGTGGCCACCTTTACCTGCCTGTC 

C CATCACTGTG AGACCAACATCAATGAGTGCCACAGCCAACCGTGCCGC CATGGGGGCACCTGCCAGGACCGTGACAACTCCTA.ee 

3 0 TCTGCnTATGCCTCAAGGGAACCACAGGTGACTGGCCGGAGCrcGCAGTGGGTGGA 

ACATGTCATGCTGCCCCTAGGGCCCAACTGTGAGATCAACCTGGATGACT 

ACAAGATTGATGGCTACGAATGTGCCTGTGJVACCAGGCTACACAGGTGAGTGGCTGAAGGCAGAGGTC^ 
TGGGAGGGACIX5TGGGGCTAGCCAGGGTCCAAGGGCTGGATGTCACAAGCTGG 

GTAGAGGTATCTATCATAAAGACTGGGTGGCCTCCATATGCCTCCTTTCCCCTGTCAGTGTCAGCCCT 

3 5 GCCCCAGTAGCCTGAAGTGGGGTGGCCAGAGCTCCGAGGCCAGGCTTTTC^ATATGTGTT^ 

CTGAGCACCGAGGACATTGTCCACCAGAGGATAGAAGAGGATGGGTGGTCTCIAGGCT 
GTTGGTAAGGTGTTGACCTAGCATGTGTAAACCCTGGGTTCAGTCCTCCTGCC!ATGTTAACCA 
TAACCCTAGCACTCTGACATTGGAAGCGAGAGGATCAGAAGTTTAGGTCATTCTC^ 
CACATGAGGCTCAGAAGCTACTGGCACTACGGTCATGTAACTC!ATCGATAC^ 

4 0 TTAGCACAAGGGGAAACTTAGACCCAGCTAGGTCTAGCTCTAGGAGGGGTAGGCTCTG^ 

CCCACCCACAGGAAGCATGTGTAACGTCAACATTGACGAATGTGCGGGC^ 
TCGCGGGCTTCACTTGCCGCTGCCCCGAGGGCTACCATGACCCCACGTGCCT^ 
ATCCACGGAGCTTGCCGGGATGGCCTCAATGGGTATGGCAGACTGACCTGGGAGGGACTA 
GGGTGAATGGGTGCTCCTGTGAGACCAGTGATCTGAGGTCAGCCCCT 

4 5 TAGCATCAGCTCAGGATATAGACTCTAAGCACACTGTGGCTGCTC 

GTCTGTGGGGGACGAGGAGATTACTAAGGTCTCCX3TTA 

GTTCCAGTCTGGAACCCTTGCACGGTCnTGGGCCAGGTGGGTACCTCCAGCCCAGTGGAT 
CTGTAGTTTAGCAGGAGAGCTCTGAGCCCAGGCTACAGCTCAGGCCTCTCCCTAGGTACAAGTGTG 
GG AACAAACTGTGACATCAACAACAACGAGTGTGAGTCCAACCCTTGTGTCAACGGTGGCACCTGC 

5 0 CGTATGCACCTGCCGAGAAGGCTTCAGTGGTAAGTGTGGCGGGTGACACAGC 

CACCTGATGAGCTAGCATTTCTCAGGCAGAGATCCTAGTTCTCrrTGAGGAAAGGCATAG^ 

GCTCAACTTCCTTACAGGCC CT AATTG CCAGAC CAACATCAACGAATGTGCCTCCAACCCCTGCCTG AACCAGGGGACCTGCATTG 

ATGATGTCGCTGGATACAAGTGCAACTGTCCTCTGCCATATACAGGTACGAAC^GCCAGGGA 

AGTGGAGCTAAGAGTATATGGGCTTACCAGGCACTGAAGAAGGTTCTGTTCCTCGCTGAGGGTCTTAGGTCCT 

5 5 TAGTTAAGCAGGAAGGCTCTGAGCCCAGGCTACAGCCCAGGCCTCCTGGGCCTGGGGATACT 

TGCCTAGGAGGAAGCTCCAGGCCGTGTGTCCGGGTGTTTATCTGGTATC 
GTCTGGGAGGCTGACTGGTAGGTACTGCATGGATCAGGGTCCACTT^ 
GCTGGGCCTGGGATTCCTTTGATGGCAGCTTCTTTGTGTGAGGAGA 
ATGGGCACTGCCTGAGGAATGGCGAGAAATAGACCCCCTGCAGGAGAGA^ 

6 0 CAACCCTGCIAGCTCCCAATACTGCCACAGAGTACrGGTGGCrrCATGCT 

GAAGGTGGAGGCCCTTCCAGTGTTCCTATGATGCCTGGCAGGATGCTGTTTTCrTr 

CTGTGTTGGTGAGTGAGGAAACTGTCAGCAGCCAGGGACACCCC^ 

GTCAAGCCGCGCTGACCAGCTTCCTGCTTCCCTTTGCACAATTGTCGGAAACT 

GAAACCGAT AGAGAGAGAGG CCCTGGAGAGTCTCACCAGTGGGCTGGGG ACATGGTGGG AGACAAGG AAG ATAAGGTTTCCT CT 

6 5 TAGCTCCTGGAGCCAACCATGGCCATAAAC^GGAAAGCTAACTGGGATTGCAAGGAGAGTCAC 

AGCGGTTATCAGATTTCACTGCCCTAGTGTGAGGCTGCACCCCATGTTCCCCAAGTAGGATATGCTTGG 
TGTCTCAGGAGTGAGCCTGCTGTGACCTAAAGCTTCAGAATCAGCTCAAGGATTTTC 
AGGCTCAACCACCTGTTATAGTAGGGTCTTGTTTTGAGGTTTTCTGGTTGGGCAA 
ATGAGAGATGGCTTCCGGGACTACATAGATGGCTCATGGAGAAAGAGTGCTTGTG^ 

7 0 TACCTCTAAATGCCCATAACCCCAACATTTGGGAGCAGAGACATCTTGAGAGCT 

GCATCTGATTCAGTGAGACTCrGTCTCAAAGGAATAAGATGCGGGGTTGGTAG 
CACTCTTACACACCTGCATGCAATATACTATGCTCGTTAAAAA 

TCTAACCTCCTGTCAGTAATCGTCATTCCTCCCGGGATCAGAGTCTGGCTGGCCAATCATACAGCATOT 
CCCCCCTTTA.TGGLAAGACCCTATGGGCCAGGCTCACTGG^ 
7 5 GTGTTGGCCCCATGTGCTACCAGCCCCTGCAAAAACAGC^ 
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TCCCACAGGCTGGCAAGGTGAGGCTTGC 

CTGTGGGGCAGTCTCGAGAAAGGAGCCCATCAATGCTGAGCTACCTGCT 
CAGGAGATGGTCCTTCCCATTCCAACTCCATATCCTTCTGTTGCCCAGAGGTCAGGGGA 
AGGACAGAGGCTGAGAGGGTGGGGGCAGGTAACCAAATGAGTTTTTTGTCT 
5 TGTGTGAAAAGCCCATGTCGCCATGGGGCCTCCTGCCAGAACACC 

TCGCAACTGTGAGAGTGACATCGAATGAGTGCCGCCCCAGTAAGTGGTACCCAGCT 

CCTAAGCACACATGACAGGGAGCTGCTGAGAGTCTGAGGTGGAAAATAGCCCTGGTGCCCGGCA^ 

TGTGCCCTCAGCTTAGTCCACTTGCCTGTGCCTTAAAAAGAGACAGGCCAGAGGC^ 

AGTGTGTGCTT CTCACGGG CTTTGCTG CT AG ATTTCCCAGCCCAG TTG AGTGAT ATGG AG CAG AGG CAC CAGGCACTTCAAAATGA 
1 0 TGGCATGGGG C CTAGTTGT ATT CAAC CTTGACACTCGG C CCTTGTCAGGAC C CATTGAGTTGTG TAGGG CTTTAG AG ATGC CCTG C 

ATCCTGAGTCAAAGCTAAAGG CAGAGAGAGATTGAX5GX3AGGTTCTGCCTGG CACCATCAGTGTGTGTGTAGGGGAATGGGCTTCCC 
TATGACTGCCTTGGGCCTGAGGGAAAAAGTGTATGCGGAGCAGATTCCAGGCTCAGTGCTGGGATC 

ATCAAGGCTTGCTGTGAGCACTTG CCACCCCATACCACCTTCTCAGCATCAGGT CCCCG AAGCZATGGCCAGTGGCCACCTAAGG CT 

CATCTTTCTGTCCCTCCAGACCCGTGTCACAATGGGGX3TTCCTGCACCGATGGCA 
1 5 CTTCCAGGGTGCCTTCTGTGAGGAGGACATCAATGAATGTGCCAGCAATCCCTGCCAAAAT 

ACAGCTACAC^TGTACCTGCCCCGTGGGCTTCAATGGCATCCACTGCGAGAACAACACACCTGAC^ 

GCCTCAGCTGAGACAGGGAAAGAAATGGACTGTGGAAGACACAAAGTCTTGACAGCAT^ 

CCCAGTGGGCATAGAGCACTAATTGAACTGAATCCAGCGT^ 

CCAGTGACTTAC4AACAAGGAAGTTTGTGCAGAGGAATTGGGGAACTAGCCTC 
2 0 CCCCTTTTGCCCTGGTTGCCAGGAAGCTTAGAGTTAAATCAATCTCAAGCCT 

TGCCCTTGATTTTGTTTT T CTCAACCACCACGTGTCTATATGCTATAAAATCT 

TGGTACCTGTGTGGATGGTATCAACTCCTTCACCTGTCTGTGTCCACCTGGCTTCACG^ 
AGTGTGATTCACGGCCCTGTCTGCACGGTGGTACCTGCCAAGACAGCTATGGTACTTATAAGTG 

2 5 GGTCTCAACTGCCAGGTGAGCTGAGGTTACAGGGTATGACCCAG 

ACACTACCTTCAGAGGGCCAGGGACACAGAAGCAGAGAGGCTCCCTGGTGTCATCATTGGTTGATGAGCCAGA^ 
CTCTGGGAAAATGCCAAGTTGGGTGCTTCTGAGGGAGACAGGTGGGCATGGGTGC 
GGCCATCTGTCTGAAGCAGCCTC-GGGCAGTGCTCACTTCTGGTGGAGGATGT^ 
TTGAAAGAGAGCAAGGAAGACGTGGAGCCCAGGCCTATGTTGTTCGGAGGGCG 

3 0 TTAGGTGATATTCTTTCTGAGATGGTGGGGGTGGGGGTGGGGAAGTCAGAGCATAAAATGCCTTATTCCA^ 

GGGTCATGACCCCAATAGGGGTCATATATCAGATATCCTGCATATTTTTGCTACATC 
CAATGAAATAATTTTATGGTTGGAGGTCACCACAACACGAGGA 

GTGCTTCTCAGAGCCAGCGAGGACCAGGGCAGGAGGGTGCCCTAGCAGTCAGAGTTGGTCAG 
CCTCAGAACCTTGTGCGCTGGTGCGACTCGGCrCCCTGC^ 

3 5 GTGCCGCAGCGGCTGGACTGG CGTCAACTGCGACGTGCT CAGTGTGTC CTGTG AGGTGGCTGC ACAGAAGCGAGGTAACCAACCGG 

CCGGGCCCACCTGTCCTTGCTCAGGCTCAGCTTAGGCCAGGAGTGCCCCGGTGCCT 

CGAGAATCGGAG CCCAGGCCTCAG CTCCTTCCCGTCTTGTCCCCACAGGCATTG ACGTCACTCTCCTGTG CCAGCATGGAGGGCTC 

TGTGTGGATGAGGGAGATAAACATTACTGCCACTGCCAGGCAGGCTACACGGGCAGCrA 

ACCTAACCCCTGCCAGAATGGAGCTACCTGCACTGACTATCTCGGCGGCTTT^ 

4 0 GAGGGT AT CAGCGTGAGTGTTGCTGCAGGGGAGGG CTTTTGACTGGG AGCTAGTG CATCATGGCAGACTGAGCACATACCTGTG 

GGATCCGGTCACATTACAGATGTGCTGAAGGAGGAGAACTATGCCAAGCAGTGGCTTAGGGCACCCCTTTCCTATATC 
GTACCTTAATACCTGCTCCAAGACCTCATCCCTGGGGGACCAGGTGTTGCCCCACACACACACACCT 

CTGGGGTCT ACTGGGTGGAGCCCTCG CTGCAGGGTGGG ACAGTGTAGTCT AACTGGGT AT ATGG CCTGCATGG CAAGTGGAGG ACA 
TGATACCACAACCCCAGCTAAGCTGTGACCTGTGCCATTTCTACAGTGTG 

4 5 TCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTACCTGCATTGATCTGACCAACT 

ACACAGGGTAAGGGTAGGGACCTGGATGATGGGCAGGATCCAGCCAGAGATCACTCAGGTGATTGAGAGCT 
ATCTCAGAAGGGAGGTGGGCTCCAGTGGCACTGTGCTGCCTCCCCCGGGCCCCAGGTCCT 
GGACAGAGCACAGTCAGCATCATCCGACGGTGTCTCTGGTCACTCTGCAGGTGT^ 
CCCCCCTTGACCCrcCCTCCCGAAGCCCCAAGTGCTTCAACAAT 

5 0 CCACCAGGCTTCGTCGGGGAGCGGTGTGAGGGTGATGTCAATGAATGTCTCTCCAACCCCT 

TGTGCAGCGTGTTAATGACTTCCACTGCX^GTGCCX3GGCTGGCCACACT 

GGG CAGGGTCAACAGGGATGGCTCCTTACTGTCCCTTCTTCACACAGGACGCCGCTGTGAGTC^ 

AACCTTGCAAGAATGGGGGTGTCTGTGCCGTGGCCTCCAACACCGCCCGTGGATTCATCTGTAGGTGCCCTGCGGTAGGTGTCCCT 
GGCCCTGTGCGGCTGCAAGGGTGCCAGGTCTGAGG^TGCCTGAACCCTGACCGACTCT 

5 5 CACTGTGAATAGTCAGCCTGAAGACCTCTAGCAGAGGCTAGGGTCCAAGCCACATGCTTCCTCX3TG 

TAAAAC CCGAG CGGG CACGATCCATACATG CCTTGCTCCCAG TGCTCTGCAGCCTCAGGCTGAAAGATTGCCTTGTATT^ 
CCTTAGGCTTCACAACAAGACCCTGTTTAAAAAGTTTATTTCCTGCTGTGCT 
TTGCCA.CTGTGTCTTAGCCCAATTTTTTAAATAGATGTTTATTTTAGAGCGGTTTTAGGTTTGCA 
AGGTCCCCACCCTGCAGGCCGTTCCCTATGTTTGCTGCCTTGCGTGGAAGGTGTTGAGCTGGCACTAATATGCAGTATT^ 

6 0 GGGTGTTGACATCCAGGGAT CTGT ACACAT ACAATG TACCAG CAT CAG C CTTGGGTTT CAT GG CTCT AAAAACTC CT AGGGTTCTC 

CCCATTCACCTATCCCTCCAGTTTGAAC^CCTTGGCAACCCTGACCATCCT 

AAGG C CAT AGACACCTTT CACAGG C CAC AG ACTGGGG C CAG GT CTTTG CTG G GG TG TGCAT G GTGT C CCTGATTG AATGC ACCAT C 
CTCTCTCCACCACAGGGCTTCGAGGGTGCCACATGTGAGAATGATGCCCGCACT^ 
ATGCATCTCGGGCCCACX3TAGTCCCACCTGCCTATGCCTGGGATCCTTCACCGGCCCTG^ 
65 GTGTGGGTAGCAACCCCTGCTACAATC^GGGCACCTGTGAGCCCACATCCGAGAACCCTTTCT 
TTCAACGGGCTACTGTGCCACATCCTGGACTACAGCT^ 
CTGTGAGCTGCCTGAGTGCCA.GGTGGATGCAGGCAATAAGGTCTGCAACCT 
GCGACTGCTCCCTCAACTTCAATGACCCCTGGAAGAACTGC^CG 
GACAGCCAGTGCAACTCGGCCGGCTTGCCTCTTTGATGGC^ 

7 0 TTTCITTAGAGTCCCAGGGCTCAGGATGCTACGGGAGGACCTAACCAAACA 

TGCGAGTCGCCAAGCA.CATTTCCCAGATCTGGCCTATTCAAGGTATC 
CTATACCTCCTCAGCCCCCTGTATGACC^TACrrGCAAGGACCACTT 
ATGTGAGTGGGATGGCCTAGACTGTGCTGAGCATGTACCCGAGCGG 
CCCACCAGCTAC^^.a^-CTCCTTCC^CTTTC^ 
7 5 CAAGGCCAGCAGATGATCTTCCCGTACTATGGCCACGAGGAAGAGCTGCGCAAGCACCCAATCAAGCG 
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CACCTCTTCACTGCTTCCTGGTACCAGTGGTGGGCGCCAGCGCAGGGAGCTGGACCCCATGGACATCC^ 
TCCTGCTGTTGTGGGCTGTTTCCCAGTGTGTCCCTGGGTTCTCTAACCAGCTTA^ 
TTCACCTTATTTTATAGTCAGAAAGCACCTGGGATGAAGAGGGCATCCCTCTGATC 
ATGTGATTTTGAGGCCCCACTGTGTACCAGGTGTTGG 
5 CTAGACCAATCCTGACCCTCAGTCACTACAGCCCTTATCCCTACTACAGAGG 

AGCCTGGAAGGCTGAGGAGCCGAGGGAGGCACAAATAAGAGCATCTCCTGGATGCT 

GCACCAGCCAGGCATTTGGAGGTTCCAGGAGCCTTCG 

TTCCTGGTCCTGGTTTTCTTGTCGTGGGACTGT^ 

AGAGGCCAGTGACTGAGAGTTGGTGACTTGGCTGGCATGAGAGAGCTGACTCT 
1 0 ACCATAGGGAGTCACAACTGCATCCTTGCAGTTCTCTGGGGCCCCAGGCA 
TCTTTGAAGTAAGGAAAAGGGGTGCTGTGCACCTAAGGGG 

GCAGCAGGGAGAGTGTCCCATTGCGGGGCTGTCGTCTGATAAGCCCTGTGGGTCAGGGTGATGGGCTA^ 
GTAGGCAGTGTACTGGGGTGTATCTAATCTCACTGTGCCATGTGTGTTCTCAGCTCCATTGTCT 
ATGTGTGCAGTCATCCTCGCAGTGCTTCCAGAGTGCCACCGATGTGGCT^ 
1 5 ATATTCCTTACAAGATTGAGGCCGTGAAGAGTGAGTAATCAGGGGCTGGAGGGATGGCT 
GCAGAGGACAAGGGTTCAAATCCCAGCACCCACGCAGCATCTTATAACTATGTGTAACT 
GACCTCTGCAGGCACCAGGCATGCACAGGGTCCTGACGTACATGCAGGC^ 

TCCCTACCCCGACCAGCCCTGCCTCTTCAGGGATGGGTTCATTGGGTGGCGGGTCACAGGCAGGCGGTCT 
GCTGCCATTCCCAGCTCAGGAAGTGCTTTTCTGGAA 

TTGGATATTATTTCAAAATCATTACTAGAGTTTTTTTGTGTGTGTTATTTT^ 
GAAGTGTTGAAGTTTGTATCTTGAAAATCCCCCTAAATCCTTTGTCTTAACAGCrCAA 
ATCCTCCTTCTCGAAAGGGGAGAAGTGAGCACTGTCTCCAGACAGATCAGCTGGTGCA^ 
CTGATTAATCACGTAGAAAATGACCITATTTTG^ 

2 5 TCCGCCCCCCGCCAGCCCACAGCATCACCACGCCTGACGAGGGGTGCTTGCCTGCCGCCCCrrGCCCX3 

TCCGCTGCCCTCGCAGCTGCACCTCATGTACGTGGCAGCGGCCGCCTTCGTGCTCCTG 
CCCGCAAGCGCCGGCGGCAGCATGGCCAGCTCTGGTTCCCrGAGGGTTTCAAAGTG 
CCCCTCGGCGAGGACTCAGTCGGCCTCAAGTGAGTGGACACTGCTCCCACTGTGTGTGG^ 
TGCTTAGCTCCAGAAGGCCCATGGGCCCATCCATCCCT^ 

3 0 GTGGCTTCAGGGCATCTCCTGGAGTACCTGCTCAGTCT CCCTACCCCCATACCCAGGCCC CT C CTCAG ATGGTGCTCT 

GATGG ACGACAATCAG AACG AGTGGGGAG ACGAAGACCTGGAG ACCAAG AAGTT CCGGGTGAGTCCCGCAGG CTCCCAAGCCCCCC 
CTGGGTGGCACCTCCTGCCTTGGCCCCAAGTGATGAGGCTCTG 

TGAC CTG AGTGAT CAG ACTG ACCACAGGCAGTGGACCCAGCAGCACCTGGACGCTG CTGACCTGCGCATGTCTGCCATGGCCCCAA 
CACCGCCTCAGGGGGAGGTGGATGCTGACTGCATGGATGTCAATGTTCGAGGACCAGGTAAGGCCA^ 

3 5 TCCCTAGCATAGACTTAAGAGGCCAGTAGGACTTGAGGAGGGCTGGGTCACT 

CTGATCGTCTGGCTCCTCCTGCCCACACAGTATACTTTTTCATGAAATGCTGCITT 
TTTTTTGGTCTCAGTTTTTAGTATGCTGGGATTATAGGTC 
ATAGGTGTATATAAAGATATCTTTTATTTTATC^ 
GTGGGGAGCCTGAAGAGGCCATTGGATCCCCTGAAGCTGCAGTTACAG 

4 0 TCAGGTCCTCTGGAAGAGAAGCCAGTGCTCTTAACCACTGAGCCATCT 

TTGAGTGTAAGGGGCACACATGTGCCATAGTACACACGTGGAGGTCAGAGGACTACTTGTATGAGTC 
ATGTTGGTCCCAGAGTGCCTTCCAGGGAGGCAGCACTGC^ 

CCAGGG CT ATGGTTG AATGTTG AC CCTCC CTG AGG CTGAC AC CAT C CTT CCCACACCT ACCCTTGTTTCTTG TCCTCT CTGCAGATG 
GCTTCACACCCCTCATGATTGCCTCCTGCAGTGGAGGGGGCCTTGAGACAGGCA^ 
4 5 ATCTCTGACTTCATCTACCAGGGCGCCAGCTTCCACAACCAGACAGACCGC^ 
CTCTCGTTCAGATGCTGCAAAGCGCTTGCTGGAGGCCAGTG 
CAGCAGTTTCTGCAGATGCTCAGGGTGTCTTCCAGGTACGCAGCACTGCAT 
ATGGACCCTGCAATCTCnAATTCCAAGTGTGACTGGGCAGGGAAAC^ 



5 0 TACTCTCAGTAAAACCCTCCCAATCTCAAGGCTATGGCCITGACTTGAGGTCTCCA 

CTACCCCATTCAG CTGTGGCTGC AGGCATAGCCCATGGCGGCAG AAAGAGAGCTTTAGAGGCTGCTGGTC CTGGCCT 
TCTCCTGGCTTTGCCTAGAAGCCTGGAGCCAGCCTGGGTGGATT^ 
GTTGCTCCTGCATGTTGCCAGCC!AGTGACCCCTGCTACCAGACCTGTCCTCT 
ACCAGGCATCAAAAAGCTTGTCATTTAATCATTTCTTAACTT^ 

5 5 GCAATGTACACCTTTTATCCCAGCACTCTAGAATCTGAGAGGATTGTGAGCATGAGGCTAGCCTG 

TT CAAAAT AGCAAAGAG AGGTTG AGACCAGCCACAGGTTATGTTG ACTG CCTGGTGTACAGCTCTCCACATX^CCTt5CCTTGCTTG 
CATAGACAGATGTGTGTACGTTTTGTAGGAATCCAGGCAGTGCTC 

CTTAACAGTTCCAAGTAC^CTATGGGCCATGAGCGGGATTCCTAGCCTGGrrACTCrCCCCT 
GATTTCCTCAACCTGGAGGTAGTTCTGGCAACCATAACCAAAAGATGAGGTCC 

6 0 AAGAGCCCCAGCTCTTGCACATGGAATGTAGCTCCTGCCTTCCTCAGGGGCTCAGCT 

ATACAGGCTTGCATTTAC^TCACCCTGCCTGAACCCATCCCTGCCTTCCAGATCCTGCTCCG^ 
CCGAATGCATGATGGCACAACTCCACTGATCCTGGCTGCGCGCCTGGCCGTGGAGGGCATGCT 
CTGACGTCAATGCCGTGGATGACCTAGGTAGGCCCGAGCTACAGCCCAGCTACCGAGGACATCA^ 
CCCAACCCACTCTTCTCCTCTCTTTGCITCCTrTCCAGG 
6 5 TGTTGTGCTCCTGAAGAACGGAGCCAACAAGGACATGCAGAACAACAAGGTAAGCAGACGGGCTGCTTAACCGC^ 

GGGACTCCGGGG CACCCATACGTTGACCACTTAGAGGTTTAAG AGTGTCTGGGACAACAGCAG CCTAAAAT AACCCGTGCTCTTGC 
TGGAGAGGAGCTGAGGCGTAAGCAGCTGTCCCATGTCACGGTTACTTGAGCCTTGCCTTCTGTCCCCTTCTGT 



ACCACTACCATGCTCCCTCCTTTGGGATAAGAAAACTGCTGAGCTGGGCTGCGTGTCT 
7 0 TGCAGGGCTAGTTGCTTGATCAGGACTCTGCTTAGT^ 

GGAAATGGAGCAGGAGTCAGGTACCIGGAGTTGCTGTTGGGCACAGGTTCTCC^ 
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AGCACCGAGAAGCTGAGGCAGGACGATACTGAGTTCCAAGTACGGCTGTGGGCCCTGAGTGAGATTCCTAGCAACATAGGAAGG^ 
CCACTATAAATACTCTCTGAGCCTGGTGGTCTAGGATGCTTGGGTCTAGATAGATCTCTCCAGATAAGAGAAGAGA 
ACCTGCTTCCCTAGCCCCTCCCCCCAGGCCATCCATCATCAATGACTCAGCAAATTCTACACCTGCCCGCT 
TGTCCGTGTCGTACAGGAGGAGACTCCCCTGTTCCTGGCCGCCCGTGAGGGCAGCTATGAGACTGCCAAAGTGTTG 
5 TTGCCAACCGGGACATCACGGATCACATGGACCGATTGCCGCGGGACATCGCACAGGAGCGTATGCACCACGATA 
TTGGATGAGTACAACCTGGTGCGC^GCCCACAGCTGCATGGCACTGCCCTGGGTGGCACACCCACTC^ 
GCCCAATGGCTACCTGGGCAATCTCAAGTCTGCCACACAGGGCAAGAAGGCCCGCAAGCCC^ 
GCAAGGAAGCTAAGGACCTCAAGGCACGGAGGAAGAAGTCCCAGGATGGCAAGGGCTGCCTG^ 
CCTGTGGACTCCCTCGAGTCACCCCATGGCTACTTGTCAGATGTGGCCTCGCCACCCCT 
1 0 ATCCATGCCTCTCAGCCACCTGCCTGGTATGCCT^ 

CAGCACTGGCTGGAGGTAGCCGGTTGGCCTTTGAGCCACCCCCGCC^CGCCT 

GTGCTGAGTACCAATGGCACGGGGGCTATGAATTTCACCGTGGGTGCACCGGCAAGCTTC 

GCTCCAGAATGGGATGGTGCCCAGCCAGTACAACCCACTACGGCCG 

TCCAGCATAGCATGATGGGGCCACTACACAGCAGCCTCTCCACCAAT ACCTTGTC CC CGATTATTTACCAGGGCCTGCCCAACACA 
1 5 CGGCTGGCAACACAGCCTCACCTGGTGCAGACCCAGCAGGTGGAGCCACA 

ATCACAGCCACACCTCAGTGTGAGCTCGGC^GCCAATGGGCACCn^CCGG^ 

TACAACCGCTGGGCCCCAGCAGTCTGCCTGTGCACACCATTCTGCCCCAGGAAAGCCA 

ATGGTCCCACCCATGACCACTACCCAGTTCCTGACCCCTCCTTCCCAGCACA 

CCACCAGCTGCAGGTGCCAGAGCACCCCTTCCTCACCCCATCCCCTGAGTCCCCTGACCAGTGGTCCAGCTCCTCCCCGCAl^ 
2 0 ACATCTCTGATTGGTCCGAGGGCATCTCCAGCCCGCCCAC^ 

ACAGAGATGTGGGATGCAGGACCCCAGCTTCCGTTCCCAAGCCCTGTTGGGAGTCCTTTCCAGTC 

AAAGG AG C CTTTT AAAAAATGTTTTT AT ACAAAATAAG AGG ACAAGAATTT C GATTTTTTTTTTTT AGT ATTT ATTT ATGT ACTTT 
TATTTTCCACAGAAACACTGCCTTTTTATTTATATGTATTGTTTTCTATGGCACTAGGGAAAAA 

2 5 ATGCTGTTTATTTATTGATTGCTTCTTTCCAAAATCGAAAAGAAAGAAAAAAGAACGTGACAGGAGAAGG^ 

ATGGCCAGAATTGCCC CTCCCCCACACTCACTG CCCCTCCCCCCAGCGTCACCTGGGATTTGCAGATGTGTTTAG AAACACGCCCA 
GACCTTGAACCTTGGGTT(^TGGATTAGTTTTGTATCT 

ACCAGTGTGGACTTGAAGAAGTGTCCTCAGCATGTGCAGAGTCTACTACCCAGTACCAGTCGTGAGTCTG<^GGCTCCAGTGT^ 
GTAGTAGTGTTTATGGGCCTTGGGAGTACTTCTCCCCTGCCCTGCCCCACTGTCCCCTTCCTGACAACTTGAGCC^ 

3 0 CAGGGTGTGGTGCCTCCTAGAGAAAACACTGCCTGGACTGTTCTGT^ 

AGACGGACTGTCCCGGCCTGGGCCTGGGCTCCTAACACCTGACTGCCAAAGGGCTCCAATGTC 
CTGCATTGAGACTCCAAGAAAACAGAAGCTATGTGGCCTCTGATCCCCAAACTGGCCTGGGTGGGGACATGCCTTC 
. AATGTGGGTGGAGCCTGCTTCTGGGCCACCCCTCCTGGTTCAGGGCTGTGCTCACAGCAGATT CTTC 
GTGGCAGAATAAGTATCTGTAAATACATGTTTAAAGATGGATTTTGTTTAAAAAATCTAAAGGAA 

3 5 TGATGAGGACTGTCAGACTGTGGCTTAGCTCAGTGTGACCCAGACCTTGTGA 

CAACCCAGGATGGCCCATCTGCTGCCCACCAAGTCCCTTTCCAGCCACTGTGTGCT 
TCAGGGCAGCTCTTTCTGGCCTTTTGGGGGGCAGTGTCTGTGCCATGCCTAATAGATATGACCAG 
CTTACTGTGTTGTATAAAATAAAGTGTAGTTTACAAAAAAAAGAAACGTAAAAAAAAAAAACT^ 
AAATGATGTATlTTTTTCATL'l"rri"rriGTTAACTAATTTGCAATAAAAAT^ 

4 0 AGTGTGGTTT GG TT C ACTrTTTTGGGG AGTGTG ACAG AG AAAGGGC CACC CAG CACTG AAAT AAC CAAATC CAAAC CCTCT CTTACA 

CAGAAAT ACAG AGGTTG CCATAAGGACACACCATTTTCAG AAG CTGCACAGCGCAGGG CATCAAGGTGGG AAG ACAGTTTCCAAAA 
GAGTTGATGCAGTCCTCTCAGGGATGTGGATCTGATTGATGCCAGCTG 

TTGTCATAGATGACrATCCTCTGAGCCCAGTAACCCCCAACCTCTTCAAATTCAGCCATGCCTTGCAAAAGGT 
CTTGTTGACTGCTTATAGACCCCCTCAAAGAGTTGGGAAGGTCATGGGACTCAAAGGACCCTTGCCTGATGATCCAGT 

4 5 TGTATGCAGTCCTGCCCTAATTGTGCATGACCTCAGGGAGGTACTAGAGTATATAGTCTGGGAAGACTAGTGGAGAGACT 

CTTTAATCCTAGCACTCAGGAGGCAGAGGCAGGTGGATTTATGAGTTCGAGGCCA 
CAGGGCTATACAGAGAAACCCTGTCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAA 
GCAGGTCACTATCCAACTCAATGGTTTATGAAAGACACACAACCrrTTCTTTATATTTTATTATGT 
50 GTAGCTGCCTAGCCTCCATATTGCTAAGACTCTGTCTCCCCATC 

CTAGACCAAGTTGGGCAGCCTTATAGGCCATGCTCTCCCAGCTCCTTCAGGTGGTAGCCTT 

GGCATGGCATCTCTCCTCTCCCCAACTCTCCCTTCTCCCACTGCCTCAAGTCTCAAGCCCAGGAGTCCT 

TGTCCTCCCCAGCTACTGGCTATTGGTATCTTTATTTACCAATCAGAAC^ 

TTCCTGTGCAGTTCGGGGGAACATTAACATTTGTAATACAAACAGCTACAAGTAAAGAGTTCCAATGTGGACTC 

5 5 TTTACCTATGTTCTGCTCTTGTCAGCGCTGCAAATCCATTGGTGCCCTAGTCAGCTTTGGTGCCT 

AGTGTTGTCTCTTCCTGGGAAGG AATTTT AGCAACACT CATCT AGTCCG AG AG CCCAGCAGGTACAAAGGCATCTTAGCCCTAAGA 
AGAAATCATAATAGCACAGGTGTGGATCTCCCTTGGCCTGCTGTACTGGCTAGTTTTGTGTCAA 

CAAAGAAAGG AGTTTCAGTTGGGGAAATGCCTCCACGAG ATCCAG CTGTAAGG CATTTTCTCTTAGTG ATCAAGGGGGG AAGGGCC 
CTTTGTGX3GTGGTACCATCTCTGGGCTCGTAGTCTTGGGTT CTAT AAGAGAG CAAGCTGAGCAAGCCAGGGGAACCAAGCCAGTAA 

6 0 AGAACATCCCTCCATGGCCTCTGCATCAGTTCCTGCTCCCTGACCTGCTTGAGTTCCAGTCCTGACT^ 

GTATGGAAGTGTAAGCCGAATAAACCCTTTCCTCCC<^CTTGTTTCT^ 

AGACACCTGCCTAGGCCTCTCCATTTGGCCCACACCTTGGAGC 

CCCTCTCTGAGCCTGACATCTGAGGACAAGTGTGAATCCATCAAGTTC^^ 

GCGGAGGGGGCCCAGGGGCCCAGAAGGGGTGCTCACACACAGAATTGTGATAACTTGTCTCATTCTTGCCCT 

6 5 GAGCCTGAATTCTGCCCGGAGAACTAGCATCTCCAAAATTCCATCCGGAATGGAAATGTCAGCT 

TATGCCTGGGTGTTCTCTAGAGCATCATCCACAATGGCATGTGCAGCCCAGGGTTCACAACT 

TCTGGACTGGGTTCGGGGTGAAAAACATTGATGTTTGCAGATCAAAGCAAGGGCCTACCCA 

GACCAAACAACAAAAGGTCAGGGAAGAGGGGATGGATCCTGCCTACTGCGGCAGAAC^ 

G AAAGATTG C CTG TTCTTGGGTCTCCAG CTTG CCAGG TACAG CT CTG GGGACTTG CCAACCC CACAACCTCT H n G T ATCT AC CC 

7 0 TAGTAAACATTCAGTGGAGTATTTTTCAGCCATTAGAA 

CACAGGGCTCGTGAAAGA<^TCTCAAAAATGACGACATGATTT CATCT 

AGAACAG AGTGG CCATTCCTGGGTCCAGGGAAATGGGTATGACTGACAATGCCT CATGGATAGTCCAGCAC 
TGTGTACGTCT AAGG CAGAAGG CTGCATGGCACTGTGAAGGTCC CAG ACCCGTCAATGCTAGAATTCCACAATCTGTG AGTCTACT 

7 5 TTTATTTTGTGACAGGATCTTGCTGGGTATCCCAGGTTGGCCTCAATCTCACCATCCT 
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T CT AGGTGTGTACCACG C CACCCACTT AAAC CT AGT ACTCTT AAGGAG CATCT AT CTGCC CCT AAAAG CCACTGG AAAT AAAT AAA 

ACATGT AAAACAAT ATTTTT AAAGGG CT AAAGATGT AGCCT AGT AGT AG AGGGTTTG TTT ACT CTGTG CAAGGCT CTGGATTT CAT 

CCCCAGGACCAAATAAAGTAATCAATAAATTGGCTTGAGACAACCTTCAATAAAT^ 

TGAGTCATGTAAAAAAGTTGGGTATGATCATGTCCTGTAATCCCAGCATCTGTGA 
5 TTAGCTTTTGTCTAATCAGTGAGTCGCAGGTCCCAGTCAGAGACCT 

GATGGCTCAGATAGTAAAAATTCCTACTGCACTAGCACAAAGACCTGAGTTT^ 

ACCATACATCTGTAATCCTGTGCTGGGGTGGGGAGGGAGACAAGATCCT 

TAGGTCCAGTGAGAGAGACCCTGCCCCAAAAACCAGGATGGAGTCAGGCATTGTGG 

GAGGCCCAGGGCTAAAATAGTCTCTGTCTCAAAAACAAACTA 
1 0 TGCCATATATACATATACATGCTCACACXSCACAGGCAC^ 

AGCCCAGAGAGAAAATAGAGTCTTGCAGCCTCIAGCCJTTCCT 

CAGAAATACTCCAGGACCAAATCCCAAGCAGACAGAGGTATGGAGATGAGCT 

AGACATGGGAGCAGAG^CAACTGAGTGGTCITGGAGCAACTTGC 

TGTGAAATCCAACTGGTGTGGGGCrcTTTACACAGCAACCT^ 
1 5 GAAATCrCTCAGGGACACAGGCCAGTTCAGTTCCCTAGGTACTGCCITTCCCCC^ 

AACACATCAACTTGACACACAAACTACAGAGTTATCTGAGAGGA 

AAGACATTTTCTTAATTAGTGATTGATGGAGGAGGGCCAGCnTATGGTCCT 

GAAACAAGCCAATAAGCAGCACCCCTCCATGACCTCTGTCTGTATCA^ 

ACTTTCTTCAGTGATGAACAGTGCTGTCAAAGTGTAAGCCTAAGAACT 
2 0 CACAGCAATAAAAACCTTAACTAAGACACC^CCACTCCCCAGKXSATCrGC 

GAAT AGTT ATT ATGT ATAT ACT ACT AGAGGGCAGTG AG TT AGG CAGTAGTG AG T AAGCAGGGG CACACT ACGT CCTCTGGCTGG CT 

GAACCTTGTGATCCTTCTGCCTCAGCCTCCTGAGTTCTAGGATTGCAGGTATGCC 

AACATTCAAGTATGTAAGTAGTTCCAGACAAGTATGGTAAATCCAACACTCTAGAGG 

GGCCAGCCTGAGTTATATAGCTAACTTCTGTTTCAAAAATAACAAAAGCAGATAT 

2 5 TGCACACCCTT AATCCCAGCACTCAGGAGGCAGGCGGAGGTAA^ 

AGGACAGCCAGGGCTACAAGGAGAAACCCTGTCTCAAAAAACTGAAACCAAA 
TCTCTTCCTGTCCCAAGTGTCCTrCGCrCCACIGTCTGTGTATATCT^ 
CATTCAGAGAAGCCCATTCCAATAGAAATAAGACGGAAACCACATAGGTCATTCA^ 
GAGCAGGTGAATTTATTTCAATACTAC^TCTTC^TTCGTCCAATGAAA 

3 0 GTATGAAAAGGACAGAGTTGACATTCTTTGT ATT ATT ATTTGTATTTATTTGCTT ATGT AT ATGT 

GTAGAG^TCAGAGGACAAGTGATGAGAGCCAGTCTCTCCCTCTACCATGTGG 

CAG CAGGCAC CCTTACCCACTGTG CCAGCTCAC CTGCATT AT CACCATCTGCACTGG CCATGCTG AAGG T GCTT AG AGG C CACACA 

TGGTCAAAATGTCATCCCAGACAGGCTTAATCrCCACCAGCACAGATAAAC^ 

GGGGCGAAGGGGAGX3ACGGTAAAGGCAGGAGGTTCAGGAGTTCTAA 

3 5 AAGGTGAGGGGAGAGACTGCTATGCTGGTGTTGTTTCT TC 

AAGAGGGACCCAAGGGTATTAGTGCCATAAACCCAGCCAAAGCAGCAGGGTTATGCCrA 
CCTT'CTGAGACTCCGGXSAACCAGCCAGCCATGAGTCTTGCAAAGTTTGCT 

ACTCCAGACCCCACCCTGTGGAGCTGTGGAGACCCAGG CCAATCCTCAG CTG CATACX1AGTTAGGAAACAGGGCTCTTTT ACTACA 
GTATCCTCAAGCCCGACTGAGCTATGTGTACTGTGGATTTCC^AA 

4 0 GATCAAGGACAGGGATGCTGCTCTCTCCACCTGCACTTTAAAGATGATGTG^ 

TGTTCTCATGGACTACCTGGCAGCTGTGGGATAGAAGGTCCCTAGAGTGTC 

TGGATGCTTTGGCCATCTGTGTAGAACTCACTTCTGOTAAGGACCCAGAAATAGCA 

ACATTCCTTTAGCAGCCAAGCCCAGCATACACAGTCACTGGTC^ 

ACTGTGAACAGATATTATGAGCAAGGCAACTCTrrri-1 11 riAAAGATTTATTTATTATTATATGTAAATAC^CTGTAGCTGTCTT 

4 5 CAGACATTCCATAGTAGGGCATCAGATCTCATTGTGGATGGTTGTGA 

GGAAGAGCAGTCAGTGCTCTTACCCGCTGAGCC^TCTCACT 
CTGGCTTACAGGTTCTGAGGTTCAGTCCATTATCATCAAGGCAG^ 

AGAGTTCTACAACTTCATCTGAAC^CCACTAGGAGAAGGCCACTGACTTCCAGGCAGCTAG 
ACACTGACACACCCACTCCAAAAAGGTCACACCrCCTAATAATGCCACTCCCrGGG 

5 0 AAGAATGGTTGCTTCAATG CTTTT ATGTG CACATACCACCT AG AG TGTT CCACAAAGTTCAGGGTT CAG AG CTTT CACACAACTTT 

GACCTCAAGGCTTATTTTTAGATTCTAGTGCrTGTGAGATCCATTCTCTACAA 
AGACTGTGTTATGCCGGGTGTGGTGGTACATGCCTTTAATCCCAGCACTCAGGAGGCAGAGTC^ 
CAGCCTGGTCTACAAAGTGAGTTCCAGGACTTCCAGGGCTACACAGAGAAACCCTC 
CAAAAAAGACGACTGTGTTCTGTTTCTCATGAGATTCTTCTGC^ 

5 5 ACT CAGAAACAAGTCTG ACCT AAGCT ACTT ATTTACTC C C CAT CAAT ACAATCTTCTT AAG CTGGAAAAGGGAAGGGGAG AG AAAA 

GGTCTCTGACCTTAGCCTCAGCAGGCCAGGTGGAAGGGGTGACACAGAACAAGGCCCTTTC 

GCTCCCACAAGGTTGAGGATTCTGTCTCTGACCTCAGAGCTTGGAAATCCATTGT 

AGCATTTATGCAGCACCTGCTATCTTTAGTAAGCCCACCCCCTCCAGGGAGTCCAGAAGTAAAGACT 

6 0 ATAGGATCTGGAAGGAGCCCTCAATGGATGGATAATTTGTAGGTAGATTTGTAGGTGGCT 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NWJNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNN^^ 

MOUSE SEQUENCE - mRNA 

GTGGTGTGCGTCAACGTCCGATCCCCGCCGGCCACCCCAAGAGGCCGCCGCCGGG 
70 GCTCCTGAOSCCCCTTCTCTGCCTAACGCT^ 

TGAATGGAGGT AGGTGCGAAGTGGCC^GCGGCACTGAAGCCTGTGTTGCCAGCGG CAG CTTTGTGGGCCAACG ATGCCAGGACCCC 
AATCCTTGCCTCAGCACACGGTGTAAGAATGCTGGAACGTCCTATGTTGTGGA 

ACCTGCTCACTCTCACAGAGTACAAGTGCCGCTGCT^ 

7 5 AACCCCTGTGCCAATGGTGGCCAGTGCCTGCCCTTTGAGTCTTCATACATCT 
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CAGGCAAGATGTTAATGAGTGCAGCCAGW^CC 

GTGCCTGCTGTGCCACCCATACTGGTCCCCACTGTGAACTGCCCTATGTGCCCTGCAGCCCCTC^ 
TGCCGTCCTACAGGGGACACCACCCACGAGTGTGCCTGCTTGCCAGGTTTTGCTGGACAGAACTC 
TCCAGGAAACAACTGCAAGAATGGGGGTGCCTGTGTGGACGGCGTGAATACCTACAAT^ 
5 AGTACTGTACAGAGGATGTGGACGAATGTCAGCTCATGCCAAATG 

TGCCACTTGCCACGACCGTGTGGCTTCCTTCTACTGCG 
GCATGAGCAACCCCTGCAACGAGGGCTCCAACTGTGACACCAACCCT 

ACAGGGCCAGCCTGCAGCCAGGACGTGGATGAGTGTGATCTGGGTGCCAACCGTTGTGAGCACGCAGGC^^ 
1 0 GGGTTCTTTTGAGTGCCAGTGTCTACAGGGCTACACGGGACCCGGCTGTGAGATTGA 
AGAATGATOCCACTTGCCTGGACCAGATTGGGGAGTTCCAATGCATATC^ 

ACGG ATG AGTGCG CCAG CAGCCCCTGTCTG CACAATGG CCACTG CATGGACAAG ATCCATG AGTT CCAATGT CAG TG C CCCAAAGG 
CTTCAACGGGCACCTGTGCCAGTATGATGTGGATGAGTGTGCCAGCACACCATGCA^ 

ACACCTATACCTGCGTGTGTACAGAAGGTTACACAGGGACCCACTGCGAAGTGGAC^TTGACGAGTGTGACCCTGACCC 
1 5 TATGGTTCCTGTAAGGATGGTGTGGCCACCTTTACCTGCCT 

TGAGTG CCACAG C CAAC CG TG C CG C CATGGGGGCAC CIX3CCAGG ACCGTG AGAACTC CTACCT CTG CTT ATG C CT CAAGGG AACCA 
CAGG^CCCAACTGTGAGATCAACCTGGATGACTGCGCCAGCA 

GAATGTGCCTGTGAACCAGGCTACACAGGAAGCATGTGTAACGTCAACATTGACGAATGTGCGG 
CACTTGTGAGGATGGCATCGCGGGCTTCACTTGCC^ 
2 0 GCAACAGTAACCCCTGCATCCACGGAGCTTGCCGGGATGGCCTCAATGGGTACAAGTGTGACTGTC 
AACTGTGACATCAACAACAACGAGTGTGAGTCCAACCCTTGTGTCAACGGTGGC^ 
CACCTGCCGAGAAGGCTTCAGTGGCCCTAATTGCCAGACCAACATCAACGAATGTGCCT 
GCATTGATGATGTCGCTGGATAOU^TGCAACTGTCXrrCTGCCATATACAGGAGCCACGTGTC 
ACCAGCCCCTGCAAAAACAGCGGCGTATGCAAGGAGTCTGAAGACTATGAGAGTTTCTCCT 

2 5 TCAAACCTGCGAGGTTGACATCAATGAGTGTGTGAAAAGCCCATGTCGC 

GCTGCCTCTGCCAGGCCGGCTATACAGGTCGCAACTGTGAGAGTGACATCGATGACTGCCGCCCCAACC 
TCCTGCACCGATGGCATCAACACAGCCTTCTGCGACTGCCTGCCCGGCnTCCAGGGTGCCTTCT 
TGCCAGCAATCCCTGCCAAAATGGAGCCAATTGCACTGACTGTGTGGACAGCTACACATGTACCT 
TCCACTGCGAGAACAACACACCTGACTGTACTGAGAGCTCCTGCTTCAATGGTGGTACCTGTGTGGATG^TA 

3 0 TGT CTGTGTCCACCTGG CTTCACGGGCAGCT ACTGTCAGTATG ATGTC AATGAGTCTGATTC^ CACGGTGGT AC 

CTCCCAAGACAGCTATGGTACTTATAAGTGTACCTGCCCACAGXK5CTACACT 

ACTCGG CTCCCTGCAAG AATGGTGGCAGGTG CTGGCAG ACCAACACGC AGTACCACTGTG AGTC 

CTG TGT GGATG AGGGAG AT AAACATT ACTG CCACTGCCAGGCAGG CT ACACGGG CAGCTACTG TGAGGAC GAGGTGG ACGAGTGCT 

3 5 CACCTAACCCCTGCCAGAATGGAGCTACCTGCACTGACTATCTCG^CGGCTTTTCCT 

AACTGCTCCGAG<3AGATCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTACCrc 
TTCCTGCCCCCGGGGGACACAGGGTGTACACTGTGAGATCAATGTTGATGACTGCC^ 
CCAAGTGCTTCAACAATGGCACCTGTGTGGACCAGGTX^TGGCTATACCTGCACCTGCC 
GAGGGTGATGTCAATGAATGTCTCTCCAACCCCTGTGACCCACGTGGCACCCAGA^ 

4 0 CGAGTGCCGGGCTGGCOVCACTGGACGCCGCTGTGAGTCAGTCA^ 

GTGCCGTGGCCTCCAACACCGCCCGTGGATTCATCTGTAGGTGCCCTGC^ 

ACTTGTGGCAGCTTACGCTGTCTCAACG^TGGTACATGCATCTCGGGCCCACGTAGTCCCACCTC 

CGGCCCTGAGTGCCAGTTCCCAGCCAGCAGCCCCTGTGTGGGTAGCAACCCCTGCTACAATCAGGGCA 

AG AACCCTTTCTACCGCTGTCTATGCCCTGCCAAATTCAACGGG CTACTGTGC CACATCCTGGACTACAGCTTCACAGGTGGCG CT 

4 5 GGCCCG^ ACATTCCCCCACCGCAGATTGAGX3 AGGCCTGTGJVGCTGCCTG AGTG CCAGGTGGATGCAGGCAATAAGGTCTGCAAC CT 

G CAG TG TAAT AAT CACG CATGTGG CK^SGATGGTGGCG ACTGCT CCCT C AACT TCAATGACCCCTGGA G CACGCAGTCT C 

TA(ZAGTGCTGGAAGTATTTTAGCGACGGCCACTGTGACAGCCAGTGCAACTCGGCCGG 
CT(ZACCGAGGGACAGTGCAACCCCCTGTATGACCAGTACTGCAAGGACCACT^ 
TGCCGAATGTGAGTGGGATGGCCTAGACTGTGCTGAGCATGTACCCGAGCGGCT^ 

5 0 TTCCACCCGACCAGCTACG^3AACAACTCCTTCCACTTTCTGCGXK5AGCT 

GATGCGCAAGGCCAGCAGATGATCTTCCCGTACTATGGCCACGAGGAAGAGCTG^ 

TTGGGCIZACCTCTTCACTGCTTCCTGGTACCAGTGGTGGGCG 

TCTACCTGGAGATCGACAACCGX3CAATCTGTGCAGTCATCCTCGCAGTGCn^ 

GCrCTTGCGTCACTTGGCAGCCTCAATATTCCTTACAAGATTGAGGCCGTGAAGAGTGAGCCGGTGGAG 
55 GCTGCACCTCATGTACGTGX3CAGCGGCCGCCTTCGTGCTCCT 

GCCAGCATC^CCAGCTCTGGTTCCCTCAGGGTTTCAAAGTGTCAGAG 

TCAGTCGGCCTCAAGCCCCTGAAGAATGCCTCAGATGGTGCTCTGATGGACGACAATCAGAACGAGTGGGGAG 

GACCAAGAAGTTCCGGTTTGAGGAGCCAGTAGTTCTCCCTGACCTGAGTGATCAGACTGACCACAGGCAGTGGAC 

TGGACG CTGCTGACCTG CGCATGTCTGCCATGGCCCCAACAC CG CCTCAGGGGG AGGTGG ATGCTGACTGCATGGATGTCAATGTT 

6 0 CGAGGACCAGATGGCTTCACACCCCTCATGATTGCCTCCTGCIAGTGGAGGGGGCCT^ 

TGCACCTG CTCTCATCTCTGACTTCATCT ACCAGGGCG CCAGCTTGCACAACCAGACAG AC CGCACCGGGGAG ACCX5CCTTGCACT 
TGG CTG CCCG AT ACT CT C GTTCAGATCGT CG AAAG CGC CTTG AG G C CAG TG CAG ATGCCAACATCCAGGACAACATGGG CCGT ACT 
CCGTTACATGCAG CAGTTTCTG CAGATGCTCAGGGTGTCTTCCAG ATCCTGCTCCGGAACAGGGCCACAG ATCTGGATGCCCGAAT 
GCATC^TGGCACAACTCCA(TIX1ATCCTGGCTGCGC 

6 5 TCAATGCCGTGGATGACCTAGG CAAGTCGGCTTTGCATTGGG CGGCCGCGGTG AACAATGTGXjATGCTGCrGTTGTGCTCCTGAAG 

AACGGAGCCAACAAGGACATCGAGAACAACAAGGAGGAGACTTCCCTGTTCCTGTCGATCCGCC^ 
AGTGTTGCTC^ACCACTTTGCCAACCGC^ACATCACC^^TCACATGC 
ACGATATCX3TGCGGCTTTTGGATGAGTACAACCTGK3TG<^TCCCCACAGCTGC^ 
TCTCCCACACTCTGCTCGCCAAATC^CTACCCTGGCAATCTCAAGTCTGCC^ 

7 0 AGGGCTGGCTTGTGGTAGCAAGGAAGCTAAGGACCTCAAGGCACGGA 

GCTCGTCGAGCATGCTGTCGCCTGTGGACTCCCTCGAGTCACCCCATGGCTACTTG^ 

TCCCCATTCCAGCAGTCTCCATCCATGCCTCTCAGCCACCT 

GGCAGCCAAGCCTGAGATGGCAGCACTGGCTC^AGGT 

TAGC CTCCAGTGCCTGCACAGTG CTGAGT ACCAATGGCACCGGGG CT ATG AATTTCACCOTGGOTG CACCGGCAAGCTTGAATGGC 
7 5 CAGTGTGAGTGGCTTCCCCGGCTCCAGAATGGCATGGTGCCCAGCCAGTACAACCCACTACC^ 
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GAGCACACAGGCAGCTGGGCTCCAGCATAGCATG^^ 
ACCAGGGCCTGCCCAACACACGGCTGGCAACACAGCCTCACCTGGTC 

CCACAGAACTTACAGCCACCATCACAGCCACACCTCAGTGTGAGCTCGGCAGCCAATGGGCACC^ 
GGAGCCCAGTCAGGCAGATGTACAACCGCTGGGCCCCAGCAGTCTGCCTGTGCACACCATTCTGCCCC^ 
5 CAACATCACTGCCATCCTCCATGGTCCCACCCATGACCACTACCCAGTTCCTGACCCCTCCATCACA^ 
CCTGTGGACAACACCCCCAGCCACCAGCTGCAGGTGCCAGAGCCCACTTTCCTCACCCCATC 
CAGCTCCTCCCCGCATTCCAACATCTCTGATTGGTCCGAGGGCATCTCCAGCCCGCC 
TTCCAGAGGC^TTTAAATAAACAGAGATGTGGGATGCAGGACCCCAGCTTCCGTTCCCAAGC 
TTCAGGATGCTGGGGCGACCAAAGGAGCTTTTTAAAAAATGTTTTTATACAAAATA^ 
1 0 TATTTATTTATGTACTTTTATTTTCCACAGA 

ATCTGTTCCAAGAAAATAAACTAGTTCTCAGAGCCTTGATTCT 
AAGGATTCATGATTTGTAAATGCTGTTTATTTATTGATTGCrTCr 

MOUSE SEQUENCE - CODING 
1 5 ATGCCACGGCTCCTGACGCCCCTTCTCTGCCTAACGCTGCTGCCCGCGCGCGCCGCAAGAGG 

GACCTGCCTGAATGGAGGTAGGTGCGAAGTGGCCAGCGGCACTGAAGCCTGTGTTGCCAGCG 

AGGACCCCAATCCTTGCCrrCAGCACACGGTGTAAGAATGCTGGAACGTGCrATGTTG 

TGCAGCTGTCCCCTGGGTTTCTCTGGGCCCCrCTGCCT 

CACCTGTGACCTGCTCACTCTCACAGAGTACAAGTGCCGCTC 
20 GTGCCTCCAACCCCTGTGCCAATGGTGGCCAGTGCCTGCCC 

CCCACCTGCAGGCAAGATGTTAATGAGTGCAGCCAGAACCCTGGGCTGTGCCGCCATGGAGGC 

CTATCGCTGTGCCTGCTGTGCCACCCATACTGGTCCCCACTGTGAACTGCCCTATC 

GAGCAACCTGCCGTCCTACAGGGGACACCACCCACGAGTGTGCCTGCTTGCCAGGTTTTGCT 

GATGACTTGTCCAGGAAACAACrGCAAGAATGGGGGTGCCTGTGTGGACGGCGTG 

2 5 G ACGGGTCAGT ACTGT ACAGAGG ATGTGGACG AATGT CAG CTCATG CCAAATGCCTG C CAG AATG CGGGAACCTG CCACAACACAC 

ACGG CGGCTACAACTCTGTGTGTGTCAATGGGTGG ACTGGCGAGG ACTGCAGTGAG AACATTG ATGACT CCTGT 
TTCCAGGGTGCCACTTGCCACGACCGTGTGGCrTCCTTCTACTGCG 
GCATGCGTGCA.TCAGCAACCCCTGCAACGAGGGCTCCAACTGTGAC^ 
CGGGGTACACAGGGCCAGCCTGCAGCCAGGACGTGGATGAGTGTGATCT^ 

3 0 AACACACTGGGTTCTTTTGAGTGCCAGTGTCTACAGGGCT 

CCCATGTCAGAATGACG CCTACTTGCCTGG ACCIAG ATTGGGGAGTTCCAATG CATATGTATG CCAGGTTATGAAGGTGTATACTGTG 

AAATCAACACGGATGAGTGCGCCAGCAGCCCCTGTCTGCACAAT 

CCCAAAGGCTTCIAACGGGCACCTGTGCCAGTATGATGTGGATGAGTGTGCCAGCACA^ 

TGGG CCCAACAC CT AT ACCTG CG TGTGT ACAG AAGGTT ACACAGGG ACC CACTG CG AAGTGG ACATTG ACGAGTGTGAC C CTGACC 

3 5 CCTGCCACTATGGTTCCTGTAAGGATGGTGTGGCCACCTTTACCrGCCT 

AACATCAATGAGTGCCACAGCCAACCGTGCCGCCATGGGGGCACCTGCCAGGAC 
GGGAACCACAGGGCCCAACTGTGAGATCAACCTGGATGACTCC^ 

ATGGCTACGAATGTGCCTX3TGAACCAGGCrTACACAGGAAGCATGTGTAACGTCAACATTGACG 
AACGGGGGCACTTGTGAGGATGGCATCGCGGGCTTCACTTGCCGCT 

4 0 CAACGAGTGCAACAGTAACCCCTGCATCCACGGAGCTTGCCGGGATGGCCTCAATGG^ 

GTGOAACAAACTGTGACATCAACAACAACGAGTC 

TACGTATGCACCTGCCGAGAAGGCTTCAGTGGCCCTAATTGCCAGACCAACATCAACGAATGTGCCTCCAACCCCTG 
GGGGACCTGCATTGATGATGTCGCTGGATACAAGTGCAACTGTCCTC^ 

CATGTGCTACCAGCCCCTGCAAAAACAGCGGCGTATGCAAGGAGTCTGAAGACTATGAGAGTTTCTCCTGTGT 

CAGCTACCGCTCCCTCTGCCAGGCCGGCTATACAGGTCGCAA 
ATGGGGGTTCCTGCACCGATGGCATCAACACAGCCTTCT 

AATGAATGTGCCAGCAATCCCTGCCAAAATGGAGCCAATTGCIACTGACTGTGT^ 
CAATGGCATCCACTGCGAGAACAACACACCTGACTGTACTGAGAGCT 

5 0 CCTTCACCTGTCTGTGTCCACCTGGCTTCACGGX3CAGCTACTGTC 

GGTGGTACCTGCCAAGACAGCTATGGTACTTATAAGTGTACCTGCCC^ 

CTGGTGCGACTCGGCTCCCTTGCAAGAATGGTGGCAGGTGCTGGCAGACCAACACGCAGTACCACTGT^ 

CTGGCGTCAACTGCGACGTGCTCAGTGTGTCCTGTGAGGTGGCTGCACAGAAGCGAGGCATTC 

GGAGGGCTCTGTGTGGATGAGGGAGATAAACATTACTGCCACTCCCAGGCAGGCT 

5 5 C?GAGTGCTCACCTAACCCCTGCCAGAATGGAGCTACCT 

ATGGGTCTAACTGCTCCGAGGAGATCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTACCTGCATTGATCTG^ 
TACAAGTGTTCCTCCCCCCGGGGGACACAGGGTGTACACTGTGA^ 

CCGAAGCCCCAAGTGCTTCAACAATGGCACCTGTGTGGACCAGGTGGGTGGCTATACCTG 
AGCGGTGTX5AGGGTGATGTCAATGAATGTCTCTCCAACCCCTGTGACCCACGTGGCACCC^ 

6 0 TTCCACTGCGAGTGCCGGGCTGGCCACACTGGACGCCGCTGTGAGTCAGTCATCAATGGCT 

GGGTGTCTGTGCCGTGGCCTCCAACACCGCCCGTGGATTCATCTGTAGGT^ 
ATGCCCGCACrTTGFTGGCAGCTTACGCTGTCTCIAACGGTGGTA 

TCCTTCACCGXSCCCTGAGTGCCAGTTCCCAGCCAGCAGCCCCTGTGTGGXjTAGCAACCCCTGCTACA 
CACATCCGAGAACCCTTTCTACCGCTGTCTATGCCCTGCCAAATTCAAOX^ 

6 5 GTGGCGCTGGCCCGGACAT^CCCCCACCGCAGATTGAGGAG 

TGCAACCTCCAGTGTAATAATCACGCATGTGGCTGGGATGGTGGCGACTGCTCCCT 
GCAGTCTCTAC!AGTGCrGGAAGTATTTTAGCGACGGCCACTGTGACAGCCAGTGC^ 
ACTGCCAGCTCACCGAGGGACAGTGCAACCCCCTGTATGACCAGTACTGG 

TGTAACAGTGCCGAATGTGAGTGGGATGGCCTAGACTGTGCTGAGCATGTACCCGAGCGGCTGGCAGCG^ 

7 0 GGTGCTGCTrrCCACCCGACCAGCTACGGAACAACTCCTTCCACTTTCTGC^ 

TCAAGCGTGATGCX3CAAGGCCAGCAGATGATCTTCCCG 

ACAGTGGGTTGGGCCACCTCTTCACTGCTTCCTGGTACCAGTGGTGGGCGCCAGCG 
CTCCATTGTCTACCTGGAGATCX3ACAACCGGCAATGTGTGCAGTCATCCT 

TCCTAG^TGCTCTTGCGTCLaCTTGGC a J5C 
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GCGCCGG CGCCAGCATGGCCAGCTCTGGTTCCCTG AGGGTTTCAAAGTGTCAGAGG CCAG CAAG AAG AAG CGGAGAGAGCCCCTCG 
GCGAGGACTCAGTCGGCCTCAAGCCCCTGAAGAATGCCTCAGATGGTGCTCTGATGGAOT 
GACCTGGAGACCAAGAAGTTCCGGTTTGAGGAGCCAGTAGTTCTCCCTGACCTGAGTGATCAGACTG 
GCAGCACCTGGACGCTGCTGACCTGCGCATGTCTGCCATGGCCCCAACACCGCCT 
5 TCAATGTTCGAGOACCAGATGGCTTCACACCCCTCATGATTGCCTCCTGCAGTGGAGGGGGC 

G AAGAAGATG CACCTG CTGTCAT CT CTG ACTTCATCTACCAGGG CG CCAGCTTG CACAAC CAG ACAG AC CG CACCGGGG AG ACCGC 
CrrGCACTTGGCTGCCCGATACTCTCGTTCAGATCGTCGAAAGrc 

GC CCG AATGCATGATGG CACAACTCCACTG ATC CTGGCTGCGCGCCTGG CCGT GGAGGGCATG CTGG AGGACCTCATCAACT CACA 
1 0 TGCTGACGTCAATGCCGTGGATGACCTAGGCAAGTCGGCTTTGCATTGGGCGGCTO 

ACTGCCAAAGTGTTGCTGGACCACTTTGCCAACCGGGACATCACGGATCACA 
TATGGACCACGATATCGTGCGGCTTTTGGATGAGTACAACCTGGTGCGGTCCCCACAGCTC 
CCACTCTGTCTCCCACACTCTGCTCGCCAAATGGCTACCCTGGCAA^ 
1 5 AGCACCAAAGGGCTGGtTTTGTGGTAGCAAGGAAGCTAAGGACCT 

GTTGGACAGCTCGTCGAGCATGCTGTCGCCTGTGGACTCCCTCGAG 

TCCTCCCCTCCCCATTCCAGCAGTCTCCATCCATGCCTCTCAGCCACCTGCCTGGTATGCCTGA 
TTGAATGTGGCAGCCAAGCCTGAGATGGCAGCACTGGCTGGAGGTAGCCGGTTGGCCTT^ 

CCTGCCTGT AGCCTCCAGTGCCTGCACAGTGCTGAGTACCAATGGCACCGGGG CTATG AATTTCAC CGTGGGTGCACCGGCAAGCT 

2 0 TGAATGGCCAGTGTGAGTGGCTTCCCCGGCTCCAGAATGGCATGGTGCCCAGCCAGTACAACCC^ 

GG CACACTGAG CACACAGGCAGCTGGGCTCCAGCATAGCATGATGGGGCCACT ACACAGCA 
GATTATTTACCAGGGCCTGCCCAACACACGGCTGGCAACACAGCCTCACCT 

CACTCCAGCCACAGAACTTACAGCCACCATCACAGCCACACCTCAGTGTGAGCTCGGCAGCCAATGGGC^ 

TTGAGTGGGG AG CCCAGTCAGGCAG ATGTACAACCGCTGGGCCCCAG CAGTCTGCCTGTGCACACCATTCTGCCCCAGGAAAGCCA 
25 GGCCCTACCAACATCACTGCCATCCTCCATGGTCCCACCCATGACC^ CCTCCATCACAG CACAGTT ACT 

CCTCCTCCCCTGTGGACAACACCCCCAGCCACCAGCTGCAGGTGCCAGAGCCCACTTTCCT 
CAGTGGTCCAGCTCCTCCCCGCATTCCAACATCTCTGATTGGTCCGAGGGCATCTCCAGCC 
CACCCACATTCCAGAGGCATTTAAATAA 

30 HUMAN SEQUENCE - GENOMIC 

CAGGCCAGGGAGATAATGGGGGACAGGGAAGATAACGCAGTCCC 
AAAGGCCCTCGGCTGTGCACGTGGTTCCTACATAGCCTTTGCCAGACACCCT 
GCCCCGACACGCAGGATGTGCTGGGGGCCTGGGCCTGTCCCCTCCATre 
CTGCCTGTGACACGGAGCCCOkGGATTAGTTTCAGGCCTCGGTCTC^ 

3 5 TGCGTCCGGGGTGGGCCGAGGTCCCTGCAGTGTCGTGGAGCCCACCCGACT 

GCACGCCTGGTGGGGTGAGGCCTAGGCTGTGCCCCGCTGCCAACCGGGA^ 
TGTCrTCAGCGTCCCCCC^ATCTGTGCTCCACTCCTCCCTGGGTCA 
CATTTGCCTCCCGGCGGGACCAGCAGACGGCACTCTGATGGCGGAAAGAC 
TGAGGCTCCCGGGGTCZACGTGTGTCTCCTTCCCGCAGG 

4 0 AAACGGCGGGGAGTGCAGGCAATCCGAGGACTATGAGAGCTTTC 

GCCCGGTG AGGGCTGGG ATGGGAGGTCAGGATGTCTGCGGGACACAGGCAG CTCCCAGGCAGGCTAGATGAGT CTTTGAAG AGGAG 

CCGGTGGGTGCTGAGGAGGCCCTGGTCGGAGAAGTTCTGGAATCAGGAATTGACCTGGGAGCACCGTTCCCAACT 

GACCTTCTTAGGCCAAAATTAGGGGAGAGGGGATGGTCCTGGGGTCACGGAAGCCCACTCCTGGGTCGG 

45 GGTCGACATCAACGAGTGCGTTCTGAGCCCGTGCCGGCACGGCGCA^ 

AGGCCGGCTACAGTGGGCGCAACTGCGAGACCGACATCGACGACTGCCGGCCCAGTGAGTAGCCCCGCGGCT 

GAAGCTCTCAGGCCTCAGTTCCCCCGGGCAGGGTTGGTGCGCATGGTGCTGGCC^ 

GGAGGGTTGGGTCAGGTGTGGGGGATGTGCTGAGGGATGGCCAGGGTGGTTCAGGAGGGC 

ACTTCGCGGAAGAGTCACGAGAGCCCCTCCCACGGCTCTTCACTGAGCCGAGGATGGCTCGGGCCCGGCCTGCACT 

5 0 TCATCGGGCCCCGCTGCAGGCCAGGGCTCTGAGGCCrCCCAGCCCTGCTTCACAGAGGTTGA^ 

CTGGCATCACCTGGCAGGCAGAGGCAAGAACTGTCTCTCCTCCCCTGCTCGGTCTG 

AGCTTGAGGACAGGAGCAGGTGGGGAGAGAGACCCCAAGCACAGGAGACGGGTGTGACGCAGCCTGTGGGTGCTGGGGCTC 

CAGACACCTTTGTCACAGGGCTGCTTGCCGCAGCCTCGGCAACGAAAGGCTGGGCTGTCCCCAGCCATGCAGCC^ 

TCCCGCAGGTGTGGGTTTGTGCCTGCCCCTCACACTCACCCTTCCGTCCTCrCCCAGACCCGTGT 

5 5 QACGGCATCAACACGGCCTTCTGCGACTGCCTGCCCGGCTTCCX^ 

CCCCTGCCGCAACGGGGCCAACTGCACGGACTGCGTGGACAGCTACACGTGCACCTGCCCCGCAGGCTTCAGCGGGATCCACTGTG 
AGAACAACACGCCTGACTGCACAGAGAGGTGTGCGGGGCTCGAGTGAGGCCGTGGAAGGGAACGGG 
AGCTGGGAGCCTGGCACACCATGCAGGCGTCCGCCTAGGCCGGCTGGGCACTT^ 
TTGTACCGACCTGTCCCTCCTGAGAGG)GCCATTGTGACCTCCTCCTGTGTCCCGC^ 

6 0 CTGGTGCGCTTCCTTCTATTAGACAACGAGAACTCT^ 

AACGTCAGTCTCGTCCCTGGTTTCTGACGTAATTACCTCCCGTGTTACCAGGACGGTTTCTGGTT^ 

ACCCT ATG C C CACG CGTTTGG T AAAACCACTCT CG AC CTG ACTT AT AAAG AAAGCAG GGG AGGG AGG TGTGACGTGGT GTG AG AGC 
CGGGCCCO^CTTCCCACTGGCCTCCCTGGGTCAGCCAGGCT 

AGGGG AGGTCCTGGCGAGGCAG AATCTGCTGGAGCGGGGACCC^ CTC AG CCCCCGCCTGCCCACCCCCTTGCA 

6 5 GCTCCTGCTTCAACGGTGGCACCTGCGTGGACGGCATCAACTCGTTCACCTGCCT 

CAGCACGATGTCAATGAGTGCGACTCACAGCCCTGCCTCCATGGCGGCACCTC 

CCCCCAGGGCTACACTGGCCCCAACTGCCAGGTGAGTGCGCCGGCCACAGAGGTGCCCGAAGGAGGGGCC 

CTGCGGGAGGTGGGAACGGTCACCCCCAGGTCCCACTGTGTCGGCrrCGG^ 

ATCTGAGCAGCCGGTGAAGGAACCTGATGCGGAAGCAGCAGGCAC^ 

7 0 OU^AATTGGGGAATTGAGGAAAAGTGTTCCTTCTAGCAAAGGCCGAGGGTG^ 

GCCCAGGCTGTTCCTGGTGTGGGGGTGGTGGGGAGTCCAGGGGCGGC^ 

AGTAGGCCCCTTGGCGGATGTGCGTTCTGAGCTACCGGGGAAATGGCCGGGGCGCGGG<^CCCAGCTGACCC 
AACCTTGTGCACTGGTGTGACTCCTCGCCCTGCAAGAACGGCGGCAAATGCT 

« — r*r>*r*r< ^r**nr*n^n*r*r^ri^nTLttTcxzcTrzrn.riiarciTLr a nnrrr a ar CTGCTfrror r r a r 
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CCCGCCTGTGCCAGCATGGAGGGCrCTGTGTGGACGCGGGCAACACXSCACCACT 

TGTGAGGACCTGGTGGACGAGTGCTCACCCAGCCCCTGCCAGAACGGGGCCACCTGCACGGACT 

GGTGGGGGTCCCTCCTAGGGT^GGGTTGTGGCCGGCACGA 

AAAACTGAACCTGATAGGCCCATGCACTGTTCAGTCTCATTAGGGGAGGGCTGGGGTAATC^^ 
5 CCTGTGGAAAGCCTGAAGGAGGGTACCTATACTGAGAAGTGGGATGGGGTTTTCCC^ 
CACCTACCCTGAGGCCCTCACCTCTATCCTATGGGACGGGGTCC^ 
CCAGGGTCTGTTGGGCTGGGTCTCTCTCCAGGTCTGACAGGAGCGAGGGGCCC^ 
GCCCTGTCAGCCCTCACAGTGGGGTGTGGGAGCACTGCATCCTGGCGCCGGCTGAGCCGAAG 
AGTGCGTGGCCGGCTACCACX^GGTGAACTGCTCTGAGGAGATCGACGAGTGCCTCT 
1 0 CTCGACCTCCCCAACACCTACAAGTGCTCCTGCCCACGGGG CACTCAGGGTAAGGG CCGCTGCACGGAGGGCTGGTGTTGGCCATC 

CATXKSCCAGGGCAGGGGCAGGGCAGGCACCCCGGGACCGGCCA^ 

AGGCCAAGGCCGACTCACAGCTGCCCAGGGAAAGGGCCCAGAGCGGGGGTCCCAGTGGGA^ 

CCTGCCT CTCCCG AGTGTCCGTGCAG CCCCAGACCTGAG CG CTTGTCTTCCGGGACGGACACGCGGCACGGCAGGGCCGGGGTGTG 

GCGGGCHTGGGCCACTGAOSAAACCTGGCCCCGCAGGTGNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNJNNNNNNNJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN1W 

NNNNNNNNNNJNNNNNNNNNNJNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNJNN 

NNNNNNNNNNNNNNNONNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
. NNNNNNNNNNIJNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNJNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNJNNNNNNNNNNNJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NINNNNNNNJNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNN1NNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNIJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
• 55 NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNIJNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

GCCTCACTTCTCGACCCCTCACCCCCCAGGCCCCTGAAGAACGCTTCAGACGGTC 

G^CGAGGACCTGGAGACCAAGAAGTTCCGGGTGAGTCG CGAGGCTCCCGGGCTCCTGGGCTCCCGGGCACCTGCTGCCGGG CTGC 
CCTGACAGCCTCTCCTCACTCC CTC^ATGTAGTTCGAGGAG CCCGTGKTTTCT^ 
7 5 TGGACTCAG C AG CACCTGG ATG C CG CTG ACCTGCGCATGT CTGCCATGG CC CC CACAC CG CCCCAG GGTG AGGTTG ACGC C G ACTG 
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CATGGACGTCAATGTCCGCGGGCCTGGT^ 

AGCCCCGAGGCTGGGGTACAGTTGATTCTCTGGAAACTTAGAATTGGGGGTGAGAGCT 
GTAGACGTGGGGGGGTCCTGGAGTCCTCCTGTTCTTCCACCAACCCCTTCCTGGGGTGATACCGCAGGGCC^ 
ATTACTCTTTTCTGAAACCTTCrTGAGACATGGAAAGCGTTGAl-rrrCll'l T T CTTTTTTTTI 1 1 rCTTT TT TTTTTGTTTTTGAG 
5 ATGGAGTCTCGCTCCGTCACCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCCCTGCAACCTCCACCTCT 
CTCCTCCCTCAGCCTCCCCAGTAGCTGGGACTAC^ 
TTTCACCATGTTGGCCAGGCTGGTCrCGAACTCCTGACCTCAGGTGAT 
GTGAGCCTCCGCATCCTGCTGAAGCATTAATTTrCTAACTC 
GGTCTTX3G<3CAGGTCAGGAAGCAGTTGCCCGCCAGGAGTTT 
1 0 GCTGCCCCTTGGTGGCTGTGTGCACGGGGCCACCAAGTGCTGGGTGGTGCATCCACAC 
CCCCCTCCCCCTGGGCTGCAGCTGGGGACCGGCCCT^ 
GATCGCCTCCTGCAGCGGGGGCGGCCTGGAGACGGGCAACAGCGAGGAAG^ 
ACCAGGGCGCCAGCCTGCACAACCAGACAGACCGCACGGGCGAGACCGCCTTGCACCT 

GCCAAGCGCCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGACAACATGGGCCGCACCCCGCTGCATGCGGCTGTG 
15 CGCACAAGGTGTCTTCCAGGTAGGCAGTGGCTGCCTGTGTGCCCACCT 

CAGGCTTACGTGGCCCTGGGAGCCTGACCCCGAGCACAGCTGAGTCCGGGACAACrrGGTGCCTCCACCT 
CGAGGTGCGAGGGGGAGGGCGTCGGGCCCATCTGTGTTCTCCAGGGAAGTC^GG 
CTGCTGAGTGAGGCAGCACCTCCCCACCCCCAGCAAAACAGGOTCCATCAGGGTTGTGGG 
CTGCAGCCCCTCXSCAGCCCCGCCCCTCCCTCAACCTTGGCTGCCGGCGTAGCCTGTGGC^ 

2 0 GCTCGGTGCCTGCAGACCTAGGGCTCAGCTTGCCGGTGAGCTCGTGGCAAGAATGGATTTAGGG^ 

GGAGTGTCCCTGGCAGGGGCTGCCTTTGCAGTCACCCCrGCTGCGAGTCCCCAGGCTCC^ 
ATGGGCACAGCCCGGGGAGTTACCACAGAGCTTCTCATTTCCTGATTTCTTAGCTCAGGTGACAC^ 
ATGTAGGTGAGCAGAAAGAGCX3TGGAAGGGAGCCCCGTGCGGGTTTGGTGTGTGCCTCTCTCAGCTGCLTU 
GAAGTTCCCAGGTGCTTTGCAGGAATCAGGCCAAGTTGCCTTTTGCACCCGCCAGTGAGCAGGGG 

CCGTGAGCCCGGCTTCCTGAGTGCCAGCTGTAACCGCCGTTGGGGGCAGGGACTTGACCTCTCTGTTTTGTAG^ 
CTAGGACCACAGTGAGGATTAAATGAGGCCACACGTGGTCACGATGGATCCC^ 

GCCCCTGCCCCCATCCGGGCCCCTCCTCAGGGGCAGCCCCATGGCGTTTCGTCTTGCCCGGCCGTGCCGATCTCAGGAGGGTCTCG 
TCTGTGTCCGGAAGACAGTGGGGCTTCCCGTCCAGGCGTTCGTTCTGGGCAG 

3 0 GAGCCTCTCCCTGTTGCCCAGATCCTGATCCGGAACCGAGCCACAGACCTGGATGCCCGCATO 

CCTGGCTGCCCGGCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAACTCACACGCCGACGTCAACGC 

AG CCCACGGGGGCACGGCTGCTCTGTCGTGGGG CGGGACCGCCACAGGG ACG<K3TGGG ACTGGGTTGCCTCCAGCTGGTCTCCAAC 
CTACCCCATCTGCTTCTTTCACGCAGGCAAGTCCGCCCTGCACTGGGCTC 

T G AAG AACGGGGCT AACAAAG AT ATGCAG AAC AACAGGGTG AGCG CG AGG CTGGGATG CCAGGGGAGACGTGAGGG CTG AATCCAC 

3 5 AGAG AAGTG AGGG CCAAACCCACGGGGCX3 CAGGGACACAAGGGCCTGACCCACAGGGTCT 

CCCCACCCGCATGATGCGGGCACCTCGTTAGTGCTCCTGCCCTCCTTCAGCCCCTGTTTATGGAGCCCTTCCAA 
CTGTCAGGACACGGCGGCTCGCCCCTCAGACCCAGCCCTACCCTCCTGGGCGGCAGTCATGGCAG 

AACGCTGCTGCTGCCACTGCTGTGTCACCGTCACCTGGACCTCACCTTCTGTCCCCAGCTGTCACCAGGCGGGGGTGGGGATGGGA 
ATGTGGTGACGCAGGTGGTCCAGATTGAGCCAGAACACGCGTGGCGGCAGCTCCCTGCGGGGCGG 

4 0 AGGCCAAGG ACCTCAAGGCTCAG AGGAAG AAGTCTCAGGACTATATCCAAGGGG AG CCACC CCCAGCCCTCATCCTGGCCCTGCAG 

CTCCTGGCCCTGCAGCTGCTGTTGGTTTCCCCTGGGTGCTCAGGGCACAGGTGCAGAC^ 

CACCCCCGCCCCCAACTCCTGCTGCCCCCTTGGCATGTCAGGCTCAGGCGTCTCTCCCTCCTGGGTGAGGGCACACAGCG^ 
GGCACCGGGGCATGTTGGCCCAGGCGCTCCCCAGTCCGTGGCAGCATGGCCCCACAGAGACTGGGCCCCAGAGGGCATCAAGGCCT 
GGGAAGCCCCTTCCCACTCCCACACAG CAGCCTCACCCAGAC CTGTGACGTGTCCACCGCACAGAGGAGACCCC AGGAAGG AG CTT 

4 5 GGTGGCCACCAGGGTGGCTTGATGGCCGTCCAGATGACGAGCTCCCTCCCTGGTGCCCTTC 

ATTTCCTTTTGTGCCTTATTGGGGGCAGGGAAGAGGGCCT 

TGGAGCCCCGCTGATCTAGGGTAGAGGACTGCACAGATCCCCTCTCTGGGTGGGTTTCAG 

ACAAGAGTCAGGCTAGGTGGTTGCAGGACGCTGGGGTGGGGTCCTGAGGAGCAGCCTGCCT 

ACTGCTGCTTCCTCrGGTGATGGAACCTTGGGGAGGGTCCCCACGCCTGG 

5 0 AGGAGGAGACACCNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNIWNNNNNNNNimN^ 

ACATCACGGATCATATGGACCGCCTGCCGCGCGACATCGCACAGGAGCGCATGCATCACGACATCGTGAGGCTG 

AACCTGGTGCGCAGCCCGCAGCTGCACGGAGCCCCX3CTGGGGGGCACGCCCACCCTGTCGCCCCCGCTCTGCTCG 

CCTGGGC^GCCTCAAGCCCGGCGTGC^GGGCAAGAAGGTCCGCAAGCCCAGCAGCAAAGGCCTGGCCTGTGGAAGCAAGGAGGCCA 

5 5 AGGACCTCAAGGCACGGAGGAAGAAGTCCCAGGATGGCAAGGGCTGCCT 

CTGGAGTCACCCCATXSGCTACCTGTCAGACGTGGCCrCGCCGCCACrGCTGCCCTCCCra 

CAACCACCTGCOTGGGATGCCCGACACCCACCTGGGCATCGGGCACCTGAACGTGGCGGCCAAGCCCGAGATC 

GGGGCGGCCGGCTGGCCTTTGAGACTGGCCCACCTCGTCTCTCCCACCTG 

AG GAG CGGAGGGG CCCTGAATTTCACTGTGGG CGGG T CCAC CAG TTTG AATGGT C AATG CG AGTGGCTG T C CCGGCTG C AGAG CGG 

6 0 CATGGTGCCGAACCAATACAACCCTCTGCGGGGGAGTGTGGCACCAGGCCCCCT^ 

TGGTAGGCCCGCTGCACAGTAGCCTTGCTGCCAGCGCCCTGTCCCAGATGATGAGCTACCAGGGC 
ACCCAGCCTCACCTGGTCCAGACCCAGCAGGTGCAGCCACAAAACTTACA 

GCAGCAGCAAAGCCTGCAGCCGCCACCACCACCACCACAGCCGCACCTTGGCGTGAGCTCAGCAGCCAGrc 
GCTTCCTGAGTGGAGAGCCGAGCCAGGCAGACGTGCAGCCACTGGGCCCCAGCAGCCTCGCGG 
65 AGCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCT 

CTACTCCTCGCCTGTGGACAACACCCCCAGC CACCAGCTACAGGTGCCTGAG CACCCCTTCCTCACCCCGT CCCCTGAGTCCCCTG 

ACCAGTGGTCCAGCTCGTCCCCGCATTCCAACGTCTCCGACTGGTCCGAGGGCGTCTCCAGCCCTCCCACCAGCATGC^ 

ATCGCCCGCATTCCGGAGGCCTTCAAGTAAACGGCGCGrc 

CGCTCTGTGGATGCCAGGGCCGACCAGAGGAGCCTTITrAAAACACATGTTTTTATA 

7 0 TT AGT ATTT ATTT ATGT ACT TTT ATTTT ACACAG AAACACTG CCTTTTTATTT AT ATGT ACT GTTTT ATCTGG C CC CAGGT AG AAA 

CTTTTATCTATTCTGAGAAAACAAGCAAGTTCrcAGAG 

TATAAACAAAGATTCATGATTTATAAATGCCATTTATTTATTGAT 

GAAGTTGAACGAGCATAGTCCAAAAAGCTCCTGGGGCGTCCAGGCCGCGCC(ZTTT 

1 ^Wt^.\^1UWll W*VS» * WW*. * * *« »•*».«. — ■•» — — — ~~ "~- ' ~ - - 

7 5 GATTAATTTGCATCTGAAATAGGAAACAAGTGAAAGCATATGGGTTAGATGTTGCCATGTGTTTTA^ 



329 



WO 03/008583 



PCT/US01/51291 



CTTGTGAAAATGTGTTCTCGG AGTGTGTATG CCAAGAGTGCACCCATGGTAC CAATCATGAATCTTTGTTTCAGGTTCAGT ATTAT 

GTAGTTGTTCGTTGGTTATACAAGTTCTTGGTCCCTCCAGAACCACCCCGGCCCCCTGCCCG 

TGTCAAACATGAGATGTGTGGACTGTGGCACTTGCCTGG^^ 

GGGCTGACCCCGGTGGCGGCCCCAGCACCTCAGCCTGCACAGTGTCCCCCAGGTTCCGAAGA^ 
5 GCCCCAGCTCGCGGGACCCGACCCCCCGTGGGCTCCCGTGTTTTGTAGGAGACTTGCCAGAGCCGGGCACA 
CCGTGGGCTGarrCCTTTGGTCCTGTCCCCGCAGCCCTGGCAGGGGGCATC 
CCTTGGGCCACCCCTCCTAGTTTGGGAGGAGCAGATTTTTGCAATACCAAGTATAGCCT 
TTTTTAAAGGTGGATTTTGTTTAAAAAATCTTAATGAATGAGTCTG^ 

CTCGGGG AGCCTTAGCCG CCCATGCACTGGGGACGCTCCGCTGCCGTGCCGCCTG CA CTCCT CAGGG CAGCC TCCCCCGGCTCTAC 
1 0 GGGGGCCGCGTGGTGCCATCCCCAGGGGGCATGACCAGATGCGTCCCAAGATGTC 
AGTTTACAGAAAAAGACTTTAAAAGTGATCTACATGA^ 

AATAAAAATGATACTGATGGTGATCTGGCTTCCACTCCCCTCTGCTCTGGCCTTTG^ 
TATGCTCTGAGX3GAGCCAGGAGTCGAGGGCCCCTTCTCCT 

GGGACCACGCTGCCTCCAGGGACTCC^TGGTGTCTGCAGCCTGCCTGGTCCAGGCCC^ 
1 5 GGGGGTTGCAGCCCTATGGGAGGTGGGGCTCGTGCCTGGGGTGACACGGCTCCCAGGAC^ 

AGCTCGGGGCTCTTCTCCTCTCAGCCTCGGAGGGATGAAGGTCCCACCCAGC 

CAGGCAGGGGGCTGCATAGGGAGGGCCTTGAGGCAGTGGCATGACCCCTCT 

ATAGGAGGGCAAAGGGAGCTGTCGAAGCCTATGAGCAGGGCAGCATGGGTCGTGG 

ACTGGCTGAGCGAGGCAGAGGCGGCCTGGTCCK3GGGCCCTC 
20 TCTCCCCACTGCCTGGGCCTCTGCCCAGCCCCAGACCTTCAGGGCAGGCCAGTGGCT 

CCCTCTTTGGATTGCAAAGCACTGCCTG CCCTGGGCCCAGTCTCT CCAAGG AGGGATGTGAGCCCGAGGCCTCTACT ATGGCTGGG 

GGCTGCGNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNmnnWNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

2 5 NNCCGGCATC^CTCAGTACAGCCACAGACAGCCTGAGCGTCCACTGCCAACG^ 

ACCTTAGTGTCAGCTTGGCCGGGCCATTTCGCCAAACGGAGTGTGTCTCTGAGGTGTTCTGGATGAGG 
GCCTGGGTAAG<3CAGTCCGCCCTCCCCAGTGTGGGTGGGCCCCGTGCGGTCCGCTGAGGCCCGGGGAGAACAATA^ 
AGGGGTCCTCCCACCCAACCGCTTGCACTGGACATTGGTCITTTCCTGCCTT 
CCTGGAGCCCACCGGCCTTCGGACTCGCAACGCCATCAGGCCTCCrcCCTGCAACTGCAGACC 
30 CGTGTGGCCAATTCCCTGCAATAACCTCTCCAGTGGCACGT CTG ACGAATGCG CACGA 

GGGAATATGATTCCACCGTAGAAAAGGAGGCGCTGACACCCACCATG^ 
CGGCCGCAAAGGGCCGCAGGTGCAGTGACTCCATTTCTATGGGGTGCCTCTGGT 
TGGTTCCCAGAGCCGGGGGGAGGAGGAAAGGGGAGTGGCTGCTAAACGGCCCGGGGTTTTTTTTT 
TTAAGTTTTAGTTTATTGTGGCACTGTTCACAATAGCAAAGACTTGGAACCAAC 

3 5 AAAATGTGGCACATATATACCATGGAATACTATGCAGCCATAAAAAATGATGAGT^ 

C CAATAACG CGGACA CTTTTTTT TTTTTTTTTI f GAGACGGAGTCTCGCrCTGTCACCCAGG CTGGAATGTGGTGGCGTGATCTTG 

GCTCACTGCAACCTCCGCCTCTGTGTTCAAGTGATTCTCCTGCCTC^ 

ACCCGGCTGATTTTTTTGTATTTTTAGTAGAGATTGGGTTTCACCATGTTGG 

4 0 CCACCCGCCTCGGCCTCTTAAAGTGCTGAGGTTACAGGCGTGAGC 

TTT AGGTTCTGTGAGTCTCACCTCAATATGTTATTTTTTTTTTTTAATTA^ CTTGCTCTGTCACC 

CAGGCTGGAGTGCAGTGGTGCCATGATAGCTTATTGCAGCCT 

CAGCTGGGATTACAGGTGCACACCACCTCACCCACCCAATTTT^ 

4 5 CTGGCCACATATGCAACTTTAAATATTTCAGTAGCCACATTAAAAAGGTAAA 
TTAGTTAATCCAATATATCAAAAATATTATTTTA^ 

CTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACCTGCCACCACGCCC^ 
TTTCACCATGTTGGCCAGGCTGGTCTTGATCTCrTGACCTCGTGATCGGC 
TAAGCCACCACGCCCGGCCGTGAATGAGATGTTTTATATTC^ 

GTACAGTACACTCACAGCACCCTAGATTGGGCCTGGCCACACTTTAAAAGGTGTTCAGTATCT 
GGACTCTTCAGCAGGATGCCAAGGGGGCTCCAGACAGTTCCACCCAGCG 
ACTCCACGCCGCTGCCGGCGTGAACCCCGAGTGTGGCGCTGCGTCCCTCT^ 
ACCAACTCATCAGGGCCCACAAATCAGGCCCCCGGCAGGCAGGGCTGCCTCCATCCAAGAGGG^^ 
GCTGCTCCAGGGATCGACAGGCCTGTGCTOCCTTCATCCCAAAAGTCTCACTTGCAAAGGGTG 
CAGCGGTGGACTGATTTCGTTCTGTGTGGCTCCre^^ 
GGGGGCAGCACTGGGCAGAGGCAGACAAGGGGGAGCX3TCACGCGGCTC 
GGGACACCGACGTATTCGTCTGCAAAAGTAAAAGTGTTTCAAAAC 

GCGTTGGCCAGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTG 
ACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGCGTGGTGGCAGGC 
AGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGGAGGCGGAGCTTGCAGTC 
GCCTGGGCGACAGAGCAAGACACTCTCACAAAAAAAAAAAAAAAAAAAA 
TGCAGGGCCTCCCACGTTACTCAGGCTTTGCAGACGAAGACGCAGGGTC^ 
TCTGCATCTTGAAGGTGGCTGGGTGGCGCCTCAGTCGCCTGCCC^ 

ACCTGGCGGCTGCTGTCCTGGCTGTTGCTGTACCTCAGTGTCCTCGTGGGTCAAGTGGGGACCT 
ATGGTGAGCTGCTGGTGCACGCTGGGCATGGGGTGAGGGCTCAGAAAGGGAACAGCTCTTGTTATCGTTA 
ATTCTITGTTCTCCACCTGCATGCCGTGTTGGCACAGTTAAGCCTGGCGGCCCCTTCAGCAG 

G ATGGACAAGTCT AGGCTCATGGAGATTGCAGGACTGAG CTGCGAAGCTTT ATTCATGCC CAACACGACAGGCG CCAGGTGCCCAG 
GGCCTCAGAGGTGGGTTTCCGTTGTTCTTCACACAACCITGAACTTGTGGCCCATCCCTGGACCT 
AGTGTTTGTTAGGAATCTATAATTTTCTGCGGAGTCACGCGTGATATTTGCTGTGGCTTTTGGTAT 
GGATTTTCTGGTGAAAATGTGCCCTGGGTCCACACAAACG^ 

TTTAATAAAATGAACCTTAGAAAAGTTATAACTACGGCCAGGCGTGGTGGCTCACACCTC 

G CG AG AG GAT CACCTGTGGTCAGGAGTTC CACACCAG CCTGGTCAACTGG TGAAAC C CCC CT CTC CACTAAAAAT ACAAAAATT AG 
CTCCGCACCATGGTGGGCACCTAGAATCTC^GCTACTTGGGGGGGCTG 
CAGTGAGCTGAGATTGCACCACTGCACTCCA^ 
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TTTAC TTTTTTTT Cl 1 T rTGTGGAGATGGAATCrTGCTATGTTGCCCAGGCTGGTCTCAAACCCCTAGGTTCAAGTGA 

CTTTGGCCTCCCAAAATGCTGAGACGACAGATGGGAGCAACCTCACCTG^ 

GGCTCTCTGCCTCCCCCrGGTGGCACTGCTCTGCCAG 

CCACGC C CAGGG CCAT CCCCCGCG ACACTGCT ATT ATCCTGTTTT ACACATG AGGAAG CTGGGG CT CAAAG AGG TTAGG CG ACTTA 
5 GCCAAGTTCGCCCATTCACTGAGCCAGGGCTATOTGATTCCCGGTCCGTGT 
GGAAGGCCTGTAGAGTTGATGCC^TCCAGGCTGGTGACTTTTC^ 
GCCACTTTCCCAGGAGTCAATACCATGGCCAGCCTGGCATCGGATAC 
TACTGGAGCTCTTCCCAGCTCCACCCGGGCCCTT^ 
AATCATCACTCAATGCTCAGGACrcTCTCCTGCCTGCTGTC 
1 0 CGGGTTCTTGGGGACTCTCCTGGGTTCAGCGCCTCTTTCTCCTAGGCCAC 

ACTTCCAGACACCACCTGCGGGTCTCTTAGCAAGCCTACAGAGTCCACACTAGCCT 
CTCATGTGGACTGTTCACrrrCAGCCAGAAATGCTCCTTCTCCr 

GCACCTCTGTGGCCAGCCAGCATCCATGTTCCCCGGAGGCAGAGTCTGGGCTTGTCTGTCTC^ 
1 5 GTGCCTGGGGTACAGCAGGGACTCAGCAGACATCTGTGCATGGTTTGGGGATAGAG 

GCTCAAGGAGAGCCAGCAGTTCCTCCTGTGGCCCT 

CTGCTTCATTCCTAGTGCTTTITAGCACCrrrACATGGTAAGTACT 

TGTGCCCAACA.TGCTTGAAGATGCTGGAGATGAGGAGCATGACGCT 

CTGTGCCTCCGGGGAGGCCTGGTGTGtGAGGATACGCTTGCCACCCTCTGAGGTCAGGT 
2 0 AGGGCAAGGCTGTCACCTTCTTTTACAGGGGAGGAGGCAGAGGCTT^ 

AGTGGCTGAGGGTCTCTGACTTCTGGAAGCTTGCAGTACAGCTGGA 

TGTGCTCTGCTAGCCAGGTGGGCGCCCAAAAGGAGATGCCTTTGCCTGTGCTGGGAGG^ 

CCGAGAGCAAGGGCACATGTGCCAGGGCGGGGATGCACCTTCAGCCTCATGTCAGCCCTGGGAAGGCCT 

GTCTCGCTGCACACCCCTGGCGAGCACACCGGGCGAGCACTCCAGGTTAGGGTGGGGCT 

2 5 GTGGCAGCGTGGTGAGCATGTTTGCGGACCCTATGCTGTGTtSCTTGGCAGGCCTTGTGCATGGAA^ 

TCCACACTGCCCACCATGGCCCTGGGGGTCTGGCCTTCCCCTC^ 

CCTGGCCCACCCTCTGG CCAG CCTGGTGGCATGCTG ACAGGG AACCCCTG CAGGGG CCTG ATG AACAAACG CCACAG AACAAACTC 

TACGGACCCTGCCTCCAGATCTGCTTCCTCTCCCCTGATTC 

TCAGTGCTCTCCACCTTGGAGGCCTTCATATGTTTTTT^ 

3 0 CTCCAGGCCAAGCCTCCAGCTTCAGCGTCCAGGACTAAGGGTCTGTGTCAGGG^ 

CCCAGTAAACCTCCACAAGCACCCAGACATACTCX3CCACTGCTATTTTACACCAGGTAGGAAA 
GGGCCGTCCTGCGTTCACTGAGACTGCTGCTCTCCTGGGGGCTCCCTGGAGGACCGCCACTGGC 

CAGGTCCACCCATCAGCCTCTGG CATTTCAGG CCCCTATCAT AGG AGGCTAT AAGCACAGGCT AGGGGTCCTGAGTGCCTGCCTTC 
TAGACCAACACTGTCCAGTGGAAATGTAACACCAGCCCC^TGTGGAGTTTTACAT^ 

3 5 CAGGTGAAATAAAATTAAATAGTATTTTGTTTAATCT 

TCGATATTTGCATCCTTCACAACCTACCATGTAGTTTACACTCAGAGCCCAG 

CTGGACAGC^TGGCTAATGGGAGGAGCTGACCCGTCGGAGGCACCAAGGTCACGCTGTGACCAAGGCT 
CGGGrTTCCTGGCACAGCCTGGAGGTCCACCAGCCTGTCCTGTGTGCTCTGCATAA 

4 0 ACTTGGAGAATGGAGAAACCAGCGGATG CTGCAGGCAAAAGCAG CAAAGCCC CACCAGG AATAGTGATGGCCACCACCGGCGGGAC 

ACCTCCCAGCACTGGGTGGAGGCCTCTGCTGGCTCGCAGTCAGTCACCTGATGTCCCCATTCGCCCCTGCC^ 
GCAACCCCATTTCAGCCCTGAAACX3TTTAGGTGCTCATCTTGCTCTATCT 
TTGAAAAAGGGCACACAGCGTCT CAGGGG CAGGCTGGGCCTTCAACGGAAGGGTCT 
AGGGTGCAGGAACCCTCCACTCCACATCAGGCTACAA^ 
45 AGAAAGACAGCCGCGCCCCCTGCAGGTGATGTGGTGATTCACACCCCTAGGAGCCAACAGAAGCCGCTGTCTA^ 
GAGGTCCAGAGACCTTGCATTTTATGCCTAGGGCAGGGTTGGTTTCTGCAGCT 

GCGAGCCCCACGCCTGAGAGGCAG(^TCCACCTCTGCATCCCCCTCTTTGGAAAGGGCAA^ 
TGCAGAGCGCCTCGACCACTGGGAAGAAGCAGATGGGGGCAGCGGG 

5 0 AAACAGGTCAGGAATCTGACTCCTGAAAGGCAAATGTTAGCTGAGCAGATTGGCCCTGGTC 

CCTGCACCGCCTGCGGCCCTGCTGGCGTGGCACGGATATCACCCCCTCCCTCACCAGCCGCCGGCCCAGCT 
TCCCACCCAGGGAGGACCTC^GCCGCTCCTGGGAGCTCAGCCTCCCGCGATGCCATCGGGCGG^ 
CTCTGG AAGG AGCAG CAG AGG AAGGC CG CGG CTCTTCC CTGC CCTTG C 

55 HUMAN SEQUENCE - raRNA 

AGGCCCGCGATGCTCCCAGCCCGGTGAGACCTGCCTGAATGGCGGGAAGTGTGAAGC 
GCGGGGCCTTCGTGGGCCCGCGATGCCAGGACCCCAACCCGTGCCTCAGCACCCCCrGCAAGAACGC 
GACCGCAGAGGCX3TGGCAGACTATGCCTGCAGCTGTGCCCTGGGCTTCT 
CCTCACCAACCCCTGCCGCAACGGGGGCACCTGCGACCTGCTCACGCTGACXMAGTAC^ 

6 0 GGAAATCGTGCCAGCAGGCTGACCCGTGCGCCn'CCAACCCCTGCGCCAACGGTGGCCAGTC 

TGCCACTCCCCACCCAGCTTCCATGGCCCCACCTGCCGGCAGGATGTCAACGAGTG 
AGGCACCTGCCACAACGAGGTCGGCrCCTACCGCTGCGTCTGCCGCGCC^ 

CCTGCAGCCCCTCGCCCTGCCAGAACGGGGGCACCTGCOSCCCCACGGGCGACGTCACCCACGAGTGTGCCTGCCTC 
ACCGGCCAGAACTGTGAGGAAAATATCXIACXBATTGTCCAGGAAACAACTGCAA 

6 5 CTACAACTGCCGCTGCCCCCCAGAGTGGACAGGTCAGTACTGTACCGAGGA 

AGAACGGCGGGACCTGCCACAACACCCACXXSTGGCTACAACTGCGTGTC 
ATTGATGACTGTGCCAGCX3CCGCCTGCITCCACGGCGCCACCTGCC 

CCGCACAGGTCTG CTGTGCCACCTCAACGACGCATG CATCAGCAACCCCTGTAACGAGGGCTCCAACTGCGACAC CAACCCTGTCA 
ATGGCAAC^CCATCTGCACCTGCCCCTCGGG^TACACG^ 

7 0 CCCTGCGAG CATCCGGGCAAGTGCATCAAC!ACGCTGG<3CTCCTrCG AGTGCCAGTGTCT CCCCG ATG CGA 

GATCGACXjTCAACGAGTGCGTCTCGAACCCGTGCCAGAACGACGCCACCTGCCTGGACCA 
TGCCCGGCTACGAGGGTGTGCACTGCGAGGTGAACACAGACGAGTGTGCCAGCAGCCC 
AAGATCZAATGAGTTCCAGTGCGAGTGCCCCACGGGCTTGACTGGGCATCrGTC 
CTGCAAGAA7GGTGCCAAGTGCCTCGACGGACCC a ^ 
75 TGGACATCGATGAGTGCGACCCCXSACCCCTGCCACTACGGCTCCTG^ 
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GGCTACACGGGCCACCACTGCGAGACCAACATCAACGAGTGCT 

CAACGCCTACCTCTGCTTCTGCCTGAAGGGGACC^ 

ACTCGGGCACCTGTCTGGACAAGATCGATGGCTACGAGTGTC 

CGATGAGTGTGCGGGCAACCCCTGCCACAACGGGGGCACCTGCGAGGACGGCATCA^ 
5 ACCACGACCCCACCTGCCTGTCTGAGGTCAATGAGTGCAACA^ 

TACAAGTGCGACTGTGACCCTGGGTGGAGTGGGACCAACTGTGACATCAACAACAATGAGTGTGAATCC^ 
CGGCACCTGCAAAGACATCACCAGTGGCTACGTGTGCACC^ 
AGTGTGCGTCCAACCCATGTCTGAACCAGGGCACGTGTATTGACGACGTTGCrc 
GGTGCCAOrrGTGAGGTGGTGCTGGCCCCGTGTGCCCCC^^ 
1 0 GAGCTTCTCCTGTGTCTGCCCCACGGGCTGNNNNNNNNNGGCAGACCTGTGAG^ 
CGGCACGGCGCATCCTGCCAGAACACCCACGGCGGCTAC^ 

CATCGACGACTGCCGGCCCAGACCCGTGTCACAACGGG<3GCTCCTGCACAGACGG 
GGCTrCCGGGGCACTTTCTGTGAGGAGGACATCA^ 
GGACAGCTACRCGTGCACCrGCCCCGCAGGCTTCAGCGGGATCCACTG 
1 5 TCAACGGTGGCACCTGCGTGGACGGCATCAACTCGTTCACCTGCCTGTGTCCACCCGGC^ 
GTCAATGAGTGCGACTCACAGCCCTGCCTGCATGGCGGCACCT 

CTACACTGGCCCCAACTGCCAGAACCTTGTGCACTGGTGTGACTCCTCGCCCTGCAAGA^ 

CCCAGTACCGCTGCGAGTGCCCCAGCGGCTGGACCGGCCTTO 

CAAGGTGTTGACGTTGCCCGCCTCTGCCAGCATGGAGTC 

2 0 CTACACAGGCAGCTACTGTGAGGACCTGGTGGACGAGTGCTCACCCAGCCCCTGCOIGAA 

GCGGCTACTCCTGCAAGGTGCGTGGCCGGCTACCACGGGGTGA 

GAACGGGGGCACCTGCCTCGACCTCCCCAACACCTACAAGTGCTCCTGCCCACGGGGC^ 

TACAGCTGCACCTGCCCGCCGGGCTTCGTGGGTGAGCGCTGTGAGGGGGATGTCAACG^ 
25 TGGCACCCAGAACTGCGTGCAGCGCGTCAATGACTTCCACTGCGAGTGCCGTGCTGGTCAC^ 
TCAATGGCTCCAAAGGCAAGCCCTGCAAGAATGGGGGCACCTGCGCCGTGGCCTCCAA 
CCTGCGGGCTTCGAGGGCGCCACGTGTGAGAATGACGCTCGTACCTGCGGCAGCCTGCGCTGCCT 
CGGCCCGCGCAGCCCCACCTGCCTGTGCCTGGGCCCCTTCACGGGCCCCGAATGCCAG 
GCAACCCCTGCTACAACCAGGGGACCTGTGAGCCCACATCOTAGA^ 

3 0 CTCTTGTGCCACATCCTGGACTACAGCTTCGG^ 

GCCCGAGTGCCAGGAGGACGCGGGC1AACAAGGTCTGCAGCCTGCAGTGCAACAACCA 

C CCrCAACTTCAATG AC CC CTGG AAG AACTGCACG C AGT CTCTG CAG TG CTGGAAGT ACTTCAGTG ACGkSCCACTGTGACAG CCAG 

TGCAACTCAGCCGGCTGCCTCTTCGACGGCTTTGACrrcCCAGCGTGCGGAAGGCCAGTGCAACC 

GGACCACTTCAGCGACGGGCACTGCGACCAGGGCTGCAACAGCGCGGAGTGCGAGTGG^ 

GAGCTCAGCCGCGTGCTGCACACCAACGTGGTCT^ 

GGAGGAGCTGCGCAAGCACCCCATCAAGCGTGCCGCCGAGGGCTGGGCCG(^CCTGA 
TGCTCCCTGGTGGCAGCGAGGGTGGGCGGCGGCGGAGGGAGCTGGACCCCATGGACGTCCGCGGOT 
GACAACCGGCAGTGTGTGCAGGCCTCCTCGCAGTGCTTCCAGAGTGCCACCGACGT 
40 GGGCAGCCTCAACATCCCCTACAAGATCGAGGCCGTGCAGATG 

CX5TGGCGGCGGCa3CCTTTGTGCTrCTGTTCTTCGTGGGCTGCGGGGTC 

TCTGGTTCCCTGAGGGCTTCAAAGTGTCTGAGGCCA^ 

CCCCTGAAGAACGCTTCAGACGGTGCCCTCATGGACGACAACCAGAATC 

45 TGCGCATGTCTGCCATGGCCCCCACACCGCCCCAGGGTGAGGTTGACGCCGACTGCATGGACGTCA^ 
TTCACCCCGCTCATGATCGCCTCCTGCAGCGGGGGCGGCCTGGAGACGGGCAACAGC^ 
CTCCGACTTCATCTACCAGGGCGCCAGCCTGCACAACCAGACAGACCGCACGGG 

CACG CTCTG ATGCCG C CAAGCGC CTG CTGG AGGCCAGCG CAG ATG C CAACATC CAGGACAACATGGGC CGCACCCCG CTG CATG C G 
GCTGTGTCTGCCGACGCACAAGGTGTCTTCCAGATCCTGATC 
50 GACGCCACTGATCCTGGCTGCCCGCCTGGCCGTGGAGGKSC^TGCTGGAGGACCTCATCA^ 

ATGACCTGGGCAAGTCCGCCCTGCACTGGGCCGCCGCCGTGAACAATGTGGATGCCGCAGTTGTGCTCCT 
AAAGATATGCAGAACAACAGGGAGGAGACACCCCTGTTTCTGGCCGCCCGGGAGGG 
CCACTTTGCCAACCGGGACATCACGGATCATATGGACCGCCTGCCGCGCGACATCG 
GGCTGCTGGACGAGTACAACCTGGTGCGCAGCCCGCAGCTGCACGGAGCCCCGCTGG^ 

5 5 TGCTCGCCCAACGGCTACCTGGGCAGCCTCAAGCCCGGCGTGCAGGGCIAAGAAGCT 

TGGAAGOUVGGAGGCCAAGGACCTCAAGGCACGGAGGAAGAAGTCCCAGGACG^ 

TCTCGCCCGTGGACTCCCTGGAGTCACCCCATGGCTACCTGTC^GACGTGGCCTCGCCGCCACTGCTGCCCTC 

TCTCCGTCCGTGCCCCTCAACCACCTGCCTGGGATGCCCGACACCCACCTGGGCATCGGGC^ 

GATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTITGAGACTGGCCCA 

6 0 GCACCGTCCTGGGCTCCAGCAGCGGAGGGGCCCTGAATTTCACT^ 

TCCCGGCTGCAGAGCGGCATGGTGCCGAACCAATACAACCCTCTGCGGGGGAGTGTGGCACC^ 
CTCCCTGCAGCATGGaVTGGTAGGCCCGCTGCACAGTAGCCITGCTG 

CCAGCACCCGGCTGG CCACCCAG CCTCIACCTGGTGCAG ACCCAGCAGGTGCAG CCACAAAACTT ACAGATGC^ 
CAGCCAGCAAACATCCAGCAGCAGCAAAGCCTGCAGCCGCC^ 

6 5 CGGCCACCTGGGCCGGAGCTrCCTGAGTGGAGAG CCG AG CCAGGCAGACGTGCAGCCACTGGG CC CCAG CAGC CTGGCGGTG CACA 

CTATTCTGCCCCAGGAGAGCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCC^ 
CCCCCCTCGCAGCACMCTACTCCTCGCCTGTGGACAACACCCCCAGCCACC^^ 
GTCCCCTGAGTCCCCTGACCAGTGGTCCAGCTCGTCCCCGCATTCCAACGTCTCCGACTG 
CCAGCATGCAGTCCCAGATCGCCCGCATTCCGGAGGCCTTCAAGTAA 

70 

HUMAN SEQUENCE - CODING 
ATGTACGTGGCGGCGGCCGCCTTTGTGCTT CTG 

CCAGCTCTGGTTCCCTGAGGGCTTCAAAGTGTCTGAGGCCAGCAAGAAGAAGCGGCGGGA^ 
TCAAGCCCCTGAAGAACGCTTCAGACGGTCCC 

7 5 TTC CGGTTCGAGG AGCC C GTGGTTCTG CCTGAC CTGG ACG ACCAG ACAG AC CACCGG CAGTGGACTCAG CAG CACCTGGATG CCG C 
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TG ACCTG CGCATGTCTGCCATGGCC CCCACACCGCCCCAGGGTGAGGTTG ACG CCGACTGCATGGACGTCAATGTCCGCGGGCCTG 

ATGGCTTCACCCeGCTCATGATCGCCTCCTGCAGCGGGGGCGGCCT^ 

GTCATCTCCGACTTCATCTACCAGCKSCGCC^ 

CTACTCACGCTCTGATGCCGCCAAGCGCCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGACA 
5 ATCCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCCAGATCCTGATCCGGAACCGAGCCACAGACCT 
GGCACGACGCCACTGATCCTGGCTGCCCGCCTGGCCGTGGAGGGCATC 
CGTAGATGACCTGGGCAAGTCCGCCCTGCACTGGGCCGCCGCCGTGAACAATC 
CTAACAAAGATATGCAGAACAACAGGGAGGAGACACCCCTGTTTCTGGCCGCCra 
CTGGACCACTTTGCCAACCGGGACATCACGGATCATATGGACCGCCTGCCGCGCGACA 
10 CGTGAGGCTGCTGGACGAGTACAACCTGGTGCGCAGCCCGCAG 

CGCTCTGCTCGCCCAACGGCTACCTGGGCAGCCTCAAGCCCGGCGTGC 

GCCTGTGGAAGCAAGGAGGCCAAGGACCTCAAGGCACGGAGGAAGAAGTCCCAGGACGGCA 

CATGCTCTCX5CCCGTGGACTCCCTGGAGTCACCCCATGGCTACCTGTCA^ 

15 CCCGAGATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTTTGAGACTGGC 

CACCAGCACCGTCCTGGGCTCCAGCAGCGGAGGGGCCCTGAATTTC!ACT 

GGCTGTCCCXXKrrcCAGAGCGGCATGGTGCCGAACCAATACAACCCTCTGCGGG^ 

GCCCCCTCCCTGCAGCATGGCATGGTAGGCCCGCTGCACAGTAGCCTTGCTGCCAGC 

CCTGCCCAGCACCCGGCTGGCCACCCAGCCTCAC 
2 0 ACCTGCAGCCAGCAAACATCCAG CAGCAGCAAAGCCTGCAGCCGCCACCAC CACCACCACAGCCGCACCTTGGCGTGAGCTCAGCA 

GCCAGCGGCCACCTGGGCCGGAG CTTCCTGAGTGG AG AGCCGAGCCAGGCAG ACGTGC AGCCACTGGGC CCCAGCAGCCTGGCGGT 

GCACACTATTCTGCCCCAGGAGAGCCCCGCCCTGCCCACGTCGCTGCCAT^ 

TGACGCCCCCCTCGCAGCACAGCTACTCCrOSCCTGT 

ACCCCGTCCCCTGAGTCCCCTGACCAGTGGTCCAGCTCGTCCCC 
2 5 TCCCACCAGCATGCAGTCCCAGATCGCCCGCATTCCGGAGGCCTTCAAGTAA 
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Table 13 

MOUSE NOMENCLATURE 
ICSGNM Myc 
Celera mCG1625 

5 

HUMAN NOMENCLATURE 
HGNC MYC 
Celera hCG1791915 

10 MOUSE SEQUENCE - GENOMIC 

CTTCCTTCTCCCTTCACTGAAACTGAACrcACCTTGGGATG 
GTCAAACAGCCTAGCAAGCTCTAAACCAGGTACTCTGCACCTGCCCTCTGAC^ 

AATG ACAT AATT AAAAC CAGC CT CAG AAGCTGCTG AAG GCATT AGAACT AAAG C CCAGAAAAG CA^ CATCAAAG C 

GTGAAAGAGGAAACGGTTAATCTTTTCAAGGGAAGTGAAGCAACTCGGAA 
1 5 AGCCTTGGAACTGGGGCTTTCAAGCrACATCT 

GTTTTCAAACCTGGATTTTCACACTCCCTGGAATGCCTGG 

GGTTCTGAGCCACAGCTCTTAGCTAG^ACCAAGCAAATGGTCCCCAGTGTGTTA 

ATGCACTCCTTTCCAATCCTNNNNNNNNNNNNNNNNNNNNGATT^ 

AG T AGT AG AAAGGGAAAGCCT AAT CAAG C CTGCAGCTG AGAGATTG C C CTGTGC CACAAC CC CCAAG T CCACCCC CTTATAGGACA 
2 0 GCACCTTAGTCATTCAGGGAAAGCCTTAATGGATTGCCTTCATTCGCCTTGATT 
TGCACCAACTCCTATCGTAAATGAAGACAGGGGTGAGACTGGAAG 
TCATCTTGATCAGTTTCAACCAAAAAAGAAGGAAGCTTTAAGCAGGACTCCrr 
TCCCATTATTGCTCATGAGACCTTGTTC^TATGTCACCAAGAATAT^ 
TAAGATTTCTTTATTTGATGTATGTGTTTTGCTTGCATAATGTATATGTACCACGTG 

2 5 GGTTTCAGTITCCCTGGATTGATGACTGTGAACTATCATGTGTATGCTGTGA 

TTCTTAACTGATGATCCR.TCTCrCCAGCTTCCGCTCAGAAATTTTC^ 
CGGTTAAGAGCACCGACTGCTCTTCCAAAGGTCCTGCGTTCAAATCCC^ 
GGCTCAGCAGTTAAGAGCACTGACTGCTCTTCCAGAGGTCCTGAGCTCAATTCC 
AATGTGATCTGATGTCCTTTTCrGATATGTCTGAAGACAGCTAAA 

3 0 AGAAAGGAAGGAGGGAAGGGAGGAAGGAAGGGAGAGAGAGAGAGAAAGAACGGAAGGGAGGGAGGGAGG 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN1JNNNTO 

GTCTTCTCCGGTTTGCC1TCAGAACAAGATGTAAAACTCTCAGCTCCTTTAGCT 
GCCTTGATGATAATGAACTGAACCTCTGAGCCTGTAATCCA^ 

4 5 TGTCTCCTCACAGCAATGGAAACCCTAACTAAGACAGCAAGTTTAGCAGTAAATGG 

ACTTTGACAT AG CT AGTGAGACAG C^GGTAATGACAGTAGCCAAACCAGCCTGATAAACAAAC^GCTAG CCCAGAATGCATAGCAC 

AGCAGCAAAAACATTGGAGAACTGGCTCAATGACAAAGTGGAAGGAGGAAAGGCAATTCCTAAATGTTGTC^ 

CTGCACACTGAAGCACACACACACACAAATAAACAACACTTTAAAG 

CGGAAAAG AAAGAGGAAAAAAAAT CAATGGAGCAAAG AGACCAGG CT G CACACACAGGAGGAACAT C AGG AG AC CCAAATT CACC C 

5 0 AAATGTTCTCAATGTCAGCATGTTTGCATCTAGGTCCCAACACT 

CCAATAGGAAGCCCCAACCCCCACCACCTATAACAACCTCAAATATCTCCTCCTGTTGGTCAATG 

TCTAGTTGAGAGGTACTG ATCTCT AGAATCCAAAAGTGTG AG CATTTCCAGAAG CTTTCCCAGCAAGCAACAAAAAAGAGATGGTC 

TCACG^H'CACCGGTTGTGGGGAAGATAGCTTGAGTGCCTTTAGC^AGGTT 

GGAAATGGAGAAGGTGAAAGTGAGAGCAAGCAAGGCTGAGCTGCACTCAAAATTCT 

5 5 GTTT GGGGTCTGTTCTAG CCAAT AAGAGG CACAGG CT ACT ATTG CACTGG CTT CTAAATG AAAGTCAAAG ATT ACGG ACAGAAAAA 

AAGGAAGAAAAAAAAGTAAAAAAGAAAGCAAGACCCGTCATGCACCT 

TCATGGAGAOVGATGCCGCCCAAAAGGACCATCACAAAGGCCGGCCAGTGAAC^ 

CAGAACAATAAACGAAGCCTTTGAAGCACTGTTAAACAAAAGAGCCA^ 

AGCAGAGGACTAAGAAAAGAAAGAGAAAATGCCAAACTGAGATGTGCCTGGACTTTACCGGGTCCTGCCCA 

6 0 AGGACCTAAGATCTAAAAGCCACTGGGTAAGAAAACAATCAAATATGCTTAATC 

GTTAAAAATATCTACTAATTTTTTTTCCTTGTCAGCTGCTTTAATTTATGCAGTATGGGAAGG 
AAGAACAGAAAAGAAACACCGTACATTTTCAAATGCIACAAAGATCT 

GCTAAATTTTATCTTATTTTCTTGCCTGGTGGGGGGGGGGGGGAGGGCGGGGAGGGTGACTGTG 
TGCCATTTACATGCCTGTGAGGTCCTCTTTTCCTTTAGCATGCC^TGGCTAGCTTGGTTTTTAATCT 

6 5 TGATGTTTCTGCGGCAGACAGGTATTTATCATTGTCTTTGCT 

AAATCGTAGCCCGTCCTTAGGGTCTCCTGCTTCTCTCCCGAGTAGGAGCTAGGTTTT^ 

TTTTTATACATGG AAGGTTAGCTCAGTGGTGATGGTTGGGGGGGGGGG CTCTTTGAAGCATG CAG AAAAG AGATT CAAG AACATTG 

GGGGATGTTATCTAGTGGTTTTCTCCTTGTTGTTACAAGAGTGACTTTGGAGAAGAG 

AAAATCTGGGATGGAAGAAGTCATTGTGGGAGCAGCTGGAAGCTTTGTCACTTGGGGAGCCTTG 

7 0 AAGCAGGTTGGAGGGTGGGGAGACAGA1TCTGGAGTTTGGCGCTTCT 

CCCCTGGGTTGACACAGATCACATTAGTGTGAGTAGGTCITCCTTCT 

C!AGATCTGCATCTCCTAAGTGATCAAGTCGACAATCAGAAC!TAACCATCA^ 

TA?^AGGC?J^GL n .CAGGTGGCTCTCrr^ 

aaacaaaaXicaaaaaacaaaaaacaaaaaaaaagaaaaga 
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ATGTTATTCAC^CCACATATTTGGACCTCCCCTCTTGTGTCTTT 

AT AT ACAGTTCAGTGACTrGAATG AATGCTG AC CACAAATGAGG C C CTGTTTC^^TACTG AGCACAC CTTCCTCTCCTT CC CAT AAC 
AGCCCTTTCTTTCTGGAGGTCTTCC r r i - n - n 'C T r A AAAATGTAAGAGCATC^ 
ACTGTGATTCrTAGTTTTTCATACTGTAAGAGGGAATGTACTCT 
5 GCCTCCACX3AAGACTAGGATCTTCTGGAAAGCTGTGTGCITTC 
CAGAGTTACAACGTTCCAGAGGAAGAAAACAGTGGGACATC^^ 
TTTGTCCTTAGTAAGCTTCTCCACAGAGGCACGCATGGCTGTC^ 
AAACTAGCTTGAACTGGCTGCATAGTGAGGCCCTATCT 
AGTGTTTGAAAGACTCTGGATTTGATTCCCAACAATAAAAACA^ 
1 0 ACACACACACACACCCAACT C CTT AAACACTAAAATTTAGTGT AGG T AG AGTGG CACAC^ AAACTCT C CAG CACTACAAAC C 

CTGAAGCAGTTTTTATTTTAGTTAGAAGCCATCCTAGGCTACCITCT 
TTATAAACTTGTGGCrrTCCTGTCCTTT^ 

TGTAAAATGGGGATGTCCCCTACCTTCCTGTTACTTAAAAGCAACAC^ 

GTCCCCAAACCACAGGCTATTCCGCTACCTGGGTTCCCACACCTA 
1 5 CCATAATTACACCTTTTCAAAAAGGACCTTITGAGGGTCAGCT 

G AGGCAGAGAGAAAG CGGGTGTGCX3CAACCTCTAAGAGAGTAAT AGTAAAGAGACAAG CCTAGGAAATCTTCCCCAAGAGGC CTTT 

TGGGGAAGGCTATCCCCCCTTCCGTGGATGTTGAATACCTTCCCrGTrGCT 

TGATGCTGAGACCACTGTGTAGGTTGGTACAAATTCCCTTTCTATTATACCC^ 

CGTTTTAAAAAAAAAAAAAAAAAGTATTGCCGGGCAATAGTGACGCAGGCCTTTAATCCC^ 
2 0 ATCTCTGAGTTTGAGACCAACCTGGATGGTCTAA 

GATTTGTTTGCTCTCTCTCTCTCCCGCrCCCCCTCTCTCC 

ATCTCTCTCTCAGATGTGTACTTGGTGACGGTCrTTTCTACA^ 

CCCTGGAGCITCTGTACAGTAGCGTGACCTGATCCTCTAGCTTCTCCATCCC^ 

CTACCTAAAATTCCAACCTTTTAATAAAATGGCTCTCTAAAAT 

2 5 TGAACCACCCAATTCGTAAGCAGTCGGGCTGAGGGTGAGCATCCTTAAGA^ 

TAGTT'GCTAGAGGGCTCACATAGCATGCCCTCCCCACCCGGGATTCCATTCTC^ 
TATGTGCTGGGATGCATAAGATCITTTATAAAGAAGAGGAAGAGGAAAAACAGTTA 
GGCTAGCAAAGAATAGTCGTGAGAATCTCTCTCAAAAAAAAAACCCCAAAA 
ATAATAAAACAAGGCRATAATAAGCTAATGCrCCTGCCTTGCATTCTGACTC CT TT T GCCC^ 

3 0 CACGTCCAGCTTACAACIACGACAAAGGTGTGAGACATC 

CACTTGTATACCTGTGACTCATTCATTTTCCCCATCCACAACTAG^ 
ATCTACTCCCCCTCTlTGCAACTGAATAGCCACCTCTGAACCATTTTTTTCrCT 
CTAAAAGAGTCATTTAAAGGATGACCGGAAGCTTGTCTTAGGCAAGGAAGCAT 
CCCATCCGACCTCCGCCCTCGTTGGCTTCGCAACGCTGTGGTCTCTGTGGCCAGTAGAGGGC^ 

3 5 CGAGAGCCACAACCCGGGTGGTGGGGGGTGAGGGGGCGGGGAAAGAGTCT 

GGTGTTTGAGG CAAAATCCT AGAGG CTGT AGTCATTTTG CAATCCTT AAAGCTGAATTGTGCAATGAGCTCG ATGAAGGAAGATAC 
T ATCATTCAACAG CTG AATC CT AAATTG CAAACT CAG TGGCT AAT AA CAACTTTG AACAATG AGCACCTTAT ACACGCT ACTGT AT 
TTTCTTTTCTTTCTTTTTTTTTTT 

GTCCGGAATAAGAAGACITCTTTGGGTTTTAAAGTGTAGGATAAGCAAATCCCGAGGGAATA 

4 0 CAATGCACAGAGCAAAAGACTCATGTTrCTGGTTCGTTAATAAG 

GGGATTGTACAGAATGCACAGCGTAGTATTCAGGAAAAAGGAAACTGGGAAATTAATGTATAAAT^ 
TTAACACACACATACGAAGGCAAAAATGTAACGTTACTTTO 
CCAGTAATAAAAGGGGAAAGCTTGGGTTTGTCCTGGGAGGAAGGGGTTAAC^ 
AGATCTGGGTTGGCAATTCACTCCTCCCCCTTTCTGGGAAGTCCGGGTT 

4 5 TAGCGCCTTGATTTTCCCCACCCCAGCTCCTAAA^ 

TGAAAJVCCX5ACTGTGGCCCTGGAACTGTGTGGAGGTGTATGGGGGTGTAGACCGGCAGATACTCCT 
CGCACCCGCCGCCACTTTACTGGACTGCGCAGGGAGACCTACAGGGGAAAGAGCCGCCT 
AACCGGAGGTCCTGGAGTGTNNNNNNNNNNNNNNW 
GTTCCACCTGCGGTGACTGATATACGCAGGGCAAGAACACAGTTCAGC^ 

5 0 GACTAGCGCGCX5AGCAAGAGAAAATGGTCGGGCGCGCAGTTAATTCATGCTG 

TGGGCTGCGGGGCTGAGGCTCCrCCTCCTCTTTCCCCGGCTCCCCACTAGCCCCCCT 

GCGAGAGGAGGAAAAAAAAATAGAGAGAGGTGGGGAAGGGAGAAAGAGAGATTCTCTGGCTAATCCCCGCCC^ 

TTCCGGGGGTCTGC^CGGCCGAGGACCCCTGGGCTGCGCTGCTCTCAGCTGCCGGGT 

CCTCCTGAAGGGCAGGGCTTCGCCGACGCTTGGCGGGAAAAAGAAGGGAGGX3G 

GCCGCGCTCCGGGGCGACCTAAGAAGGCAGCTCTGGAGTGAGAGGGGCTTTGCCTCCGAGC 
CGACTGACCCAACATCAG CX^CCGCAACCCTCGCCGCCGCTGGG AAACTTTC 
TTACAATCTGCGAGCCAGGACAGGACTCCCCAGGCTCCGGGGAGGGAATTTTTGT 

CGCGATCAGCTCTCCTGAAAAGAGCTCCrrCGAGCTGTTTGAAGGCrGGATTTCCTTTGGGCGTTGGAAACCC 

6 0 CTGGTGGTCrrrCCCrGTGTTCTTTCTGOSTCTTGAATGTAG 

CITTTCTCAAGAGTAGTTGGGGTAGGCTGGGGTAGATCTGAGTCGGGGTAGAGCGACTTGTCAAGAT 
GAAAAACCGGGATGCATTTTGAAGCGGGGTTCCCGAGGTTACTATGGGCT 
TACATTAATTGATATGTGTCCTTTGAGGGGTCAAACCGGGAGGTCGCTT^ 
TTTGGAGAAGGGATTACCTTTTGCGTTTGGGAGCGAGAAGGCTCCGTAGCITCTGAC^ 

6 5 TCAGCTTGGTCCATTTCTGACAGCCTGGGACCGACACGG 

TG AGCCTTTT AAG AAG TTGCT ATTTTGG CTTT AAAAATAGTG ATCGT AGT AAAATTT AAG C CTG ACC C C CG CG G CACT AGG ACTTG 
ATGTTGGGCTAGCGCAGTGAGGAGAAGCAAAATTGGGACAGGGATC 

CACATTCCTACTTGGGATCCGCGGGTATCCCrrCGCGCCCCTGAATTGCTAGGAAGACrcCGCT 

TAACAGCTGCT ACCCT CGGCGGGG AG AGGGAAGACGCCCTGCACCCAGTGCTGAATCGCTGCAGGGTCTCTGGTG CAG 

7 0 CGGTTT AGAGTGTAGAAGGG AGGTG T CTCTTATT ATTTG ACAC CCC CTC C CCTTTT ATTTCGAG AGG CTTGTGAT AG CCGGAG ACT 

GGAGAGGGGCXKjCGGCGCCGGCTGGGTAGGAGCGCGGCGACXK^ 
CACACGGGAGGGGGCCGAGCGACGCGGCGCCTCTCGCCTTTCTCCTTCAGGTGGCG 
TGTATTCCCTACAGTCGCCTCCCTCAGCCTCTGAAGCCAAGGCCGATGGCGATTCCT 
7 5 AAGGAGGCGGGGACTCGGAGCAGCTGCTAGTCCGACGAGCGTCACTGATAGTAGGGAGT 
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AC^CACACACACACACACACACACACACACA^ 

GGGGTACGCGCTGCGCCAGGTTTCCGCACCAACCAGAGCTGGATAACTCTAGACTTGCTTC 

AGCCACGACGATGCCCCTCAACGTGAACTTCACCAACAGGAACTATGACCTCGACTACGACTCCGTA 

ACGAGGAAGAGAATTTCTATCACCAGCAACAGCAGAGCGAGCTGCAGCCGCCCGCGCCCAGTGA 
5 crGCTTCGCACCCCGCCCCTGTCCCCGAGCCGCCGCTCCGGGCTCTGCTCrCCATCCTATGT^ 

AAGGGAAGACGATGACGGCGGCGGTGGCAACTTCTCCIACCGCCGATCAGCT 

TGAACCAGAGCTTCATCTGCGATCCTGACGACGAGACCTrCAT 

GCCGCTGCCAAGCTGGTCTCG^AGAAGCTGGCCTCCTACCAGGCTGCGCGCAA^ 

CAGCGTCTGCTCCACCTCCAGCCTGTACCTGCAGGACCTCACCGCCGCCGCGTCCGAGTGCAT^ 
1 0 ACCCGCTCAACGACAGCAGCTCGCCCAAATCCTGTACCTCGTCCGATTCCACGGCCT^ 

TCCGAGTCCTCCCCACGGGCCAGCCCTGAGCCCCTAGTGCTGCATGAGGAGACACCGCCCIAC 

ACCCCATTCACAGCAGGGTAGGAAGCGAGAGGTTGGATGGACCTCCTTCrcCACCACT 

GGCTCCCCCTTTCTTTCCCTTCTGTCTAAGAGCTCITCATCCC^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNmWNNNNNNNNNNNNNNNNNNNNl^^ 

TTATCTTTCAGCTCCATCCACTCCCTTTACCCCTCCTAAGCATTTTAATTACCCT 

AT CTGGAGGGGGGTG AATT ACCTGC1T C1TTT CT TG ACT GCCAGAAGAAT ATTTG AATTTAATGGAT ACG TTTGTCT AAG AC CCAA 
GAGAAGCAATGACAGAAGCTGGGACAGCCTTTATAGCCTTAGAGCCAG^ 
2 0 TCrTTTGGGTGTTTTGTTTTGCTTTGT 

CGTCCCAACACTCCCCCAACACCAGGACGTTTGGCAAAGCTGCAAGACf T ^ 
AAAATAGGGAGTTGCTAAAGTCATAGCAAGAGATTTGCAGOTATCCCTCACGGGACCTGA 
ATAGGAGGTGCTTGGGAAATGTGCTITGCTTTGGGTGTGTCTGAAGCCTCA 
TCCTGGTACATGGTAATTTTCn'CACCTGTGCCCTAACCCTGTTCTGCCTTTCT^ 

2 5 TTACTTTCTTC CCTTTC CAACTTGGTATITGG AT AGCATCGGTCAAATCCT ATGTATAGCGTCCGGG ATTCAGGAGG CGTGGCTAA 

CTGTGATCTTCCACTTCCTCCCTTACAGAAGAAGAGCAAGAAGATGAGGAAGAAAT^ 

CCCTGCCAAGAGGTCGGAGTCGGGCTCATCTCCATCCCGAGGCCACAGCAAACCTCCGCACAGCCCACTGGTCCT 

ACGTCTCCACTC^CCAGCACAACTACGCCGCACCCCCCTCCACAAGGAAGGACTATCCAGCTG 

GGCAGGGTCCTGAAGCAGATCAGCAACAACCGCAAGTGCTCCAGCCCCAGGTCCTCAGACACG^ 

3 0 ACACAACGTCTTCGAACGTCAGAGGAGGAACGAGCTCAAGCGCAGCT 

ACGAAAAGGCCCCCAAGGTAGTGATCCrCAAAAAAGCCACCGCCTACATCCTGTCCATTCAA 
GAAAAGGACTTATTGAGGAAACGACGAGAACAGTTGAAACACAAACT 
TCGAGGAGGAGCTGGAATCTCTCGTGAGAGTAAGGAGAACGGTTCCTTCTGACZAGAACTC 
AAAGCCTAACCTCACAACCTTGGCTGGGGCTTTGGGACTGTAAGCTTCAGCCATAA 

3 5 AAGAACTTTTTrTTATGCTTCCCATCTTTTTTCTTT^ 

TCTATCCAATTTTCCCATGTAAATAGGGCCTTGAAATGTAAATAACTTTAATAA 
CATGTACCATAATTTTTTTTATTTAAAGACATTTTCATTTT^ 
TGGAAAAAATACAATTGC^CCAACTTGTGTTTTCTTTTTCCTCTTCCr 
ACCATTTTCACAGGGTAGGTTTACAAATATGGGAAGGGGTTATCATTGTTAA 

4 0 TCTACAGGATGCTTTCTGTGGATAGTAATAAAAACCAGAGCTGTTAGTTAGGAATGGGCAAA 

GGGAAGGGAAAAGCAAGAGGTTAAAGATAACAGCTAAATATACAGGAGGAA 
TTCCCTGGTTCACCTCAACCCAAGGACTCTGCCCTGCCAAAGAACTG^ 
TCTTGCTCCCAGGTGATAGTCCCTTCACATCAGTATCTCCTATGCrTCTGAAAAAAACA^ 
GATCCTGGTTTTCCAAACAAGGACATACAAAGGTTCAGAGGGTTGC^ 

4 5 CCCCTGT CTTGAAAACAAG CCTTGCTGCCTTGCTT AGTTGGGTGTCC1TCCX5 CT 

CCATTAGGAGCTTTGACAGAGTGCCACGAAAGAGGTACT^ 

CATCCATGCACCCAGGGGATTCAAAACCACTTTATCCTTAGTGCATCCAGGAGGAAGATCCT 
CATTAAAGCrTCACTACTCAAGAACTGGTAGACTTCAGTTC 

TGTACTCAGAGGCACATAGCTCACTCCTACCTTACCCACTTAGTTTTTAACATCAGATCCCT^ 

5 0 TT AGAAGTGCTACTGGTCAG ATCAT AAAAT AAAACCT^^ CAAGCTCG CCTTCCTCCC 

AAG CACTTT CACAC!ACT CATG AC ACIACGCTCATCAATGG C C CATGAGAAAGC CTTTTGGAAC AGGT AT CTTATT AATT ATAAG CCT 
CTGAAAAGCCACAATCCAAACCAGAAACTGAAACATGTAGGTAAAGTCCAGGGAGAAAGGTACCCT 
TTGTCTCAGAGAAGGTCTCTAGCATCCTGGAGTGTCTTTGGGACIT 
TTTGACACCCCCCCCCCATGATCTATTGAAGCGTTTAACATTTAGTTT 

5 5 TTCAAACTGTTACACCCTCTATCCTACATTCAAAAGAAAACACAAACCCCAGTAGTAGTT^ 

TTAGCAAAAGGGGGGAGCAGGGGGTGCTCTTAAATAGTTCCCTCTTTTT^ 
TGACTCACTTGAGACCTGGGAAGGTGTTAGGTTGAATCACTCAGTCCAGGCAAGCCCA 
GAAAACAACTCCTOTTCCAAATGATCAGGAAACAAGTTTAGAG^ 
CCTATCTGGGCACTTCTCAGCTTTACCAAGCCAGGGAATGGTCTGAAAACAGG^ 
60 GATCTTGACCATAACACAGGAAGGCTCTCCCAGCCAATCTGGAGTCCCAGGGGTCTC 
CAATCACGATGAGGGGTGAAGTGGGAGGAGCTTTCAGCTTGGTCTGG^ 

ATGAC^CTGGAACTCAGTTTCCCCGAGAGGATCCCTGGGATGGTGCTCTTCAGGATAAACTGAGTGAGG 

GCTATCATTCGGGGACAACACTAAAAAGCAATCAATTACACTTTAAGTTGAAC 

CATCCAAAGCTTGAACACGTATGTAGAGTCTTCTGCTGTTCTTTAAGATGGACTTTGGCACTA^ 

6 5 ACCCACCCCCCAAAAACAAACAGAACAAAAAACGTGAGGGGATTTTCT 

TTTTTTTTTTTTTTTTTAGCATGACTGCTAAGGAACCCTGAAGCATTTCTTCCCAACAA 
GGAAGATGAGCTTCAAGCCTGTCTATAAATACATCACTGGGTATCTTCAGGGAACAAACT 
GGATCGATTTCAAGAACAGAGGTTTATTTTAAACAATGAAGCCT 
GTATAAGGACCCATGCTTGACACCACAGGAATAAATAAAACCAAAAATGAGGTTCCT 

7 0 TGGTGGTGGTGGTGGTGGTTCGACGTCTTTCCAGAGAAATCCAACAAACTGCCAATTTAGGGAAA^ 

GCCAAACTGGTGTGAGTTGACTCAATGACTCAGTGGATTTCCTTTTTGGvi 
TTAGGAAGACAGGGTTGAGAGAGGAACAAGTGTTAATGGGATGTAGATCCTCTGAGGCTGAAAG 
ATGCCGAGGGGCTCAGGAACACTCTCTTCCAGTCTTCACCCGACCATCTC^ 
CTAGGAAAGAGGAGCTTTGGGGAAAGCTCTGTCA^ 
7 5 TGGAGAACAATTGTTAGATGTGACAATACTGAAGTTTC^ 
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AACGAT AATAG C AATT AATGTTT ACTG TGTG CT AAAAGTTTG ACAT ACATT AT CT C CTTTT ATT CCCAAAG CAAC C C CCT ATACAC 
ACACACACCTC CT TT TTG GTTTTGGTTTTACT T T TT AAATTT^ 

TAGAGAGGGAACCTAGAACCTCCTGCATGCAAGGTAAGTGATCAATGGGTGAGTCACATCT 
CAAAAGTATTAGGGCATTTCCTACAGAGTGGTGGAAGTTCGAGTCCTGGGAACATGG 
5 AAAGAAG CT AAAGGAG AAAATGT CAG CAGCAG ATGGCAGGGAAGCAAAG AAG AAGG AG AG CTAAAACCAACAAG CT AAAACCAAAA 

ACGCTTGCTATTTGAACTGTGCCTAAGGTAGCAC^AAGTAC^ 
TCCTATCCACITCCCCTTCCACCCAGTATCCTCCTACTCCTT 
CAGCACCCAACrCTCCTGCAGTACTTTGGTTCTGTTTATTTGT^ 
CTCTGGAATTTCTTGCCTCAGCCTCTTAAGTGTTGTC 
1 0 TTATTCAGGAATTTCCACTGTGTGATTCTAJGGATGCAAATACT 

AGTTGGTTGGTTTTGGGTTTTGTTCATTGGGTTGGTTTGGGTTGGTTTTG^ 

ATTGTCTTTTCTACCACTGCAAACCCAACCATCTTTTCAAACCACTG^ 

TCCACTCCTATTCGATACACITACTACCTACCACAAACACGGCAGCC^ 

1 5 GTGCAATTGACTAATTCTGTAAACATCTAAGTGAAGGGCT^ 
AGAAGCAAAGTTGTTTCAAGTTGCATAGCGAGAAAGGTCA^ 
GACTTCCTtTtSCTTGATTTGGTTTTTGA 

AAGAACTGCTC^GAGAAAAGGAGGGGAATTGTACTGACTGGTC 
TGTGTGTGTGTGTGTGAACrTGACACAAGCTGGAGTTATC^ 
2 0 GCATTTTCTCAATTAGTGGTGAAGGAGGGCCTATGTGGGTGGTGCC^ 

CTGAGCAAGCGAGGGGAAGCAAGCCAGTAAGTAACATCCTTCCAAGACCT 
CAGTCCTGACTTCCTTTGGTAATGAACAGCAGTGTGTAAGTGGAC^ 
GATGTTTTGTGCAGGAATATAAACCCTGACTAAGACAAGAATCAAACT 
GTTITGCrrTGCTTGATGTCrrGGAGCCCTCTCrTAGCTGGGTATGG 

2 5 AGTATTCTTCCAGGTGGTGATGTCACTGGCCAAGCTGTTCTCCCTT 

ATCTTTACACAAATGACTCTAAACCTCTTCTCTCCAC^ 
GCTGACCTGGAACTCACTATGCAGACCAGCCrTGCATTGAACTT^ 

AGGCATGCACCACATTTAGTCCTGCTAAAC'rr CTTTTI TTGTITTTTTGG^lTTTTCGAGACAGGGTTTCTCrGTGTAGCCCTGG C 
TGTCCTG<3AACTCACTCTGTAGACCAGGCTGGCCTCAAACTTAGAAATCCACC 

3 0 GTGTGGTACTACTGCTTAGACTGACACACAAAATATTGTT^ 

GCAGACTTTTGTGAGTTCTGGGCCAGCCAGAGCrTCATATGA 

ATGGGTGTATG AAAG AGTAAATGTGCACATATTTCTGG CATCTGTTTTAAGATT CTACTTCCTT AGGCTGTCAGTCATTCTCACCC 

CATTCAACCCTTTGCTCTGCACTGTGATTGACAAATCrrTCTCT 

GGCAACAACTTCTCTATGCAGGGACTGTAAGTTCCAGACAAC^ 

3 5 AGACAGAGGACIAGAGGAAAGGAAGGAAGAAGGGAGAAAAGGAGGGAGGGAGGG AGGGAGGGAGGGAAAATGTCAATTTATGCAACT 

GTTTCATGATTCATCCAAGGATCCrCTCTC^ 

CCCTC^ACTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCGAACTCAGAAACCCA 
TAAAGATGTGTGCCACCACGCCCAGCTAAACCTCTTCCTITCTAGCCAGAAGAACAC/^ 
ACCATGTX3CTTACCCTTAAACCAGACAATTCACTTGGTTCCCAGG 

4 0 GCITCTGAGGCJIAAGGGAAGACCTATCCAAGATAAAAA^ 

TAGATACATTCTGGGTAGTGATGGTTCTCACTGTTCAGAGATATAAGAAAAGAGCAGATGCT^ 
CTTTTGGTAAGAGAGACTCCTITTTTTAAAATTTTATTT^ 

TCTTCAGACACTCCAGAAGAGAGTATCAGATCTCATTACAGATGGTTGTGAGCCACC^^ 
CTTTGGAAGAACAGTCAGTGCTCTTAACCACTGAGCTATCT^ 

4 5 GAATGGTCCATCTTAGAGGTATTGAAATTGAGTCTGAAGTGGGAGGCCTAGGC^CTGTTTTGAGCCAACT 

CAGTGAGGTGGGAATTCCAAGGGCAAACAAGGATCAGCAAAAGCrrATAC 
GCCTGTTCCAGCCATTGCTCAGTAAGTAAGCAAGAGTCTAGCTACCTCTTCATCTCCCT 
AGGCCATCAGCCTCAATGATTCTTTCTTTTTTTTTTTTTAATTAGGTATTTTCCTC^ 
CCCCATACCCTCCCCCTCCACTCCAACriTTTGGCCCTGT^ 

5 0 CACTTTGCTTCTGCCAACCGATATTCTTTACCTGGGACTCTCTAGCAGATGCCAACTGCCT 

TAGAAAAATGGTCTCCCAAATAAGCCCGATCAGTTTACTCCCTGGCGTAGGA^ 
AC^GATCTGTGGGCCTGGTAGTGCAGATGTTAATCCCAACTACTCTACTTGGGAGGCCTAGGG^ 
GCCTGGACAGCAAGATGAGTGCTGGGGAAGCCAGAGAAAATGAGTAAAACTTTCTG^ 
AAAGGTCCAGAGAACTGGGGAGATGATTGGCTGGTTAAGAGTACTTGCTCTTCT 

5 5 AGTAGCTTGCACCTTCAATTATGGCCCCCTCTTCTGTTTTCTCAGGACCAGGTGCACA 

ATACTCACACACATAAAATTAAAAATAAATAAATCTTTCAC^ 
GGATGCTAGCAGCCCTGACTTCCATCTACAGTCCAGAAAGACAAGTAAATCAGA 
AGGTTAAGAGCACTGGCTGCTCTTCCAGAAGACCTGGGTTCAATTCCTAGTACCTGCATGAGAGCTG^ 
TGACCCTATCACTCAGACACTGTTCTTGTCATCrGCACACGGGATGATCAC^CACATATCA 

6 0 GAGAGTCAGCMGTAACTGTCTCAACTTCAACTAAGCAGGGCAGTTTGTTA^ 

CTTCTTCACTTCCTGCTGCCTCTCTACAAC^ 

TGAATCTAGCTGGCTACCTGACCTGATTCTCTCCTATCAGATGGATGACTTGCTC^CCTC 

ACAC^GTATAGAGTTTTAAATGAGTGTAGGCAGAGTGCATGAATCATCACACTTTTAC^CAGAG 

TACGCCAGGGAGTAAACTGAGCACGCATATTTTGGAGACCAOT 

6 5 AAGCAATTTTATCTGCC^TTCTGTTGAGTAGTGAGGCAAGGATCATG 

GTCTCTGGAGAAGTAGCCGTCTTCCTGCGCTGGGGATATGATTAAGCAGTAGAACATGCACATAATTTCATGAGG 
CTCTTCAGAGATCAGCCTACTTATTTATGACACAATTGTTTT^ 

CCTGGTACCCAGAATGATCTCCTAAACCACAAGTAATTGGCACAGAGATGGATCTTCCAAACTGACCC^^ 
TCTTCATACCCAAGAACrTCCCCAGTCAATGGATATTTGGGGCATTTGCTATGC^ 

7 0 GCACTTAGTAGAAAGGATGTATACTC^CGAGTATTTACAAATAATTAACCAGATT^ 

ATTGCAGAGTTACATGAACATCTGATAGCATTGCGGTTCAC^ 

TGCTCAATATCTTGAGGCGCTGCTATGAGAGAGAATGAAACTCTCTAGGG^ 

AGCAAAAGGACTTCAACCCTTGGATGAGACACAGGGTGATCCAATACCCC<^ 

G CaaCCT AC CCACAG T CGTG AG T ATGATG CAGTG GG AAGT CACAGT ACTG AAAAC CAACACACTT AACTTG C CTGTGCTG CTG TG A 
7 5 ACCTCCATTCAATGAGGGTCAGGAAGCTCAAATAAGATAATCGCA 
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AAGTGCTGACACTTGCZAGAATGTTTATGTTTCTAATGCTGGGCTAAGTACTTCACCT 
TCTCAA 

5 MOUSE SEQUENCE - mRNA 

GATTGGGGTACGCGCTGCGCCAGGTTTCCGCACCAACCAGAGCTGGATAACT 

AGACAGCCACGACGATGCCCCTCAACGTGAACTTCACCAACAG^ 

TGCGACGAGGAAGAGAATTTCTATCACCAGCAACAGCAGAGCGAGCTGCAGCC^ 

CGAGCTGCTTCCCACCCCX3CCCCTGTCCCCGAGCCGCCGCTCCX3GGCTCTGCTCTCC^ 
1 0 CCCCAAGGGAAGACGATGACGGCGGCGGTGGCAACTTCTCCACCGCCGATCAGCT 

ATGGTGAACCAGAGCTT1CATCTGCGATCCTGACGACGAGAC 

CTCAGCCGCTGCCAAGCn^TCTCGGAGAAGCTGGCCTCCT^ 

GGCACAGCGTCTGCTCCA.CCTCCAGCCTGTACCTGCAGGACCT 

CCCTACCCGCTCAACGACAGCAGCTCGCCCAAATCCTGTACCTCGT^ 
15 GTCCTCCGAGTCCTCCCCACGGGCCAGCCCTGAjGCCCCTAGTGCTGCATGAGGAGACACCGCC 

AAGAGCAAGAAGATGAGGAAGAAATTGATGTGGTGTCTGTGGAGAAGAGGCAAA 

CCATCCCGAGGCCACAGCAAACCTCCGCACAGCCCACTGGTCCTCAAGAGGTGCC^ 

ACCCCCCTCCACAAGGAAGGACTATCCAGCTGCCAAGAGG 

GCAAGTGCTCCAGCCCCAGGTCCTCAGACACGGAGGAAAACGACA^ 
2 0 GAGCTGAAGCGCAGCT T TTTTGCCCTGCGTGACCAGATCCCTGAATTGGAAAACAAC 

AAAAGCCACCGCCTACATCCTGTCCATTCAAGCA 

AGTTGAAACACAAACTCGAACAGCTTCGAAACTCTGGTGCATAAACTGAC 

TTGGGACTGTAAGCTTCAGCCATAATTTTAACTGCCTCAAACTTAAATAGTAT 

2 5 TCTTTTTCCTTTTAACAGATTTGTATTTAATTGTTTTTTT 

TGAAATGTAAATAACTTTAATAAAACGTTTATAACAGTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

ATGCCCCTCAACGTGAACTTCACCAACAGGAACTATGACCTCGACT 

3 0 GAATTTCTATCACCAGCAACAGCAGAGCGAGCTGCAGCCGCCCGCGCCCAGTGAGGATATCTGGAAGA 

CCCCGCCCCTGTCCCCGAGCCGCCGCTCCGGGCTCTGCTCTCCATCCTATGTTGCGGTCGCT 

GAT G ACGG CGGCGGTG G CAACTTCT C CACCGCCG ATCAG CHK1AG ATGATGACCG AGTT ACTTGGAGG AG ACAIX^TGAACCAG AG 

CTTCATCTGCGATCCTGACGACGAGACCTTCATCAAGAACATCATCATCCAGGACT 

AGCTGGTCTCGGAGAAGCTGGCCTCCTACCAGGCrGCGCGCA 

3 5 TCCACCTCCAGCCTGTACCTGCAGGACCrCACCGCCGCCGCGTCCGAGTGCATTGACCCCTCAGTGGTC^ 

CGACAGCAGCTCGCCCAAATCCTGTACCTCGTCCGATTCCACGGCCTTCTCT 
CCCCACGGGCCAGCCCTGAGCCCCTAGTGCTGCATGAGGAGACACCGCCCACCACCAGCAGC^ 

CAGCAAACCTCCGCACAGCCCACrrGGTCCTCAAGAGGTG 

4 0 GGAAGGACTATCCAGCTGCCAAGAGGGCCAAGTTGGACAGTGGCAGGGTCCT 

CCCAGGTC CT CAG ACACGGAGGAAAACGACAAG AGG CGGACACACAACG T CTTGG AACG T CAG AGGAGG AACG AG CTG AAGCG CAG 
CTTTTTTGCCCTGCGTGACCAGATCCCTGAATTGGAAAACAACGAAAAGGCCCCCAA 
ACATCCTGTCCATTCAAGCAGACGAGCACAAGCTCACCTCTGAAAAGGACTTATTC 
CTCGAACAGCTTCGAAACTCTGGTGCATAA 

45 

HUMAN SEQUENCE - GENOMIC 

ACTCCCACCACACCTATTCAACATAGTATTGGATTCCCT 

GAAGAGAGAAGGTCAAATTATCCCTGTTTGC!AGACAATATGATTCTGCATCTAGAA 
GAGTTGATAATCAACTTCAGCAAAGTTTCAGGATACAAGATCAATG 

5 0 AGCCAAGAGGC!AAATCAGGAACACAATCCCATTCACAATTGCCA 

GAAAGATCTCTACTATGAGAATTAAAAAACACTCTTCAAA 

GGAAGGG AAG AAT CAAT ATTGGT AAACTGGCCAT ACAC CT (2AAAACAATTT ACAG ATTCAATTTTATT CCT AT CAAACT^ 
ACATTCTACACAGAAAT GG AAAT AACT ACT CAAAAATT CAT ATC^ 
AAAAGAGCAAAGCTGAAGGAATCACATTGACO\ACTTCAAATTGTACT 

5 5 CAAAACAGACACAT AG AC CAATGG AACAAAAT AGTG AG CACAG AAAT AAAG C CACAT ACCT ACAAC CATTGG ATCTTTGACAAAT C 

TGACAGAAACAAGCAAGAGGGAAAAGACTCCCTATTAAATTAATGGTGCTCT 
TAGACTTATGCCATATATAAAATCCATATGCCGGAAAGCTGAGGCAG 
CCGAGATCACACTACTGCACTGCAGCCTAGCGACAGAGTGAGACT 
AAATAAATATCAACTOUVGGTGGATTAAAGACCAAAATGTAAAACTCTAAAA 

6 0 CATTTTGGACATAGGAGCTGACAACGATTTTTTGAAAAAAACACCA 

ACTTCTGGACAGCAAAAGAAACTATCAACAGAGTAAATAGGCAACCT 
AAAGGCCTAATATACAAAATCTACGAGGAACTTAAACATACAAGG 

CATAAACCG ACACG TTTCAAAAG AAGACAT ACAAG CAT AT ACAT ACAAG CCAACAAG CATATG AAAAAATGCTCAT CATGACT AAT 
CATTAGAGAAATGCAAATCAAAGCAGCAATAAGATACCATCTCAC^ 

6 5 GATGCTGGTGAGGTTGTGGACAAAAGGGAACGCTTATAC 

TGGTGATTC CT CAAGT AATT AAAAACAG AACT AC C ATTTG ACCCAGCAATC C ACTGG AT ATAAACCCAAAT AAATAT AAAT C 
TTTCCACCACGAAAACACATGCAAGCATGTGTTCATTGTAGC^ 

CTACTGCAGACTGG ATAAAGAAAATGTGGT ACATACACACGATGCAATACTAAG CATCCATTAAAAAAAAAAAAG AAATCATGTC C 
TTTTCAGGAACATGAATGGAATCAGAGGCCATTATCCrTAGCAAA 

7 0 AATTGGGAGCTAAATAATGATAACCCATGGGCAGAAAGAGGCCTCACACTGTGAGGCCTGCGTGA^ 

AGAAGTCCAGGGGAAAAAAACTTCGGGTACTGTGCTTAGTACCCAG^ 

AAAT AAT AAGAGCT AT CT ATG ACAAACACACAG CCAAT ATCAT ACTG AATGGGC AAAAACTGG AAG CATTC C CTTTGAAAACTGG C 
ACAAGACAAGCvATGCCITCTCTCACCIACTCCTATTCAACATAGTGTT 
;^^GGCTATTCAATT7lGGAAJL B -^^ 
7 5 CAGC CC CAAAT AT C CTT AAGCTGATAAG CAACTTCAG CAAAGT CT CAGG AT ACAAAATCAATGTGCAAAAAT C GCAAGCATTCTT A 



338 



WO 03/008583 



PCT/US01/51291 



T ACAACAAT AACAGACAGAGAG CCAAAT CCTGAGTGAACTCC C ATT CACAATTGCTT CAAAGAG AAT AAAAT ACCT AGG AAT C CAA 
CTTACAAGGGATGTGAAGGACCTCTTCAAGGAGAACTACAAAACACTGTTCi^ 
CATTCCATGCTCGTGAATAGGAAGAATCAATATCGTGAAAACGGCCATATTGCCCAAGGTAATT^ 
TCAAGCTACC^VATGACTTTCTTCACAGAATTGGAAATAACT 
5 T CAATCCT AAG C CAAAAGAACAAAGCTGGAGGCAT CACGCTACCTC ACTTCAAACT AT ACT ACAAGGCTACAG T AACCAAAACAG C 

ATGGTACTGGTACCAAAACAGAGATATAGACCAAAGGAAGAGAACAGAGCCCTCAGA^ 
CCTTTGACAAACCTGACAAAACCAAGAAATGGGGAAA^ 
CAAAGCTGAAACTGGGTCCCrrCCTTACACCTTATACAAAAATTAAT^ 

AT AAAAACCCT AGAAGAAAAC CTAGG CAAT ACCACTCAGG ACAT AGG CAT GAG C^U^GG ACTTCATG T CAAAAACA CCAAAAG CAAT 

1 0 ggcxa(2aaaagcc^iaaattgacaaatgggatctaattaaactaaa 

ggct^cctaaagaatgggagaaaatttttgcaatctactcatct 

atttacaagaaaaaaaciaaacaaccccatcaaaaagtgggtgaaggatatgaaca^ 

caaaagacacatgaaaaaatgctcatcatxiactggccatcag 

ttagaatggtgatcattaaaatgtcaggaaacaacaggtgctggagaggatgtggaga 
1 5 gactggaaacaagttcaaccattgtggaagtcagtgtggcgat^ 

cccattactgggtatatacccaaaagattgtaagtcatgctgcta 

gaatagcaaagacttggaaccaactgaaatgtccatcaatgatagacrrgg^ 

tatgcagc cat aaaaaatgatg acttcatgtcctttgt agggacatggat gaagctggaaaccat cattctcag cacact at ct ca 
aggacaaaaaaccatacaccacatgttctcactcataggtgagaattaaacaat^ 

2 0 C^CACCGGGGCCTGTTGTGGGGTAGGGGGAGGGGGGATGGATAGGATTAGGAGATATATCT 
GCGGCACACCAA<ZATGGCACATGTATACATATGTAACAAACCTGC^ 
AAAATCTGTACACTAAAACTCCAAGTCATGAGTTTACCCATATAAAAAACCT^ 
AATAAGTAAATAAATAAAATGACAAACCGATGGCTAACATCATGTTGAATAAGGAAAAGTTGAA 
GAGAAGACAAGGATGTCCATTCTCTCITCTACrrCAACTVTGGTACTGG^ 

2 5 AGGTCATTCAAATTGGACAGGAGGAAGTCCTTTTCTGCAGTAACATGA^ 

AAATAACTGGTAGTATTGATAAATGAATTCAGTAAAACTTCAAGTTACAAATTTAATGTACAATAATT 
GAACAGCAAACTATCTGAAAAAGAAATCAAGAAATCTATCTCTTTC 
TAGAAATAAATTTAACCAAGGAGGTGAAAGATCTCTACAGTAAAAAATAATAAAATATG^ 
TAAATGTAAAGTTAACCCATGCTCATAGATTTGTAGAGTTAATGTTGTTAAAATGTCCATACT 

3 0 ATGCAATCT CT ATCAAAATACT AAGG ACTTT CCT C1ACAG AAGT AAAAT AT AT AT ATTTTTT AATAT AT AAAATG TAT AT AT AAAT A 

TAATATATAATATGAATATTATATATGATTATAATATTATATGATTATATACTATTAATATTA1TAATAATATATATTA 
TATTATATATAITATTATATATTGTATATAAATATATATTATATACTATATTATTATATATTGTATATAAATATATATTATATATA 
CrrATATATTTATATACTATATATATAGTGTATATATATATTTATATAGTATATATAATATATACACTATATATTTATATAGTATGT 
ATATATATATACTATATATAATACATAAAATATATATACATATATAGTATATATATGAA 

3 5 CCACATGATTCCAAACAGCCAAAGCAATCTGGAGCACAAAGAACAAAGCTAG 

AAAGCAATAGTAAACAAAATAGCATAGTACTAACATGAAAACAGATCCATAGACTGATGGA 
TCCATACAATTATAGCCAACTAGTTTTTGACATATGTACCAAGAAA 
GGATAAACTGAATATCCAC^GCAGAAGAAAGAAACTAGGTGCT 
GACTTGAATATAAAACATGAAACTAAGAAGATCCTTGAAGAAAACACAGGGGAAACAA 

4 0 TTCAAAT AAGAC CTCAAAAGCACAAACAACC AAAG CT AAAAT AGATGAAT GGG AT AAT ATCAAG CCIAAAAAG CTTTT ACATCTT AA 

AGGAAACAACAGAATGAAGAAATGACCTACAGAATGAAGAAAT^ 

GT AATCAATCCAAGCAACTCAATAGG AAACACACAAATAATC ^ CAAAGGATTCGAAT AG&TATTTCTTAAAA 

GGAGTCACACAAATGGGCAG CAAGTAT ATGAAAAAAGGCTCAACATCACT AAG AACCAGGGAAATG CAAATCAACACCACAGTGAG 
CTATCCACTCATCCCAGTTAGAATAGTTATTATTTAAAAAAATAAAAT 

4 5 CTGTAATCCCAGCATTTTGGAGGCCAAGGAGGGCAGATCACTTAATGCC^ 

CCATCTCTACTAAAAATACAAAAAATTAGCTGGGCATC^TGGCACAT^ 

ATTGCTTGAACTGGGGAGGCAGAGGTTCCAGTGAGCTGAGTTCGCACCACTGCACTCC^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAGJVCAAGAAA 

CTATACACTTGGTGTACTACTATATTATCCAGCAATCCCACTGCTX5GATCCCCCT 

5 0 AGTT CIAGAT CCCC!ATGTTT ATTG CAACACT ATTTG CAAT AG C CAAAAT ATGGAAT C AACTT AAGTGCT CAT C AACAGATG AAAC AG 

GAATGAAAATATGGTATATATACACAATGGAATGCTATTCAATCTTTAAAAAAGAATGAAACCCTCAA 
GAACAGACACTTCTCAAAAGAAGACATTTATGCAGCCAAAAAACACATGAAAAAATGCTC^ 
AAATCAAAACCACAATGAGATACCATCTCACACCAGTTAGAATGGCAATCAT^ 
TGTCGAGAAATAGGAACACTTTTACACTGTTGGAGGGACTGTAAACT 

5 5 GGATCTAGAACTAGAAATACCATTTGACACAGCCATCCCATTACT 

ACACATGCACACGTATGTTTATTGCGGCACTATTCACAATAGCAAAGACTTGGAACCAA 

ATT AAGAAAATGTGACACAT AT ACAC CATGGAGT ACT ATG C AG CCAT AAAAAATG ATGAGTTCATGT CTTTGG AGGGACATGGATG 

AAATTGGAAGTCATCATTCTCAGTAAACTATCGCAAGAACAAAAAACCAAACACCACATG 

AATGAGAACACATGGACACAGGAAGGAGAACATC^CACTCTGGGGACrGT/TGTGGGG 

6 0 AGATATACCTAATGCTAAATGACAAGTTAATGGGTGCAGCACACCAGCATGGC^^ 

TGCACATGTACCCTAAAACTAAAAGTATAATAATAATAATAAAAAAGTAAAAATGATTAAGGACTATAATAGGA 
GAAGATGATAAAGTATGCCCAAAAGAAAGCAATGCTATAATAAAAACAAAATAAAACA^ 
CTGTCATTTGCAACAATGTGGACAAACCTAGAGGACTTTAT^ 
CrCAGGCATATGTGGCATCTAAAAAAGTTGACCTCAAAGAGA 

6 5 GGAG-GGGAACAGGTTGCTCAACAGCTACAAAGTTACACTTATAT 

TGTAGTCAACAATAATGTATTGTGTACTTTAAAATAGTTAGACGAGAGGATTTTGG 
TTCAGGTGATAGACATATGCTAATTATGCTGATTTGATCATTACACAATATATATGTATCAA 
ATGTACAATTATACATCAATTTAAAACAAAATAAAATGTATTTTATAAA 
TTCCCAGTTTTGGTCAGCCCTACAGTGTATACCAACCAGGCTGGCCTGGGGCT 

7 0 GGAATG AACAGATCTGG AAG<5 AATCCT AAATTTT CCAAT AAAG TCT CTG T ATT CCTTT CAT CCTGGT CCCT ACCCAGG AAG CAGTC 

TAGAGGAGATGATCCTTITCTAAGAACAGCAACATTCAGTAAACAAACTTCC^ 
AAGTAGAGTCAGTGATATTTTTGAATCAATTATGAGAAGGCCACCATGCT 

ATTCATOSATGGTCCGTCTTTATTATTAC^TCATTACTG^GTGGCTTTTGCAATAAAATACACCATT 
CTAATGCAAAAAC^TAGCATGGATTGTTTTTAAGGGGC^ 
7 5 AAAGCTCAACAGTGGTGAGGCATATTCAAAAGCCCCAAGAGGAGAGTCAACAGTCAGATCCATCAGAGT^ 
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ttatgtaatgacgtctccacgccatgaa 

attagatatataaagtcaggcccaatcatccccttgattagttagtctcatcagagaatgagccct^ 
ggaatccaggcagttcagtaagtagtcaagatttgtttccacatgcggtcccaaagaagtcttta^ 
agtggcataccaaaaggctggaccattggtagtgtcccaagaattgggttct 
5 ttt ctg cat ag ctggttcagagg ctgaacag acacagg agccag tgaac accatcag ctt agccaat cctt cactg agg 
caagcccagatatttggggaaacctcitjtaaacagggtccaattgaact 

ttt cc c c cagag agacagttcccactctt acat at aaag cagag atg cagaagaaattcg agttgatg ccat cgcc cac ccac caa 

cccacccccgcatttccgccctgagactaggatcccagcgccagggacccagacc 

agagtgcgccggggatgagacggaatctcggggccactcgagagct 

1 0 TGTGGAGCAGCTATTTGG CCTGGGAACAGCTGT AGAGAGCGGGGAG CGACTGGCTCGCCACTGGCGCGCCCTGGGGGAACCCX3CCC 

AAGGTGCCCTCCTCCCCAGACTGCCCTCCCAGCCrCGAAGACGGCAATCCAGCGCCC^ 
GACTCAGGCCTGCTCCAGATTTGAGAACCCAAACAACTCAGAGGGTGAGGA 

GGTTTACAGAAGTCGATCACCATCATGGTTCGAGTAGACCGTCTGAACCCCTGCATGAAACACTTCCATAC 
ACAACACAACTGCGCTGCCCATATTTCTCCAGAAAGCTGCT^ 
1 5 AGGGAGATGCTCCAGGGGAAACGGAAGATGAGGAGGATGAGGGGATTGTGAGCCCCTC^ 
TGCCAGTTCTGACACTGAGGATGTGACCAAGAGGAAGGGCCACAGCTTCCTGCAGCA 
GTTCTTGGCCCTGAGGGACCAGGTACCCACCTTGGCCAGCTGCTCTAA 
ACTTGCAAGACCTGGCGTGAACCGAGAAAAAGATGTCTATGGAGAAAACGCAGCTCCAArc 
ATTGCGTACCTCAGTGGCTACTAACTGACCAAAAAGCCTGACTGTC 

2 0 TCCCCTTTAGTAATTTTCACATTTTGGTTATGACGGGACAGTCTTT 

AT AAGGG CTTG CATTCTTGGAAAC CTTGAAACCCAGCTCITI'CTCT CCCCTGACTCATGCTGTGTCTTCTLTGGCGCCTT'1'GG CTT 

CTCAGCAGATAGCTGACTGAGGAGATTTGGGGTCTGTTTACCTCACTAGCTCCGAAGAAAAGGCT 

GGTGTATGTTGTCGGGGACTCCAGCCTGCATGAAATCTCACACT 

CTGGGGTGATGCTAGAATGCTCCCTGGTATGTCTGGGGTCATGCAA.GGACAGCTGGGCCrc 

acagtatctgaggagttggaaaccittaacgtatatattgtx3atattagctgacacct 
gagaacatttaaagag cttg cactctaggttcx5ttgt ctt cagagctct 
gagagacttttgtttcattttgccittgttgtgcaatgggcittacagcatcct 
cagggaagtggggatcctagcctggggcctgaggaaatcttgg 

3 0 ggcgacccatcttataatctcttctcaagggagatc 

tctcatctcaatctttgactgatgagtctgaa^ 

ggggtctgcag catacttggctcc ctaccagcctg ccatgagtcat ai'l , l'cl"lttccagaaggttk5acntgcttcctggt atgt^ 

taaaggagcctgcaggagctctgcttagccaatcatc 

acagcctccattcc&ggaaagacatccagcccagcagttc 

3 5 acaagcccctccagaactctgcccaaaatagcaggtgcttgatggttctgaatttggaaagc^at 

gtggctctggaaaaccagctgctacattcaaatcrrattt^ 
cagaggatgttttgaaatagcctctctacccttcaggagcatggtagt 
ggagaacaggccaggtgtaggttactaatgtcctacatgaatagcttggttttataagc^ 
ctttttaatatattgtatttttgtacgca.ttttgcaaagtggagttaact 

4 0 tcttctgtttcjagggtctg atttatttggaaag ccaatttt aaaatgttg c 

cttctgggacatcttgacaaaaaatatttctcju^acatg 
tgttcttcattttaccgtttccatttcataagccaagcritattgacc 

tcaaagatggaaatatcgag ccag atggtaacaaattct ccatgaat cgattcttcagttttgacagaatcccaattgcaagccat 

4 5 ctgcttccccitgccaggtccccccatc 

GGTTCTGAG CCT C ' I TTTTCGGTGG CAGGTTGTATG AAGAGACGTCAATTTTTTCCTGTTAGAGGGATCTAATGTTGTGAATTTGCC 

TGCTTAGTCCCAAGAGATTACTTGGCAAGCAGGCCCCTGTAT^ 

CACTGCATTGAGGACCCTrGAGACTGAGACTTCTAGCCG^ 

GAAGGTAGAGACACTAATGCITAAATCCTGAGCCAACCACTGCT 

5 0 CTCTGTCTCTCTGTGATGGGGATGACrTTATATGAGACCTGTAGAATGAGAAATTACAAGC^ 

TATAAGTAGAAGCAACAAGAACAGTTTATTCAACTTT 

GAACGATGCCCAAACCCCAACAATTGCITTTCTGGACTTTTTTCCCTTAAAT^ 
GGGGTTACAGCAAGAGCTGTTAGGGGATTTCCCTATGTCCAGGCCAGAATGTAGGGATTC 
GGTGGGGACTGTAAGCCTTTTTTGTAATCCCTCTGGCTCITTC 
55 TG CAGAACTGTAAGGGCTCAGGGATTCTTCCATTCTGTCCCTTGATT^ 

TACCTCATACCTATTACCCCTTCCCTCCACTCCCCAGAAAATCAGATAAGATAGTGTC^ 
AACATGTGAGTCCGTCCCTTTCCTGJ^GTACCCTACCATGC^^ 

AGACTCTGTCTATACTAGTAAGCAAGGCATGTTTGCATTTGGTTTCCAGTAG 

6 0 GGTTTTTTTTCCACTGAAAACTCTATATCTTCACT 

TCATCCCTTCTTCCTCCCATGCAGAGGAGAGTGAACAACAACCAAACCATGATTCAAACA.GCT^ 
CTCAGCTTCATTAATAGGTAGGAGAGTGAGTGGGGAAGGGCCTTTA^ 
CTCCJ^TCCCGAGGAGTCTTCTCATATCCCCTA^ 
TCTTCATTACCAAAATTGGCAAGAAATGTGAAGAGTTTGAGACTGTACC 

6 5 ACAGATGTTGGCAGAATATATGAGATTCTTGGGTCAGAGTTTGTTTTGGTTCATGT^ 

AATCACAC^GGAGAAATGAGGAGCAGCCAGTCAAATGTTGACATACAAGTAT^ 
AATAAATGTAGAAAGGCCATGGACAATTATCTCAACAATATCTGACCATTGTTTCT 

ATT AAGT ATGTGTGTTGATTTT CTGG AT AT CAGGG CTGAGCAAGGAAAAAGTCATGACCATTAGATGGCAGTAGCACA(IA^ 
TTTATTTGGTTGGCCCTTTGAAAGGTTTCAGGAGAGTGGAACITCCCT 

7 0 ACCACCCAGAAGAGGAAGAGTTCAAGGCAACTTCCAAAGGAGAGAAGAATTGGACAGGAGG 

AGCAGCCAGGTGTGGAGTCTCTGTGTCAGAGAGTTCCCAT^ 

GCCATATTTAAAGCAATTGGGTGTGTAAACATTTGATTTTAGTTCCAATG 
ACGGAACAACATACA?JGGCACC^TCTAAAGATATTA^^^ 
7 5 AAGTTGTATGCTATTTCATGGASTCTC^ 
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GGATTTTAAAAGCCAGTTCTCAGCCAATCTG CCCACTTGGGCAG CCAGTAGAAT AAATTTGAAAGGAG ACAATTTTGTTTTG ACCA 
GTCCGTTTAGTCAATAGAGTGGCATTTATATCCCCTTTGGTACATGACCCCAGCTATCT 

GCAGGAAGATAATTTTAGCAAGCACACAGCCCCTGTGC CCTGTG ACCCCTCCAG CAGCCTGACCT AGGGCTATGCAGTT ATCTCAC 

ACCATGGCCACTAAAGTACAGTGAGATTCAGCAAAGAGTTTCAGTCCTTGGGCTG 
5 CATGGCTATGTGTTTCTCTTAATTGTGGTGTGCTCTTCCACACATATG 

GTTAGTACCATGAGGAACTATTTGGAAAGCTCATTTGTGACATTGGGGAGTCACAT 

AGGGAGGGGGTTCCACTGTTGAATAAAATTCTACTGGAATTTTGCA 

GGAGAAGATTGTGATAGGGAAGCCATCTGATTCCTTra 

AAGGGAAGTTCACATGGCTTTGTGAAGTTCCAGAAGTTCCAGGTTGTG^ 
1 0 TGTTATCCACCTTGCATTCAGCACCTGGGTTAACTAAATTTGTCGTGGGCTCAGCCC^ 

GCCTCTGGATAAGGTTGTTGAATCTCAATGGTTAGGGGAATCG 

AAGAAGTATTACACTATGTTTTCTAAAATCAGGTTGGCAGAAGCAAACCCTC 

TGGCTAAGGCTATAGGCACXnTTGGTGTAGTCGGCAATTGCATGTTCCTCCTTGCT 

TGTTGATTGCCAATAATAAATAGCATTGATGTGTTCCCATGAGTGTGGGATAGGTTTTG^ 
1 5 CGTTGTTAGTCTAGCCACACACCAAATTAGATTCTG 

TTTTAAAAAAATAGATATCAGCTGGGCACAGTGGCTCATGCCT 

GTCAGGAGTTCCAGACCAGCCrGGCCAATATGGTGAAACCCCGTCTCTACTAAAAATACGAAA 
TGGTGGCATGCTCCTGTAGTCCCAGCTACTTGGGGGCTGAGGCAGGAGAATCACTTG 

GAGAT CACGTT A.CTGCACTCCAGC CTGGG CGACAG AT C CAG ACTCTG TCTCAAAAAAAAAAAAAAAAAAAGATATCAATTGT ATCA 
2 0 AGGCATAGGTCCCTTTTATGTTCCCTTGTATTGCAACATTATTCT 

CGGAAACTTT CAT AGGGAAGAGGTTTGTAT AC CTTTAATAAC CTCAGG AGTGG C C CCTTGGG AGG TGG CAGTGTGACTACACCAAT 
AAATTTCCTCCCAAGTACTCATGGACATAAAGTGTCTACTGAGACACTGGAGACrACT^ 
TTGAAAAACAAACTGTTGGGTACTATGCTCAATACCTGGGTGATC 
C^GGTAACAAACCTGCACATGTGCCCAATGAATCTAAAATAAAAGTTGAAAAAGAAAT 

2 5 ATCAAAACAlTCCTCCCAAGTTAGTTTCATTACATGTCrT 

TTTAAGGAGCCCTCAGAAAAGTAACATTGGGCAACIAACTGCCCT 
TTTCACAATAGCTATAAGCAAATAGTTATCAATGTGGGTG^ 

AT ATGTTT CTTGT AAG AA CTGTCTG TG CTG T G ACAC CACG AGCT ATGAGTTGTTC ACTGTGGGT AG C CAG CATTG CAATTG CTGT A 
GAGTTCACTTCCATTGAGTGTTATGTAAAACTTGCAGTT^ 

3 0 GTGTGTGTGTGTCTGTGTGTGATTTTTTTTTTAAGAAATGTC 

ATAATTGTAAAGGTATACACTGTGGGTGTCGATTGCACTTATTATAATTCC^ 
TCCCGTAATGGCTTCTGGGTATTGTATCTGGATTATAGTATTAATAAC^ 
CATACTATCTCTGGGCATTCCTTAAACGAGGCTCAATAGATAGTGC^ 
AGTGGGTATCATCTGATCTGAAGGAAAAGGCGGCCCACAGAGCTTA 

3 5 CTTAGCTGGAAGGATGTTACTAAGTGGTAGTAGAGAGAGGAGGAGAAGTGCT 

GAAATGGGAGCATGGTTCCCTTTGTTCCCAGTCTGACCrATGTTGGGTGTCCCTGTGGTG 
TACTCTGCCAGCAGTAGGGTGTGGATTACCAGGGATAGGGTGATCCACCAATCGAGGTATAAT^ 
TTCTCAGGTAAGGAAAGGGCAGAACAACAGCATAGGCATAAATGTTGGAGGTCATTTGCA 
TCTGGGAGTCACCCAAATGGTCTTACCAGATACAAGGGTTGTTGGTTCAATCT 

4 0 ATAGTGCCCTT AGTGG AG CAAAGTACATGGG CAG AAAATGAGG 

TGCTGGGTAATGAGCAGACTAAGACTGAGAATGCACCCAGAGTGAGTGGTCCATGCCAGATCCTTCT 

AAATGGCCAGGCTCTC^ATTTTCTATGAGGGAGAAGGAAGAGGTGAAAGAGC 

CCCCATGTTCGTGAATACTGGTAAATCATGTTGTGTCCAGTGGGTAC^ 

CCTCATTTGCTGTGAATTAAAATATGGTGCATTTATGCCAGAGAGGTATGGATTTAATGTAAGCATGACT 

4 5 GCTACGGGGAATGCTATTGTCCTGCATCTTCTTTCAATATCCCTTGTTT^ 

TGTAGGACCAGAGAATAAATGAGAAATCCAAGCAGTGTTCTTTG^TAATGCCT 
CTTCATCTGACTGGATAACTTTGTCTAACTTTTCTACT 

GAGCTTGGTGCCTTTCGTGGG T AAG CCAAGAGTAAGC CTGTGT AGGAATTACT ACATTGTTGTGG CAT ATGTAATAGTT CCTG CTG 
CTTCAGAGAGGGGCCCAATAAAATTGACTGGATTTGC CAGCTCTG AT ATAG CCCTTCTCCTGTGCAGT AGGTCCAAGTCTCTAGAT 

5 0 TTCTGGTCTATGGTACC^CCCTAACAGCTTCATGTAAATGTCTCTCTGCTGGGTCTATTTCT 

CTCATGATGCCCAAATGCCCTGACCTCTCCTTTATAGGGTGAAGGGTAGCTCTTACCAAAG 
AGGATTAACTACTTTC^TACTTTAGCTAGTTTGTCAGTTTCCAGGTTGTATTGTTTG 
GTGCAGAGTGGAGGCCAATGTGCTGATATTTTCCCCCACATCTCCTTACCCCAAAT 
TTGTCCAAGTCCCAACCCAATAGCCTATTAGCCACTGACCAGGAGTTGCCAAAAATGTGAA^ 

5 5 CTTCTCTATTTCTAGGGGTGTTCCACACAGCCTGGCCCATTGGG 

AGGGTAGTAGGCTGTTGCTGTCCTTTATCTGACTCCCTTATACCCTGTGCTCACTCCATTGGTAA 

TTATGAAAGTGCTGGCC ATACTCCCTCCATTCCCCAATGAGCAAGAG CCAGGGG CATTTCCIAGGGGTTTTGGCAAGTT AACCTCAC 

TGACAGAGTATGTGGCCATTTCCTCACAC!AGTAGTGACACrCCAGGAGGGCCAGGT 

TTTAATAAAGAAGCTTCTGTTGCTGATCCAATTTTATGGGCA 

6 0 GGTAATTTTCTCTGCCCCACCAGGGCTTCAGTTTTGATAAGTGCCCCATATGCTGCCAGT 

GCTGAGGCTGAGAGCTACTTACACCAAAAGCCCATGGGT 

GGGAGCAGTGGAAACTTATAATTGCAAGTCTTGTCCTTGGGAGCACTAGAGGGAGAGCCAAAGTAT^ 
AGTGACTGTAGATGTTCAGGTCCCTACTGAAAAAAGGAAGCITTACAGGTGTCT 

TGGAGGTGAGGAAGGCATTGTGCTATAATTCCTAATGAATGTTGGGGGTTCTTTACTGATGTGAGCGGTTTAAG 
6 5 GTCTTTTATCAGATTGGAAATAGTCTGAA^ 

AGGAATCGGCTCAGATAATGGGCCAGOSTTTGTGGCrGACT^ 

ATGCTG CCT ACCG ACCG CAAAGTTCTGGAAAGG TGACCGAG TGTGG ATCAAAG ACTG AAACAT AG CCCCTTTG CATGCACGGTGGG 
AAGGACCCCAGCCCATCATCCTGACC^CTCTCACTGCTGTGAAGGTAGAAGGAAT 
CCTG CAGCG CCTG AAACTTAGGAGGCAAGACCAAGACCGGATAACTCAGGCCGT 
70 TGTACCTGCTGTCAGCAGATCTAGGCTTTCTTCTTGGACACCTT^ 

CTTGGCAAACCCGACTGCTGTGGGGTGTCCTAGCAGAAGGTCCTCGACGTACTGGAGCAACAC 

ACTTCTGGAAGTCCCGAACCAATGTTTCCTCGAAGAGATG 

GAGTGGTGACACCCGACCCCAGATCTTCCCACTGAAAGGCAAACAGTTC 

TCTTTCACC TCCAAG CAGGTGAACCAGCTGT CCTC AG CTGACAG CAA C CCT AA C AAC ATC5T ACAGTT AGG CACTGTTGGATGTAAA 
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GTAGTATTCTGAGGTCTGAACAGGCAGGAGTGGAAAAACACCCTTCC 
CTTAGATCGTGTAATTGCCAATCCACTCTCA^ 

GTTCTGCGAGGAGGATGTCACCGATATAATGCCAAAGCATAGTTTCTG^ 
TGACTGTGCAATTGGAGGGCTATTAAGTATCCCCCAAAAAGCTAGCATCACAATGAAGATG 
5 AGTGGGTAAAGGCGTACTATTGACTCTCAAAGGTGGAAGTGAAATGTGTT^ 
TTAGCCAAGTCTACAACTCCAGATTCAAATTCTGGTGGCTTTAGATGCCCT 
ACACCCATTAAGGCAACAATGTCCTATACAGGTGCTCTAATAA 

TGATTAGTAGTGTCTCTTAGGGCAGGCCATATGGGGAATTGATAGGTGAGGTAGTAGGAGTCAGTGTT 
CCTGTGACAGGCCATAATCCTTCAGTGTCCTGCTTCAGCTTCTGCTGTGGTATATGAACCACC^ 
1 0 TGGTTCCCTCCGGGAAGTGCCCACCAGTAGCATAAAAACCTTAGTCTTATGTCATGA 
CAATAATGGGGAAATGGATAATCTTGGAGGGGAGGCAAGCAGGACAGA^ 
AAGAGGGGAATTATCCCTTTAATTAATCCCTTTAATTAACCCCC^ 

CAGTTAATATGGTGAAAGGGCAACTGTCTGTCAGAAGGGGCCTTCCGACACTGCAGGATCCCTGTG 
1 5 TTTGGCAAGCATCAGAGACAGGTGTGGAAGATAAGGTT^ 

CCTAGT AAGT ATTGCAATTGCT C CAAACATTCAC CAGAG AT CAT CCAATTTGGG AC CTTCTTGTGCCTT AGGGT CT AACAT AACCT 

CnTTTCGTCATCTCGTGGTTGATGCATACATGGGATCCrTGGCCACAGG 

TAGTAACTTTGATTTAGGCCAGGTAGGGTATTATAGTG 

AAATCCAGTGCAGCTATG^TCTCTTGAATTGTTTTAGCAGTAGCCA 
2 0 CCGAGGAACAATTGAGTGGACrCAGGACTCAATTCAGACTCCTCCAGTOT 

CACTCCAACATGTCCAACTGGTGGAGGAGATGTTTGGCCTTGGCTAACC^ 

TAATGGTGGGCAGTCTTCTAATCCACAAAAATACAC^TCAGCAATAAA 

CTGCTCTAATTATACAGGTTCCnTTAGTGGGGACACAATTTTTC 

TTAGGTGGAATCAGTTCCCTCCCAGATGCATTCTGTAG 

2 5 AGAGACTGGGGCrrCGGGTAAGAGAGAGGATAGGACrCCrCCrCTGAGCAGG 

TTTTCCATTTTTATTACTTTCTGGGTGGCTACAGGCAAA^ 
CTTTrCTTCTGGAGAGOlCCTATTCTCCTCrCCAGGGACCr 

T C CAGG TGG CATCCTGG TG AT AAATTG CT GC CAG CAAGGT C AG CACATGCCT CATGGTT AGG AAT C CTTCTTCGAC CTGACAGCAA 
CCTCCTGTAGGGTTCCGGGAGGGGTAGGAATATAAATTTCTTTGCCT 

3 0 AGGTTGAGATATTAATATCCCTCTGGAGAGGGGATAGCATGATGACAACTGCAA 

TGCTACCTGCTTGCAAAGGCCTTTCCACCAGCTGCCCTTAGGAGACCAG^ 
TTGCCTTATCTGTCTCCGGGGAGGAGGAAATCCTCACATCTCAGACTACCCCA 
CAT AG CCTGTTATGCAGTCCAGGGTGGTTCACCCTTATTCCAGAGGCATGCACrCC CTCT 
ACAATGTCCAATGTCAACTGAGGGGGGGACCCAGGTAAGTGGGCTGACCT 
" 35 TCAGTTCrAGTGGTGTGGGTTCACTCAACTACTGGTGCT 

TATTGTTGOK3TACAGCAATGAGCGATGCTAAGCAAAACTGCTACAG 

TAGGACTGATTTTTCAATTGATTGTCCTACTGACrGGGTAGACGCTATGCCAA 

ACTTGTGGTGAGTTCCAAGATTTCTGGGCTGATCAATGAAGAAGACATG^^ 

TTTTTTTTGAGAAGGGAGAAGG AGTCTCACTCTGTTGTCCAAG CTGGAAT ACAGTGGCG CAACCTCTGCTCACTGCAACCTCCGCC 

4 0 TCCCGGGCTCAAGTGATTCTCCAGCCTCAGCCTCCXSJ^^ 

TTTTTTAGTAGAGACGAGGTTTCACCCTATTGGCCAGGCrc 

AAAGTGCTGAGATTACAGGTGTGAGCCACCGCGCCCAGCCAAGAAGCT 1 T TTATTTCGGCGGTGCTTTGACAGGTTGCCAGAAAGG 
GGAGGTCTCTCACAT CAG ACCTTC CAG AGGCAGCAGCAG AGGGCCACTGAGCAG AAGGGGAAGAAGGCAAGGTAACTCCTGGGGT A 
GAGGAGAATAGGAGAGGGCCTTATGTGTCTGGGGGATATGACTCAGCAGCAGGAGAGGGAGTCTCT 

4 5 (3UU3CAGTGGCATGGGGTCTTTTATAGCCC CTGTGTTTT CTTTAAATGTGGCTAGCAGATGTC^GTGCAGTTTTGTGGGG CATGCA 

AGG CAGG CAAGCCCTAAGTGGCTACACATATGCn^ACTTGAGCTATATT^ 
TTGGCAGGCTTTCTGCCTGCTGTGAGGAAGTATAATGGCAATATAAAGGCTAATATATGGAGGT 
TAACATGCCACTTAGTCAGCCACTCTGATTAATCCGACTAGCATGGTCTGAGAT 
TCTAAGCAGCAAGGTGAGATCAATCrGCAACGTTAACCCGAACTTAAAACGTTAAGCGC^ 

5 0 TTTTATCTAAGAGAAATGAAAGCATAGGTCCACATAAAGACTTGTATGTGAATGTTTATAGCAGCTTCA^ 

CTGGAAACAATCCAAATTACCATACAATGAAATACTATACT 

AATCTGAAAATAACTACGCTGATTTTAAGAAGAOUIAAAAGAGTATACAGTGTATAATTCT 
AAACTAATCTATAGTGAGAGAAAGTAGATCAGTGGTTGCCTAGGGACAGGGAra 

AAGGGGCATG AG AAAGC CTT AGGAGG TGATGAAAATG AT CGT CATTTTG AATGTGGTAAGGGCTT ATGATTATTGGT AAGG AG TT A 

5 5 TGCTTATAAAATATAAATAATTTGTATTTTATAAATTATATAA^ 

TATGTCTTGCATCAAAGAACCCTCAAAACAAGACCTTAAGTACTG 
AACATCAATAATAAATAAAAATGGTCAAGACTTATTTGTTTCTATGTG 
CATATTTTCAAAGTATTATTTCACATTGTGTAAATGTATTTTTTTTTCACTA 
CTACTTTGATGGTATAGGAGCATGTTTGACTCAGATATGAACCACTTGTGGAC^ 

6 0 ACACTCTGGATTAGTTTCCACTGGTGAGAAAATAAAAAAGTGATGAGC^CTTTITC 

ACATGCTAACn^CATTTAGACTATTCTATTACC^AAATAT^ 

CTCCT CATTCTTCAAGTTCCCACTGAAAGGTCTC AT AGCTT AT CAG ACCTTC 

AGTCATGTGCAGGGCAGAGTGAGGCAAAATGCAGCCAGTGGTTAGGACCTAAAGACCTCTCCTT^ 

GAG AAGCT CAGG AAGAAAAGCCAACCATCTGAGTCCAGTGGGCTTTGAAGCAGTGGTCTTGCATCT 

6 5 GTGAAAACCATTAACTCTGCTGCTGATCCAGCTGCAGGGAAAAGCATTTCCTCTG 

TTTTGCACCAGGTCCTGATCGATGAGCTGAGCCTGAGTCCACAGTTCACCTGATGGAGAACTCA 
GAGGCTGTCTGACTGCTGCTTGAGGGCCTAGGGACCACACCCCTGGAGTCAGGAGGGCAGAGGTG 
GAAGGTAAAGATACATGTGAATATTCTATCTCCATTAGGTCAGATGGATGAAA 
AGTTAAAGCTGATAGGACACAAAGATGGTTGATTGTTCTACAGACGTGGA^ 

7 0 GCAAATAACTGAGACAGTACTTCCTAGTGATGCATTCTGCTGTCAAACAGTGTGAAAT^ 

CCCTGGTCCATCTTCTGGCCCCACAGAGATGCTGATCGAACACCCTCCACAGG 

ATTCTTCTAAGCCAGTCAGAAGAAGACAAGCTTGCAGATGGAAGGGAATCAGAACCTGGGCAATATCT^ 
CCTGTGTGTGGTTTGGAGTGTTTTCACTAAGACAAAACAGTGGGAGAAAGAAAAACCCT 
TGG^GCTTATTGTAACiaj^ACACT^ 
7 5 TTTCCGTGCTGCAGTAATTGTCAAGCTTGCGAAAA 
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ACCTCTACTCTCATCCTCCACCTGGACCAGACAGAGAAACAACTT 

CATCCTTGGCCAGAGCATGGAAGGTTGGGGTGAGGGACACGGTGAACAAAAACTGCATCCTCGAGCT 

CTCTTTGGTCTTGTGTGTCSGGGTCAGAGG1TACCAGGCAGGGGTCACTAAGG 

CTCCCAGGCAGCTCAACATGGGAGGCTCCCAATTCCCCTTTCTCCT 

ACTG CAGATTGG AGGTCTGTTTTTCCACAAAAC CT AGTTCT AAG ACATTT CT AGCAAG TG AG AGTG ACTGATTGG AAGTCT AG TGG 
TGACAGCTACCCATGGGGCAGGCTTAAATAGCGACTCCTCCTTC 

TGGG CT ATTTT CTC C CATG AATTTTTGCCTATTTTGGG C CCTAAAT CAAATTGAGAAAAGTGTG ACT AAAACACT CAAACTGCCAA 

TATAGTTAGAGATATACACTTTGTGAGACTAACrCGGGAATCTACTGCCCTG 

TGAAATGAAATCCAAGCACCTTTTATCTGTTTAACTCTGGTC 

TTCCTGAAGAGAACATTTGCTACTCACATTTATGGTGGAAGATGCAATCTC^ 

GCCAATACTGGGTAGGGCTAGACTATAGGGCTCAC 

TGCAACAAACrCAGTTCAGGTCTGGGAACACTGGGCCC^ 

GTTTCTTTAATCATTAGTATTGTTTGCATACCTTTAGGAGGTACAAGCG 

TGTAGGCTTACCAAAGCAAAAACAGGGTGGTAGATAATCTCCCTTGTGCAATCAACA^ 

TATTCATACTGACTACAGAGAGCCTGGACTCCTCTAACCTGATAAAGGATATGAAGCCAGTTATCT 

AAATTCACCAGTCAATATATATTCTTCGATAATAGAGGGATTCTTT 

TACACTTACTTAATAGAGGGCTCTGGAGGGACAGACATTGAAAAAGGAAG 

GGTAAGGAC^TTCAGGATGCTTTGTCCCAGGCCTGGGCCCAGAACA^ 

GATCAGCTTCTTGTGGCCCTCTTGATAGGGACATCCTGTACTCCAAGTCATCT 

GCC^CACACACACACACACACACACACAC^CACACACACAC^ 

ACCGAGCCCCCACCCTGATCCreTGCAAGGGGCATTGTTTACTCCAGGCCTCCAA 

CCACATACTAGGCACCAGCTATGGCTTGTCCACACTACCATGCTCCAA 

TCTGCCTCTGTAGACCAGCTTAGGTACTGGCAAACCAGTC 

TCCATCCTTGGGGAGAAGCTTTCATTCTAAGCTTGTCTTTCCT^ 

CTTGTATCCTATACAAGATCTTTTATCATCGCTTAATGATTC^ 

CTCCTGATTGGATCCATATTGCTGCAGAACACAA 

AAGAG CT ACATCAAGT AAGG ATATC CACT AT CAC CACTT CT ATG CAAT AGTTT ACTGGGGGT AGGGGC GGGGTCC CAGGCAGAG CA 
GTAGGGCAAGAAAAAGGAATAAGATATTTGGAAAGGAAGAAGAAAGACTGGAA 

TT CT ACAAAG T ATT ACAG ACAAATT ATT AG AAT CGAT AGAAGAG TTT ATATGCTTAT AGGATAAAATCTT AACGAACAAAAGCCAA 

TTGTATGCTATTTAGAGCrATAAATTTAGACAATAAAAATTTCAAACATATC 

TTGCCITGTGGGAAAAATATAAAATAATATTTGGGCAAATTAAAGGTGATCTTATT 

GAACATTCAAT ATTGAG AAGTCAAATCTAAAATATCAGCACAATTCAAATT AG A^ CTGTTGGTTGAAGTTTTTAC 

G ATT AGTTT AT AATTT AAATGG AATTG AAATGGACCAAAAGT AG CC AAGACACTTTT G AAGAGCAAGG T AGAATTGATTTG CCT AA 

TCCAAATATCAGAATTGACTATGGATCTCTAGTAATTAAGACAGTGTC 

ACTAGCCCACATATGTGTAGATACTTGATTTATGGCACTGTAAATCAACT 

GAAAACTGGATATTCACTTACTTACATACATATATACATAAATGGAACTCT^ 

ATTAAGACCTGACTATGAAGGACAATAGCAGACATATTTTAGAAGAAAATATAGAGAATGC^ 

CAGAAAAGATTTCTTAAAACAAAAAAGCACTAACGATCAGGAA 

GACAAACCC^AGCACAAAGCAAGTGAATATACAAACAGA 

TAACAGACTCATTAAGTCAGTATCAGATGAACAATAAAATAGAAACGTGGGTTAAAAGAGGTA 
ATAGAAACTCCAAAAGGCTO^TTAACACAAAAATGTGCTAAAGCITAATAG 
TTTAAATTCGGGCAATACTTGGTGTAGGTGAACATGTGGATCAGTAAGGACTCTCTATGCT 
ATCACTACAGAGAACAATTTGGCATCATCTAGTAAAGCTGAGGATATCCATTCC 

ACC CT AGAGAG ATTTTT ACACGTGTCC CATT AG AAATG TGCAAG AATGTTC ACAAT AATCATTGAAAAGT AG A^ C 
TCCATCTGCTGTAGAATGGAGTG^IAATAATCATACAATAATCAC^ 

AGGATCT ACTTACTT AT AGCT ATT ATG AACAACATG GACGAGT CTCT CAAGT AT AAT ACAG AGTGAAAAAAGCAAG CT AT AAAAGG 
AAG C ATGCCGTATG ATTCCACTrATACAACTTCATATC^J^ CT ATAAAGGA 

AAG CAGTG AAATGATTATTACAAAGTCGGAGCAATTACTATATCTATATAGG AGAGTGTGGTTTGTG CAG 
CTGT ATT CTGGG AGGTTGGCAGTGG TCTATTG CTTG ACCTTGGT CCTGGTT AAGTAGGTGT CT G CTTTTGT AAAAT AATCTAAACT 
AT ACAGATACGTT AT ATG CACT CTTCT AATTT CAGATTTGAAAAGTGCTTCCTC 
GTAGAAATTTTGATGAGGTTCGAATAGTTATGGTTGAAAAACTTGATTTTCAAGAGACCA^ 

G AGAGAG CTGGG AG TATTTAAACAACAGG TTACACC CCCCCACC CCCAC CAC CACCAATCTGTTCAT CACT CT AAG C CTT CT C AT C 

TCAGATATTTCACCCTATATTTGTGTGAATTCCAAACTCATTC^ 

ATTAGGCACCAAAGGCCCAGCAAGACTGGGCCACCACACTTAAATGTG 

CACATTACTTGG CAG AGTCCATTG AGTG ACTG AGGGTTTAACCTTAAC^ 

GAATGCAGAATAAACTGAGCAATGCCCTCATTGTGCTATGAGAGTGGACA 

AAAGAGTTGGTTTAGGTTGTGCACTGTATAGGGAGAAGCCAGGGGCGGGTTGGGTTTT^ 

TCCTCCAGACTTTGCATTTGTAGGCTGAAGAGTGCTAATCTGTTTTGGCCGCATCTCTCACT 

ACAAAAATTGAACTATCTGGCAAATTGTACCGAGCACTTTGCT^ 

AAGGGGAAGGGCAATTTCCGCAGAAATGCCAGCACTATGGTGGTTTAATGCTGACACAGAGAC 

TTTCAT C CAT CAATCCAG CC AGGT AGC CAGCCAAT AAACAAATTTT AACACACAAAAAGT ATG AAG CAACATGTCT AACACTGGG A 

GGGGCTGGCGTGTGTTTATTTGGGTATATACAATGAGTCACGGATTAAAATTGCTTTTACCAGATAGATACCTA 

CATTATTGAACAAGATGTCGTCATTTCCCCATTACTTCAGAAGACCTCCT^ 

TACCAGG CTCTACCTCTGGG CTTTCT ATTCCGTTCCAGAG ACCCAGTACCAGAAT ATTTAAATCACTGTGG CGTTGAATACATTTT 

AATATCTGATAGGGCCAGTTTCTGCGCATTGCACATTTTTTTT^ 

TCGCGCCTCCTCCTCGGCTCTGGTTCCAGCCGAGCCTCTCGGACGCAGAGATGGAA^ 

CTACAGGGCGGCGGCGGCGGCGGTATCCCCGCGGGTGGCGGCCGAGTCCACC 

GCGCCGCCTCCTGCTGCCCCGGGGCCCCCAAGATGGCGGGCCCGGG 

GCCCAAGCCTGTTCGOSCCGAGGCCCCATCAACCTAGCGGa^CGGC^ 

GACGTGGAGACCGCGGGCTCCGGTCAGGCCGCAGGGCCTGTGTTGAGGGA^ 

GAGCGGCGCGAACCCCGCGGGCTGCTCTGCGCAGGGCCCCCACTGCCT 



ACGCCCCCACCACACGCCTGGGAGCCAGGGGCCGCTCCTCCCCAGCAGCCCAGGTGTC^ 

AGCCGCGCACCCGGGTGACTCCCCAGACCTGCGGTCCGACCTCCTG CT AG CCG AGC CCGCAG AACCCG C^CCTG CT CCGH CG C CCC 
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GAGGCGCTGTGCGGGCAAACCTTCGCCAAGAAGCIACCAGCTGAAGATGC^ 

CTTCAAATGCCCCCTGGGTCGCTGCGGCTGGACCTTCACCACCTCrr ACAAG CTCAAGAGG CACCTGCAGTCGCACGATAAACTGC 
GGCCCTTCGGCTGCCCTGCGGAGGGCTGTGGCAAGAGCTTCACCACCGTGTA 

GAGAACrCGTTCAAATGTGAGGTGTGCGAGGAGAGCTTC CCCACG CAGGCCAAACTCGGCG CCCACCAGCGCAGCCACTTCG AACC 
5 CGAGAGGCCTTACCAGTGCGCGTTTTCTGGCTGCAAGAAGACA 
TCAGGGAACAGGAACTGTTTTCCTGCTCTTTCCCT^ 
AGTCACACCGGCGAGAGACCTTTCCTTTGTGACTTTGATG^ 

AAGGAAGCACGACGATGACCGGAGGTT CATGTGCCCTGTGG AAGGCTGTGGGAAATCTTTCACGAGGGCCGAACA 
ACAGCATTACCCACCTGGGCACAAAGCCTTTCGTGTGTCCTGTGGCAGGCTGCTGT 
1 0 ATTCACTCCAAGAAACACCTGCAGGATCTGGACACTTGGA 

GCACAGCATGAAGACGCACATGGTTAAAAGGCATAAGGTGGGCCAGGATCTCTTAGCTCA^ 

TCTGCTGCATTGCTGGACACAGCATTGGTGAACTCTGGAATCTTGACT 

CCTCCCTGCTAATAATAATAATTCCGTAGGGCAGGCTGTGGACCCTCCGTCCTTGATGGCCACCAG 
1 5 ATACCTCTCTCTTTTTTGGAACGGCGGCCACTGGTTTTCAGCAGAGCTC<^ 

CCATTGGGATCTCTGGACTCTTTGGCCATGAAAAACTCCAGTCCAGAGCCT 

CACAGATACTCTGACTCCTTCGAGCACCCTTTGTGAAAACAGTGTCTCAGAACTACTG^ 

ATCCTAACTCTGACTTCTTTGGACAGGAGGGAGAAACCCAGTT 

AGAAATCTTATCACTGTGACTGGCAGCTCATTTTTGGTATGAAGCAACT 
2 0 GTCAATTTTGAGGATATTCTGGACrAAATATTTAAGTGCAGTCATTTCTTT^ 

GTTTGGAGATCTAAATTCTGATCTTGAGTCTGGAACTGACAAGTTGTGra 

CTTTTTCTTGAAAATTTTCGGGCATTTTGCAATTATTATTGTTTGTA 

2 5 AAAAAGAGCTAAAAATAGGAGTTGGTTCTTCTTAGCTGTTTATCCTTCTAACCTTT^ 

TGGAAAATACAAGGGCTTTTGTTATCACTCAGACCCAGAGTGAAAT 

TATTTAGTCTCTCTTAGCCTCAGTTTTCTATCTGTAAAATGGTCATAGCAATGTGAACATATGTGTTACATCCT 

CTACCCCAGTAGGTTGTATTGAAAGGAAAATGTTATATGTGTTTCAGCATGTTTTGGTGAA 

TTTTCCCTCCTCAAAGGGGAGAAATTAGCACAAATTAGTATCAGGATTGTGCAGGAAATAAACA^ 

3 0 AAAAGGAAGTTATTTCTTCAACAGATTAAGGATTTTTCTAC^ 

TGATAGTATCTTCGGTCTGCATTCAAAGCCAGCrCATGCAATGAGTATTCAGCCTATTCTTC 
AGGGAGATTTTTCTCCCCTCTGTGCCCTAGAGAGCTCATTGCTGGCTTACTCTTAGAGTTGAAATC 



HUMAN SEQUENCE - mRNA 

3 5 CTGCTCGO^CGCCACCGCrcGGCCCCGGCCGTCCCTGG 

GGAAAAAAG AACGG AGGG AGGG AT CGCGCTG AGT AT AAAAG C CGGTTTT CGGGGCTTTATCTAACT CG CTG T AGT AATT C CAG CGA 
G AGGCAG AGGG AG CGAGCGGGCGGCCGGCTAGGGTGG AAGAGCCGGGCG AG CAGAGCTGCGCTGCGGG CGTCCTGGG AAGGGAG AT 
CCGGAGCGAATAGGGGGCTTCGCCTCTGGCCCAGCCCTCCCGCTTGATCCCCCAGGCCAGCGGTCC 
GAAACTTTGCCCATAGCAGCGGGCGGGCACTTTGCACTGGAACTTACAACACCCGAGCAA 

4 0 GCTATTCTGCCCATTTGGGGACACTTCCCCG CCGCTG CCAGGACCCGCTTCTCTGAAAGGCT CTCCTTGCAGCTGCTTAGACGCTG 

GATTTTTTTCGGGTAGTGGAAAACCAGCAGCCTCCCGCGACGAT^ 
CTACGACTCGGTGCAGCCGTATTTCTACTGCGACGAGGAGGA 

CG C CCAG CG AGGAT AT CTGG AAG AAATTCG AG C TGCTGC C CAC C CCG C CCCTGT CC C CT AG C CG CCG CTCCGGG CT CTG CTCG CC C 
TCCTACGTTGCGGTCACACCCITCTCCCTTCGGGGAGACAACGACGGCGGTGGCGGGAGCTTCT 

4 5 GGTGACCGAGCTGCTGGGAGGAGACATGGTGAACCAGAGTTTCATCTGC 

TCCAGGACTGTATGTGGAGCGGCTTCTCGGCCGCCGCCAAGCTCGTCTCAGAGA 
AGCGGCAGCCCGAACCCCGCCCGCGGCCACAGCGTCTGCTCCACCTCCAGCTTGTAC 

TCTCTCCGTCCTCGGATTCTCTGCTCTCCTCGACGGAGTCCTCCCCGCAGGG 

5 0 CCGCCCACCACCAGCAGCGACTCTGAGGAGGAACAAGAAGATGAGGAAGAAATC^ 

TGGCAAAAGGTCAGAGTCTGGATCACCTTCTGCTGGAGGCCACA^ 
TCTCCACACATCAGCACAACTACGCAGCGCCTCCCTCC1ACTCGGAAGGACT 
AGAGTCCTGAGACAGATCAGCAACAACCTGAAAATGCACCAGCCCCAGGTCCTCGG 
CAACGTCTTGGAGCGCCA.GAGGAGGAACGAGCTAAAACGGAGCTTTTTTGCCCT 

5 5 AAAAGGCCCC(IAAGGTAGTTATCCTTAAAAAAGC CACIAG CATACIATCCT CTCATTTCTGAA 

GAGGACTTGTTGCGGAAACGACGAGAACAGTTGAAACACAAACT^ 
AACGATTCCTTCTAACAGAAATGTCCTGAGCAATCACCTATGAACTTGTT^ 
CTGAGTCTTGAGACTGAAAGATTTAGCCATAATGTAAACTGCCTCAAATTGGACTTTGGGCATAA 
ATCTTTTTTTTTTCTTTAACAGATTTGTATTTAAG 

60 

HUMAN SEQUENCE - CODING 

ATGCCCCTCAACGTTAGCTTCACCAACAGGAACTATGACCTCGACTACGACT^ 

G AACTTCT ACCAG CAG CAGCAGCAGAGCGAGCTGCAG CC CCCGGCGCCCAG CG AGGATATCTGG AAGAAATTCGAGCTGCTGCCCA 
CCCCGCCCCTGTCCCCTAGCCGCCGCTCCGGGCTCTGCTCGCCCT^ 

6 5 GACGGCGGTGGCGGGAGCTTCTCCACGGCCGACCAGCTGGAGATGGTGACCGAGCTGCTG^ 

C^TCTGCGACCCGGACGACGAGACCTTCATCAAAAACATCATCATCOIGGACTGTATGTGGAGCGGCTTCT 

TCGTCTCAGAGAAGCTGGCCTCCTACCAGGCTGCGCGCAAAGACAGCGGC 

ACCTCCAG CTTGTACCTG CAGGATCTGAGCGCCX3CCG CCTCAGAGTGCATCG 

7 0 CCCCGCAGGGCAGCCCCGAGCCCCTGGTGCTCCATGAGGAGACACCGCCCACCACCAGCAGCG^ 

GAGGAAGAAATCGATGTTGTTTCTGTGGAAAAGAGGCAGGCTCCTGGCAAAAGGTCAGAGTCT 
CAGCAAACCTCCTCACAGCCCACTGGTCCTCAAGAGGTGCCACGTCTCCACACATCAGCA^ 
GGAAGGACTATCCTGCTGCCAAGAGGGTCAAGTTGGACAGTGTCAGAGTCCTGAGACAGATCAGCAACAACC^ 
CCCACCTCCTCCGACACCGAGGAG AATGTC A AGA GGCG ftACACACA ftCGTCTTGG fi.GCGCCAG B GG AGGft a CG ^GCTA A^ACGG AG 
7 5 CITTTTTGCCCTG CGT G ACCAG ATC C CGGAGTTGGAAAACAATGAAAAGGC C C CCAAG GT AGTT AT CCTT AAAAAAG CCACAGCAT 
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ACATCCTGTCCGTCCAAGCAGAGGAGCAAAAGCT 
CTTGAACAGCTACGGAACTCTTGTGCGTAA 
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Table 14 

MOUSE NOMENCLATURE 
ICSGNM Bach2 
Celera mCGi2764 

5 

HUMAN NOMENCLATURE 
HGNC BACH 2 

Celera hCG33075 



10 MOUSE SEQUENCE - GENOMIC 

GTGCGCTCAAGATAGAGACCAGCTACACAGCAAGTTCCAGGTCGGTGT^ 

GGGGGGGGGJ^CAACAGGTAAGGGTGACCATGTAAAAATTAAAAACTTCT^ 

ACAAACCTTATTCATGAGATAAAATATGTGCAACTTGCATATC^ 

TTCAACACT CTT AAAGCAAAT AACT CAAT AACAAGCAAAG AATT AAAACATTTT CCAGAAAAT ATGT ACATG AAACGACACT CG AC 
1 5 ATCATTAAACATTAGAAAATATAGATTACAGTCATAATGAGATGTCACCTT^ 

AATGCCTGTGGGGGTGTGGAGACC CT ACTGATGGATTGTAAAAGAGTACTAT QITGGGAAGCGTTTTG AGCAGTTGTTciuUUVreT 
TAAAGAATCAGTGTATATCCCAGTACACTAACCAAGGTTCTCCAGAGAAATAGAAGGGTGTGTGTGTGT 
AGAAAGACrCGAGAATAAACGAGAATAAATGTGTTTTCAGGAACTGGT^ 
CAGGATCTAGGATCATTTGTGAGACAACTCTATGGGTCTACCTGTGGGGATT 

TGCAT AAAGGGACAG AGGGGGGGTGGCGGGGGGG CGGGGGGGGG AG T AATCTGC AGACCAAC AG TCACCATTCTG TTTC CCCACTT 

CAAATGCAGTATGGCCAGCTCAAACCTACCATGACAGACTGGATCCTTGA^ 

GCTTAAGGGCTCCATGTCTGTGTGTGTTAATrriTlTCCAAGGAAGGAGTCAT 

TT AAAACATT ATTT AG AG TG CC AGGCT AATTTT CTCG CCGTGTGAT ACA.TTG CACT AAGTGGT AGGAAGATTTT AT CACAG CAACA 

2 5 GGAAAAGTGTTGAGAAAATTAAGACAATATGGAAGCTAAGAAGTCTAAACTCTGTT^ 

CTCAGAAAGGCCTTCAGCCGATTGAATGAATCCCGCTTGTATTCTAGACAGTAATTT 

CTAAG CCCATCTTGCTTCAAGAGCAGAAACCCT AGTGTGGACTGAAT AAAG ATTGCCT CACTCAGTGGTCAGAGTGAGCAGATGGC 

ATACTCCCAAGAAGCAACATAGGAAAGACTAAATGATTTTTTT^ 

GCGTGGTGGCGCACGCCTTTGATCCCAGCACTCGGGAGGG^ 

3 0 TTC C^GGAGAGCCAGG ACT AT ACAGAGAAAC CCTGT CTCGAAAAAT AAAACAAAAC CAAAACAAAAG AC CAAAAAACCCTT AAATT 

CAAAGGGGGGGAGGGTTGTTACTATTGTTTGTTTGTTTGTTGTTTGTTTTTA^ 
GAAAGAATCAATGTCAGCAAAAAAACAACAACAAAACAACAGGAAGT^ 
CAGCTAGACCAACACAGCAGAGACTGACCAGAGGAGCGGGTGTTCCCAACACT 
G^ATTGACCAGAAGCCAOTTGGCGATGTCCCTAAAGCCAAACCAG 

3 5 CCGGGGCCTTAGTGCCAGTG CTGGGGCCTGGGAACAACATG AACTCTAGTTCTTCTACAGATTTTAATCT 

TAGAGGACCCCTAATAGACTTGAAAAGGTCCTTGAACCTTTGATTACCACAT 
GAAGGAGTCATCAGGTTGTAAAAGGACCTTTGATAG^AGAAATGTTAAAACATTATTTAGAGT^ 

TG AT ACATTG CACT AAGTGGTCTGG CACTCCCAAGTCTTGG ACT AAAAATG AATTCTTAGTG CATG ATCT CACTGCCTGCAGGGCA 
GTTTGATTCTGCCCTGGCATTTGATGGGACTTGTTAGAGGATTCGGAAAGGCA 

4 0 AAGGAAGAAAAATATATTGTGGAGTTGAATTAAATGCTGAATGGCTAA 

ACCCTGCTTAGAAACAAGTGAGTGACAAACCAGCTCCGTCTAGATGCAGCATGT^ 

TCCAAAGGTC CT r T TCCATATGGAGACCAGAGTTCCTACAGCTCTGGTGTCAAATG 
G CCCTG ATCTATAGGAGAGATTCTATATAAAATG AGGCACAGCCCTCA 

4 5 TTTTGGTCTTGATCATTAATTCCCATTACAGTAGATGGACAGTGGTG 

CTGTTTATCTAAAATGATGAAAATGGCCAATTATAAACTCTGGATACT^ 

CGGTCCACTTATCTTATGTGAGAAGATTTCTGTGCAGGAGAACTAAGCGTTCIAGAATCTTGTC^ 

ATCCTGAGGGCCCTGCCAGCCAGGGAAGAAGAGAAAAAGGACAGGCCAATT/TCT 

TCACAGATGATAAATAAAGTAGAGACGAGGATGX3GTAGATTGATGTGAGATGTAAGGTGAAGATAGAACTGA 

5 0 GGCCTAGCCAGC&GTGAGGAGCAAAGATGGTTCGTGGTTTTTTCTGCT 

GCTGCCTC^TGCCTCTCITACCTGCCGCTCGCCTCACTTGTCTTTAAGGGCTCTCG^ 
AGCTGCCATCCTTGTGTGGCCCCACAGGGTCAGCTCAGAAACTTGTGAGTGTGGGATCrc 
CACGAAGCAAGCCAGTCTGCAGATGAGACTGAGAGGCTGTGTGGCTCTCAAGAAAGGGTCAGAACT 
TCTAGAAGAAAGATAGCAGAGGTCTTAGAAGACTGCAAATCTTAGGAGGAGGCA^ 

5 5 GCTAGGAGGAAGAGTTAAGGCATTGTTGATGACCATCCAAGATCCAAGAAGAAACATTCTTC 

AGGTATGC CTGCTTGCAG CC CACAGTGAACTTGATGT CTGG CTGTGTGAT AT AGT AT C CTTGAAGT CACGTGG G CACT AAT ACC CC 
GATG CCATTTGAGAACCGACTCTGCGGATGTTTTCTGGGTGGGCGTTG CCAGGGCTGATGATCCTCTCTAGAGGCTGG ATTT CAGG 
ATGAGATTGCTATTATGGGTTTTCCCTGTGATACTCAGCAACCA 

AGATGATGATACTGCTTTTTCIAAAAGAAACTACAACAGAAGGGGAAGTGGGGGGACACACCT 

6 0 G CATGAGCACT AACAAACT AAG CAG TCGG CTTGTGT ACAAACTG CCATGGCAT CTG CTGCTTTGCCAC CTTG CTTTGCTG TTTT AC 

TGAGTACACAGAAAAAGTGAACACCATTTTTTTGAGGAGGGGAACGTACTCTCTGACTGG 
AGAGGTGGCTGCTTCCTTGGGTGAGCCTGGGTAGrrCCCTTTGTCCAGCGGTGTGCCCTTAGGTAAGGCA 
AAGGTTCTTTTATCCTCCAAGTGGGCTAATAACACTGAAAATCCCTAGCTCAGTCCCTAATAGAA^ 
CGGTAGTAATACAGACGCTGTACCCACAGACTGGTGTCTGTTAGTTACAGAGTCAATGAAAGTCTAGGAATTAGT^ 

6 5 AGTTTTCACTCATTTGTTTTTGGTTACTTCTCCTTTTCTTATAACT 

AACCTCTTTTAGGATAGGATAGATAACATCTGAATTTTGCTATATCTGGGAACATAAACACC 
ATTCAGGCTCAGGAAGAGAAGGACCGATGGTCTGAGCACTGTATCCCAGAAACTC 

TCTGTGTTGTATGTGTCTCTCTCTGTCTCTGTGTGTGTGTTCAATCTTAAT 

7 0 TTCCCCCTTTTTGCTCTGTGATAGCACAGGGCACATTTCCTTGTGTTGCTT^ 

TTGCAAC C CAAACAT CT CCAATCCACATCATAAGTGG ATGT CTAAATTCTG TCTGAAAGATAATTTTT AAATT CAT ATTT AT AG AT 
TCCAAAATTTAATCCCAAGTGAACCCATTACTTGTTAAA 
GAGTAAGGACTTTAGAACCACTGTCAGTAAATCTATAGCTAATATT^^ 
AATCTCTAAATGTTTTTATTTTTGGAATTGAACTTTTTGTTTT 
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AATGCCCCTGTGTAAATGGGGATTCGTCTCAGTATTAGCCCTAAAAACCAGTAAGCTGGTTTTC 
TAGAAAACTCACCTTGACCTCTCTGTGCAGCATCTCTGCTGAGAAAACTG 
ATAACATTGATAAAGGAGAGAGAAGGAGAAACTTTTAATCCCACAGCCAAGGGGTGCCACC 
GGCTCAGCACAGACATGGACCGATGCTCTGAATACTTTAI'TITCT 
5 CAATTAGGGAAGGAATTTAATCTCAGGAAGACACCAGAGAATCAGTGCTGGGAGTGAGAG^ 

AAGAAGTGATTCTGCTCGTATGAGATCTATGTGTGGGTGAAAATGCCTCCACCTGCCGTCTCCT 
CTCTTAAGCCCACTAAGACCAGCTCAGCTAGGAGTTAAGCT 
GAAAACCCAATCGTTTCGCAGCTAATTGGTACAAATGTTCCCTTGCCCGCT 
TAATTAAACACCATTTACTCTTCCTACCITAAAAATGCAGGTCCTGCCGACTC 
1 0 GGAATTATAGACAGATAATTTCCCACATTTGCAGTACACTACTTTATTTGGAAATGAATC 

CTGCAGTTCCCCCTGCTTCTCTTAACAGTAACTATGAAGAAAGGAAAAGGCGGTGTTGTCATG 

CATCACCCACTGTTTCCTGAGAACTGTCCACTGAGCCACTCT 

GAGGCAGGAAGCGTCCAGCACTCCTGTATTCACCGTTGTTTCTAAGAGGCC^ 

ATATGTGCTATCCTGTGGGCCACTGGGCCTGGAAGACCTCCAAAAGAGCAGTGTCCTATGGTTGTGTTC 
1 5 GCTATTTAAATTTGACTTAACTTAAAATCAATTGCTCATTCTTAGCAG^ 

GTGCTGTGCTGTGCTGTGCTGTGAATGCTGAGGTCCCCATTATCATAGAAACCTAAATTAGTGGCCC 

CCTTGTGACTATAGCATATAGTTAACACAAAAATCCTATTTACCAACCTGCTTG^ 

GCAAGACACCATOCAGCAGTATGAATTTGTTTGACTATACCAGTTGGTGAA^ 

TCTAAGCTCTTOVCTATCTGCTGAGGCTATTTCCCTAAGGTTCTCXS 
20 CTGGAGGGTCACAACACTGCAGGCCAGATTGGTGGCTGGTGCTCTAGGTC 

AAGGGTCATTCTGTAGGCTCACATACCACTGTCATGGCCATGACCCAAATGACAGAA 

ATCATCTCATGACCCAAGAGAGTCACTCAGCTAGTTTGTGACCAACTGAGAAATACAACACTTAG 

CATCTATTGGCrrACCTCTTTCTGTGTCTACTGTAAAATGGGGATTA^ 

ATGTTTCTCAACCTTAAATGTATGTCTTATAATTTGAAAAATCCCCCCAA 

2 5 TTTTCCACCTCATTTAATCCTAAGAAGAACCrGTGTGAAGAGTTAGG 

GTGACACAGAGGGCTGTGTCTCrCCGGACTTAGGAACAATAATCTCTTGTAAATGTACG^ 

TTTGGG CCTTGTGGCA TlTr GCAGTTCAGGAGTT AGCGATGTTGGAAGTT ATGG CT'rTGCTCGAGTGGTG CATTTTTTCTTTTTTAC 

actttgataatctgccagttgcacagactttggttcggcatgggcgg 
taattagctttgcaaatgctagttgtacagtttcagcatccgaggagct 

3 0 gattttttttttattccctctccgtccc ccaag caatt agg ag aaacaaagtctgctctctgaggatcacttcct cacctcttcat 

ttgtataaactatgagcttggtg ac ctcaccgaatt aacgat at acctgattgcaacaaaggacaa^ ccactctac 

cctcttaggctcctaattcaggacannnnnnnnnnnnnn^ 

nnnnnnnnnnnnnnnnnnnnnnnnnnnn^ 

nnnnnnnnniwnnnnnnnnnnnnnn^ 

3 5 nnnnnnnnnnnnnnnnnnatgtgtat 

agaccatgaatttgaggattttggaggggtatgcggaaggctttggagga^ 

aaaatctttaaataattttttagagaacaaaagaagaaccgtggttctgcatccaaagtaccc 

tggaggctcccaact<;tctataactccagtctcaggggatctgacaccatctttt^ 

g catgcacatggtg cattgttatacattcaaacaaacactcatacacat aaagtaaaaat at at aaaatct 

4 0 aaagaaaaacttgtg agg acatgcttgaag ctcacat cac caacgtggt cgcttgtt agcttt ct ag ag acccaag7vaatggtcag 

aaactcaggcagcctcacactcaggtgaccctccgtgtgtcgcttgcagtgacaatcaaggaa 
agaaaaacaattgaagtgtttctatgaaatattataatactacagggattc^ 
acatgttctctcttatgtaactcatatttaatttcatatttaataattaaataa 
tggtcciacaatccaattttaaatcgaacacctttcagggtgtggagtg 

4 5 catgcatgtacatgtgtgtatccaaccccagaagttacactgaatgggcagtgagcttcaac^ 

TGGTCAAAGTTACCACTGTTCATTATTTTTTTTCCTTTACTCTT^ 
CAGAGAAGAGGAAATAAATATGTCTAATCTTATGAATGAAAATTTTATCTGAGGACT^ 
TGTCTGACTTGAAAATAAACACTTTGGAGGCATAGATGTCAGCCA 
CCACrTCGGCCTCCATACTGGGGCGTTTGAATTTGATGAAGAACGCCITCTC 
50 TAGCCACATCCAAGCTGTGGGGGTTCAAGTGAACACCT 

CCCGAATAGAGCCTAGATATTGGTACCTAGGGGGGATTGTCTTACTCTGGCCTTGTAGTTCGTTGAAGAG 
GTGGAGTACTGTGTTGTTCTCAAAGTTGAGTAGAGTTTGTGAGGCAGCAGGCAGGAGAT^ 
CGAACGGGCTTTCCCTTCCGGTCTAGGTGGACACTCACTGATCTATCACAAGATTGCCTCAC^ 
TGGATTGCTGTAGCCTTCTCATCrCTTCCTCTGCCCAGrrCCCAGCATCCTAAGACTC^ 

5 5 TATGTAAGTTTTTACTAAGATTT CTTTTCTTT AACCCAACAAGGCTCAGTGTG AGGCAGT CATGCCAATAATCCAAGCACACAAAA 

GTTAACTTTCTTTTAAGAAAGTGTCATGAGAAATGTAAATTAAGCAAGAAACAAGGGCTGG 
CTCAGTGGTTCTTATTGTCAGCTTGCATCAGAGTCACACTGGCTGTITTGAGCCAAA^ 
ACTAACATTTTCCAAGTGATGCTGATGTCAGTGCCCCAAGGCCCTCACTTTGAGAATCACC^ 
GTGAATGGCAAAGGCGGGAATCTATATGCTTTAATATTGAATACATTAAGAGGTAGAACCTGTACTTACCTCATTG 

6 0 CAGAAAACAGCTCTAGGGTCTGCACGGCGGCTTCAGCTCAGAAGTCCAGTGGTTACCACGCGCC^ 

CCACTCTTTGCCTTGGACTITAGGAGGTCAGAGTAACGGTGCAGCTGTTAGC^ 
CACACCATGTTTGATGTCAAAAGGAGAATCCGTGGCTCCITTGCAAAGTGAATCTTC^ 
ATGTAATTTCIATTTACAAAAGACACATCAGGGGAGGTAC^CCTCCA 
CAGAGCCTCAGTCCTGCCTTACTTTGAGGTCAGCTCATTACCATGTTAGCTTCCAAAACT 

6 5 CCCTGCACTGATTCAAACCCATGAAGAGTTTGGCTAAGTAAATCCTCACTT^ 

GCTTGTGAGCATGCTCTGTTTCCAAGCCAATGTAGACATTTAAAGTATTTTTCTTTCATT 

AGTAATGGTGATACTGTCCTGTCTAGGAAACATCCAAGAAATTGCTGGGCATTTTAAATTTGTGGTAAGGCA 

GTTTTTCTTGGATGTGGTGAATGTTGTCATGCAAAGTAGCACTTGGGATTTGCA^ 

TT AATTG AG AATGGC CG CTG TCG ATACAGT ATTTGC CA C C CAGCT ATG TT AT CCT AG ATG AAAT AGG C CAGACAT ACTT AGTG CTG 

7 0 TTGCTCATAGGAGGTACCTGGGGTTGTCAGACTCACACAGTCCATAGGGAGAAT 

GTGTATTTTTCAATGGGTACAGTGTGTGAGCTGGAAAGGTrATATTTTGGTAGAAGGTGGTGATGITGG 
TGTTCTAAAGTCTCTTATGTTACAGTCTAACATAATTTTTTrAAAGCAT 

CCTTGG CAG CAAAGT CACAG TTTG AG AGG CTG CGGT G C CATCGT ACAC CG CACTCTCATTGTGCTACTTTTTGTTTT CAATATTTT 
75 TCCCCTTATCCTCAGAGAAGGAGAGCTCCCITGGGTGC 
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ACCGAGGCCCAACCAGGCAGTTCAAGTAGAGGGAATGTCCCATGGCAGG 
AGGACCCATATGAAGATCAACTAGGTCCAGTCCCTGATGTTTGGCT^ 

CCTCTCAGGAGACAGTCATGCTAGGTGGCTGTCTGTAAGCCTAGCAGAGGATCATAGTCTC^ 
GGCTCTCAGTCATTGGTTGGCTGTTCCCTCAGTCTCTGCCCCATCTTTATCCCT 
5 GAAGGCTTTGTGGGTGACTTGATGTCCCCCTC^ 

CCCCTGCTAGTAGGAATCTCAGCTAGGGTGAACCCaTACACTCCCCATATCCITr 
TGCCCTCCCCCATGATTTCCATTCTCACTCCCAGCCCTCT 
CTCTCTCTCACCCCATCTCCCACCCAGTTCCCTCTCT 
GCATCCTCCCTTGGGACCTCCCTATTACTGTGAGTTGTAGCATA^ 
1 0 GTATGCCGTAAGI ^ TTITGTCATAAGTGGGTCAGGGCT^ 

CTCATGATGTCTTTGCTCTTAATAGCAGAATAGTATTCCATT^ 
AGCTGCTAAGGACXJrACITGAACAAGTGTCTTTGTGGAA 

CTTATATAGAGCTATTCCCAGTTTTCTGAGAAACTGCCAAATTGATTTCCAAAGC 
GGAGGTGTGTTCTCCTTGTTCCACATCCTCACCAGCACATGCTGTGGC^ 
1 5 TGGAATCTCAGGGTCATTTCG ATTTGCGTTTTCCTAATG ACTAAGG AAGCT CATTTCTTTAGGTG CTTCTCAG ACATTAAAG A 

TTCCTCTGTTGAAAATTCTCTGTCTAGCTCTGTACCCCATTTTT^ 

TTT ATG T ACTTTGGAT ATT AACC CTCTGT CAG AATT AGAGTTGG T AAAG ATT CTTTT CT AAT CTGGTG ACTGCCGCTTTG TCCT AT 
TGGTGGTGTTTGTAGCCTTACAGAAGCTTTTCAGTCTCA 

GTTCAGGAAGTGTACTCCTCTACTAATATGTTCAAGGCTATTCCCACTTTCTGCATTA 
2 0 GTCTTTGATCCACTTGGACTTGAGTTTTGTGCAGGGTAAATATGGGTCTATTTGCA 
CTAGACCAGCACCATTGTTGAAGATACTTTCTTTITCCCAATGTAT 
TGTGGGTrTATTTCTGAGTCTTCAATTCTATTCCACTGATCACCCTGTCTA 
CCCTTTGAAATACAGCTCGAGATCAGGGATGGTGGCATGTGTGTTTCAGGAATAATCT 
GTGATGTCTAAACTAACTAACCCTGTGGGCCTGTAGATTTAGCCT 

2 5 TGTTGATAGTTTTTAGAAAAATGTTTGATTTGATCATTT^^ 

TTAATTTTTTGTATACATGATAGTAGTTCAGGGGATGTGATGTTGGCAGCACG 

AGAGGTTTTCCCCCAATCTCCAAATAAGCTGAGCTGGAGCGCACCGTAAAGTTCCTACCAAGTAATAA 

GTTACCAGTTTACTCTGAGTCTCCAAGCGGCTGAGTGCTAGAGAAGTC^ 

CTCACTATTACAGGGGAATTCAGGGTTCCCTTTTGT^ 

3 0 CAAGGTGGCTCAGCAGTTAAAGTAACCTGACAACTTCAATTTGGTC^ 

AAGTTGCCCTCTGACCTCCACACCTGTACCATGGCACGCGTTCATGAGCATGTGTATACACA 
CACACACACACACACACTTGCACGCACACGGATTTTTTTTA 

TGAGAGAAAAACAACTGGGTAGGCAAGCTATAGATCTCAGAAGCCGCTGGCAGGAGGA 
ATTTTAAGATGCATAAAAGCCTCCATGTTGAGCATTGGCATTC 

3 5 CTGGAAGGTCAGTGAGCATTCTCCAGACAGTGAGTGAACTGGGAGGAAATGCCCCA 

AGAGGCCATTTCACAGCACACACCTCCCTTGTGTGTACTTTACACACTTTT^ 
TTTGTTAGATGAAACCCAGGTGGAGAAAGGCTTCCAATCTTTCTGTGCCAACTG^ 
TTTTAGCATATATTCGCTTACCAGATTGCCTCACCCTCACATTAATGGTGA^ 
CACAAGAGGGCCTGGTATCTGCCACACTGGTGGCATACACTGCAGAGAATG^ 
40 GGACAGCATCTGTGACGCTCTACAAACCAGTGGGTTTATCGAAGCCC 

TCCCGTATTCTACCCTTAGTTGACACCAAATCAGATAATTACATGAGTCT^ 

G AATCCAAAAATAATTGTCTCCTTGTGGCAAAGCAG AGGTCACTTCGG CGTATACCTTCTTGAAAAAGT AAACT CATGCATGGTTC 
ACTTAACAACAGCTTTGCTCTGAGACCGCCAGTTTATC 

TCATCTTCTGGAACTGGTCTGAATTCTCATTAGATCCGTGTTATCCTACATAATCGTCAAA^ 

4 5 GTTCCCTCTAATCTAGCCACAAAGATACAACTCACCTACATCAGGTGCTTTTGTGTGT 

CTAACTTCATAAGCCATGCTCTGCACTGATTTGTACTGAAGATC^ 
TACAGGACGAAGAACTCCTCTTTCACGTCTTCCCTCCAGGTTCCATTACTCAG 
TCCTTATCTAGAGGATGCTTCTGATAGCCGTCATTCTCAACCCTACATTCGATTTAAAAACC^ 
TTGCAGCTCCTGGGATCATAAAAGAAAGATTGGCTCTGTACAGATGATTTTAGTGTTAGCT 

5 0 AAAGTCATACTTAATGCACAATAAGCCAAATTGATGGGTATGGGGAAGGCTGACTTAGCTCAGTATTAT 

CACCCTCCAGTAAAACTCTAATTCCATCCAAGTGCCTGCCTTCTCTTGTTCT 

TCCCCTCCCAACCTCAAAAGCAGTGTCTGTTCAGAGCGCTGGGCTCTGGTGTGCGGCTCTCTTACTGCCCT 

CTTAAGGGTTGGGGTGACGCCTCACACTTAGACAGGGTTAAGGAGGTTCTAAA^ 

GTTGCTTAAGCCTCCAGAGTAGTACCATCCACAGTCCTTACCTTAGAATAGGTATTT 

5 5 TACAAAATGGGTCTTTGGAAATTTTTACCTTTCCGTGTATAAG 

AATCCTGTGTCCTTCGTGGAACATAAACTCTATATAGGTTC 
TACATTCCACTGGGTCTTTTTTTACTATCTGCACCCA^ 

CGTTTTCGATTTGGTTGCTTG&CTTCACTG 

6 0 AGATTGCTGGTGACATTATATATGAATCTCTGGTGACATTCT CG^ 

GTTAGCTTTGTCATTGACACGGAGACTAACTTTCTCTAACCCAGACTAGTGAAAACTGAATTGTGCC 
CTTTCACTGTGCCCAGACCCTCCCATCTGTCACAGCACAGTCATTTTGTG 
GTGTGACCCACTTTTTCATAATACTTCATTCCATGTCTTTTTACAACA 
AGCATCTTCTGAGGCTTTCTCGCAAGCTCTAGAACCACATTCrAAG 

6 5 AGAGTGATGGGCCAGTTCCTCTAGTTTCTGGATAATGAGCCAGTGCACAG^ 

TGCCAATTATTTTACCAAATTAGACCTTTAAAAATACTAGCCACTTACCTGCATAG 
AGAGTAGACATTTTCATTGATACTTTGAGACCTTAGTAACTTTTCCTT 
GTGTGCCCCTAGCAGACGTCCTCTCTGATGGTTGGTTGTTTCGGTTTGGTTCTACTGCTTG 
GATACCTCACTCCGCTGTGACATCTGGGTGATAGTTAAACCTTGATTAGCTTCTGCCTGGGGTATGGGC 

7 0 GACAGAAAGCTTCAAAGAAGAGAAGGGCTAATGAGGGTCCGATGTGTCTTGCTAGA 

AAGCCAAGTGCTATTGGCTTATGTTTGGCAAAAACCATTCCT 
ACCAGAGGATTCTTTTAAATATCGTCGTTTCAAAACAAAATGTTGAAGGTGT 
GACAGTTTTTGGTGAACATTTATTTACCAGGGTCTGTCTGGCTTGTGGCAG 
ATGTGGAGCATGCCCAGAGATAAGTCAAC CTTG TT T TCCATGAAAGTCCTGAGACAG 
7 5 GAAAAAGTGTCACACACTTGAGCCACTTTGCATCCCTTGTTAGGACAAACCCT 
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TAGGTCX5ATCAGTGTGGGAGGTTCAGAAAGAGTAGCG 

AGATTCATACAAATCAGAATGTGTAGTGCAAGCAAAATCATTCTGTTAATTCTGATGACT 
TGTTAAGTGGTCCATTAGGAAATAAATGTCTCTGTTTAAATAGCAAAGGCAACCTAGGTTTTTU" 
TGTCTACACATGATAGTCTGTCATTTTTCCTCTTCTAAAGGAAG 
5 AGTGGTGAGATCCTTGGCTGTGTCTAAAAAGTACTGGCCTCGTTTGAAAGGTCAATTTACGGTG 
AGAAAGTTATCTCTCATGAGCCCATACAGTGGTGAGGAGAGAGGGTCACTTAGCT 

TGCACACCCGTCCTGCAG CG AGCTCCATCAGCACAGATGTTGGGAGGAG CAGTTGACCAAGTG AAAACTTCCCTTCATCTGX3CTTC 
AGAGTCTCCTGCAACCACAGTAAGTGATAATGTCTCCATTTGTGGCT 
GGAAATTTGTACCTCAGTGATCCCAGCATTGATGCTGTCCCATCGCACCGACT^ 
1 0 AAGCGCACCGAGCCTTTCAGTTTGATGCGGTCCATGAGGTAGAT^ 
CATTCAGAAGCCTTATCAATGGGAGCGGCAAATCGCACTCC^ 
TGGGAGACGACAGACCCCrGCACAAAGCTTAGCCGGGAGAC^ 
CTTATCTGGAAATGAGAAAAGGTGCAGTACCTGCCCCGACGGGTGTCTGTG 

AAAGCTGTGCG CAGTGCG CTGTGCGCGTGCGGCTGCTCATT CATGTTCCCTCCTGCATCTCGAAGGTGTGCTGG AGCACGGCAG AG 
1 5 CATCCGAGGTTCTCCCGTATGTGTACAGTAGAAATTTTAGGATTTAATACACAAAGGCTC^ 

AGAAGGTGAGAGCATlTATAGAGCAGTTTCriTTTTCTTTTCTTTTT^ 

TGATTTTACTTTGTAATGACATGGACATGGTGTGCAGCCCTTCCTTTGTAGATTA 

ACACTATGCTTTTCATTTAAACCCGAGTATTTCAGTTCTAACATCTGTGCGTGTTAGGTGTAGTA 

AGCGGCTGCX3CTAATGCTGTAATCAAGCGAGCTTTTAAATATTTCAGGTAGGAACT 
2 0 TGTAGTTTCGCTCCCTAGCGAAAACATTGCCTACGCAGTGCTGATCCATCATCAGGG 

AG CACTGCGTTCATTAG TT1 1 i GTTAATTATTTT CTGTTTCATGT ATATGTGCCCTTCTCTGTc T 1 1 T i CCTCCCAACAAGGTTTT 

CTATT ACAAGTTAAATATTATTTTATAATGTAAGGGCATAG CTATAAGCAG CACATAG CTCGTGGTCACCCTTTCGAAGC CTTGGG 

GTTGGACAGTGAGAGTTCAGACTAGACGTTGCTTCAGCCTCCTCT 

AGAAATTGTG ACAG T ACCTTG ACTTTG C CT C AC CTGGGGCG CGATGAAAG TG ATGAATGGGAGCAAT AAAC ACAG AAG TT ACAG CA 

2 5 CAGCCCAGAAGCAGCGAGGGTTCACTGTGAGCAGGGAACGGATTTTAACTGCAGAGTCCCTT^ 

GTACCAAGGCTGATTGTAGCATACATATTAAGGTCAGCAGATTCT 
AGACATTCCTCAGGTTAAGGACCAAGATGGTTTGCCTTTTCrTACAGCCATAT 
ATATGCGrTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAAGAAAATCCTTT^ 
TTCCTCAGAGAAAATCCTTCCAGGTGACTTTCAGAGTTCCrrGGCATTACAGTGCATGCATGCAC^ 

3 0 ATGCGCACACTGGCACACAGTGGCACACACATGCACAC^ 

TG AAGCG TCT ATGAG AATT AT AG ACTGGAGAAGG TGAT AAATTAAACTC TGG CTGTG ATTTCCTACCTCAC CTTTGGGGGGGATGA 
CAAAGGATCTG AAATTCATC CGGTTGATACACTCT ACTTCT CAGTTTG A 

GGGTAGTGGCCAGGCTGGGCCCATTATTCCTTTGGTTGAATAATATAAAGAATTTGC^ 
AGTGCATGTCCGCATTGGTAGTTTAGTGCrCTCAGTCATGAAATGCTTTGGAGTTTATTT 

3 5 GAAATTTTCAATAAGATTGACATTTGGAGTAAGAAATATAAATCTAATGAGAGCATT 

AGAGTAAATTTATACATGCTCGCTGCAGAAGGGCACGGCGGAAGGTGAAATTGCATGCT 
TGTATCAGTGTCAGGCCGTGACACCAGAGTATAAATGACACTGTCACCAAGTGCTGCACT 
AAATTACATTACTTAGACAGACCTGAAGGCACCGTGCTGGAATGAG 
TTGCCATATAGCCAACTGCAGGATGCACAGCGCTGGCTGTGTGGGACCTGGAAGGGACA 

4 0 ACGTGACAGATATGTACAGTTGACCTAGAGAATGGGTATAAGCTGATTTCTGTGATCTGGTCAGCA 

ACAGTTGGGTGGCAATAGGGGGTC AGTGTACAT^TCCCATTCTTT CTTrTTCTCTGTTGTGAGAATCT AACCCCTGCC CC CTCGGT 
AGCACTGTTCACCTCATTTCCAGCCCCACTGTCATAAAAC^ 
TGGAGAGACAGAGGCAGGAGGATTGCATAAGAACTCTTTTTCTCAC^ 
GGCAGGAACTGGGAGAAGAGATGAAACATCAGTGATGACGCCATCTCCTCTCCT 

4 5 TCTGGCTGGGATGCCAGTCAGTCCCTGAAGAAGCCAATCACA 

TGTCCCAGAGAGAGTAGTCTGAAACATCAGGGTGGCTITCTCAGCCATGTGTC^ 
CCTCTTGGGGGACACGCTGCATGGAGAACATGGACTCTGC^ 
TAACACAGGGCACATTGGACATCAGAGGTGGCACTCAG^ 
TCATCTGCGTTTTGTGCCTGTATAGACTITATTCAT^ 

5 0 GGCAGGAGAAGAGAAAATGACATATTTTCTAATTTATTTAAGTATACCCACCT1T 

GAGGTAGAGTGCTTACGCAGGACATGCAAGGCCCTAGGTTCCATCCC 
AGTGACATCTGCGTAAAAGCTTCACACGGTGTCCATGTGAAGCCTGAGCTCACT 
CTCGCCTGCACTGGCATGTCCCCGTTTCCATCGCA 
GTGTTTACTACAAAGTTACTACGTGACGTTTTTCCCCTGGCT^ 

5 5 TGAGCTGAACCCTGGTGAAAGAGCTTTGACAGCATCTAAAATGTTTGGAGCCTC 

TGCCCTACCAGTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTCTTCTTCTTCTTCTTCT^ 

GTGGAGGATTGCGAGCACTCATTCAGAAGCTATTCTTAGTTCCCAG 
ATTAAGGAACGATCCCTCTACCTGCCACCTGATTCTGTTTTCCCCAAGGACGCCT 
60 TGGCTTCTGAGTCAGGCGCATCTTGCAAGGCAGTGAGT 

GCATTAGCAG CGCTTGCTGCAG ACCTGGATTCAT CGTCAG AAGTCAGCCTGTCCACGGCAGCAACITACCC^ 

TGTTGG CAGCTGACCACTGCTG CCTTGCACTGCTGTGTTCCTGGGCTCCCACTGCTGC AGATG AGAAAAAG AATGCAAACAAC^ 

GTCCCCCACCAGAGCCCATAGCCATCTTCCAGAGGACAGTGAAGTCATTGTT 

TAAGCCCTCTTGTGAGATGTGGTGAACTATGCACTGCAGACATTGTGAAGTCCT 

6 5 TCAATACCTTCTTGAa^GCCATTTAGGAGAGGAAAGTOlCTTGTCTCACAATAGCA 

AAGTGTGTTCCTTTATATAAAATACACACTGTAATTTATAGAAATCACAGTGCTCCATTGAAT^ 

CTTACCTGAAGTAGAAACTCCATCTGTTTGGTAAAAGCCTTCTTTCTTTGAAT^ 

TGTGTGTGTTGTGCACACACATGCACATGTCACTGAACACGTATAGCGA 

CTCCTGTATGTCATGGGGATTG AACTCACATTGTTAGGG CTGCTACCAGGCCCCAT ATTGAG CAGCTCGCCCACCCCTCTTCATT C 

7 0 TTTCAAT AATATTT ATTTCTTCCAAATO CTCTGAGGCTTG AT AAT A 

ATCCTCGGAACACCTTGTCACTGTTGAAGCATGGGAAAAGAGGAGCCCT 
ATATTTTTTGGTTTTGTTTTTGCATTGAGCCTTTAAGGGAGGCTTGGTATAT 
CTGGCAGAACCCTGGAATTGTTTCC^TTTTGATGTTGAATATATGACTrTCCGTGATGG 
GCCAGAGGTGTTCTCTCACGATATAAATGTAGGATTA^ 
7 5 ACCATTGCAGAATCCTCCTAATTGAAGAAATGOT 
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TAAGTGTCTGATAATATGCGAGGAGTGAGCGCATTCTGTGTTAGGTCATC^ 
AGGATQVGATCAGATCCTTGCAGAGTCACTTTGGGGAG 

AGTAAACATTTCTCATCACCTGAAAGCGCCTCGGGGGCATGGCTCATGAGGCACCAGCCACTGGCACATCT 
CTCACCCAAGCAAGATAAATTACCTCGGAGACAAATTACGGAGA 
5 CATCTGTGGCACTTCCriTTTAI'TCl'l N i'CTCATCCTTGTAAAGGAGCCAGGGAGAAGCTC^ 
GACAGGAAATATGATAATGATGTTAGATTTCATACATTTGGGGGATTTTTT , i' , ri'CAGT 
ATGTGGGTTGGAAGAAATGCTTGCTACCCTCTGTCAGCTTTTAGTAGTTTTAGTCT^ 
AGGGATGTTTAGGGTTCTTTACCGAAAGTTCTGTTCCCTCCACTAAGAGGTGTTTT^ 
AGAGTGAACACACGTTCACTCTTTCATATAACAGGTGG^ 
1 0 GGGCAGGCGCAGGGAAATGACTGAACATGCTTTATCCCATTGTGCA 

TCCAAGCTGGGGGGGTGGGGGGAACGCTCCTGAGATGGCTTTGCCITTGACCC^ 

ATCCAGCTGAATAGAGGGACAGGACCGCCATAAAAGTGGACCTCCAGCCCCACAGAGCATTGGGTTATGACTCT 

ACCACCATCCAGTGCCTAAGGTGACGTTTACTGCAGTGGGCATGCATCCAGGTAGCACAGAGTAGGTCTGA^ 

CCAGCGTTTGTCACTTAGTGTGC^TTGCACAAGTAAAAGTTACACAGGACTGTCAGCTCCACA 

1 5 TGACTIGCAACGTGGCCATGCCCX5ACAGAAGAAGATCCCAAAGTTAAGTCCAGGAGAA 

GTGCCACCATATTTTCTCATCCCCGCTCGTCTTTGCCAAACCCTTTCCATCTGCCTGGGAAGATTGTGTC 
GACTCCTATTTTGAATGTTATCTCCrrCTAAGAAGCATCTTTAAATTTATTCTTTT 
CTCAACTGCCCrrCCCTCCCCAGTTAACCCTCTCTTAATGCCTTCTCCCATCACCAAATCCCTTCT^ 
CTTTCATGCTTTTGTGTGCGTGTAAGACGCACTGCATTTAATTAGAGGTGAACATGAGTGAG^ 

2 0 GTGCCAGGCTCATTCACAGAGGAATATGATTGCCATCTCTCTGTATCCTTTAGCTTATTACA 
TATGAGCCTCTCTCGGATCCATGATGGAGAACTAACTGGTGGGCCCAGCTCTAC^ 

TGCXjAlt^CCATGTCCTGTCCCCT AG ACAACACCrATAGTACTCTTGG CATCTTCCAACTCTTACT T TCTTTCTGCTTC CTCTTTC 

TTGGCATTCTCTGGGCTTCAGAGGGATTGATACACATCrCCCACTTAGCT 

CAGTGATGGGTCCCTGCGTTAACTGTCACCCATGTGAATAGAAGCTTCTCTGAACAAGGC 

2 5 AAACATAAATGTTGGGAGTTCAGTTTGAACAGCCAGGTGTGGTGACACACGCCTTTAATCCCAGCACT 

ATATTTCTGAGTTCGAGGCTAGCCTGGTCTACAAAGTGAGTTCCAGGACTGC 

AAAAAC AAAAAC AAACAAACAAACAAACAAAACC ATG CAGTTTG A C A CTG TG CT CAG TT AG CAAAG AAAG TT C C C CT ACTGGGT CT 

ATGACTGCCATAATCATGGAGTTTGATGAGGTTTGTAGAACTAAGTATGAATTCCTCCTACAGG 

AAGTAGCTGGCTATCCCTAAAACAATTACACCACTGTTGCAATGGTGGGCATATCTTGCCT 

3 0 GTTCACAGTTAGGTAAGACCAGTGGTGACTTTCTTCCC 

CAGCTGGATTTCTCTCTGTTTTrGCAATCAAAATGTC 

AGAGGAATGACAAGGGCCCCAGGGATCCCTCTGATCAACAACTAAGGAAGTACCCTATACCTCCGACTGG 

CAATGTCTTAATCAACATATTGTTTTCTTCCTCCTGCCCATCACCAAGTCCCTAGCAGACT 

TCATAGTATTCTACAAAGTATCCTTTATGGAGTTTTCTCTGGGGAAAAGAAGA 

3 5 ACCAAGCAGTAATATGTGGTCCACAGTCCTCAGGTTACACTAAGGATATTATGACCATACATTAG 

AGCGCT ACATGCAGTG CAC AG TG CAGGGATGAAGGACTGCGCCTGGGG AAG AGGG AG AGG ACGG AAGC CACAG TGG CTGTGTTT AG 

ATAAATGACAGACTTGCTTAAGAGATGGGCATGATACAGAGGAGA 

TACrrCTGGAACCGATGACTTTCTTTGGTAACAGTGGCTCTCGTTT 

AG AATATAAATTTCTGGGCTGTGGGGGAG ACACAAATCAGATAACCT AACACTG CTAAGATGG AGTGTCC CTGTT CCATG CAGCCT 

4 0 CCC^GCCGGGATGGAACACACTCCTGACTTCCATGTTGTATCAGTTTGTCTAAAACATATCCATCCA 

TGGGATGTGATGGAAGACTTCCTTTCCCGATGAGGGTACTGTGAGAGAGATCAGCACTCAATCCAG 
GGCTCCTTGCTCGATGGGACTGAGAGCAGCTCCTCAACACCT 
CTCCTGGGAGTTGCCATCAGGGTTAATGAACTCCGTGA^ 
CTCCTGTGTGTAGGGATTGOSGTGATGAAACAGTGTCATGTAGA 

4 5 TCCTCAGCCCTACAGAGGAAGAACAAGAGACTCCCAGCTGCATACCCGCTGCAGAGTAGGTTCCTGGGAGT 

CAGAGGTGTTCCCAGCATTCCCTGGGACCTGCCAGTGACCTGCAGTGTGCGGCCTCT 

TTGGC CG CGT CTT CTTTCCGTGT AAGATTGTTCTG ATG AC CC CAG CATGT CAAATG ACT CTCGG T AACT CTTTT ATTCT ATT GT AT 
TAATTTTTTAATTGAATGTATAATCAAACCTGGGTCCTCTAGA 

5 0 CTACATATTTGTCTTATTTTCCATGGTTATC 

TGTGTGTGTGTCTCTCTCTCrCTCTCCCTCTCTCCCTCCCrCCCTCCCTCCCTCCCT 
CCTCCCTCCCTCCCTCCCTCCCTCCTTCCCTTCGTCCCTTCCrCCCTCCTTCTAACTCCCCTCACCCCTCT 
GTGTTGTAGGTGTGCATCCGGAGGTCAGAGGAGAACTCAGCGGAACCTGGAGCTAGAACTAAGGTCGCC^ 
TCTTTGTACACTGAACCATCTTCAAGCCAACGTTTT™ 

5 5 ATCTTTTCATGTTCCTCTAGACTGCCTTCIAGTTCTTTTC 

GTCATTAACTTGCCAGACACTCAGCCATGGATTGTCT^ATGACGTTGCTGAAGTTAGGAGCG 

CATATATTAACTCAGGCACAGCAAATTG CTAAAT ACATCCCTGTAAGTCAAGAGCTTCAGGATGACAAT CCAACCACTCCT ATTGG 
AAGGGGGTGACTGAAACAGACTGGCAGTGGGCAAGGGAGGGTGAGTTTCTGCCACAGAACTC 

GGACACAT ACAGGGGCCCCAG CAGAG CATTTCCCAGGACGTG ATGG CGAGGCCCCAGTCTAACAGGAAGTCAGTGGGCTCCTGCCA 

6 0 GCCTCCACTTGTTCAGTCCTTGTCCIACAACTTGAGCATCC^ 

CTCCTCCCAGGGCCTAGTTCAGTAAATGTCAGAGTAGCTACAAAAGATGAGC^ 
GACCAGATCGAGAGATGCCGTACAAGTGTTGAAGAGC^ 
CCCTGATCCACGTGTCTGTGGATGGATCAGGAATGGTGCA^ 
GCTCTTCAAAGCTTTTGTTACTAGTCTATGCTACATTITATTTAAGAATG 

6 5 CAGAAGGTGGTATTGGTTCTCAGAAAGGAAGACAGCGATCTATTTTCCCAACT 

CCTGATGAGAAGCCCAGGGCTCTAGGTGCTGAGGTC^ITACCCACACCTTATGACT 

TCTGTGAGTGGTGAAGGAAAGAGTCCTGCCTTCTGTATAGCCCTGGGCTTTGCCAGCA^ 

CACACAGAAAGCAGGGTTGCTGCATGTATGTAGAATCrCACT 

TAAAAGCTTCATGCAACACTTGAGGTCAGrTGCTGTGTGCAGTCTAGTCAGCCACCAAATCCATGTAGA^ 

7 0 CTCTTTAAAAGAGAAGTAACTGCTGATTTTTCTCAAAATGTTCTGACCTTTGTCT 

TTTTTGTTTGTTTGTTCTTCCTAAGCTGGTXSCAAAGGC^ 
AAAGAGTCACCACAATGCTTATAAAGGGCTTTCAGAGTTCAGTCC^ 
GCCGTGOSTGCAGGTGGTAGGTTCCAATGTACTTGGGGCTA^ 
GATGCCCCAGGTGTTGTCTTAGTGTGTATATCTTTGTATCTGCC 
7 5 TCCTGGTTCCGCGCTGTGTTCTGTGGTGCCTTAGCAAATCGATGCTT 
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AAAATAGCATAACAGTTTTTTAATGTATTTCATT 

GGGGAAATGCTAAGTCATGAAAAGTTCTCAATCATTGTAGAAAAAGTGATTTTGTT^ 
TTTCAAAGAACCTTGGCGATCACTTGCITAGATTCTCTCGTCTAGCCGT 
AATAGATATTTGTTGGGTGTTGGAGAGATGGCTCAGAGGGCAAAGGCATCTCT 
5 CCACAGCTACTAjGAAGGAAAGAATCGACTCCCTCAAGTTGCCCTCrr 

AATGAT AAT1TTT1TTTAT AAAAT ACA'l'l'l'G'ri'GGATG ACTG AG T AAACAGCT AGTGAG CTG CT CCTGTGTG CT AAGT ACCACATG 

ttaaattctggagactgaaataataaataatggatttgcatacataccccagtacttt 
ctagtgacccctctagccatcctcattcagtgtgag 
taaccagatatcatggttatcacttctaatcccataagaaagt^^ 
1 0 gctacttagcaagactatgttttaaaggtccagtggctgtggacctcagtataagagga(^ 
ttcccgaatacaaaaccctgaatttgatcctcagcactgcccaaaga 
aacaaagagtcggcttatgtcactctcctcttacgcctgtcttacrraaccttt 

GATCAGGATTGCCCCAAGATACAAATGGTGCTAAAGGTAGCATGGAAAAGCTACACCTCAG^ 

TAAGTTAGTCAGGCTCTCTAGATGAACAGAGTTAATAGATCTTACGGGGCATCTATCAGATCA^ 
1 5 GTCCAAAGTTGACGCTAAGCATTCTGGGAAGGCACTTGTACAGGAAACTAATGA^ 

AGAGGAGGGAACTACAGTCT^ATAGTTTAGCCTTGGGAGTAAACGTATAATTGAGCCCTCCCT 

ACTAATGTTTTGGGGTTTCTCTCTGCAGCCTGTGGTTTGACAGCTTCGTTCCCTAGC 

CCGCTCAACCAACTGGCACCCTTGGCAGGCTGCCTCTCCCTCCTGAAAGATCTTCCC 

AAAATGTGATCATTTAGCCCCTAACCCTGAGCTTACGCTCATTCTTAGTGGGTTGCACAG^ 
20 TTCTCTGGTGCTCTCGCCAGGGGCACCGCTGCATCCATCTCTCATCTCTCCCAGTCT 

CAGTGTAGTTAAACACAGGAGCTGGCCATGGCTCCAGGCTGTCCCCTGCCTCCCTTG 
ACCATCCTGACACCGTAGGCACTACCTCATTCCTGTCCTC^ 

GTGTGATCGGTAGAAATGTCATCCTGATCACATCATAGCTGTCATTAGTGACTCCTTGCCCTAGTGTGGAGG 

2 5 CTGACTGTCTCTCATCAGGCATGTGCAGACAGATTAAACTCCACAGA 

CCCCTTTCTGGAGTGGCCTGATCTTCTTTCCACACrTGTCAG 

CCGCCCCACCCACTACATCCACCGTGACTGTCCACCCTGGGCCGGGAATGCGTGCACACAAAGCAGAGGTGTTTTGGAGAG 

CATAAACCTCITCTTTGGAACCTACCTTAGTCTGGTTTAATTTCTCAAGAAAACACT 

GATGCATTCTGAGATATAACCATATATTGCCTTCTGTTGT^ 

3 0 CTGGGTGTCTGATACCAGCGTCATCAGCrGCTCTTTGCTGCCTAGCAAG 

GGTGAACAGTGACCCACAGCTGCCACCGCTTCGGTGTTTACAAACAAAAGAAAACAAAC^ 
ATACTGCCAGCACTGCCTTTACCTTGTGACAGCAGAGGCAAGTC^ 

GCTCCCTCAAGGAGACATTTACTAAACTAGAAAACCTTGCCGAGCCCTGCTCTGGGATGGCCAG^ 

3 5 CAGCTTGTCTTTCTCTGTGGTCTCTCAGCGCCAGGGGACTGGCCTGGTGGTT^ 

ATATACCTCCAT AAGCCTCTGCCTCCCAAGTG CTGGGGTTAAAGG CATG CGCCACCATTGTGTGCAGAAAATGTCCTTG ACGTTGG 
CCTCCCCAGATATTTCTAGGCCTGAAGACTGCTTCCCTACAAAGACGGGTCCTACCCAGATTATCTAAACCTGTTTGCTAGATC 
GCTGTATAGTATAAACCCTGCCTCCTTGCTTATC03TGTGTAATAATTTCCACCTCTCTGCT 
CTCTGTCCAACTTAAAAGACCGACATGGTGAGCTTCCAGGGTGTTTTCATTTCTCCATCAG 

4 0 AGAAACAACTTTTCTCAGTGTCCATGTGTCAGGTC 

TGTGATAGCATGGCATGGCTCTTTGGGGAGAAGAAATTCAGGAATAGTATTGTGTAACTACT 

GCGGTGATGGGCAGGGTACACCTACAGCCTGACTCGGCTTTGTGTCGTTGTCATCCAGTTGGGAAA 

TTCAAGGAATCCTTCAGGAAGGAGCTATAGCATAGCTATATAACTCCATGGCGATATGTGATGGCTGGTTCT 

TAGTAAG CACATGAGAGACTAGTAAAGTGAACCTCTTTCTG CCT AGATCITTGG AG ATGTTTGCAGACATGGACCATGAGGTCTCT 

4 5 GACCTAGTCAGTGGTTTAGTCCATTGATACATTCTAAGTCTGAAGAGATTAATC 

AAGCAGG CCATGTAGGAAATGCTCT ATGGGACTGTCTTGCC CTGGTCCCTTTCTG AATTC CTCCTCTGTTTT CTGAATGCTGTGAT 
AT AAGCTGTACCGCCATGGCCCTCTCCTCCATGATG AGTTAGAACTTCTG AACTCGGG AG CT AAAATGAACCTTT CCTGCTTTAAG 
TCGTTTCTGTCAGGCATTTTGGTTTAGCTACAAGTAGCTAACCCAGTCTGCCATATTCACCCTCC 
AGCTTTTCAGATGCrGTGTCTTGAGGCTCTTTCTTGCTGGTCGCCTC 

5 0 GTAACTCGGAAGTCTGATTCACTAGATTGAGTCCCTCAGGTTTCTCCT 

CATGCCTGCCTTATTCGCCCTCTGGATACACAATATTCCCCTTCTCCACATGTCTGGTGAG 
TGAAGTACCACTCACCCGTGCTGGTTCCCTGGTTC 
AGTGTGATGCAAATGGGGTGTAGCAGACACTGCTGTTGTCTT 
TCTGAATTCCTTTAACAGGGGAGACGTCTAGAAAAGTCTTGGCr^ 

5 5 ATCTGCTGATCTGTACCCCAGTTCCACTTrATGAATAATGGACAACT 

CCAAGAGTTCATTGGGTTAAATGTATTTCATACTCATTAATAAAGAAAAAATTGCTGC^ 

TGAAACTATGTTTCCTAGCACTAGAATATAGATTATTAAGTCTAATTCAGATGGATATAGGTGACTCAATATATTTTGA 

GAAATATACCTGTCATTAGJUITCTCAAAACAGCTGTXK3TAGAAAGGTTGGTACATGGTAGTATCCT 

TTTAACTTAAAAGAAACTATAATACTTGTACAGAATTAAGTAAATATATTTTTC 

6 0 TACGATATATACTCAAAGTCTAGGCGTCTAGCCTTATGATTGAGACAAAGCTTCCTTTCTA 

CAGCTCTAAGATTGGTATCAAAACTGATCTTATTTATAAATTGCTATGC^ 
CATCCCGGTATTACTTGTATAAC^TAATGTGATGTGGTATTACATGC^ 

TTTCCGGGGGATGTAATTAGGCAGATTCTCACACCCTCATCAAGAGATTTCCCCACCATCATTAATCT 
TCTAOVTCCAACATTAOTrGAAGATTTGTAGACGCATCCTCTAGAGTGG^ 

6 5 GGTATTTGTTATACTTTAAAAACGAACATGCACACTTTGGCATATGTAACTATTAGAC 

ATTAGTTGTATTTTTGTTTTACATAGTTTGCTAGGTGGAAG^ 
GAATGTGTTGATATCTAGAAGAAGCAATGTAAAGAAGAAAAGATTTGAGTTAG 
CAGATTTCACTGATGCAGGAACCATGGTGATGCAGGGCATCATGGCAATGCAGCAGCCTCT 
GTCAGGAAGTGGGAGTGGAGATGGGATGCAGGAGTAAGAGACAGCACCTAAGAGC^ 

7 0 CTATCTTCTACAGTCCCCCAACCTCCCAAAATAGTGCCACCTTATGGTTATAGTTACTTTTGTGT 

CCAGGGCAACTGACAGAAGAAACAGTTTATTGAGGGATTTTATTTTCAGAT 

AACAGG CAGGCAAG CAT AGTGCTGGGG CAATAG CTG AG AGCTT AT ATC CTGTTCTG TGAGG CAG CATGGGGGCAGAGAGGG AG AG A 
GAGAGAGAAGGAGCAAGGGGAAGAGGGAGAGAGGGAGGGAAGGAGGGGAGGAGGGGAGGAGGG 
GTGTTCTtrAAACCTCAAAGTCTGTGCCTCTGGTGACACATCTCC^ 
7 5 ACAGCCTGAGCCCAAGCATTCAAATGTATGAGCCTACGAGGGTCCTTCrCA 
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ACAAAACAAAAATCCCAGACTTTTGGGCCACATTC 

GCTAAGTTATATTAGACCTCATGAGATTTAACTAACCGGTCCTTGTATAATATA CGTG TCCTGGCCTTTCT^ 

CGGATGACAGATTCTGTGTTACTGTCTTTTTACAATAATTTt^ 

CTTCCTTATAACCAATCAAAATCCTCCTTCCCACTCACTTCAGAATCCC 

5 TTTTA rrrrrri 1 1 1 1 t t ta ttttttttttat ttt tt t tt ttag^ 

CCAACTTTTCCTGGATCCAGCCCCGGTCTCTTCCCACTGACTTTGTT^ 

TATTCATGTATATATGGCCTTCCACTGGAGCATGATTAACTTATCAGGGGCTGCACTCTT 

TGATTTTTTTCAATAATGTTriTCTTlVlTATTACT 

GGATAAATi'T'l'Tl"! I'l TTCTATTTAGTTTGAAGGATTCCITC^CATGTCTACre 
1 0 CTGTTTTCACTATGGAAAGTTTATATTTTCTATCATATTrTCCT 

CTTTCAGGGCTCAAAAAACACCATCCCACGCCTTTCTGAATT^ 

GTTCATGTAAGACTTAGCATTTCTTGTCACTTrCAATATACTTCCnTTg 

GAAGGTTCTTTTCTGATTTTCTCTGTTTGGTCCTAAATGCCrc 

ATTAT^TTCTGTTGAAAACATTTTTATGTCrTTAGA^ 
1 5 TGCTGCACATACCGTGAAACTCTTTGACGTCTGTTCT 

TGATTTTCCTTG AC CCATTTTATTAGTGAGACTTT CCACAGAACTTTTTATTGGTTATGTTTTTAATTTCCATCATTTCW^ 

TTTCCATCACTGTClTlXjrriTCATAGTTCCTCTTCCGTATCTTGCACT 

TAGTCAGGAGCTTATTTCCTTTCATTTCTTGAACAATAATTC^ 

TGTTACTGAAAAGTTAGTTGTTCTGGAGAAGTTATAGTGACATGATTTT^ 
2 0 CTCGGATTATCTrcTTGCTTGGGGTTTTTTCT 

AATTGGTAACGTGGGGTATGTTGTTTAGAAATTAGTTAAGTCCGGG 

ACTATTAACTGCAAGAACAACCACGGATTCTGGGTATTGCCTCTAGATGTAAATGTTGTAT^ 

TTAACTTTAATAACTTTTGGAAAATGAACAACCAAAGATAATATGAAGTATAGTT^ 

GGAAGGAGAGTGGGTGAGGCTCAAGACAAGATCAAAGGACAGTGGCAG 

2 5 GAAACAGTACCAGGGTCCACAGATATTAGAGATTGATAAAGCTAGGAAAGGGCCAGCCT 

CAGGACTACACAGAGAAACCCTGTCTCAAAAAACCAACATAAAAGAACCAT 
AAATGACAGAAAGAAGGAAAGTTGTCAAAATAAAAGGAATAAAATGAG^ 

G AG T AAAGAAACACAT AG CAATCTTAAAGTCTAACTGAGCAAT AT ACCACAAATCCAAAACCATC AT AAGATGTA^ CAAGGGT 
ATAAAGGGTAATAAATAATATAAAATGAGCTGGGCAGTGGTGGCACACACCTTTAATCCC^ 

3 0 ATTTCTGACTTTGAGGCCAGCCTGGTCTACAAAGTGA 

AAGAGAGAGGGGGGTGGAGAGAGACAGAGAAGCAGGGXSATAGGGTGCTGGAGCAGTGGT^ 
ACX5CGGCTTCAAGTCAAGCACCCACATGGCAGCTCA 

GTATGGGCACCAGGCACATGCAAGGTGCATAGACATGCATGGAGCCAAATCATCCATGCACATA 
CCAAAGGGGTCAAGGATGCTGGGAGAAAACAGCTCGCAGAATCAACT 

3 5 ATCACGGTG CCTG CATGGGT CTGTG CT AAGTCCTCTG CAGATG TT ATG ATTGTTTGCTTGGTG TTTTG TGGAGT AACACTGGG AGG 

GGGTGTCTCTGAGTATTTTGCCTGCTCTCCAAACCCTTCT 
CTAGTTTTATTGGAACTTGTTATGCTGCATTCGGTTC 

TTAAAGAAAGTGGATAAGAAGTGGATGTTGTGGGGGGAGTAGAGAGGAGAGGGAGATGAGAGGAGTGG 
TTGAGATGTATTATATGAGAGAATGAATAAATGAATGAATGATCTTTGGGGGGGGGGGTTCGAG^ 

4 0 TGGCTGCXTCTGGAGCTTACTCTGTAGACCAGGCTGGCCTCG 

AACTAAAAGTAAAAGAATACGTATGGGGTGAGTTGCAGGTACACCTGTGCTGAGTTCAGA 
TGGCAAGTTGCAAATGGTGCAGTGTGGGGAGAGGAACCTGTGAGCC^TTAATCTCTGTGACT^ 
TTGGGGAAGGGGAGGCTGTAAGTCCTTGCCTGGTCGCTTGGCACTCCTCTGTGGGCT 
GGAGCAGAATTTTCCCTCCAGTTTCTCCCGCTCTCTCTAAGCAGCTGAAT^ 

4 5 GGGTCTCCATGGGGCACTTCTTAAGGGCraSAATGTGTGGC^^ 

AT AGT CACTTTTTCTCTTTG TT AAGT CTTT CCCATTTT CAT AAAAAAT CT CTTTTTTTTT CTTGAT ATT AT ACT AT ATT ACAACAT 
TCCTCCATTCCITTTTCTCCCTCrGAATCGTCCCATATACCTCTC 

TATTATTGCATATATGTATACATATACATATATTTCTAAATATACACATTACTAAATGTAACCT 
TGTATGTGTATTTTCAGGCCTGACTTTAGTCTCCCACTCCT 

5 0 TGGGCTTTTCC CGGTCCATCTTAGCATGTCCATTGGTACT ACC CTTGTTCGGTTCATGGTTAGGCAGACGTGTTGGGGAGACGAG A 

TGAGTATAGCTTCTGAC^rTGCTAGGAGACATAACTTCTGCCACCTTCT 
ATGGTAGTTCTCAAAATTATGGACTCAAACCATATGCCAAAATATCTATCT 

TGATTATCACACTAATTT ATTTITATTT ATATCCTGAAGTGCTG CCTTCCCCCTCCTTGGCCCTGTGTTrrCATCCTTrr CTCCCA 
GATTCACTCTGTGTACAGCCACTAATACGAGTTTTTCCTAAGTAGT 
5 5 GGCTGTACCCTACGGTTCCAACTGATGCAAAGAAAATGTCATCTTCrrAGGATACT^ 
TCA.CCAAGTATGGAAACAAAATGCCATAAATCAAAACCAGCAAAAACAACA^ 

GGATG CT AT ATGCTTG AATT AACAAATT AAG AAAT AAGAAT AGTT CTG CATTT AAAATTT AAG TTT AAGATGG AAT ATT AAAAAAA 
TAAACAGGAAGCAATGAGCTATCAGATCACCAGGCATCTTrCAATATC^^ 
ATAACTACTGTGGATGCCCTCACTTGTAATGAGTAAATTAGTAAAATGGAAGATTAAACTA 
S 0 AACAGACAAATGAAGCTAAAGAAAAAAGCTCCTAGATATATCAGGCATCG 

CAACGTGAAGCAGAGTGCTCTCAGTGGTGGTCCATACTGCTGCTGGAGAGTTCCCT 

CAGCCTGAAGACCCTCAAATGCTACATTTCAGAAAGACTAGGGGA^ 

NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNKNNNNIWNNNNNNNNN^ 
g5 nnnnnNNNNKNNNNNNNNNNNN^ 

AGAATAATACCTTAAACITCCAAAAGCATTGTTTTAGA 

TGTGCACTACATGCATGCCTCTGACCTGGTGCTGGCAGAGGCCAGAAGTGGGC^ 

GTGAGTCCAGT CCTCTG CAAAG TCAACACTG CTGTT AACCACAG TTGT CTCG CCAGCTCCTTG AG ACAT ACTTTT AAAGTT AACAA 
TGGAAGAAGATGTAAAATCATTTTTrAAAAAATCAGAGGAAATAAAAAGC 
7 0 AACAATCCCAAAATATATAAATGACATGCATAAATC^CAACATACATAAATTATAAAAGAAATGA^ 
ATGGATTGAGATTAGCGGGTTTTGGTTTTTTGTTTGCTTTGTC 
GGTGGTX1ATGAGCTGAACAAGAAGTAGTCAGAATTTCTTAGCT 

AAC^TGGACTGAGAAAAGTC^TAAGTAAAGGGGTGCAAATCATTCCAGCAAAGACTAATC^ 

XI IH/UjlUilUWtnnt'vi* * M >» «»w • — — . - • — — - - — - - — ^ ^~ 

7 5 ccagttTCTTAAATCTATTTGGAAACATTAAA 
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TGTGACAGGTTCCTGATTAGACAGAAACCATTCCATCT^ 
TAGTGTTCATTCTGTTTCTATTTTGTGGAATAGTTTGAAGAGTATTGGTA^ 
TAAACCCATTTGGTCCTGGGCTTTGTGTGGTTGGGAAACTTAATGACTGCTTGT 
GCTTATCTGATCCTGATTTAACTTTGTTATTTGGTATCTGTCT^ 
5 ATAAGCTTTTGTAGTAGGATCTGAGGATTTTTTGAATTTCCTAGTTTCTGTTG 
TTTGGATACTGTCTCTGTGGCCTCTGGTTAGTCTGAGTAAGGGTTCA 

TTTGCTTCTTTTTGTTCTAGAGCTTrCAGGTGTGCTGTCAGG 
TTTTTGGAGGCACTCAGTGCrGAGTTTTCCTCTT 



ATGTGGGTTTTCTGTTGTTTTrCTTGTTATCGAAGACT 

TT CTTGT ATCTGTTG AGGCCTG TTTTGTGAC C AG TAAT ATGGTCAGTTTTGG AGAAGGTAC CATGAGG TG CTGAGAAG AAGG TATA 
TC CTTTTTTTTTTTT AGG ATAAAATGTT CTATAGAT AT CTGTT AAAT CCATTTTG TCCAT AACTT ATG TT AGTTTCACTG TG T CTT 



TGTTTAGTTTCTGTTTCCATGATCTGTTCATTC 
1 5 GTGCTTTGAGCTTCAGTAAAGTTCCTTTTATGAATGTGTGTGCCCTTGCATTT 

TGGTAGATTTCTCCATTGATGAGTATGAAGTGTCCTTCCTITACCTTTTTGATAAC 

AG AAT AG CTGCTCCAG CTTG TTTCTTCAGACCGTTTG CTTGG AAAATTGTTGT CCAATG TTTT ATT CTG AGGT AATGC CTGTTTTT 
GTCACTGAGGTGCATTTCCTGTATGAAGCAAAATGTTGGGTCCTGTTTATGTATCCAGTCT 
AATTAAATCCACTGATATTAAGAAATATTAAAGAATAGTTATTGTTGCTTCCTGTTATTT 
2 0 TTATCnTCTTTTGGGTTTGTTAAAAGAAGATTAC 

TTATTATCCTTTGTAGGGTTGGATTTCrGGAAAGATATTGTGTGCATTTGTTTT^ 

GTAATTGAGAGTTTTGCTGGGTATAGTAGCCrGGGCTGACATTTGTGTTCTCTTAG^ 

AGCTTTCATAGTCTCTGX5TGAGAAGTCTGATAGX3ACTCCCTTTATATGTTACTTG 

TCUXiGTlTlGTGCATTTGGTGTTTTG ATT ATTATGTGT CAGG AGG AATTTCTTTT C CCACT CT ATTTGGAATT CTGT AGG C 

2 5 CTCTTATATGTTTATGGGTATCTCTTTCTTTAGGTTAGAGAAGTTTTCTTCT CTGAAGATATTTACTGG CCCTTTAA 

GTTAGGAATCTTCACTCTATTCTATACCTATTATCCTTAGGTTTAGTCTTCTCATTTTGTCCT 
AGGAGCTTTTTGCATTTTGCATTTTCTTTGACTGTTGTGTCAATG 
CTCTTATATTCTGTTGGTGATGTTTGCATCAATGACTCCrGATCT 
ATAGTTTTGTTCAATTCCTTC!ACCTGTTTGGTTGTTTTCCTG 

3 0 GCTGTTTATCTGTATTCTCCTATACTTCTTTAAAGGAGTTATTTATGTCrrTTG 

TTTAAATCAGAGTCCTGCTITTTCTGAGGTATCCAGGGCTCACTGTGGTG 

TT CTGTTG CTTATGTT ATTCCCCTTGCCTTT AG ACATCTGGTTATCTCTGGTGTTAG CTGGTCTTGCTGT CTCTGACTGTGGCTTG 
TCCTTCCTGCAAGCCTGTCTATCAGAACTCCTGGGAGACCAGTTCTCTCTGGGAGGAA 
TCAGCTCTGGGGTGCAGACAGAAACTGGAAGGATCCTGTCCCCAGCTGCTCC^ 
35 CCAGGAATTTGAGCAGAAGTGGTGGTCTTACCTGTGCTCACAGGC 
ATAGATATGGAGTGCCATGGCACATGATCTAGAATTGC^ 

GCTAGAACTCTTACATATGGCTTGAAATATATAAAATGTTAAAACTACACATGACATTGGAAATGT 
TCAATATATGTAAAACAAAAATTAATAAAACTTTAGTGAAATTTCACATATTTCTTTAAGGTATC^ 

TAAT AAACATGG AATTT AAAATGG T AGG AG AAG ACATTTTTTTTTCAT AG ACACAGG AAACTG CAC CAAG CACCT AGAAT ATG CAG 

4 0 ATC CACTT CTGCATGGAGG AATTATT AAAAAAAAAAAAAAAAAG TTG ACCT CACACT AAATG C AAAACTGG TCTCAACAGATTTCCA 

GGAATCACAATTAAGATAATCATATATTTTTTAAAGATTTATTTATTTACATAT^ 
AGCCTTCATGTGGTTGTTGGGCTCCAGTCAACCCCAATTGCTCTGATCAGCCCCGCTTGCT 

AT AAAT AAGTACACTG TAGTTG TCTTCAGATG CAC CAG AAG AGGG CAT CAAGT CTCATT ACGGAT GGTTG TGAG C CACCAT ATAGT 
GCTGGGATTTGAACTCAGGACCTTCAGACGAGTAGTCAGTGCTC^ 

4 5 ATTCAATAACCATACAGTCAGCCCCCTCCCAAGTTCCCCTTAAATGTGGAAATTTTCAAA 

AAATTTTTAAATACTTAGAACTGAATGGTGATG 

CTGTGGGGCAGTAGAGGGCAGGATGAGGCGAAAACAGTAGGCAGGAGTGACATTG^ 

ATCCCAGAAGATGCTTCTGTGGATAGGCTCATGGTACAGGGATCAACCAGACTTTTT^ 

TTTGTAGGTGACACTGTCACCACTGCTCAC^CTTGCCACCACAGTATGAAAG 

5 0 CTGGTCTAATAGTTCCATATATAAACTAGCAATGGACCATACT 

GTCTAAAACTGACCGTAAATGATGGGAAATTTGAATAAGATTGCTGGATATAAGATCAGTA 

GTCAGTATCATGGAAAAGTAACTTTTTAAGAAGTGGTCACTGAGCCAGTGGTAC^ 

TGGTCATGTGATTTAAAAACTAGCCTAGACTACATAAAAAGAATCTACCTTAA 

AAGTAATAAGTGTACTAATGTTCAGAGTCTAAAGAAACTAAAACAATTAATTGATCTGCTTCAAACCAGTAGTTCT 

5 5 ATTTCCAGATCCTTCTATGCTCTAAAAACRGCTGAGAAGTCATAAGATCTCTTACTTAATTGG^ 

GACAITTAAGTCAGCTACTAATTTTTAAAAATAATGAAT 

TTTCAATTTAATTGATGAAAATGTTTTTATATTTTTTCCAGCTCT 

CTCTTTCAATGTTGAATCrGCTGTGCTCrrGTAGTTTTACTTTTA 

GACCACACTTTAATAGCTCCCCAGTTTCCAAACAATGG AAGGT^ CAGT CTTTGGCAGGTCATAACT 

6 0 AAGAAGGTAGTTGCAATATGGGACCTAAAACTGTATCGCTGAGCTTGTCATGTTCCATCATATTAAAAT^ 

ACCCTTC^ATGCATCTTTTATCCAAGGATGGTTGTGTGAAGGCAGTTGACTATTTC<3AAAAGGC 
CTTCAAAATGATGATAClXjTTTTATGTCIAAATTTTTCA 
ACTGTOUVGCrAACAGTAGCAAATGTACACTTACCAAAACTCTGAATTTCA 
CTGTCACCTGTTTTTCCTGAAGTGACAAATCT 

6 5 ACCTCAGTTTGTCAGCCGTTCTTCCTAGTAAAAAAAAAAAAAAAAAAAAAATCACATTATGTGGGAGAAAATGTGT 

TTAAACTTACAATrTGCCTGAGCACAGTGGCATCCTCCT 
CCCAGCCTGGTCTACATAGTTTCAGGCTACCCAAGTTACA^ 

GTAAAAATATTAAGATAAACTTACAGTTCAAACCACCCC^CAAATGCTCTTCCATGAGCTAA 
TACTGTACATTGCAC!AGAACIATTAAGATGTTAAGTACTCGA 
70 TTT ATTAAACT AATAAAATTTATTT AAAAAT ACACAACTT AGCATT A^ T CATCAAAAT AATTT AT AAT AGTT AAAT 

GCCTCTTAAATGCATGTGACATCTTCTGAAACTAAAAGTAATATGCACT 

GATCGAGGTAGAAAAAAAAATCTCTTATGAAGTCCTCrTAAGAAAGGAAATTGTGACAGGTTCCTGATT 

CTCCAAGGGAAAATATGCAGTGTAACTCTGGCTTCATAGACTATATTGGGT 

GAACACTATTCCTAGTAGGT CTTTT TTGAAGO^^ 

7 5 GACTTTTAATGACTGCTTCTATTTCTTTAGTGGTTATGGGACTGTT^ 
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TATCTGTCTAGAAAATTGTCAATTTCATCCAGATTT^ 
TTCCTCAGTTTCTGTTGTTGTGTCTCCCTTTTCATTTCT 

CTAAGGGTTTATCT ATCTT GT TG ATTTTTCT CAAAG AACCAG CTCCT AGTTTTGTT AAl'l CTTTGTATAGT T CT TTT TG TTT CTGT 

TTGGTTG ATTTCAG CCCTGAG TTTGATT ATTTC CTTC CATCT ACTC CTTG AG TGTATTT ACTTCT^ CTTTC 
5 AGGTGTG CTGTCAAC CTG CTTGAGCATG CT CTCTCCGAG ACACT CGG AGCT ATGGGTTTTC CTCHT AGG ACTG CTCTCATTGTG TC 

CCATAAGTTTTGGTATAATGTGTTTTCGTTTrCATTAAATTCTAAAAG'1'ri'I' 1 AATT'rcrriATTTCrrCATTGAGCAAGTTATCA 

TTG AGTAGAGCGTTGTTCAGCTTCCATGTGTATGTGTG CTTTCCCTTGTTITCT 1XX3T ATTGAAGACCGGCTTTACTCCATGGTGA 

TCTGATAGGATGCATGGGATTATTTCAATCITCTrGTATCTGTTGAGGCCT 

GGTACCATGAGGTGCTGCGAAGAAGGTACATTCATTTTGCTTTAGGA 
1 0 CTTAACTTCTATTAGTTTCACTGTGTCTCTGTTTAGTTTCT 

CTACT ATTATTGTGTGGGGTGCAATGTGTGC 11 TGAGCTTAAGTTTCTCITATC 1 l"i CCTTGCATTTGGAGCACAGAT 

CTrCAGAATTGAGAGTTCAiriTGGTAGAi TTT1 CCTTrGACTAGTATGAAGTGTCCTTCCTTATCri'l M l"ri'GATAACTriTGGTT 

GAAAGTTG1T1TTATTAGATACTAGAATGGCTACTCCAGCTTGTTTCTTGGGA 

ACTCTGAGGTAGTGTCTGCCTTTGTCACTGAGGTGCATTTCCT 
1 5 TAGT CTATGTCTTTTGTGGGGGAATTGAG C C CATTG AT ATT AAG AG ATATT AG AG AAAAGTAATTGTT ACTG CCTGTT ATTTTTGT 

TGTTAGAGGTGGAATTATGTTTGTGTGGCTTTCTTCTTTTG 

CCCTCCTTGTGTTGGTGTTTTACAACTTTGTAGGGCTGGGTTTATGGAAAG^ 

TGGTTTCT CCAT CT ATGGT AATTG AGAGTTG TG CTGGG TAT AG T AG CCTGGGCTGG CATTTGTGTTCT CTT AGGG TCTG T AT AATC 
TGCCCAGGATCTTCTAGCTTTTATAGTCTCTGGTGATGAGTGTAGTGTAATTCTGAG 
2 0 TTACTGTTTTTAATATTCTTTCTTTGTTTTGTGCATTTGGTGTTTrGAT^ 

CTATTTGGAATTCTGTAGGCGTCTTGT ATGTTCATGGGCAT CTCITTCTTTAGGTTAGAGAGGTTTTCTTCr ATAATTTTG CTGAA 
G ATATTTACCGGCCCTTTAAGTTGGG AATGTTTG CTCTCTT CT ATAC CTTTTAACCTTAGGTTT'CATTTTCTCATTGTGTCATGGA 
TTTCCreGATGTTTTTGGGTTAGGAGAGTTTTGCTTTTTGTGTTTTGACTGT^ 

2 5 GGTTGTCTCCCTTTGTGATTTCITTATTGTTTCTATTTCCATTTTT 

TT ATGTTTTCCTG CAATTCT CTAAG AG ATTTTTGTGTTTC CTGTTTTAC CTATG CTCTC CTG T ATTTC 

TTT AAGGGAGTT ATTTATGT C CTTCTT AAAG AC CTAT ATCATCAT C ATAAGAAATGATTTTTAG AT C CAGATCTTGCTTTTCCAGT 
GTG ATGTTGTTC CAGGACTT C CT ATGGTGG AAGAATTGG ATTCTG AT AATG CCAGGT AACCTTGGTTTCTGTTGATTC 
TGCTTACCTCCTGCCATCTGCITGTTGCTAGTGCr^ 

3 0 GTGTCAGAACTCCTCAGAGTCCAGCTTTCTCrTGGCATCCT 

CTGGGTATGTCAGAGCTCCCATTAGTCAAACTACCTCTGGGACCCTC 
TGCGATCTTGGTCATGTTAGAGGCCCTGAGAGTGGAGCTTCCTCTG^ 
GAACAAGTACATTTTAAAGTAAAACTCATGTCTTTATGTT^ 
ATACCAGCTTCCTGCTGAGGC^CCAGAGTTTTACCAACCATTGTGTTTGCT^ 

3 5 TTAAGCTTC TT TGTATTGCAAGGTGAGCATGAGTACn'GTGAA 

ACAGAAGAACAAAAGTCCATACGAAGTGCAAA^ 

TCAGG AG CCATGGCTGGGCGGTTATGAG CACTAGCTGCTCTTCCAG AGGTTCCTGGTTTAATTCCCAGCACCCAAATC 
AACTGCCCAT AACTCT AGTTC CAGGGGATATGATGCC CATGTCTCACCTCCTCGAGCACCTG AAAC!ACAC^ CACAT ACATA 

CATGCATACAGTCCAAATACTCATACATGTAAAAATAATTTACCTTCAG^ 

4 0 TCITTCTTCAGACTTAGGTCCTGTCCCAAGACAGCTC 

ATTCTCTTGGTCCCTAGGATTTTGCATAAGTGATACrTCACATGTAGTATGA 
CAAACAATCCTAGAGATACAGATAGATCCITTGTC^ 
AGACAGGCXSTCATCTGTAGTCCrCCGCTGACTTTCTTTTGTGTCAC 
GTTTACTGTTCATTTGCrrTGGGTGTTTATTGTCTGTCTCAAGACCTGACATATGAGTCT 

4 5 GAAACCTAAAGCACTCteGTAAGTGTCCATGGGTTAGTTAA^ 

TTCGGAGAGGTTCATGGCTAGAAATAGCTAGTTTTACrrCATTCTGGCTTCTCT 
CCITCCAGGATTGACTCrGTCCCACCAGGATCCTCGAGTTGGATAT^ 

AGAGATGGATGGATACCGAGCCACTACCAATGACAGCTCTCGGT^ 

5 0 TCATTCAGAAAGTGTTTGTTAGAGGGATTTTTTTTTCAGAATTAAAATTGAAACT 

AGAATTAATT TT TTGTACATTTTAGAAACGTTTG CTT CAGTCTTCAATTCCCGAG AAACCATTGTATCTATTCAGTCTTTCGCACA 

GAAGTTCTC^AGGATCTTTCCTAAGAACTGGCAATGAGATTCATAGTTCATTCGCC 

GTC^CTGATTATCGTGTCTGGTTTAAAGACAGTGGACCAAACCCTCGTCT 

TGTAGTGGGATGCCTGGGAGTCAGGTGGAAGCCAACAGTAGAATGTGGGATGTGCATCCAACCACT 

5 5 GTTTGTTTCTG^jTGTTCAGTATTTGAAATCTTTAACAGCAAGAATAGATGTAGA 

TTCTCTAACTCAAACTGAAGGTCAGTGCCCAGCCAA^ 

AAGAGATAACGGTGTGATTTCTGAAAGTTACACTG CTGTCT ATTTCCAC ACGTATCATTT AGCTTAACTTGG CTTTAG TTACACTG 

TAAACTTGCCIXKKaAATGTCAAAAGTGGTGAGTGTTGGAGCCAGACCCAAGT^ 

CACTCTAT(ZAAGGAAGCC!ATAC1AAGTGGGAGCAGGGGGAAGAAGTGATA 

6 0 AGAATAGCACTGGCGCTGAGTCCTGGGTGGGGAGGGAGC^TGCCTTTG^ 

TGTAt^CGGGCCAGAGTAGTTGAAGCn'CGGGTTAGACTCITAACAGCTTGGAGAAAGGAA^ 
ACACACAGGACACCCACACACAAGCAGCTCTCTCTGATTAACTTTCT 
TTAAGGTCCAAGCTAGGCCTCTGCCAATACCTGCTCAGGCTTCC 
TCCACATGGGAAGTGAGGTATTTGAGGTATGTAAAGGAACCATCTAATGTGCTGCT 

6 5 AGAGGGCATCTTTGCACTGTGGACATCCCCCATCAGTAAGACAAGGCCC^ 

GTCCTGAGGCTCACTGTCTCCCATTCCTCCCCCAACCCTAGAGAACACTCACCATG 
TACTGGCAGCTAATGCTCATTCTAAGTCAACATTTTACCT 
CTTCATCATCGGGTCGTCGGTGGTCATTTTGAGAACTACACA.GGGATGCAGTAC^ 
CAAGTCACCATGCCTTTAGAGAAAAGAGGCCTAGAAGACTACAGAAA 

7 0 CATAAGGCAATTCTAACGCCTGTGATTAGAACAAAAAATAGTGGTGTGCGAAACT 

ATGTGTGGGGAAGAAAACTTCACAGAGGCGGTTCTGTCAATGTCATCT^ 
GCCTGAGAAGCTAGTGTTTGGGTGATACAGTATTTGAAATGATTTAA^ 

r^AGAAAf^ATGACACACTACAGATAAAATGCCCAAGATGTTGAAATTG 
75 CATGGTTCATCTCATTTCTTCCCG AG ACT CACTCTGGTGATCTCTG CCTCCCCCTAC^CACITACAAG ATGTTCACTGGG AGAAGT 
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GCTTAGCTACAAAGCTGCCTAAAGTAAGCTGGGGTACCCCACAGGTACAGGGC^^ 
AGGCCAAAGGGGAAAAAAATCTAAATGGACCATCTCTGCAGGAAGAAAATAAA^ 
CTTATGTCCAATTTCTGATAGACTCCAATTATATCAACTTGTCGACCTCCTCCTT 
AGGCCACGTGAGCTCGCTCCCAGGATGATCTCATCCAGTTGTGTGGGTTTTAAATCAT 
5 CTCTAGAATTAACCCTGTC CCCTGGTCCTTTTG ATCC CTTCCTAGGG CCATCCATGGGATGTTGAGTCTGAGCACACCTCCTCTCA 

CTCCCAAACCCGAGCCTCTCCCTGGGGTTTCCCGTTCACCAAAGCCACCCCGGGCC^ 
CCATCATTAACTCTATTCCTTTACTTCCTCCACCTACTCATTAGCACATCT^ 
GCCACTTTGCCATCTTTTTTGCTGTTTCCTCCCTCTTCT 

ATG CCTCCCTG CCAG C CCATTC CACTT AAA(^TCAAAGTGATCATTTTGAAG CAT AAATGAAAT C CT ATACAACT CAC CCACT ATT 
1 0 GTT AT ACAT AGACTT CAG ACTACT CG CAAACCTCCACGT ACCCTG CCAGCGCTTCACCTTTGT AACC CT CT CTG TC CATGACCT C C 
AGCCACTCTGGAAGGTTAGCATAGTGTCAAGCCTGCTCAT^ 

ACATGGTCTGCTCACT CG AGTCACACAGAATCTGCTCAAGGG ACTTACTGTGACCAAGATC 
TCATCCA.CTCTCTTGTCATGCCTACACTTTTCTCTCTATCAC^ 

T AAGTGCTCCCTGAG ACATG TTCCCT CG<J'l ITGTT C CTG CTG CATGCC CAGGTACCTAGAGG AAAGT ATG ACAT AAGACATGGAAT 
1 5 GTTGGAGAC^GCCAGTATTCACTGAATGCCCATCTTTGTTGTAGGTGTT 
CTGCCCTAG<XiAA.CTCACCCCTTGGGTGGCAGAGACAGACAAG 
TTGAAACCCCCAACTGTGCCCGCAGGAATCGGAAAGGTAGCCAGAAAGAG 

GTTTT CAAGGT AGAGGAG AGTGGGAAGGACACTTCAGGTGTGG AAGTC CAGG TG CAGCAG CG TGG CACTGGG ATAGAT AACATTC 
AATGGACCGGGAATGTGACCAGCTTATGAGAGAGCTCAAAGATGTC 
2 0 TTCTTGGCGCTGAACGTGCTCCTTGGAATTTTCCCTTTTAGTAAGC^ 
GTCATCCTTGTATTTTCATTCTCTTACACTCGACGTCT 
GCCCAGCCTCTGCTCGCCAAGCTTATATGAATGTCTGACC^ 

TCTTATGACCTTCATC^CCCCATTCCAGTCAAAAGCCAGCACTGTCATTGTCAACTGCT 
GCATC^ATCAACTACGTGGGTGCCTAGAAAAGAGTCTCTGTGTGTGTT 

2 5 GCATATCAAAGTCTGTGTCTTCTATATCTATATGTCTTCTTTCCTTCCCCTCCCCT 

CCTCTTCAACCCTCTCCCACACCACCATTCTCCTCCTCACCT^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNTOINN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

3 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

CCTCCTCCCACITTTCCCTACTCrCCCCACTGGC(^CCATCAGATAGGAA 
GATGTCACACCTCACACAGAGACAATGGAGCTAAGTGATTGTAGACT 
CrTTGGGTTGTTTTATCTrAAGTAT rri 'C ri 'CACAGTGACAGA^ 
AAAAAAAATCAAAATAAGCAGAAAAGTTTAGCTTACTTGATCAATAAA 

3 5 TAAAAGGAATCATGAAGTCCATTTC^CTCAGATTTCTATAAATAAAATTTTA 

GTAATTCAGCCCTCTGAA^i liAAAGAGAAAGTCTATAAGATTGACCACTC 

AGACAAAAAGCCTC!ATAGAGGCAGCTTACAGGAATAAGAGTTGTGTCAGCTTGTGG 

AGCCITGGAGGATGGGAAGTTGCCACTGCCAGTGTGGGACACGC^ 

ATCTGGAAG ATGAG AAAC CAGAGTX3TTAGAAGGTT AGAACCTTCGAAGCTG ACCCTAAGCATCTCGTGTCT CACA 

4 0 GTCCCTCCCAAATGCTGCACAGCCTCCAAAACAAAGCCTTCAATGAGGCACCGC^ 

ACACIAGCACCAGCTGTGGGCCATGCGTTCACATGGAAG 

CAGTGTCATGGGCAGATGGATTTTATTTCAGGAGTAAAGGATGTTACAATAGAAA 

AC CTT AT AATTT ACT AAGTTTAT CTTTT AT ACCAAACACT AT ACTG CTTTT AT CAATTCTTT AAAGGT ACAG AT AT AATCCAGTGT 
CCACTTTTCATCAAAATGTCTTTAAAATTAAAAGTGATCCTTCCTTAACACAAC^ 

4 5 CCTCAGGCTGGCCTTGAACTTGGJATCCTCCCCCTTTGACTACT 

TAAGTTCAGCATTATTCTGAATAATTTTGTCCA 

GTTAAGCTG ACATGTTTT ATTTGTATAGGGTATGAGTTG CTTCAATTTCATGCTCATTTCTATGAAG CAACTAAAATGTAGTATAA 

OUVAACTCAAGCAATGAGTTTATAGAATGAGTATCCTTTGTGT^ 

TCCT ATTT AC AACAG CAC^ AAGTT CAAATGTCAGTT AT AAAT CTG AA 

5 0 AG CTGGGTG TGGTGG CG CATG CCTTTAATC CCAG CAG TTGGGAGG CAG AGG CAGGCGGATTT CTGAG TTTGAGG CCAG CCTGGT CT 

ACAT AGTGAG TTC CAGGACAG CGAGGG CT AT ATAGAT AATC C CTG TCT CCAAAAACCAAAAAC CAAAAACCAAAAAAAAAAAAAAA 
AAAAGAATCTTTAATACATATGTGTATGTATGTATGAATGTATGTGAATAAAAATTTTAAA 
AAAAATGTAAATAAATGGAAAGGCAATCCCTATTTTTAAATAGGGAATTTCAT^ 
CAAGTAAAAAGACTTTTATAAACAAACAAAATGGTGCCTAATCAATGTGGGAAG 

5 5 AAATTATGATTTAGATATTAAAATGTATTATATTTGAATA^ 

ATTTTAAATTTTAATCTACATTGGGAGGTTCTCATGTCAGTAATATAAACCT 
TAGCTGTCAAGGGGAAATCCGCTTC^CAGTTTCATTTTATTGCAAAATAAAATCTAACTA 

T AAT AT CT ACAGGTG CAGGGGACCTT CCAAAGTACAAT AGAAAAT CCAAAAGCT^ CATAAAATG AATT CAGTT 

CCAT AAAAATT AG ACACTTG AGAGG CAGAT AAATAGAACTTG CAAAGG CAAGTG ACAAAGT AG AGAG AACATTT CTG CAACT AAT C 

6 0 TATATAAAATTTTTTAAAAAGCTAACTCCCCAACCAATAAAG 

AAAAATGTAAGGTTATGCCGGGCGGTGGAGGCACACTTTTAATCCTAGTACTTGGC^ 
AGCCAGTCTAGTCAGCCTAGTCTATGGAGTGAGTTCTAGGGCAGCCAC^ 
AGAACACAGXICACAGAGTCAACrAATCAGGGCTCCCAGGGCCCACAG^ 
GTCCGAGCTGGGGTTGTTTAGCTTGGGCTTTTGTGGGAGT 

6 5 CCTGGGTTC^CTCAGCCAGCCTTGAAATGAGGGTTTGAGTCTAGTCT^ 

GGCCTGCTCTTCCTGAAGGAAAAAGGAAGAACAGTGGATCT 

TGCGGTTGGGATGTGTCTTAAAGAGTAAGTCTTAAAAGCAGAAGGT^ 

CATACAAAATATAATCAGCTCACTTATGAAAAATGAATGCAAATCAAAT 

GTTTAACAATATGATCTATTAGCAAGACTATGGATAAAGGAGTACTGATAAATACCACTACTGCTGCC 

7 0 CT CT C^U^G AACAGTTTC^ CAGT ATCAGAACT CT AAAAG CAG T CTCCAAAG T ATATCCCAAG ATGT CACATACACCTG AG ACAGG A 

AATAGCGACAGACCTTGCACGCCAGTGTCTTTCCCCCCTAAGTCACCGTCTCACT 
CTCCTGTATCGGGCTCCCAAGCACACACCGGCTCACCTGTCT 
GTGTCCTGAGGTGTGATACTAAAGGTGATACGCATGTCTTCATCC 
TCTTAGCAACCTCAGTATATGAGTTCTTTATCTCACCAAGAATTCCCACTT^ 
7 5 CTGAAAAGCTATCAGTTTTAAAAAAAAGTCGGGTACATAGTTATA 
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ATTCATTCATTCATTOVTTTTTGCTT 

CCGCCCCTTCCCCTTCTCATCTGAGCAGGTGGGCCCCaSGGTGTACCCCCACTCT 

AT CCTCTCCCACTG AAAC CAGACAAAG CAACTCAGCT AGAAG AACAT ATT C CACATAC AG GCAACAG CTTTTGAG AT ACC CT C CC C 
TGCTCCCATTGTTCCAGAACCTCATGAAAACCAAGCTGCACATCTGCTATGTATG 
5 TTCTTTGATTGGTGGCTCAGTCTCTGAGAGCCCCAAGGGTCCAGGTTAGTTAACTCTGTTGG 
ATAGGAGTCCGAAATCCTGGAAGGACTCTTTTACCCATGGAATXIATACTCAAATT^ 
CATATTTATACATATGTGTGTGTGTACGTGTTTGAATATGTGTACATATGTATACAGACACACATArc 

CCAGAGGATAACTTGGATGGCTTCCTCAATCATCACCATATTA'I'r'l'l 1 A 1 1 1 GAGGCAGGGTTTTTCGCIt^CCTAGAACTTGCTG 

10 CAGCTGGCTTCTTTACATGGNNNNNNNNNNNNNN^ 

AATGAGAATTAAAGTGGATATCCAGGTGCACATCTCTATGCATATATATGGAGACCAAAC^ 

AGCTTCTGCCTGTCTGCCTCTCCAGTCCTGGGATTCTAAACACACACCA 

TGATCTGTCTCTCCAGCTCACAGGTAGGGAGATTTTAGA^ 
1 5 AGTAATGGGTTTCATTGTGATGTTTCCATGAAAGCCTTTAACACAA1TTG 

CCTTCTCCTAATCCCCTTTTCTCTTCCTGGAAAGCCACCTCCTCCTT^ 

TGATATTTATCTTTATGAG<rCTGGCTTTATATTACTTAATATGATATTATCT 

ATC^GTGGTTGGGTCAATGGCTGGGTAGATGGATGGATGGATAGATGGATGGA 

CAATTCTGTATACCTTT<K3AGACTACATAGTCTGATT 
2 0 AAGTATTCTGACTTCGACTI'CITTCACTGTATATCCAGCAATGGCATAGCTAGATTG 

T ATT CTGATTTCCATGGT ATCT ATCCTGGCTTCTATT^ 

TCTCTGTTATCTTGGTGGCAGCCATTCTCACTCGGGGAGATGAAATCTCAGTGAAGTT^ 

TG CTGGACAT ATCT ATTG CTIT ATT AGT ATTT CTTCTTT AAAAAT GT ATCTGTTCAG CTG^ CACA 
TGTTCTTTGTATTTTCTGGATACCAATCCGCTGTTAGATGAAT^ 

2 5 CGCTAATGGTTTCCTTTGTTGTGCAGAAGCTTTTCAA 

GGCGGCCTATTTTAAAAAACAATAATCACTACCTGTGCCTGTGAGCT 
CTCAGGTCTCTAG CTAGGACTGTTTTGAG CTGATTTCCGTTGCITCAGTCTTCTACCIX»CT 
TCATTCTTCAGGATGCTGTGTTGTCACCTTGTTGGGAACGAGATGGCTG 
TTTCCATTCTGTTGCTGCAGTAAAATATCCTGAACAAAAGCAATTCA^ 

3 0 GGTGCACTGAAGGGGGTTTCTTACCTGGGCAGTAATTCAGCTGGCGG^ 

AAAGGGGGGAAAATGAGTTTCTCCTCTCCTTGGGGAGCCT CCAT CATGATTTATGGTTCTGTAAAGTAGGGGGTG CTTCACAGAAA 
CTGTTTCTCTTCATTGTAGAGAAGGAGTGTTGTGAAGATGGGCTTTT^ 

ACATGGCCCCATCTCTCCCTGGGCAGCAGTGTGCATTTGTTTTTGATTGACAGGTCAGCAGGTTTA 
TGTGTGGCTTCCCATTGGTrATGCATCCATGTCACCCATGAGGTATAATTAATCTCATAGTTCTCACCA 

3 5 CTAATTTAATGAAATAATTATCIATGTTGTCCCCAGCT^ 

AC^AATTGAAAAAGGAGTTAGCTACTGTAGCCATGTTGAAATTCAACCCTTA 
TGGCTGACATTGGTGGTGGAATAACTGGGATGGGACACAGAGAATCCTTGA^ 
TGGATTTCACACATATATTCATTTATCAAAACTCAATGGCTTTTTCCACTCAAGAT^ 

CAAGAAAAAAAATC CATCAATCAG TCTT AATTTCT AG T ACAT ATT ATGC CT C CTGAAATATTT ATGGAAGAGTG TGGTCATTTCTG 

4 0 CAATCAG CTTTG CTCTGCAGTG AAAAAATAATG<3 ATGAGG ACATG^ ATGCTAGAG ATGTGTAAAGAAATGTCT CAG A 

TTCTAAACAGAGAGTATGTATGGATT<K3ATGTTCCTTATCCA 
GAGTATTTGCATATGCTTAGTGAGCTGTTAGATATGAGATTCCACT 
ATCATACTGTGAACCCCAAGATTGTGTTGTTTAATGAAAAAAATCTGTTTCTAGTTG 
TTAATTGAAAACAATAAGGTACAGTTAGTTTGTAGAA 

4 5 GACTAATCAGAGAAAGATTTGACAGAATAAGTCAGAGGTAGTATATACCCIAAGT^ 

TTAAGTGGCAGCATGGAAAGAGTAGGACAGGGGGCAGTTTACCAGCAGAGTTTTACAGAGATGGTTG 
GCACAGGTGAAGACAGAAGGAGTCAGAAGATGAGAACATATTGCC^^ 

G AAG CCAG TTTG AATCAGT CAACTTGG AG ACG AG TTTTGAG CCAG AACAG CTG AG TTG AACCAG C CAG C CAAAGTT CAG AAAG AAC 
TAGGAAGTGTGAGCTTATTCAGCAGTGACTCTCATAGGCTGAAATTATTCrAAGCCT 

5 0 CCAGGACTAGGCCTACGTTAGCAGACAGAGGCAGTAAGCCTCGGAGACATTTACT 

GT AGTTTACATAT AT AG ACAAAAG T CACTTCT^TCIATAATT ATT ATTTTAT ACATGAAACGAAGTGT CATGGT ATC CAGTTTTCTTT 

TTGTGGTGACATATCAGCACrAAATATGTTTTAGAGTTTGAGGATT^ 

TTAATATTATGTGAGTCTTTTTATAAACTTGAAACCITTTTTTTA 

GAGGGAAAACTAGGAAAGGGAATAACATTTGAAATGTAAATAAAGAAAATTACTAATAAAAAATAT^ 

5 5 TTATTATATGTAAATACATTGTAGCTGTCCTCAGACACCC 

GTGGTCGCTGGGAATTGAACTt^GGACCTTTGC^ 

GAAATCTTAAATAATTTTTCTAAAAATCAAAGGGCAAAACCACT 

TTACCCACATTCTTCAGGTTTTCACATCTGCTCCTTC 

AGATCTTT ATG ATCT AT AGAG GG C C C CTTTT ACTGGG GGTG AGG ATTTG AAAGACAATTTT AAAAGGAC^G A 

6 0 AGCGTGGCGACATAOVGGATAGTCTTCTGCATCACCACTAAA 

ATTTTTATTGTCACCTAAGGAAGACCTGAAGAACTACCATTCT 
TGTACATAACTCCCTTATAGAAATGACGTGGAAGTTACCAAGCT 

TTCCTCTTGGTGT ATTTTTCCCTCGTGCAATAT AGT AGAAAG CT AAAGCC AG ACACATGTG CACC CCTCTTGTAGTTACCAAATGG 
CrcrcTGATCTTGTGGGCATTGCrTACTACCTTTC 
6 5 AGTCCCTTCTACCATTGGGCTGCACCTGCTGTAGTGAAATCTTGAACCC^ 

CCAGATGAGATACAGATCTAC71AGAGACTCATTTACATACTGCTTAAATACT 
TCCTTACTATCTGAACAGCCAGAAAGC^AGATGTATTCTAAACCTCTACTAA 



ACX5CTGCTCTGTCAATTTACAGCCAGAGTCITTGCCTTCTCA 

7 0 GTGTTCAAACCACTCAGAGCCATTAGCTGGGTAAACTTTGACCT 

TTCAACTTCTAGTGIAATATGTCCTTGGCTTACTGTAATTACCCAAGGCTGG 
ACAC^GTACATTATAGTCCAGATGTGGTTGTATCCACGAGTCTTTATTGTAGTGTACATGC^ 
CCTATGTATTGTGTCCATTTGATTAGCTGTAAACTGGATGTCAGTAGTC^AGCTGCr^ 
ACTTGACTTTACTAATGTACTCAGGCTGAAGATGTGTTAG^ 

7 5 ATATTTTCAATTATCTTTCATATTCAAAGAGAGACATTAATATAGCTGAGGA 
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TAAATTGCTAATACTGTTGTAATCTGTAATTATAGAAATt^GTGTGAGAA 
TCCTGAATGAATTATTAATCAGAATCAGTCT^^ 

TGATTTTTGTGTTAATGTTTTATGATGTTGTAATTGATAATTAATTGGCCA 
GGAAATGTTCAGTGTGCTGAGTTCATTAACTAAGCAGTAATCATTCTGCAAAGAC 
5 TGCTTAAAAGTGGTGTCTTAAGCTTGAGCTTTTC^CATT^ 
GAAGACATCTTGATAG CT1T CAGATTGGGTCAATTTT 
TGGTATGCCTCTCTATATGTCAATTTCCATCCTATTCTA 

CGGCCATAGACCTAAGAGCTCTTTCCTCCTGGTCTTGGTGAGTATAACACATCA 

CATCTTTCCCTATTGTTCAAAACACTATAT^ 
1 0 TTGACTGGCATTCTAAGCAGGAATTCCTGGAGTGAGTTTCTm 

TAAGAATGCCTTTCACCCAGCCTGATTCCTATTGAAAAGATGCATTGTTGGG 

AAGAGGCCAAGGAATCTTTTTGGTTTTCAAAGTATCTGCTTTAGAAAC^ 

ATTCTGAGGAATTCTGAAACATTGTATTAGACAATGTTGACAGTGC^ 

AGCAGATTAGAAAACCATGGCGGGGAGGAACAAAGGGGTTAGGGGACTGTTTCTC 
1 5 CAAGAAGCTCTCAACTGGCTTATTTTTGTGAGTTGCACTTGGTTCT 

CGGACCACAGGAAAAACGCACTTGCTAGATGGTCTTCATTGATCTTATTG^ 

TCTCCTCTCCACTCCAGGTACTCCACAGCC^TGTTGATCCCA^ 

ACTAACCGCCTTCTGAACTCTCAGCTGAGGACCTGTGCCGCTGTCACCnTCT 

TGTGTTTATAAACAGGCGGGGTCAGAGTCATCrTTCATATGCAGAAAT 
2 0 TCTACAAGGATCCACCTTGTGTGCAAACAGCGGATAGAAAAGGCTCTTTGG 

TTGTATGACTTTAAAATAGGAAGAAATCTGGAGTATGCAAGCCAGAGACTGGAGACACTGGG 

ATGAACAAAGACCTGAAGTTGATTAGAGACTAAGCAATCACITTC^^ 

CTCACATATGAGGAGACTCGGAGAATGTACA.CAAGCTGCTCAGACCACCCAGCCCT 

GAGGAGGGCAGCCACTTCAGATCAACTGTTAATGGGGGTAAAATGCAGATTTCT 

2 5 AACTTGAGAACTATTAT(^TTCAAAGAGATAGGAAATTAACTACAGAGTCTGAGGTGGGCAG 

ACAGGTTACAAAAATATAAGCCACTGTCAATCTGGGAATGAAGGGTGGAT1TTAA 

GATTACTTACTAGGTATTGAATGAGTCAACTCATGAACTAATGAGCAAATGTGCAATGC 

TTACACCAGTTTTAAAGAGTTGGGGGGAAGGAGGCAGAGAAG^CT 

CTGAAATCCAAGCAGTCTGGCTCCAGCCTGCGCCCATCACCATTGCGTTCCTCTACGGO 

3 0 GCATTCCTGCTCAGGTCGATGTCCAGGCGCCGTGCATGCTTGTGTGTTAAGTCCTTCCCT 

ACACCCTTACCCAGAGTTTCCATGGTT I'lCTGAGATGACAGCCrGATCAGCATTCAGATTCAGGCAGGCAGGCCCCAGAGAAAGCA 
CACACATCTCTACTCATGAAAGAAGAGTCAGATGTATATAAATGCAGTGGAGGGA 
TGCAAGGCA.CGTGCGTGTAACAAATAGGAAGGGGTGAATGTCTTAGGAAACGTTATTAGATAA 
ACCAGGCACTCCCTTTAACATCAGAGAGAAGTAGATCAGGAGTTACATGTCCATAAGA 

3 5 TTTAGCCAATCCGTGATGAGTGTGGTTCCTGAGTGGTTCATTCAGAGTGCTC^ 

GTCCCAGGGGGACGCTGTGGAAGGCAGGCTATAGTGATGTCATGGCCACAGCTGCC 
TCCTTGCAGCCTTAAGCAAATTGGCTTTTACCCACGTTAAAGCGAAAAATGCAAGCAAATG 
TGATCTCCATCGACCTCCATATGAGAAGTCCTCCGTGTGTACAGAATTACAAACGTATGCACTGTGCAAATAAT^ 
TGATAGTTTGGGACGGAAGATGTAGCTCAGTGATGGGGCAGTTGCCT 
• 4 0 TATTTGAGCACTTACTACAAAAGTAAGTAGCATCCTAAATACTCTATTGTTAGAGAGAACAAGTGTTA(^ 
CCAGCATCTGTCATCTAAGAATTAGACAAGCAGGAATCAAATGGTAACCGTCTGTGTAGGGAA 
GCTGTGTCKSACTCAGGAAACTAAGCCGCCTCTTACTGCCAGCCCACCCAAACAG^ 
AAAGAGTATTTTCTACCCCTGGAACAGAGCAAATTTTGACTTTTTG 
CTAAGAGCAGTCACTGCTCTCGCAGAGGATCTTGGGTTTGATTCCCAGGACCCACCT 

4 5 TTCCAGGGGCATCAAACGTTACCCCCGTGAGCAAGGTGCACA 

GGGTCCTTTCTTTGAT ACAGTTTGCGTTACCCAGAAGACACTCTGT AAGG C AG AAACGTGTGAGTT CTGCGGCACGGCAGCTTTAT 

TCCAATAGCATGCTCTGGGTGGTGAAGCATCTGTGTCACATTTGGTAATCCCTACTCGAAG^ 

CACGGTTTCCCCACCCCCGCTTTGCTGACCTCTGCTT T TCCCGTGCTCTTTGGAG 

TCGGTTGG C CTCCAGT ACGG CGGCAGATGGT CCAAG TC CTTGT ATT ACTGAATGTG AG ATT AACAAACTG CTGG CCCAT CCTCACC 

5 0 TTGGGAGTTTAATCACTTTTAAGTAGAACAGAGAAAGGAAGTAAGGGAGGCAGACATCTGTAGAAAT 

T AC CTGAG CTTTG ATTCGCTG CTGCCTGGTTTGGT ATGCTGTGCTTTTATTTCCTT AGTTACCAGGGGAGTCA^ 
CCTCTTCCATCCTGCTTGCATTCCTGCAGAGAAGATGTTTCAGAAC^ 
TGGGGGTGGGGACAAGTTCTCATTTGTTCAGAACCTCCTCGTGGGCCA^ 
CCTGAGGACCCATTAAACTCGTGATCCCACAAAGGAAAGAGGGAACTGATTCT 

5 5 CG CATGCT CACATG AAGGTTG T CC ATTG ACT C C CACATGCTT ACACGC ATG CTCAC ACG C C AT ACAC ACT AAT AGT ATTTATCAGT 

TACATCAGAACAAAGATGTGAAGAACCTGTATTTCTGCAGCCT^ 

CCCTCTTGGG AG CCTGGGAGAGCATCAGCAGCAATC ATAAG AGTGAAG AGTCACCTAGCCG CAGATGCTCTTTGTGTAAGG CAG AC 

CTTTCCCCT<3ACTGTGTGTCTGGAGGTGGACCTGCACACAAAGCIAGGGCTTAGC^ 

AGCCAGTCGGATCTCCCTTTGGGAAGTTTCACX3TAGTAGCACATC 

6 0 TACTGCCTCTGAAATCCGCATGCAAACrTCAGTCTAGTATTG 

CACTTGAAACAGACTCAATAAAAACAAAATTATTTTTAATTATAG 

CTTTATTTTCTCTCCTCTTCTCCAATATCTGGCTCCCCGACACATCTAAAGCCCTCTTAGAACTGAGTT 

CTTAGGGCT<1ACAATCATGATGAAGCTAAAATTAGGCAGAATTAGAATTACA 

TTTCITAATCTTTGATTCTGTTGTGAGTTTTTTTTGACT 

6 5 CCCCTCCCACCCGATGGGGTCGGTCTGGGGAGGAGAAGTGAGCCCCAGTTGAGCCCAGCGCTCAGTA 

ATTAAAATCTAAAGTGATCCGTTGAGTCAGCCTCCCTCTGGTTTGACTCAGACGCTTTATGGTG 
TAATGATTTCCCCATGGAGAAGGGTGTCTGATAGAATCCAGGCAAGACTGAAGATC 
TAGTATTAACCCTTATCTCTCAGTTTCCCATCCTCGAATTTTCCCTGAAGGAACCTT 
TTTAGCTTCCTAACCACTGTTCTCAGATTCTTCTCACGTTAACAGTGG 

7 0 TGAGAGAGAATATTCACACAGATGCCCTGATAAAGAAAAATGTAGAAAAGAGATGACATT 

AATGGGAAGCAGAAAATGAGCTATGAGAAATGAATATAAATGGAGATGGAAGAAGTGACACCAGAGACAAA 
TGGAAAAGAAAGGTGACCAATAGCAGAAAATGGCCTGACCGAGAAG;^ 

GGGGGAGGG AGG CAG C TTTG C CACAAAAG AACAACAGAGGAGGAGGG AGGG AAAG AGAGAT GGAGAGAG AGAG AGAGAG AG AG AGA 
GAGAGJUSAAGGCaC^CCATCAGAACAGAAGAACAAGGT^ 
75 CCCTCCATT<5TCCTCTGTTGCCCCAGC(^TCACTGACTGCGACCACTCATGTTGGAC^ 
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AATTTCTTTCrCTCrCTTTCTTCCTTTCTTTO 
TGTGTAGCCCTGGCTGTCCTGGAACTCACTCTGTAGACCAGGCT 

CTGGGATTAAAAGCGTGCGCCACC^TTCCCGGCCACAGGCCAATTTCAATGCTGCATTCA 
CTrACTGTTCAGTTCAAGCrGTGGCCTTCrCCCTGCCTCCTAAGCAGCA^ 
5 GCAGCACCTCTTAGGTGAAGCACTGAGTCCAGCCTGCCTCCC^ 
CrTTGTGCTTTGCTAGATGTGAAATTGGCTTCTGJVGTCAC 
TACTGAGAATCTTTCTCTCTTCAGCTCGCTCCAGATACATAGTAGC^ 
CCTTTCTTTTAAGAACTGTGGTTAGAAATC 

GGTGGATTTCTGAGTTTGAGGCCAGCCrGGTTTACAAAGTGAGTTCCAGA 
1 0 AAAACCAAAAAAAAACGAGAAAAAAAAAAATAAAAAAGCTTGTTAAAAGACTTTTC 

ATTCTAACTAGGAATACTTTGAGTTCACCTGTTTCTTCTTGCT 

AGAGAGGAAAGCATGGGATCTCTGAAAATAACTTACAGTGGGACAG^ 

TCAGTTGGGAATACAGATAACTTTACTCAGTAATACTGTGTATCATAGGTGTTTATGA 

TTGTGGTATCATGTTTTATGTCTATTATAATTATATATATATTATATTGCAATTTTTACAT^ 
1 5 TTAGGTTTGTGACGTCAATCTGCATGCTACATATGGCTCT 

AGTGCTACrGACAC-GCATTGCATTGTTTCTACTAGAGGGTTATTATAAAT 

TTGGAACCTGGGAGAATCTCTACTGTAACCTAAGAATGGAATTGCAGGX3GATAAC 

CAAGTTGTTTCTAGAGATACTAAGACAAGTCTCAT 

TCAGATTTTCT AL w lTCrTT CCAAG CTGGTGGTT ATGG ACTG T CTTT AAT ACTTT ATT ATCTTT AG C CAG ATTCATTACAGTTTG CC 
2 0 CTAGCATCATATTATATTTTCCAGATCAGTGTCTTTGTCAGCTTCTGC^ 
TGAGTCGCCATTTTCTCATTGGATACITTGGAGTTTAGAT 
CAACCACCACAGATGCAGAGCTTCTCGTGACTGCTCTGTGTTCAAA 
TGTCGOUUVGGTGCTAGTGAATCCAAGTTCCTCCTGAAAGGTCAAGGAAAATG 
CTCTGAAGGACITGATGATACAATTGCAATAGCTCCAGTCTTAGCAGT 

2 5 CAGACAGATGGGGGCTTTGAGGAAGTAAATCAGAGAAACCTTTTAGCTGATCCACGCCAGG 

T AGGG ACATT AAG CACTCAATGTTTGCAITAGG AAAGACTG AAGCTTTCT ACTTTCAGTGAGA^ CTTAGAAAGAGT 
TTGCAGACCACACTAACACCGACTCTGACTCCGT1TGAATAGCTCTCTGCACT 
CTAAAAACTAACrcrTACGAATTTATAGTAAATAACTTTTTAAATGCTTGATAA 
TTTAGCATCTGATTCACAGAAGTGTTTACTTAATGCACT 

3 0 TACTCCTTTTCTTCACCCGCAAGGCATATTAATTTTTT^ 

TTCTACTTTTTTTGTGAATTGCCGCTTTTCTGATTCCCCACTGTG^ 

G TT AG AC ACTTCACT ATCTG AATATG ATT CAG T AATG AATG AAG AGCGGGTAAG CT CACAAGTG AGT CTG C ATT CATT AAT ACGG C 

ATGCTTGACTGAGAAGAGTGTGGCCTCCGTAGCTGCTTTTGAGGGGATGAGC^^ 

TGCCGAACCATATGAGGAAATGATTGGGGTGTGTGTGTTGTGCACATGTGC 

3 5 AGC CAAAACC CAAACC AG AG C CTTTGG AAAGCT C C CT CACAAT CTG ACTTCAG CT AATG TCTACAT ACTTT A^ CTACT 

GCTTAGGGTAAAGCCTGGCTGTCAGACTCCTGCGTGCTATTCTGGG 

AGATCCTGTGATCCCTGCTGCTTTGTTGCATTAGGTGGCAAGCGTTGCT^ 

GCAGGGTGCTCGCTCCTGCAAGGGAGGAAGAGCCGAAGGCAGATGAGAGTTCACA^ 

TGCAGGAATAGGGTGCCATTGTGCTTGAAGAGACCCGGGGAGGGGTGGATACAGTCAGACATGTAAAGAACAAGAGA 

4 0 GGACAACTACAGCAGAAAGAACTATCCAAGAATCAGGTAACAGCATCCAGAAAAGTCACTA 

GATAATGGATTTCAGGGTTTGCCTTGTGTGCGGGTTAGCTTTT 

CTCAATTGAGAAAATGTCCCCATCAGACTGCCCTGTGAGCGAGCCTGTGAGGCATTTTCT 

T CCTTC ATGGGT AG AG AG AC CC CTG AATGC^TGATC CCAGGTG CT AT AAT AAAGCAGG CTGAGCAAG CCAT AAGGACCAAG C CAGT 
AAGTACCACCCCTCCATGGCCTCTGCATCAGCTCCrGCTTCTAGGTTCCTGTCCT 

4 5 ATACAATTGAGATAAGCCCTTTCTTCCTTGAGCT^ 

AAGACCCTGGGTTCATTCCCCAGCACCACCCAAAATAGATTTCAGTTTCT^ 
CTTCAAATCTTAATTGATGGCTTAGCCAGAAGAGCAGATATCCTTGCGAATGTGACT 
CATATATGGCCCTATCCTCTCTrTTTGAAAAGCCAGTTCTCT 
TCTGGGAGCTCTGC^GCCITGCCTGAATCCTGGAATATGGTAATCCAGTCTCTG 

5 0 GACIATGATTCTTGTCCTCCTCrrCCAAAGTGGTTAGGACCT 

TCTGCAGTTCATATTTGCTTGCTCAGGCGATGTTCCCAGAACGGACCCTCCAGGTTAAGGTC 
CATGTTGCTCCCAGAAGCCCTTCACACTTGGGTCCCCATGTCCTACAAGGAACTTAGTTCAAGAAAGTTACTGCC 
TATCGGGGTGCIATCCTGACAGGAGTTCCTAATTTGGGCTCTGCAGAGCAGGAGACTGGAATCCTGCT 
AAACCTTTGTACATTGTCTACTTCATTTCTTAGAACCACCTCrrTCAGAGACAACT 

5 5 ACTGCACTCATAAATTGTCTTCAAAGGGCCTAATTAAGATAA 

GGTCAATTTCCCTCACAGCCTCCTGAGGGTTTTAGAGCTGTGAGATCCCA^ 
CTGGATTCAGATACAAAGTGAAGCAGCAGTTTGTCCACATCCACAGAGC^ 

AAACTCTGTCTC7VGTACAGTTACAATGATAATAGATTGCCCATAAGAATATTTGAAAAATGGCACACT 

6 0 GATGACGTTAGTGTGTCAGCCATCTATTTTTTTAGAACATTAGTTGCTCAACCAGGTTTTCAGTGTTGCCT 

TTACATTTTTTTAACTCAATACATTGTGTAGACTAATAGAAAAATCAAATGTGATAGGA 

AGTCAGGATTGAGA CTGT ATGGTAG ACACAAAAATT CTGGAAAC ATTCC AAGG AAGTGG AGG CT AATT AG AAGAC AATAGAG AGG C 
TCAGCAGCTAAGAACACTTGCTGCTCTTAAGAGAGGACC 

ACTCTAACTCCAGGAGTTCCAATGCCCCTTTCCTGTTCTCTGTGGGTACCTGCATTTGTGTC 

6 5 TTGATGCAACAAAACCACCAAGACCAAAAAGCAAGTTGGGGAGGAAAAG^ 

TCAAAGGAAGTTAGGATATGAACTCAAACAGGACAGGTACCTGGAGACAGGACCATGGAGG 
GTGGCT^GCTCGGCCTGCTTTCTTACAGAACCCAGAGTCACCAG 

GTATCAAGTTGACATAAAAGTAGCCAGGATACCATATGCAGGAATACACACATATACATAATTTAA 

7 0 AAGAAAGGCAGTTTAAAAAAAAAAAAGAACTTGCGTTCTTTAACT 

CCAAGAGACCTTGACTTTATTCCATGTCTTAGGACTGTTTTGTATGAATTGTGTCAGTTAT 

TTGT AGAT AAACAA CTTT ACAAAT ACTTCGTTTTT ATTTGAAT G C OVAAG AACTAATGTGTTTT CCT CACAAAAACTTTTTGT AAT 
TAGATTCTGAAAAAAACACCACTTATAGCAGAGCATA 

AGAGAAAT(^TAACTAATAAATTGAGTGCCGCTGAGATGACTGGAGTGGGGAGGTCAGGGTGC^ 
7 5 ACTGGC^CCTGTTAGTTGTCCTCTGACCTCCACTGTAAGCAATGGCACATTCT 
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TTTAAAACCTCTCTTAAAAAATAAATTCAGATATTTTGCTTCT 

TGTTTTATTGTTCTTCAAAAGGGTTCAGGGTTTGGTTGTAAATCAGGCTGCA 

AACTGGCnTGAGTTAGCAATACTTCATTAAATCCAAAA^ 

AGAACAATTAACATT ATT AGGGTTG CTTT ATT CTCCTT CACACAC C CTT CT CC CCAG AAC CAAG AATGACACTATCTCAGGCAGGC 
5 TGATGCTGGTGGACACCTGCATGTGCAGACCCCATGGGAG CTATTGTGCTCATGAACAAATACACACACACACACCAAAATT CTAG 

• • CATGTCAAGTGGAAAAGGTTTAAGTGGTATTGATCAAAACCCTTG^ 

TCITGAGTCTTTGTGAAAGAAGCATCTATTGCCATGCTCTGAAGTAAA 

TAACATTCTAGGAACAGTGTCAAAAGCATTATTTTTCATTTTCT 

ACAATAGCATCAACAGTAAGCAGGCAACTCAGAAAGCCACCAACATCTGGGTATT 
1 0 GACAACACCCAGCAGTGAGTATTTTATTTACACTTCCACCTGAATAACTACT 

TCGAGGCCTGAGCACATAG CTCAGTTGGCAAGTAAATGGCATCTGGGTGATAAGACCTGAGCTTG ATCTG CGGAACCTGTGTTAAA 

CGCTCTGTGATGGCTGCAG ATGCX3GTTCAG CTGGG AAACAG CTCTGCTGACCAGCACG TACAGAACATTAGGCTTGGTTCCTG CGC 

TACGTAAAATTAGGTGTGG CATCGAGTGCCTGTAATCCAAG CACTT'GGCAGGCCAGGGCAGAAGG AGCAAAAGCCTGTT AT ATATA 

CATAAGAGAATTAACCAAAAAAGGCAACCAAGGTGGTCCATGCT 
1 5 GG C C CACTGG CCAGACAAATCAGTG CTCTCT AGG ACAGTGGGCAGGGCGG CACATG TTTG A 

CTCCTTATGAACAACACTTGAGGTTGTCTTCTGGCCTCTATAAGCATGA^ 

TGGCAACGAATAGGAGTCCCCCAAAGACAGTAAAGGCTCTAGCCCCAGAACACTGCTT^ 

AAAACAAGGCATTTTGACGCTGTCAAGTGGACAGAGCrCAGTATTT 

GGGGCAGAGAACAGAGAGCCCCAGAGGTTGCAGAAAATCTCCCITCAGTGTTC^ 
2 0 GGCATCTCAAACATCTCAAAGTCTTCIGTGGAGATAGAACriTAGTGCTTG 

AGATTTAAGCAGACATGAAGTACAGTGCTCA.CAGATGTCATTGATACATAGTGAG 

TCATACAATCTCGAATGCACAATCTGAAAGGGTCAAGCTGCTTCCCAGAA 

TGTCAT AACC CAAAAT ATC CATCAT AAAAATTGAGT CCAAAATGT AACTG ACAGTCTCTGGCAT CTGGC CAAGT ATGTCCAAT AAT 
TAAAAGAAAGGAGGGTTGTAGTGGGGGGATCTGAGCCAGAACCCATACACAT^ 

2 5 TGTCAAGTCGGAAGGTTAAAATTTTTAAGCAGAGACATAGAAGACACTGTAAAACACC 

GGTCACAATGAGATAAAAACTGCATTGCCTTTGAAAAAGATGAATGGCAGAGTAGAC^ 
AAAAAGCAGAGC»TGTTTGAATGAGCAGGGTGGTTTAGAACTGTAGACAATT^ 
AAGAGTAGATGAGACAGCAAAAATATTTGGCAGAAACAGTTGAGAAAATGATGGCTGGAG 
GACCCAGACACAGGAAGGAGAGGACATCAGTGCTCACCATAATCACACTGCTTGGAAGAG 
30 AGGCTTAAGCAAAGGGGAAAGGAGGAACAGAACTAAAGGTGCCAGGCCACGCCCT 
KNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNIWNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNIWNNNNNNNNNNNNNNNNNN^ 

3 5 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

AGAAAACAAAAGGCTATAAGAACTTATAAGCAAGTTAGGGAGG 

GATAGAACACGTGCCTAGAATTCCAGCAAGAGGCCCTGACTTAGATCTCCAGCACTCAAAAAGAAAGTA 

AATAAACAACAGTTGATGCTGTAAAATATTTGAACACTTAGGTATAAACCTGAGAAAA 

AAAATTTTATTTTAGTGGAGTITATCAAGTGAAAACTTTGAAAATAACT 

4 0 ATACTAACCCGTATTAAACATAAAGGGCTGTTGAGAATATCCATCAGTAGATGTGAATGGCTTAAAACT 

GTAAATATAATTCAGCAATAATATAACAAACTTTTCTTTTTATTAT*^^ 
TTTTATTATTTAAGAAATTGATACAATGTATTTTTATCATACTCACTCCCACT 

TTTCATAT ACTATCCCCCTTTCGTTTCCCCTCCATAAACACCCTATCCCriTCCCCTCTCC CCCTG CCTCTATGAGGGTGCTCACCC 
ACCCACCCACCCACCCATTCCTGTCTCCCTGCCCTGGCATTCTCCTCACT 

CTGGGAGCTCTGGGTGGCTCTGGTTGGTCGATATTATCGTTCTTCCTATGGGGTTG 
TAACTCCTCCACTGGGGACTCCATGCTCAGTCCAGTGGTTGGCTGCAAGCATCCCCATC^ 
CTCAGGGGACATCTATACCAGGCTCCTGTCAGCAAGCACTTCTTGGCAT 
TGGATCACTAGGTAGGGCAGACTCTGGATGGCCTTTCCTTCAGTCTCTGCTG 

5 0 GAATTCTGGATT AAT A'rrriTGAGGTGGATGGGT AAC C C CATC CCTCAAC CAGGG GT CATGC CT ATCTACTGG ATATGGTCT CT AC 

AGGTTCTCTCTCCCCTrrGTTGAGTGTTTTGGCTAATGTCCTCC 
CTTTCTAGTGGCTACCCATCCCCCCACTGCTAACCTCCTTTCAAAAT^ 
GCTCT'CTCTCTCCCAGGTCCCTCTCTCCCTCTACTTCCCAATATT'AT^ 
GATGTGATCTTCTTTCTTCTTGGATTTCATATGATCTGTGAGTTGTAGTGTGGG 

5 5 TATCAGTGAGTGCATACCATGTGTGTTCTTTTGTGACTGGGTTAACCTCAOT 

GGAGTTATCACAGAG AAAGGAGCTTCAGTTGAGGAAATGCCTC CACAAGATCCAG CTGCAAGG CATTTTCTCAATT AGTG AT CAAG 

GGTGGAGGTCCCCTTGTGGGTGGTGCCATCTCTGGGCTGGTAGT 

AAGCAAGCCAGTAACATTCCTCCATGGCCTCTGCATCAGCTCCTGC^ 

6 0 ACACTCACTCAGGATGATACTTTCTAGTTCCATCCATTTGC 

CCATTGTGTAAATGGACCAGATTTTCTGTATGTATTCTT^ 

GGCTG CTGTG AACAT AGTGGGACATGTCGCCCTTGTTAT ATGTTGGG CTC ACTCTCACTCTTA 

CCCCTAACTCCTCTCAGATAGACTCITCCTCCATCCCTCTCAAACT 

AGC CCACCAAATTCAATTTCTGCTGCCCATTT ACCCATGCTG CGG AACTTGCAGTAG ACTTGCCA 

6 5 GAGAATACACCCrrAAAGAAAACTGACTTTTTCCCTTCCTAGGAAGTGACCAACTGTC 

CCTTCTCTCTCTCTATGCTAGAATTTTGACT 
GCCATGTCCAGAAGATACTGTTTTGCTCTGATTCTCriX3CAACTT 

GGG CCTTTCAGGG ATCT AACATAGCTGTCTCACTTGTTCCGAG CACCCTAGGACTCTTAGTCTCTGT ACTTCCACCJVGTTGTAAGT 
TCTTGCATTTGGCCACCATTTGCTGCACAA 

7 0 GACATAGAAAGCAGTCCAGCCAAATGGATGTATCTGGAGGATATCATCCITAGT 

T ATGCACT CACTG ATAAATGG AT ATT AGC C CAG AAACAT AG AAC ACCCAAAAT ACAATTTG CAAAACAAGG^ CAACGGGTG 
GATACTTCGTTCCTCCTTAGAATAGGGAACAAAATACCCATGAAAGGAGTC 
GACTATCCAGAGACTACCCCACTGGGGATCCATCCCATAATCAGCCACCAAAGGGACCCT 
GCC»GTCCCTGGC»JLaTACAOAAG^ 
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TGCATATGTAGCAGAAGATGGCCTAGTAGGCCATCACTGGGAAGAGAGGCCCCTO 
GGGAATGCCAGGGCCAAGAATCGGGAATGGGTGGGTAGGGGAGCAGGGCGGGGGGAGGGTA 
AAATTTGAAATGTATATAAAGAAAATATCTAATAGAAAGAAAGAAAGAAAGAAAGAAA 
GAAGCAGTCCAGCTAGCACTGTAATAGTAGTAGGCTCACTCTGAGGCCTAGGAGTTCTT^ 
AGTACCAGTCATTTGTTTCCTCCTGGACAAAAGGACTTTAAGTCCATCAGAAG 
TTGCACCO^TGGGTGTGCCTTGCCGTGCCGGTTATCATTGTAACTGATAGGGTTCACAGATO 
CTACCATAGCACCTTCCAGTACTGCGAAGGCTGACCAGATAGGAGAAA 
GTGACAAAATCTCTGTGGTATCTTCACAATAAGGCTCTACCATCAAGTTCTGGTGAGC^ 
TGTTGGAAGGGTATCTGGGACATCTCTGTTCAACATCATGAGGAGAAGAATCCCACGGGTGATCC^ 

T ATGGCTT CTGAGGGG AG CAT AG TCTC CT ATGTGGGGTGCTTCCAGTT AAGCAT CTTTAT ATG TTTATGTGCTT CGG AAG CTT AT A 
AAGCAGTCGATTTCCGGATGGCCTTTTCAAACATCCTTAGGGTTAGTTATCCTTG 
TCACCTTAACCTCCCTCCCTTATACAGTOTATTITCTTACATATATGTCA 
CCCTTCCTTCCCCTACCTTCTTCCCTCCAAATA^ 

TGGTATATGTATATGTGTGTGTATATTTAATTCTAGGTCCTGCATATGAAAGGAAATGAGATGCTTT 

TTT ACTTGTT CT CTTTT AAT ATAAAC ACCTTCT AATTT CAT ACTT AAAT CT AT ATT CTG C ATAT AAG AGAACATGTGCAACATTTG 
TCTTTCCAAGTCrcACCAATTTCACTTTCCAGTCCCACACATT^ 
CTCTACTGTGTATTGTGGAACAGGTTTTC^ 

CCATTTCAAAAGTTCTGTGGTAAGCACAGATGTGCATCCCAAGATAGA 

ATAAGCGTAGCCTAGTTTGAGCTTTTTGACACACTCTGTGCTGGTTCCCATAATGGCTGTACTAGA 
ACTCTAGAAGTGGAAAAGAGATGTTGTCAGAAAGCAGAGCGTAGATTTGGG^ 
GGTGGTGATGGGATGTTCATGTTGATTGGACTGAT^ 

ACT AACGCTGTTTT CAG AAG CT AG AAGG AAG AAGGACTTG CTACTGG AG TTTGT GCTGGTAGTAAGCATG ACTGG AAC CCAGC CTT 
TGGATTCAGTTATGGACAGTGCATTGTAAAATTACTCCCAGGGCrCCrTTT 
TTGAATATCTCTTTTCTCATGGTTCCTATTC 

ACTGCACTGTATTGCTCATATTCCGCCATGAGGGGGAGAAGGCAGATTGAGT^ 
TCATATCTCATGGTTAAATATTTAGTCTACTTAACATCTCTGC 
ATTTCIAAATACTTCAAAAATGATTTGTTCCTTTT^ 

CTCCTAACGAAATCAAAACTATGTAGTGTTAAGGTGCTTCGTGAAAAGATGCAGGGTCAAGAGA 
ATAGGTTGCACCTCCCAGCCATCCCIXX3CCITGTTATGCCAA 
CCCCGGCAATGAAAACACGACACAGTTAATCCTGTGCACCTAGATT 
GAGACCCCTTAGAACTTGCAGTTTCTCCAGX3CCATGTGCTAATGCTCCA<^^ 
TCTCCCTCTTCCATTTTCTCCTTCTTATCTCCCACTCACCTTCT 
TATTTTACAAATTAAGGTGGGAAGCAGGTTTACAGGAAATCACCTGAGTGCTX3AOT 
ATGGAATTAACACCAAATATAATTAGCCCC!AGGGCTATGGACAACATTTCCCCCrTTCT 
GATATAATTCAACATAAGTATTATTATTTTGTAATTTATAAAGTACAAGAGACCTAACCC^ 

G AACCTCTG TCAT CT ATTCHT ACIT AAAAG ACTT AT AATTCT ACAT^ C TTT AG ATTG AATGCCAT CTG AACA 

CCATCCTCTCAAAACTGTTCC!TCTCCAAATAAAAAGCCn'GAGTTGGCrAGGAGACT 
AGAACGACTCATATTAACTGGAAATATATAGAAAGCCTAAAACC^CT^ 
GGACAGTCCCTTACTTCAACACGCTGGAGCATCGGTCTTCAGCCTACTG 

TTAAC^ACTAGCCTACTCTGTCTCGGCAGAATCAAGCTGTCCTAACCTGCAATTGCATCCTTTCTTTGG 
ACCGGGAAAGCTGTCAGGAGCAGAATGGTCTCAATAACAA^ 

T CGGAAAACCAACTATCTATTT AAAGTAATGT AGACAGTTCTCTGTT AACTGCT CCATTTCTAATTAAAATAACTGAG AAC 

ACCCTAACAATAAACTCAGAGCCATGAATTTCCTATTTGGCCCTTAACTCA 

4 5 ATAGAAGGACTGGGTTCAAGCCTTGTACTTCAAAATTT^ 

GGAAACTGGGATGACGACTCTTCATAACTTCCTCAAGCTGATTATGGGCATTG 
AATGCGGGAGTTGGTTGGCCTTAAGAAGACCC(^C(2AATGATTGTATT^ 
AAGCCAAGAAATCAGGAGTTAGTGAGGTCAGTTCAGCATGTCTGACGATGAA^ 
TCTCCAAACAAAATTTTAGTATCATCAAGATAACCTTTTTGGG^ 

5 0 AAATACATTATTCACCTTTTCCCCAAATCAGCATAT 

AACAATAATACAGCCACATTACTCAAAATCACACACATTTTAGCCTATTTAG 

AAAATTCCCGTGGATCCCACTGTGTACAGAGTATGAGTCTCCTGCAGACTTTATTATACCATCTTTTAAACCATT 
ACACITCTATTTTCCITCCTTTTTCCITTTTCCTTTTCCTTTCCT 

TTTAAGGTGGGAAGCAGGTTTACAGGAAATCACCTGAGTGCTGACTCATTCCTTGTTCC^ 
55 CATCAATATAATTAGCCCCAGGGCTATCCACAACACTACGGATGCT^ 

AGCCTGGCTCCCCCATGTATGTATATGAGTCCACAGTCCACTGTGCCAACATCCTCCT 

ATT CTCTGTGACGTGACGCTGATCGTGGAGAGG AAGG AGTTCCGAGCCCACCGGGCTGTGCTGGCCGCATGCAGTGAATACTTCTG 
GCAAGCACTGGTTGGACAGACGAAAGATGACTTGGTGGTCAGCTTGCCTGAGGAGGTACAGT^ 

6 0 GGTGGAAGTTAGACGTGCACTGTGCCACGTGGATGGCC^CGTCACCCATGTCC^ 

AATCAGTTTCTGTTTGATGCACTTGAAGCCTCAGAGGCATGAATACA^ 
GGGCCCTGGCCCTCATAGTTTCTAGTTCACTTAAACTTCGGGT^ 
ATTTGAATCACTACTATTTCTTCTCTTTCAAATGGATCATTAGATTTCCCC^ 
CCTTGCTTAGGAGAAAAACAAAGCGACCAGCAATTCTCACCATTAATGCAGTGAGATTCT 

6 5 CAGGATTGAAAATATCCCCTACTCAGGTTTCTTAACTGCTTCAGTGAGAACAGTCATTTTGG 

GCTTGTTCTGTGCAGAGATCACAGTCAAGTGGTTTGAGAGGCAA 

GTCACTCTTCAGACTGCTTATGATTCTTCAGTCATTCGTAGCCn'CCTTA 

CTTTGTGTGC7J\TGCAGAGTATTCTCTGCACAACACCA^ 

ACAGCACAG CATG CAGXIAGCACAGCTCACGTTOiCGTCCTTGCGAATGAG^ 

7 0 TTCTATGATAAGTCAAAGACCAAAAATGTAAGGAGTTAAAGATTACACT^ 

CAGAGACATAGCGGTGACATTAACAACTCAATAGTGCITTGTA 

ACGTTC^^GCCT AACACG AGAGAAACACAAAGGGGG CTGG AGATGCJ^CTTGXJAT AAATTGCCT AGAGCGTtKJTCCC CAGCAATAA 
AAACAAGACGCATGGGATGGAAGAGACACCAGATTTCAGTCCCTTGCT 
ATGGCCAGCTGATGAACCGCTGACGTTTCAA^ 
7 5 CCTACACAGCCGTGAGAGAAGGAAAAGAATAAGTCATTTCAAGATTAGGTAGCCAAG^ 
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GGAGAGCGAGCAATGGCAGCTGGGTCCTTAGCAGGATGGGCGGTTCCCATCTGCACTGGAT^ 
TCCTGGGCTTCGCAGCTGAGGTAGTGATGTAATGCTGCAAACCCAGCCCATCCTGCAGCAAGTC 
TCTCCAATGGGAACAAAGACGACATCCTCCTCACTCAGTCCAACTCTAG 
AAAGTAGTCTTAGAGGAGGTGACACATGTTGCCTACrTATTTCTG 
5 T AATTTCCAAAACTT AGAT ATT AAAG CAGAT ACACAAACG AAGTCAT ACTTT ATGT AT CG AG T CTTCCCTCAATGACTGTGTT AGC 

CCTACCrCATACTTCCTAGAAAAGAAGACCTAAGCCCAAGCCTGTTCTCAATTCT 

G CCT ATG TGTGAGTGTGTG CATGTGTGTGTGTTTGG ATTTTGGGTT TTTTTCCC CACAC CAGATTTCTCTTTGT AGCCTT AG CTGT 
GCTAGACCAGGCTGGTCTCAAACTCTCAGAAATCTGCCTCCTAAGTCCTC 
GAAGCCCTTAACCTATAATCAAAATACAGTACCCATCCAGTAAAATGTGA 
10 AAGTTTCTTCTCAAGTCAA 

MOUSE SEQUENCE - mRNA 

CGAGCTGCCATGTCATGCGCATCCCCTCCGCCTGCTCTCAGTGACCCCAGGACCGCCCACCACGCCG 

GGG CTGGGAAAACACGGG CAGGACCGGCG CAGGGGG ACAAAAGTTC CCGGG AAG CTGT CTG CTTCGAG AAGTG AGGGTTCTG AAAC 
1 5 AAGAAGGACCTTCAGGAGCTGCTCCACACTGTGAC^TCTCTGATCC 

CTGATCT ATCACAAGATTG C CTCACTG AGTTTGGGGGT CCATGTCTGAATGG ATTG CTGT AG C CTTCT CATCT CTTCCT CTGCCCA 
GTT C CCAG CAT CCT AAGACTCG AAG CCAGGACAGGAC CTGGX3GAAATT AT ATGG TTTGAACGG CATG TCTGTGG ATGAG AAG C CTG 
GCTCCCCCATGTATGTATATGAGTCCACAGTCCACTGTGCCAACATCC^ 

TGTG ACGTGACG CTGATCGTGG AG AGG AAGGAGTTCCGAGCCCACCGGGCTGTG CTGGCCG CATGCAGTCAAT ACTTCTGGCAAGC 
2 0 ACTOXjITGGACAGACGAAAGATGACTrGGTGGTCAGCTTC 

CCTAC^CTGCCAAGCTGTTACTCAGCAGAGAGAACATCCGGGAGCT 

CCCCCAAGAGGACCATGGGAATTCTGCAGGAGAAGAGGAGGAAGAAGAGGAAACAATG^ 

CCACAGACCAGATGCTCCCAG ACCCCATCAG CITTGAGGCCACTGCCATCCCAGTAGCAGAGAAAGAAGAAGCCCT CCTGAG 

2 5 TCTGAGGTACCCACAGACACCAAC^AAAACTCAGAAAAGGGTGCATTCACC 

TACCAAGAATGTCTACAGTGCACCCTCACACGGTACCTCTVG^ 
AGCCGGGGCTCCCCATGGGACAGATTAAAAGTGAGCCACCCAGTGAGGAGACCG^ 
GATX5AGACCGACATCAAAGACAGACCAGGAGATGTCGAGATGGACCGAAAACAACCCAGC 
TGGAGCCGCCTGCCTGGACAGGTCGAGGAGTGTGTCCTCGCCTTCCTC 

3 0 CCACTGGCCTGCCGAGTACATCTCAGCAACCCTTAGTCAGGAGCTC^ 

AAAACTGATTACACCCCTTTGGCAGGGAATTACGGACAGCCCCACGTGGGCCA^ 
ACTCAGGGGGCCCGGGCCGGAAACTCTCTGTGAGTTCTCCTCTTCGCCCTGTTCCCAGGG 
AGGAACCAGGGCTGATGGGAGATGGAATGTACAACCAAGTCCGACCCCAGATTAAATGTGAG 
GACGAGTCTGGATCATTCTCGGAAGGAGACAGTGAGTCGTGTCCTGTGCAG^ 

3 5 CGATCAGATCACAGACCTTCCCAGGAACGACTTCCAGATGATGATCAAAATGCACAAGCT 

ACGACATCCGGAGGCGGAGTAAGAACCGCATTGCAGCCCAGCGCTGTCGAAAGAC^ 
C^AAATACGGAAACTGGTGTGCGAGAAGGAAAAACTGTTGTCC^AGAGGAATCAC 
CTTTTCCTGCCTCTCCCAGGAGGTCTGTCGAGACATCCAGAGTCCAGAGCAGATCCAGGCCCTG 
TACCCATGGATCTCCCTGGAGCCAGTGTTAACCCTCCTCCrcTGG^ 

4 0 C<3AACTGTACCCTGCTGCCTGGAGCCAGGGGCGGCTCCCCCCGGGCT 

TGGGAGGAGGTTGGAAGGCAGTGACCCTGGAACATTCTCAGAACGAGGACCTCCCCTAGAAGCCAGGAGCCAGTCTG 
ACTTCTGCCAGGAAATGACTGAGAAGTGTACAACGGACGAACAGCCCAG^AAAGATTAT 
TACCCACCACCCTACCCCCGGGTCTTCACACCTCTCTCATCTGGGTGTTCTTCTGCCAGCCAGTGGCGCTG 
AAGAACCAAGAGCGAATTTGGTGCACACTACAGCAGTCTTAGCAGCAATACTGTTTCGA^ 

4 5 GTGCTAGTTACCTTCATAATGGTGCTACCCCTTGCCCTGGCAA 

AAACTTCACAG<^ATAGACTACACCTCTAGGACCCAACCACGGATTTT^ 

GA ATTl' CrU'CTGAATGTTCAATTTTTTTCATTGAAGACA 

ATATAATATATATGAAATATATATCCATATCOAAAAGTCC^ 

TCATTTACAAACAGGTTCCAACATTCCTCTTTTTGTGTCTGGTGCCAGAACTGGTTTGGAAGCTG 

5 0 CTGCACAATTCGGTTTCCATCTGTGCnTATTTACAGACAAAATTCAATC 

TACTTTCGTTG^TTTGGTTTTGGTGCCGCCGAACGGTC 

GTCTCTGCGGATACTGTACAGCAATGGTCAACTTTGCCACTTGCACTG^ 

ACTrTCGGTATAACTCAAGTAT A TTTTTTTTTTCGTl T C^^ 

AAACTTG CT AGGTAGACCGGGCTAG CCTTG AACTTAAAG AG ATCCAT CTGCCTCTGC CTCTG CCTCTGGAGTGCTGGG AT AAAGAG 

5 5 TATGCACCACCATGCCTGGCAGTCCGGGAATGCCTACCCGTTGGCCACCATGACATT 

TCTGGCGGCTGAGGGTCTC^CTGAGGAACTGAACCAGCAGATGTTCTACACCTTCTGTG 
CCAAGGCTTTGACACTGTGTGCAGGGCTAGAGGCCAATGGTATAG 
CGTCACCGCCCACTCGTACCACACTTTCTTGGAATTGCTGCAT^^ 
TGGGCTGCTTAATGGGGAAGAAGATGGGGAACCATGAGAAGAT^ 

6 0 CGGTAAAGCACAGTCTCAAAGTAGCCTGCGAAGCCATGAAGCACCGTGTTCACCT 

GTTGTCAATCATAGGATCTCX;GTTGGGATGGCTGAAGGTAAAGCA^ 
CCACATAAGX3CATCTCAAACTGTGCCrrCAGGATCGCGGTCCTTTTCCCG^ 
ATGGGAGCCAGGAAGGAGGTGTATTCTCCAGGGATGCTCACGCCATCATCTTT 
TGACAGCTCGTTGTCGGCAAAGGAACCCAGAAGCTCACTGACAATAATGTCAGC^^ 

6 5 ATGAGACAACTGTCACCTC^CTCCCCCATTCTTC^AACTGCCAGTTCTCTAGCGT 

TACAGCCTGATCCGCCGCTCGGCCTGTTTGGCTGCCCGAAAAGACGCATTC^ 
TACCTGGACATTGGTCTCCTTTTCTTCTTCTGGTACTCG^ 
TGGGGTCCTTTTCAAACACTTCATATGTCTGAGATTCCAGAT^ 
CCTTTGGCAAAAAGCTCATAGGCATTGGGTGGAGGGCGATT^ 

GAA 

MOUSE SEQUENCE - CODING 

7 5 CCAACGGAAGAAGGACATTCTCTGTGACGTGACGCTGATCGTGGAGAGGAAGGAGCT 
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GCAGTGAATACTTCTGGCAAGCACTGGTTGGACAGACGAA 
TTTGGGCCACTGCTACAGTTTGCCTACACTGCCAAGCTGTTACT 
CCTGCGCATGCACAATCTGGAGGACTCCTGCTTCAGCTTCCTX5CAGACCCAGCTCCT 
GAAAGGACAGTGCGTGCCAGCGCCCCCAAGAGGACCATGGGAATTCTGCAGGAGAAGAG 
5 GAGAC^CTAGGATGGCTTGCGCCACAGACCAGATGCTCCCAGACCCCATCAGCT^ 

AG AAGAAGCCCTG CTC CCTGAGTCTGAGGT ACCCACAGACAC CAAGG AAAACT CAG AAAAGGGTG CATTGACCCAGTACC CT AG AT 

ATAAGAAGTACCAGCTCGCGTGTACCAAGAATGTCTACAGTGCACCCTCACACGGTACCT 

GACAGTCCTGGCAATAGCCTCAAGCCGGGGCTCCCCATGGGACAGATTAAAAGTGAGC 

CATCAC C CT CTG CTTGTCCGG AG ATG AG AC CG ACAT CAAAG ACAG AC CAGG AGATG TCG AG ATGG ACCG AAAACAACC CAG CC CTG 
10 CCCGTACCCCCAGTACCCGTACTGGAGCCGCCTGCCTGGACAGGTCCAGGAGTGTGTCCTCGCCTTCCT 

GGTATAACGAAAGGTGTGGAGTCCACTGGCCTGCCCAGTACATCTCAGC^^ 

GGGGATCTCTCAGGGTGACCTTAAAACTGATTACACCCCTTTGGCA 

CCAACTTCGCAATGGGGTCGCCACTCAGGGGGCCCGGGCCGGAAACTCTCTGTGAGTTCT 

AGATTCCITGCCACGGAACATCAGGAACCAGGGCTGATGGG^ 
1 5 GTCTTACGGAACCAATTCCAGTGACGAGTCTGGATCATTCTCGGAAGCAG 

AGGTTAAACTTCCTTTTCCTGTCGATCAGATCACAGACCTT^ 

TCAGAACAGTTGGAGTTCATC CACGACATCCGGAGG CGGAGTAAG AACCGCATTGCAGCCC AGCGCTGTCGAAAGAGG AAGCTGGA 
CTGTATTCAGAACTTGGAATGTX3 AAATACGGAAACTGGTGTGCGAG AA CGT 
GCATGGGAGAGCTCCTGGACAACTTTTCCrGCCTCrCCCAGGAGGTCTGTCGAGACAT^ 

2 0 CACCGCTACTGCCCTGTCCTCATACCCATGGATCTCCCTGGAGCCAGTGTTAACCCT 

ACCCTCCCCGTGTGCAGTAGGAGGAAGTGTACCCTGCTGCCTGGAGCCAGGGGCGGCTCCCCCCGGGCTCCCCTG 

AGACCTCTGAGAATTGTACCTCTGGGAGGAGGTTGGAAGGCA^ 

AGGAGCCAGTCTGTCACGGTGGACTTCTGCCAGGAAA 

G 

25 

HUMAN SEQUENCE - GENOMIC 

ATGATGTGTACCTGTAGTTCGAGCTACTTGCGAGAAAGAGGCAGAAGAATavcrCGAG 

TAATTGTGCCACTGCATTCCAGCCTGGGCAACAGAGTGAGACCTTGT^ 

AGATAGATAGATAGATAGACAGGAAAAGGAATGTAATGGACATTTCTCaUUVG 

3 0 AAATTCTCAATATCACTAATCATTAGGGAAATGCAAATCAAA 

TAAAAAAAAACTAACGTGAAAAAAAATAAAATAAAAAGTATTGGTGATGATGTGGAGAAACTGATACCTAGAGTAGT 

AGATATAAAAAGTTAAAAAGTGGTTTTCAGGG^TTGGGGAGTTGTTTCATGTTTTGCAAG 

CACAATATGAATGTACTCAACTCTCCTGAACTGTACATTTAAAAATGATTAAGATGATACATTT^ 

AATTTAAAAAGTAAATTTT AAAAAAG AGGGGGTGGAT ATAATT AT AAT AATTG CTG C AAAGGATTTAGGT CCAAGGTACCT AT AAA 

3 5 GAGGATCTGACCTGGGTAGTGGTGAAGAAAGCTCC^CTTAGCTGAGAGCAGAA 

TAGAAAGAACAATATATGCAAAATCCTTGTGAGGTGAGTGATCTTCATGCACATGGGAT 

AC CTTT ATTTT CACTTG ACT ATTT CTGTTTTT AAAAATGG AG AC AT AACTT AT C AAC ACAG ATTG CAAAAAC CTT AAGTG AACAG C 
TCAGTAAATTTAGTAAAACACTTTGAATCCAGTAACAGTCAAA 
CTAATGCTTCCCTGGGCTGAGGATATCTO^GGGGGAGCTTTCCATACTGAA 
40 ATGATCCTGACTCAAACTTTGCCCTAGAAAGGAAAAAAGATATT 

GTGGTATTGTTATACTGATGCTTACTTTCTGATTTGGATGTTGTATTGTGGTTATGTAC 
ATGCATTTAGCAGCTGATGGAGCATCATTGTTC^CAACTTACTTTCAA^ 

TTAGATATTTCTGCAGGCATACGGAGAGAGAAAGAGAGAACAAATGTGTTAAAATATTAACAATTG 
TATGGTAGTTCTGTGTATTACTCTITGAAACTTTTCCTAA 

4 5 TGATGGCAATGGACTCACAGGTGAGGCAGCAGGTAAGAGAAGTGAGATTAGACATAAAGATAAACAT^ 

GAAAAAGCAAAGTGTCAGAGAAAAGAATTAGTGTTCACTTAGCGCTTGCCAGAATGAAGTAGGCTATC 

TGGTTATCATAAATTAATAAGCATTCCGATTCAGCATCTG CTTTCATTAG CATAAGTTGGTCATCAGG CTGGG AG CACGGGTTCCT 

CCTCTAGACAATGGCCCAGCCACCAATCITAAGACACCTTATATCCCA 

TTAAGCACCAAGGAAATGTACCAATAGAAGCATGCCTGTTGACAAGACCTGGGAAG 

5 0 GCTGGAACAAACACAGCACACCTCCATGTTTTCAGCCTTCT 

GG1TTCCACACACACCTAACAATCGAAAGGCAGAGAAATTAGCTCCTCAGT 
TGGGGAAAATCGAGAAACCGATGACACCIAAATGTCTGTCATTC^ 
CTGTGCTGACCAGCACATTGCTGCAACCACrGTTCAGGTTGGATGCC 
AGATTAGAAACTGTAGGTGCCATCTTACATATTCTTTGGG^ 

5 5 AACTCATAAGCTGAGTTGAGAAATATAATTCAATCGCAAGTATATTTTTATCAAAATAAAAG 

ATTTAAAAATTGTCAGATTACTTCTGTCTACGACCAAACTTT^ 

ACCACTGATCAAAAACATCTTTTAGAGGTGGGCAGAATAAAGAGATGGACATTTAG^ 

T CACAAACCCTGAGGGT AAAAAAAGGGAAGCATT GTT AG AG CTTlTTrT AAAAAAAAAAAAAAAAAGGGAAAAAAAG AAAGT AAT C 
TTGTGATTATTATAAATTTTATTATTATTTTGCTTACCTGAGCCTGACCrAATCACCT 

6 0 AAATTAAGCTCTTACCCGAAATTTCATGTCTCTTGGC 

GATGTCCCTGTAATCCAGACAGTTTTCAGAGGCGAGATCTGAGTGATCCTCCCCCGTCCT 
TGCCTTTCTTCCACCTACCCATGTTTACCCCTGCCCCAGAGTTCTGCATT CTTC 
GTTATCTGCTCCATTACCTTACCrTCTACCrCTTCCTTCACTCAGATTCCTACCTTCCT 
CTTCTCTCATTCATTCCCCCCCTCCTCTGCAGTGGGTCTAGAAGCT 
65 TTCTTAATCATGGGATGCTATTCCCAGGCCCCTCATTGAAAT^ 

TTCTACCTTCCTGTTGGCTCCATTCCTGTCTTTTTCATAAACrCCTTGTTCT 
AAACTTCAGCATTGAGGAAGTCAAACTTCCCTCCTTTTGACTCCACCCT^^ 

TGACCCCTAAACCATAAGCGCTTCCGAGCCCTGCTCACTGCCTTAACATA 

7 0 GCCTCTTCAC^CCCCTTCATATGGCCAATGCTGTGTTCTTTCCAAAACCAGATCT 

CATCGGCTTTCTACCCATGCAGCO^CTTTT 

CCrrCAGTACTTTGATTTTGGGTGTGGGAGAGGAACTTTTAGTCCTGATC 
CATGTACTTCAAAGAGAACAAAAGAAGTTCAGGTGATTTTTTA^ 
G™ATTTGGGGCa»TGTTCTCAGATCCC^^ 
7 5 TGAGAATTTTGATCTGTGGATAAAATTACACCTTCATC 
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TCGCTGCTAACCATCTCAATTCCTGCACTTrcgCT^ 
TCTCATCCTTCAGAATCCAGCTCTGTTGTCACCTCCTCCAGGAAG 
TTTTGTCCTATTCAGGCCCTTAGCATXXSTCTGCT^ 
TATGCAGTGTTTTATCTCCAAAAGTTAGAATAGTGCTAGTAC^^ 
5 GG CAAGTAACAGGT ACAAGGTCTTGGCCATTGGGAGGGGAAAGAAGG AAAAATCTAC^ 

GGCATTTTGTCACCATGGTACTTCAGTACTCCTTGCT 

CTTCTrGACATCACAACCTCTCACT CTTTCCCAACACTG CAAG AT ATGAATG CT AT AGT CAG ATG AAAGTGTAACCTCAAGGGGT C 

AAACCirTTCTGGCTACTTTAACCTTCATAATTTGTT^ 
1 0 G CAT ATTTTG AATTTT ACAACATTT ATTGG CTACCTTT ATTACAG AAGT AAT ATATTT AT ATT ACAG AAA A l"rrrriT AAAAAG CA 

AAAGT AAAAC CACTT AT AGGCTCTTTATTTCAAGAGAAC CACTTGAACACTGTGCAT AT AG C CTT CCAGATTTTTCCT ATGCATG C 

ACTGGCAATTTTTTTTCACAAATAGGATCCT 

TCAATATACATAAATTTTCATC^CrrAATTrTTTTCATTA 

TGGTCTCAAACTCCTGAGCTCAAGAAATTTGCCTGCTT CAGC CTCT 
1 5 CACCAATATTATTTTTGATGGATCCATGATATTCATATTTTATATATC 

TTTGGATITTTCCATTTTCTTTTTGCTATCATACAAAAAAGATCT 

ACAATTTCTATATAATTTTAAATATTAATATATAAAAGTAATAAAATCAAATACAA 

TAGTTTTGACTCCTTATGTAGATTT AAGTTCCTTAAGGATACAGTCCAATC CTCAACATGTAACAG ATG C 

TGAGTAGTGTATGTTGAAAAATTAGTTTATTTCATATACGAT^ 
2 0 TCAATTCTTTACTATAGAAGAAGATCACTATAGATAGTAATATAAGAAC^ 
TAGGAACTGAGGCAATCTTTGAGGAGATGAAGATGGGACAGATAAGAG 
AACAATAGCTAGAAGCAAAATTTTTGCTTTTCTGGTTTGGATTGCCC^ 
AGAAATCTCTTCTTGGCAGTGGTGTGTCCAGAAAGCCCCCAGGAGTGGGCC 
CATCCCTGGCACTGGTTCACATCTTTTCrCTATCCCCTAAATCrATC 

2 5 CTC(^CTAAGAAAACAGAATTCCTCAGACAACGGCTTCCCATACTTCCT 

TGCTGCCCTrCCTTCTGTTGGTTGGGATGAACTGGCCCCTTC 
TCTTCATCTTGCTGACCTCTAAATGTCGGAGTGCCCCAGGTCTTAAACCTTAGAC^ 
TTTCCAGTTGCTGTGTTGATGACACACAGATTTCTATCTCTAGTCTGGACTTCT 
CTGCTCACCTTTTCCACTTAGATGTCTGATAGACATCTGAAATGTACTATATCC^ 

3 0 TGCTCCGCCTGGATAAATGAACTCTCCATTAATAAGCTGTTCAGTCCAAAATTCAG 

CAATGGAATCATCAACAAATCTTGTCCGGTTTATGCTGAAAATATATCTGGATTCCATTTATG^ 
ATAGTTTAAATCACCATCACCTCTCTAATGACTTATCTCAATAGCCTCCTACTTGGTCT 
GACCCAGCAGAGTGATCCTTTAGGAGTTGAGGCCAGGACATGTTCCTTG 
TTTGTCAGATTTCATTGGATGTTAAATTTGCCCTGCCATGCCTTATT 

3 5 AAGAAAAAAT ACAAT ATTTTCTC AACATCAT C AAAT CCTGT ACAGGGTG CTGGAG AGCAAG AAAAT CAAAGGCTCAAGAACTGC C C 

ATGTCTTTAGTCCGTCTTCC l i-i CCTCATCTGTTTCCTGT ATA rn rAAi^TTCCTGTl i i ATTrTCTACAATCAAGAAGATTTC 
TATTATAGAAATAAAGTTTATTTGTTTCAATGCCATGATACAGATTTATTGTCCCTGGAAA 
ACAAAGCCCATGCTCTGGATCAACACACATGTGTAATTATGTAGGAAAAAAATC 
AGAAATAAATTGACCTCCACTCAACTTACTGCCATGATTATACAGAGGGAAGGACCTG^ 

4 0 TTGCCAGAGCAGCACAAGCCCAGGAGTGGCCATGCAAACT 

TACTGTTAAATGAGTGTCTAGTACTTCACTTACCATTTTAATTTCACATAATTCT 
CCACTTCTAACTACCGAAAATAAAGTAGAAAAATGAACTCACTACCAGACTTCAA^ 

GCCAGAGGTGCATGATTGCAGAGCTGGATGTAACCTGAGAGGCTAACTACTCCTAATTCAGCCTCCTTGAATGACTGGATGATGCA 
ACCTATCACAGGTGTTTTAGGTTTTTTCATCTCTATTTAATGTCCCA 

4 5 TGAGTCTTTGGGCAAGCAGACTAACTGAAATGACCTGTTA 

TGGAGGTGGGAGATCAGCTGGCAGCCTCATTTCCCTC 

ccaccgcttgcttgaaaataaatgaatctacaccactctc^^ 

GCTCCTACTAAAATGCGTAAAGTTATCTCACAAACACATACTTACAAGAATG 

50 TCCTAGGGGGCACCAGACCAGAGGTCAGGAGACTGAGGGCTACTCCTAGC^ 

AGCAGAGAAAATCTCACAGTCCCGTGCCAGACCTCCACGCAGGGACTCTGCAGTGCTGTTA 

TTGCTTi'CAG C CATCACTT AG CTTT CCACACAG ACATT C CAAATGGGG CCTTTCAAAACTCTG CCCT CAAATG AAG GT AG TTCAG A 

GCAACACTTTAGATGATATCACAAGACTCCCAGGACGCTTCACT 

TATGATCCTTCTCTCAGTAGAGCGAGCCAAACTTTTATAATATTCATTAAAAAT 

5 5 TG TACACAAG AGGGCCAAG TT AT AGG TGTTTAGGGAAACGTGG CAG AATTTT CTGG C CTTGGCAAG AGGT AAAGTTGCAGG CACCA 

GCACAACACTAACACAACCTTATTTTATTGCAAATGTACATTCAGCCTAAATCCT 

AAG CTTTCTT CTG ACAG GG CACT CACACGACG G C CTTT AAG GG CAG CAACAACAT CCTAAGAG CCG CAG AAAGGAAAGGCAGGGG A 

GAGTGGCTGGAAGGCIAGTAGGAGCTGGGGCTGAAGCCCGGGAGGTTGGCTTTTAATGTCCTCCT 

AAAACACTGGCTTGGGGGCTTAAAGGTTCGCTCMTAGGGa^GGCGGAGGGCCT 

6 0 GGCTGCTTTTACTCGCCCX3GTCCCCACCCAAAAGGGAGTCCGGTCAGCTTTC 

GGGAAAACGCTAGCTAACCCTGGGATCCCOX3TCCCGACGTCTGC 
CCCACX3TGCCACACCAGCCCTCACCACTACTCTCACCCTGCTATGCCCCC 

GCGGGGCGGTCAGCCGGGCCGGGAGGAGGGGGCTCTCCGGGCCGCGCGGCCGCACAAAGCGGGGAG 
GGAGGCGCGCCCCCTCCCGCGAGCCAGCGGAGCGCGCGCGCCCGCCCC03TC 

6 5 OTCGGAGCGGCITACGCTAGTCGCCGGGCGTACCGCGCCCC^ 

AATGGGGAGCGGGTGACGTCAGCGCCGAATGTCAACAATGTAGCGATTGAGAGTGTGGGCGTTCCGGG 
GCGCGGAGCAAACAGCTGCCGAGCCGCCGCCGCCrCAGCAGCAGCAGC^GCAGCA^ 

CCCGGCCTCCCCTCGCCCCTTCCCCTCCCCCTTCCCGGCGC^^ 
70 CGCGAGCTTTCGGTGACCCACGAACTGCCCACCTCGCCGGCTGCC^ 
CGGAAAAGGACGCAAAGTTCTCCGGCGAGGTGAGTCCCGGGCCCC^ 
TTGCGGGGGGAGCGGGCGCGCGGGGCGGGCGGGGAGGCGTGGGGAAACGGCGCG^ 
GCCCTCCCCGCGCCCTGCCCGCTITTATGGCATTTTCTAGG 
ATGTGTGCTCTeCCTCCCGGCTGTTTAC^ 

7 5 GAGGAGGAGCAGAGAGTAATAAGTCAGGCATGGGGAGGGCGGGCTGCAGGAGGCGCGC^ 
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CCCGCCKSCCCTGCCCCCGGCCCGGCGAGGCCCGGGTGTCC^ 

TTAACCCTCGGCGTGCGCATCCCGGGCTGCTCCGCGAGGACCTTGGCCGCGGCGCCTT^ 
GGCCCGGACCACCGCGACCGGCGGGGAATCAAAGC 

CTGGATCGAATCTCTCCTAGCGAAAGATCGGGTTGTCGGAAAGCGTCCCGCGTAGCTAAGACCTAAGGGCG 
5 GGAAACCTTGCACCTCCTCTCCCACAGATCCTCCATGAAGTCAAGGTGCAACACACACACACACACACACACAC^ 

CTCCAGTCTCCAAGCCCCAGAAGCTTCACCCAGGAGACTGGTTTATGAAGGGGACACTGCCCT 

C03TGGCCCCGCTGTGCTTCGCTCACC(XX3CGGGTGTTCGGGAGCCCGGCGCOT 

ACCCTAGGTGCCCACGAGGGGAGAGACTGGAGCGAGCATCTCGGCGACCT^ 

AGGGGAAGCGTGCAGGGCCGGGAGTGAGTGTGTGAAGGGGTGGCACACTCCTGGCrTCTGGG^ 
1 0 AGCACTCTCCCGCGCGTAGGCGAGTTTGGTAACTTGAGGGAAGTTCCGGCGCGGCCGCTGCCGGGCT 

GAGGCGCGCCTCGGAGCCTGCTCTCCCGGTCACCGCCGTCGCCGTGACCTGTGGCCTCCAGGA 

ACCCGCGAGACGGACCCGCAGTCTCCCGGCGGGCCCGCGGGCGGCGAGACGCGGCGTTCCTGGTTTTCACCTTCGCT 

TGCAGTCTCTCACTCTCTGTAAATTTCATCTGTTGAGTGTATTTAACTAAAAA 

CTAGTGGGAACCTTGGC^AGATGATGCCTAATGAGATATITTC 
1 5 AACGGTTTTCTCAGATCATTAAAATGTAACATGTAGGAAAGGCACGTCTTGCTGCAAAGGCT 

CGTGAAAGX3CTGCCCCTCCTTTGGCGGAGAGGGCTA 

TGGTTCTCGGGCACCTCCCGAGTAAACGGCGTTTTTCATTCTTGCAGGTAATAATAA^ 

TTAGTCTTTAGTCTGTAAGAGAGATTG CCAAATTACAGTGTTG CGTTCAAACTGCTCCACTGAAAGGTCTGTCAGCATTTCCATTA 
TAAATTCAGTTTTTCTGGAGTTTATAATTTCACCACAAAACTTTGAOT 
2 0 AAGAAATTTATTGATTGCCCATTTCCAACGTTTTTTTTACTCTCX1AAACCTGAA 
GAACACACAACCATGGAATAAAGTATTTTTGTTAGAACATAACT^ 
CTTCAGGTTTGCCATTTGGTAATCATGAATGTGTTGAGTAAATTCTTCTGCTGTCTG 
TTGGTAGTGTGTTTGCCAGGATTACATGGAGACACACACTTCTCAGTTAAGTGTGTGTCTGA^ 
CCTGTTAGTTACTTTTTTGGTTCTTTGTATGT 

2 5 GATCAGGATTCACTTTCTGTATTACCAGTGCCTGGGAGAAAGCCATTTATTATTTTCCTATTGAATTAT^ 

CTTCAGGCAATTCTTTTCCAGTAGCTCACCTCTGGCTGCTC 

GAAAGGCACTGAATAAGTGCACAGTATTGTTATTCAGATATGGAATTTAATGATCCATT 

AGTGATGGTAGGAAATGTTTCCACCTGATAGAGGGTAATAATAACTGCTTAAAAGTAGATTCAGATTA 

CTAGCCATGTGCTCTTCAATTTGTTGCAATTCATTTTGTGATCAAAGTGAGTCACAGA 

3 0 TCTCTCCTAGGCCAGCCCTTCACTT CTGGG CCCACTGTCTGCACAGCCGACAGGACTTAGTTATTCATTAGCGCTTGTGAGCT 

TCTCAGAATTCCTAATCGAGGCTCCTGTGATGCCAACAAAAGGGCTCAAGCCAA 
GGATTAACTTTCAGTGTCTCTGGTGTGGATCCAGTCTTAACAAAAAGGAAAGAGACCCTAGT 

G CCCCTCTACTCTAGTTTTTGAACCTCAG ATTTCAGGGGTGTG ACATCTAGCTTATTGTAAATG CAGGCTTGATTGTTAATGAATG 

3 5 TTACATAGGCCCCCTTCCTCTTCCCTGCTCAG 

ATTTTTGAACAAAGGGCCCCTCGTTTTCATTTTTCACTGAGCCCTGCA^ 

TAG TG TGG AG TGG CAGG C CGC C CCT ACTCAACTCATT GT AG ACT CTTG CT AGGG AACAGTTGG T AGG ACTACTGG CTTTGTGTTGC 
CACATCATTTGATTTTTCCAGGGGAAGCTGAATATTGGATATTTATGTGG 

AAAAATTTTTGGATGAAAGCATATGTAAGAGCTAGTGTCGGCCTGGGGGCTGCTTGTCTCCCACTACT 

4 0 TTTTTTGTTTAATATCTTTGTGTGTCTCTGTTTTCTAAAG^ 

TTACAACTACCTTGGAAAATAGAAAGATGCCAAAAAAACyVAAACCT 

TTTTTAACAGAGAATTCGACATGAAAGGCCGAATAATTTAATACCATGAGTACAGTGTGAT^ . 
TTTTGAACCACCTGCCCT CT ATTTAGTTGTTACTG ACCAGATGGGGTAACTTGGCTGGGTAATG CCTGAGGACCTCCATGGGAGTT 
TGACATTTGAGAACACTCACC^AAATAATAGTTI^GATTTACTGAATCCGCACC^ 

4 5 GAATGTACTTCATTGATTCCTCAAAGCAACCCTTTGAAGTAGGTGTCG 

TGTTGGAATGAACTTAATCTGAATATAGTGCATGTTTTCTTGAATC 
TGAATCAATGACTGGGCATGGTGTAGGACTGTGACATACAGAGAACATGGTCCCrc 
TGTAACATAITTGTCAAAATCTGCAATTCAGATAGC^^ 
TGAAGAAGTTTCAGCGACATATGACCCTGTCAGAGGTACTCACAGTT 

5 0 GAAAAGAAGACAAGCTTTTGGCAGAGCTGGGGCCAC!ATATGGGAACGCAGCACCAA 

T ATTGTTAATAGO^TAT ATGACCTCTGCTGTCCT CCTATCTTTAAATTTTATGTAA^ CATGTCCAAACTG 

AAACCTGCTATGAACTAAACATTTTGTAAACTCAGTAAGATTGCCCAAAATACAACTCAA 

ATGACATTTAATTTAAAAAAAAAAGACAGTAGTTGCTTTTT^ 

TTACITTTTTTTTTGTTAATTTCAAATGTCAGTCTATCT 

5 5 AGGCTGGAAATCTCCTGCGAATCITACCACTA^ 

CTGCTAGTGACACTACTTCAAAATTCAGGCAGGAATTTTGCCA 1 1 CCAAAATAAT AAATCT 

TCCCAATCCAAAAGCATTTCACATCATCCTTCATT 

TGTGTCATTTTCCTCCCTGACCTTTTTGTTCTTTTCCTGAAGGCTC 

6 0 GTTGCTAGGTCCCTCCTGTCCTTGTGTCAGCTTTCATTGGCTCCTTT^ 

GATCTCATTATTCTCTTCArGTGTGCCTGAGCCTGTAGCTGGCATC 
TTCCAAGTCTTCCATCACTCTAGTCTTGGGCCATCTTCTATTCTCCCTCCCCT 
AAGACTTTCCAGGAGTCTAACCCAAAGACAGCTTAGACCTTCACTTCGGATTTTAAAGTTC 
TTCCAC1AAGTAGGAAACAAAGTGAATGTACATGCTCAGCATCT 

6 5 TTTTTTCCTATCCTGTATTCTTCTAAGCAACAGAAGCT 

AGGCAGATGCCTCATTCATTCATTAAACAAATATATACTGAGTATCTCT^^ 
ATG^KSGAGCAAACCCAGTCCTGGCCCCTGGTCTCATGAAGCACGTCATCTGGTAGGAAGA^ 
AACCC^CTGCTCGTGAAGGAAAGTATTTCATACACCAGAGTGTCTAATGGGCA 
CCCTGAGGAATTGTGGAGTGACCTTCCATOVGGGACAGTCTGAAACT^ 

7 0 T(2ACTACTATCTTTGGCCCTTTGGAGTATTGCCTAAGCCATGCTGGAACCTCTGGGGCC^ 

CAGAGTCCAGAATTGATTTATTTACCTGGGTCTTTAGAACCTGATAAA 
ACTTGGAAAAAATACAAGTTAACATTCTTATTTTGAACATATTT 

ATATTCTCTCCCAGTCAGCTGATGTGTGTAGGGCCTTTGGTGAAAAAGTGAGTCTAAATAGTCTGTCTGAAGTC 
7 5 GCGGACCMCACAAAGOTTTT^ 
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ATCATTAGTGTTGAGTGTATTTTATGTGTGGCCCAAGACAGTTCTCCTTCTTCCAGTGTGT 
CGTCTTATTTAGACCATTCTTAGATGCAAAATTAAGGAATTT^ 
ACCCCTCTATriTL'Tr r rrCTTTGATGATAGTTAACTTGTATTGAGAGCT 
TCAGAACAACCCAGGGCTAGGTGTGATCACTAGAAGCCGCAGCATAGGTGTCGCTC^^ 
5 ACTAGCTGTCTGATGCTTGGGCCTGTTCTCTTACCATGTTATGGCTCT 
CTTTAGCAGTTCCCAGAAGTAGAAATAAAGGTCATCTCACTCTAAA^ 
ATAATTTCTCAGTAAATCTGAGTGAGTATATCTCCTGTCTTrCACATGGCCTGC 
TCCATTTAGGGATCGGAAAGTAAAAGTGATGCATATAAAGGCAAAAGAAGTACTAGGCATATAACT 
CCAGGGTTTACTTGTAACTCACACTCCTACCCTCTTGTTAGGT 
1 0 TTTTGGTGGGTAGGAAAAGGTAAAAAACTATACCTGTAAGAGG 

TCCATCAGAAATCCTAATTAGAAATATTAGCTACCAGTAACAGATAAAAGGAGGTAACCTTTGTTCT 

AAGAGATTATCTTGCCAGATTAGCATGTTGCAACTCTC^ 

GTAACCCCCAATTACCTCAGGTTTGCAGGGTCACTGTTTAGAATGCTGA 

TTTAGTCAAGAAGCATCATTCTCAGGAGCCACTTG AAACATTAAGCACA CAAGTGT ATG 

1 5 AATGT AT AGTACTGTG CTT ATTTTT CAG CATTT AGTTCAAATTTTGG AG AAAGCTAAATGGG AG ATG CTT AGTGTGTCAGTGTTGG 

AGGTTTTAAGTGATTTAAAAAATCAAGCAGATGAGTGCATAAAGATTTCTATGCAAA 

TTTAGTGTGAGAATTACTTGCATTGATTTGTAGAATATCTATGTGC^ 

AGTCAAACTGAAAAAAAAATGAAATTAACTGTGAAAGATATTATTACCCATTCAGCAAG 

CCTTTTCATACATTTTCTTGGGAAGATAAGTGCCAG 
2 0 CAGTTCCCCCATCCTCCTTCTGTGCTCACTTTTTTGTTTCATGCT 

TCCITCCCTTCCAAAACCAATCrCAGAGAAAGATTTTATGAAC^ 

T ACTTTTCTG AAGCATCAG CATGTT AAT AGG AG ACAAAG C CAAAGAAGT AAAAC CCTAAAG ATAAAAACAATAG AATTTCG CACAC 

TGGTAACCTGGTGATAGGAGACCATTTCTGAATGGAGAGAGGGCTCGTGATAAATAAACACAAC 

ATGGATTACAGTATTTAAATCAAAGTAGTATTAAAATACAAA^ 

2 5 G CCTTTTG CCGT AC ATTGG AGTTGAT CACAT AAAC CTCT AT AGCAAG ATGG T ATGTT ATTT AAC ATTT AAACAAAG CTTCCAGGG A 

AAGCCATTTGGTTTGTGGTTAGGGATTTTTGTTATTCAGTAGCAATGAACCTGAA 
GAAGAAAAAAAAAGCAGCGGATCCAACTTCACTTACCT^ 
AAATTGGTGTGTCCCAATTTGACATACCAGAAGCATA 
TAATTAAACTATACTTATTTTAGATAGAGTTGTTGGCAGGTAAATACC 

3 0 CCACCATACCCTTCTCCAGCAACCATGCTATCTTX1AGTTGA 

ACTGTCACCTCCAGAGCTGGGTATGATGAAGAGTCCTCTTTCTCTGGGAACACACTG 
CCTCTCCTTTGTTTTTGCAGTTTAGTTATACCTCCGTGTCTT^ 
TCTTTGTACTTGTTTCTCCTCCTTTTTTGTTTTCC^ 
ATAAAGATTTTTTTTCTAACAATACTTTATGCCAAAGTAAGAGA^ 

3 5 AGAGAACCAGAATGACCCTAAAGTTGAAGTCTGAAAGTGTCACTTGGAAGACTGTXj 

CTCAAACACAGTTAATCAAGAGTATTTTCTTTTAAATGG 
TAACCTCTCATATTTTAAGTTCAAGACATTTTCACGGACTATCAA 
ATGTATTCTCAGTTCCAGACAACCAGTGTTCCTGCAGATTTTTAGGTCCCGTA 
TGTCAGACAAAGCCTGGCACATGAGATCTCAATTAGCATACAGGTTTTAAATTTCCCACATTAA^ 

4 0 TTATGTTAAGCGTX3TTGATATTGTGGGTTTG AAAGTAAGTCCC CTCTG AAAAACAG CAGTTCTTGT 

C^TCGATATGGTAACTATTTTGATTTTTCAGAAGGGAGGAAAAG 

GCTCGTGAGATTTCAGTGCAG CTGTTC CACACATAGCAACACTAAGAGTG AGTTGTTTATTACCATG ATTAATTATT ATTAG 
TACTTCTTTTTGAGCAGTGGCATAGATTATTTATATTTTTGGTCTAACCTTGGAAAAT 
TTrGCTAATTTGCATGTACCCTGTTTTTGTTTTTCC^GGTAGG 
45 ACTTTCCTACCAGCAGTGCTTGCTGACTCATGCAGCTGTTGTAGCA^ 

CGCCCTTGGAGAGCTCCGGGCTGCTGGGCTGGTTAGCTACTGGGAGGCTATTTTCAGATCTGTGGTCACTGCTGAGCTGGGTCCTC 

TTCTCCTGCCAGCAGGCAGACACAGCCTTCTGGTTGCAGGGGAGAAAATTGGACAAATTTCTA 

TTCATAGCATTTCTGGATATGCCTCAGGAAAACTTTTTTTTTGC 

GGGGCTGATAATAAATGAAAAGGGAATGTGCTGTTATTTCCTACTAAGATCAGAAAGAGTT^ 

AGTGACTAGCCTGACATTGTTTTATGTGGCTGAAGTCCCTAAGTGATGACAGCTTTTG 

AAAAAACAAAACAAAAAACTTGACCTGAGGTCATTT^ 

TCAGTTTTGCTTATTTTCATCGGAAAGTGfrGAAATGTTGCTTT^ 

TCCAATCATCAT AAACTGT CTTG AAAAAGGATG ATTAAAGT AAG AT CAGCTGGGGAAGGGAAAGCTTTGCACACC CTTG AGGAGTT 

5 5 TGCCTTCTTTTTCTTAAGTTTCAATAAGCATTCTGA 

GATGTTGATCGAAGTTCTTGGTTrCCGCCGATACGAATGACTCATCAGTATCAATAGAATTGTGA 
AAGACCTTCAAAACATGTGGTTTTAGCCCTTGCTCTTAGTGAAAGGTTGGAAAATGT^ 
GATTTTGCCCAAGTATAAAGTTGTTTTTTGTAGGAATGATGATCCCCCT 
ATTAAGGAGCCAAACTCAGATCATTTCATCGTTTAGAAATCTCTGCGCTAAGCATCT 

6 0 GGCTGAAAAGCAAATCCTCCCTGAATGCAGAAAGGAAAATACTGGTGCAGACTACAATCC^CT 

ATCTATTGCC^AGGGCTGTGGTATTGCTAAGTAGCCCCATAACCCTGTGAGTCATTGTCAAAGGATTCCAAGAGGAAGGT 
AAAATGCGAGTCAGAGGAAACAGCAGAGTCGAGCAGCTAGCTCAGGAAGAACTG 
CAGGAATTCAGGTGTACGTTCATCTTTTTGCAGCCCArTTCCGTATGTCTACATATGGGGACATTTT^ 
GTTGGTTTTGCTGCAGTTTGGCCAGACTATCAGAGATCACTATCTTAGCTTCGTATCAC^ 

6 5 TTTGTATTTTATAGTCAAAGCAACTGAATT^ 

TTTATTTATTTATTTATTTATTTATTTATTC 

CCACTGCACCCTCCACCn'CCCAGGTTCAAGTGATCCTCCTGGCTCAGCCT 
CCAGCTAATTTTTGTAATCTTAGTAGAGATCGGGTTTCACCATGTT 

CTGCCTCAGCCTCCCTAAGTGCTGGGATTATAGGTGTAAGCCGCCGTGCCTGGCCAGACCTAAAAAATTCTT 

7 0 AAGTTAAAAATGCATGAGGAGGCCGGGTGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGTCGAGGCGGGTG 

GGTCAGGAGATCGGGACCATCCTGGCTAAAACGATGAAACCCTGTCTCTAGTAAAAATACAAA 
GCACCTGTAGTCCCAGCTCCTCGGGAGGCTGAGGCAGGAGAATGGTGTG 
ACCACAGCACTCCAGCCTGGGCAACAGAGCGAAACTCCATCTCAAAA 
TTGGAAAAAAAAGTGAGTAGGACTTCTGTTCACTGAGTGGTAACTATGGAGAAT^ 
7 5 G TGGCACA T AAAT AAT A vJ i' GG T T CAG CACA CAT G TGAAT CAAT AAAAAAG C CAG AT AAC C AAAAAG AAAAATGAT ACT G T 
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AT ATTCTT AAACATGCATTTCACAAAGGAAGAAAC CCAATT G GCT AGT AAAT AT ATAAAAAGATGCTCAGCTTCATTAAT AAACAA 
ACACATGCAGATTAAAATCATAGTAACATATGATTACACACTCT 
GATGTGAATGAATGGGAGTTTTTGTATTTAGTTGACAGGAGTCT 
ACCAAGTTCAACCTGCAAATAACrTACAACACAGCCATCTTACCCCT 
5 AGAAACATGAACAAAACAAAAAATAATTC^TGCTGGCAA 

ATCAGTTCAGCGATTGTGGAAGACTGTGATGATTGCTCAAAGTCCTAAAGATAGAA^ 

TGCATATACCCAAAGGAATATAAATTATTCTAAGACACATACACGTGTATGTT^ 

GTAATCAACCTAAATGCCCATCAGTGATAGACTGGATAAAGAAAATGTG£ 

GAATGAGATCATGTCCTTTGCAGGGACATAG ATGGAGCTGGGGG CCATTAT CCTTAAG CGGACT AACACAGGAACAGAAAACCAAA 
1 0 CTGCACATTCTCACTTATAAGTGGGAGGTAAATGATGAAAACGC^ 

AGGGTGGGAAGAGGGAGAGAATCAGGGAAAATAACTAATGGATACCAGGCCTAATACCTC 
CCTCCATGACAC^GTTTACCTGTGTAACAAACCAGTACATGTGCATACCCCTGAACTTA 

CAAAG AAG CATTGTTCAT AAG AG CAAAAAACTGG AAGCAAC C CAAAT AAATGC CCATCAACATTG AATCGAAT CAGCACATTTTGC 
TCTGAGGTATTrATACACTGTGATACTATTTGGCAATGAAAAATCAA 
1 5 TTTCTTGGTGGTCAAGAGGAAGGATTGTGCCTGGGTTCAGATTTTC 

CTTCAGTGTACCTTAGTTTTCTCATCTGTACATCAGGGAAAATGATGTCTACCTGATAGTGTTGTTAGG 
TGTTTGTGACATGCTTAGGAGAATGACTGGCAGACAATACATTCTATATGTGTTTGTTA^ 

GAAT CTT ACAAACAAT ATGG AAAGAAAG AAGCAAG AT AGAAG AAT AT ATACAGTGTGATT CT ATTCATGT AAAGTT CAAAAG CATG 
CAAAACTAAATATGTTATTTAGGGATACATACATAGATGGTAAAACTATAAAGAAAATTAATATGATTTAATATGA 
2 0 AAGGAGTATAACAGCTACCTCCAGAGGTGAAGGAGTGGGTATGATAGAG 

TTTTATCTCCCAACCTGGACAGTGGTCCCAAGGATATTCTTTCTTCTTAATT^ 
GTGACCTATTAGTAAACATTTTAAAGGATTTATGTGAGTTGTTATAGTAGCAAGATGACATTA 
ACATTTTAGTTAAAAATAATTGTTCAAGTGCACATGGGAACACTTCCTTGGTAT^ 
C^GTGTGATTCTTACAGAAGGCGACACCTCCTTAAGGCCACATTAAGAGAGCTGTCC 

2 5 CTCAACTACTCTCAGGTCTGGATGTCCCATGTGTGGGATGCTATGTCTTGTCCCAAGGGATT^ 

AGAGGGAGCATTTTCTGTGATAAAGATGTITAGGGGTCTGGAAACCTATCA^ 

ACATGGAGAAGAGGAGATGTTGGGAGAGAAAGAGTGTCTTCCATCACTAGAAGTGGAAGCAGCCTTTC 
GTAATATACAAGTCCCTCCCTGTAATGTCTACCCITGGAGGCAGGTT^ 

GGG AC CT CC CTT AGACTTGACT GGGG CCACAAT ACCCACTGCCTGGTC CCTGGACCAG CAGC ATCGGCATCACCTGGAGCTTGTT A 

3 0 GAAATGAAGTCTCAGGGCCACCCACACCTACTX3AGTCT 

AATTTGAG AGG CACTGGGCT AAATGT CT CT AAGATCCCT CATGGTTCTT AAGATTTAATG ATTTTATAGAAT AA^ TATGTTCA 
ACTGAAAATTATCAGTTCATATGGTGACATTATCAATTTAAAACACCTT^ 
TTGCTAAATTAGTTTTAAATGGATTTGTCCAGTTTCCCCCGATGTCAAAGTTAGGTTG 
AAAATATTACATAACGTTGTATTTTCTTCTTTCTAACATAC^GGATTAGATT 

3 5 TTTTCTTTTATATTCATTGAAAATTCAAAAGCAGATTAGGGTTCTGAATACCTCTTA 

T AT AT CTCATT AAT ACATTG ATTTT AT ATTT AG CAT ATTTTTT CTTCTT AT ACAT AG T C CAT ATTTTGACCTT ATGGGAAC CTTTG 
AAGTCTGAGTATTTTATAATGATT CTT CCACAAT ACCTCTTT AT CTTTTAGATTCTAGAT CTG AG ATCAGG CAAG T AAAAT AGTTTT 
ACClTri ' TCCCCTCAAAAGGCCATGACTAACAAGCTCATTTACTTTTTCCTTA 
GTTCAAGTGGAATCCATGATTCATTTAATCTTTTCCCCrTTATGTAATTAGTCTVCTT 

4 0 ATTTAAAGGGCTAGTCATAAAATTCCAAAATAAAAA.GTTCTTAAATGACTTAAAATAA 

AAAATGGAGAAAACGTCTCATGAATAGATGTTCTTTAAGGCTGTC 
TTGGTATATAGGACCAAGAAACTACTTTATGGCTTAATGTTTGCrGTAC^ 

TTTATAAAATGATAAAGTAGTGTAGACTTTTT 1 TT TTT T T T TT TG AGACTG<jAGTCTCGCTCTXaTCGCCCAGGCTGGAGTGCAGTGG 
CATGATCTCGGCTCACTGCAACCTCCACCrCCTGGGTTCAA 

4 5 TGTGCTGCCGTGCCCAGCTAATTTTTGTATTTTTAGTAGAGAC 

TCAGGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAC^CATGAGCGACCGTGCCTGGCCGTATTCTT^ 
G AG C AAG ATTT AGG AAG ATT ACTG CAT ATTCCT AT AGCGTTTGTTCG ATTT TAG ATTTTTT 

T AGCTTG AACT CTAAACCTGGTCTGTCCAGTATGGTAGCCATCAGCCACATGTGG CATTTGAGCATATGAAATGTGGTTGAGATGT 
GCTGTGCAATATACACCAGCTTTCAAACAAAAAC<jATATATATCTCCTTC 

5 0 AATGTTTTACATATATAGGATTAAOTAGTAAATATTAAGATACATTrGGCCTGTTCCT 

TTTAAAATTATATGTGGTTCACCTTATACTTTGATTTCAAAGTGCTGATTTAG 
AGAAAAGAAGATCAGAGCTTAACAGAGAGTTTTAGAGATGAGGTGCCA 

ACCTTTGCCCT AC TTTT ACAG ATT ACTG CTTTCTTG AAC CC ACAAAGT AG AG CACTGGGT AG CAAG CATTT AAT AT CTACAATG T A 
CAGAGCATTGAAGGGGGATTAAAATAAGTATCACACAGGTCCCTGCCTTTAGX3 

5 5 GAGCACATGAGTTGTTAAATCATGGCTTGAGAGAT^ 

GCACTAAAAAAAGTGCTTCAAAGGAAGGAAAGGTCTCC^CGGCCTGGAAAGGTCT 
ATGCCTTGGCTGGGCACTGACGAGTGGGTAGAATTTCAGTGGGCAGAGAGAAAGG 

GGGAAGAAGGAGGAAAACATGGAAAAGCACAGAATGGTTGAGAAGTCAGTGAGAAGAAGAGATGGATTGAACTGT 
AGAATCATTCTCTAGTCCACTTATTATTGAAGCTTTAATATTTGACATCGTCTCCTATAATATT 

6 0 AAATTATTTACCAAGTGTCTCAAAGTAGAGGTTAGGGCA 

TAGGAAGTTCTTTAAATGGCAAAACTGATAATATTATCTTGCCAGTTATGTTTATA 
AAAGGCTTCTCGGATAAGTACATTATTAGTATTTCAC^CAGTAGTGCTC^^ 
TGTTATGTCATTTGATCTTTGCTGTTGGCTCCATTTTATAGAA 
CTGGAATTTGAACCTAGCTCTTAAGCTTCTTCCATTTTACTGTGTGAAA 

6 5 AGAAGCTGATTAATTTTATTTCTrTTTAATTGAATGATAATATAAAT^ 

CGAATTTCCATAAAGTGAACACGCTGTTGTAACauSTTCCCAGATTAAGACACAGCATTAT 
TCCTTCCAGTAACT ACTT CACCCGTGGTAGC CACTACTCTGACITGGCAAAGCATAGATTTC 
TCAATGGAGATACAGAATGTACTATTTTGTGTCTAACTTCTTTCACTCGATAATAT^ 
ATATTGACTATGGTAGTTGATTTTTCATTCTCC^TGCTATATAGTATTATATT 

7 0 GTTTCCAGTTTGGGGCCATTATGAGTAGCAATGTGAAGATTCTAGAACATATCT^ 

ATATTTAGGAGTTAAATCAGGAAAGCAGTGTAATTTTTAAATGTCTrG 
GAAAAGAAGTGAAAACCTCAAACTGTTGGTGTCTTGAATAGTATATCAAAATAAATACT 
GTAGAGGAATTTTAATGCCCGAACACATTTTTTCAAGGGAG 
GGATGTTAAAATACTGACGTTATATCACTCAGGTTTCTCTTA 
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GAAATAAGAAAATGGGGTTGGGTTAAATGGC^ 

AAATTTCCTCTCTCTCTCACTCTTTTGCTTAGCGTAATCTGTAGGCAAAA 
CGATAACTTGTTTTTAGAAAAAATCT 
QVAAAGACTTCAGTGTCTACCTTTAATGCTTATTAACAT^ 
5 TGTIX3TTGATCTTAAGCAGACTAGGCAGAGAAATCCTTCACATTTT 

GAAATGAGAATATGTGAATAAATACTCTCATACTTTCTTGGGCCTC^TGAA 
AAATTATTTCAGAGTAGGAAATATAACATTAATTTAGAACATGGCTGTCATAGATG 

ACTATGAAAGTAGGTTTAAAAGTTACA CITITCTIT AATC 
1 0 CCCTGTCACCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACT 
GCCTCCCAAGTAGCTGGGACTAAAGGTGCACACCACCATC 
GCTGGTCTCGAACrrCTGACCTCAGATGTTCAGGCCGCCTTGGC 

TC CAG AACAACAGG CACTGGGGC CTACTGGAG AX3TGG AGAGTGGG AGG AGGGGG^ CAGAAAAAAAAAAACCT AT CAGGT ACT 
AGGCTTAGTACCTGGATGACAAAATAATCTGTACGACAAACCCCCATGACGCAAGT^ 
1 5 CTGAAGGTAAAAGTTAAGTAAATAATAAAATATTTTACATTTATTTAAAAA 
TTTTGGTTAGGATTTGAGCTTATTCCATTTrTCTrCT 

AGGGGAGAAACCyvAATAATTTTCAAGTCATATTTGAAGAGCAGGTATGTTCTCTCACG 
ACTGCTTTAAATGATGATAAAAATTGTTCAAGTAGAACAAAGCAAAGT^ 
CCTGTTTTCTCTCCACTTATGTGACTrGCATGAATTTCTCAAGCACT 
2 0 CATCAACAAAATCCAGCCTTTTCCTTTGTAGGAC^^ 

GCAAAGACTAGTCCTCCCTGCGTTTGGACCCAGTGACATGTTTCCATTTGTCATTGAACACT 
GGTACCCCCATTTGGTCCTCACCCTCTACCAGAGGATTTGCTGCTTCAGCCT 
GCACTTATGCTGCCAGTTATCAGGGTTGTGTGGGTGTCTTTAGAGGGGTCTCCCCTTAGCATCAG^ 
TAACACAGGTTACTCATCACGAGTTAAAGATTTTTTCCCTTTTCACATTGAT 

2 5 ATGTGCGTACTTTGTGGCTGCAAAGATGCAGCTGGAAAGTGTGTGGGGG 

TGGGGGCGGCTGTGATAAAGGTACAGCACCCACTTCCGCCAGCTTGTCTTGTGCGCCCT 

ACTTCAACCG G ATTTT CTTGTTGGCACTTTT ATCTTT C ACTG CTT CTG ACTTGCCTGTT G AGGTTT ATG ACTAAGGTTGTGATT AA 
AAATTCCCTTTTTAGGTATGAAAAGTCTTTTCAATTAC^ 

TCTGGGTTTTGTAAAAATGAATCTTAAAAAAATCTTAATATTATGTAAGAGTTAGATAAAGGTTAAGTA 

3 0 TCAGGGCATAGAATCAACGTATGTTCTTGGAAGATTAAGTGTGTC 

TAGGCATGGAAAAGCCCTTTTCAGAATATTGTCATAAAATCAGACT^ 

GTCTGTTTAATCTCTGCATATTACTGTCCTTGGAGCACGGAGAAACGTGAGCT 

ACTAATGTTTACGCCTGATTATATGATAGGAGCTTTACATGTTCTTAT^ 

TTTTCAGAG AAAGAAGCTGGGGTT CTG AGACATTACC CACTT AGGGG CACCCAG AGG AAAG AG CAAGACCTCCAGCTCCAT CACCA 

3 5 GTAATTTCGAGCCTGTGAGCTTCCCTCCACACTCAGT^ 

TTOIGAGGCAGAAAAGCTAGTTTGCAGCTGAAGTACTCAGATT^ 

GG CAGAATTTGT AT AGAGG AGC CAAG AGAAG AAG AGTTT CCACACTGT AT AAGC CAAATGATGG AAGGTGT AAT AACAAAGG AAGG 
GTGAGCACACAGGTTTAGGAGATGCTGTGTGGGCCAGTCCAG CTGGGAGCCATTAC CATTTAACACITAATGGAGTGCTTACTGTG 
CTAGGCACTCTTCTAAGTTCTTCACACCnTTTAACTATT 

4 0 TGCCCCCACTTTTTTTAGAAACAGAGTCTTGCTCTGTTC 

ACTCCTGGGCTCGAGTGATC CTTTTGCCTTAG CCTCCTG AGTAGCC AGG ACTACAGGTGTGTGCCACC^TGCCKSGCT 

TTCCCCATAGAAACAGGAACTCGTTATGTTATCCAGGCTGGTTTTGAACACCTXXSCCTCAAGCG^ 

AGTGCTGGGGTTACAGGTGTGAGCCACTGCACCTGGCCTC 

ACTGTGATTCAAAGCATGGCCACCAGGATCCAGAGCCTGAGTTCITAGCTACTCrGCT 

4 5 AGGTTGCTTCTTAGAGGGGTGATGAGGATTCTCAAATTGCTGGGGATAGCGAGAGG^ 

AGATCAAAATGX3CAGCAGCACGCTGGCCAATTTTACAAGA 
AGGATCTGGTCAGTGTAAGTGGGTAGOAAAGATAGAGK3TGAGATGTAACAAAT 

AAG AGGGAATGAGCAAAAATG AAT CT AAGATTTGGAG CTTGG TTGGCTGG^ AGG ATAATT CTG CT ATTGCAGAAAT CAATGAGGTC 
AATAGACAAGCTCATTTTTCCACAGCAGTAGTGTTGCAGCAAGAAATGA^ 

5 0 GCAGGAAAAATGGCTCACTCGAGAGTTGAGAATAAAGGA^ 

GGAGAGAATTGATGTGCTGGGAACAGACGATTTCTGAGGGAGAGAGAAAAACCCAGGGCCTTAAAAAGTC 

AGAAGG AG AAAGAGG AG CAG GAAAC CAT AAAAG AAACAGGCG AAGCAG G AG AC^ACxAGGAGAAT AAAAAG AGGAAAAAAAAAACT AC 

TGAACAGTGTCAGACACTACAAAGAAGTCAAGGJU3AACAAGAGTGAAAGGGGGACAT 

ACCCTGCACAGTAAGTCTTGGTGACATGAGGAGGTGAAATCCAGTTGCATTAAAGTCTG 

5 5 CAAGTCCCTGGACCTTTTCTAAGCACTAAATTTGCAGGTAGTTCAATGTCAAGCAGAGAAGGAAGT 

CATGATACTACGTTTTGTTATAGTGACCTAACTGATCTAACGTGTTTTGTCTTTCAGCGCGT 
ACATTTCGCCACCTACTGAAGACATCATTTGGGACCAAGCTGAT^ 

GTCTTTGTTTCATAAGTTTACACAGCTAG CTTACATTCCCACCAGCAGTGTGAAAGTGTTCC CTTTTT ACCAG ATCCACGCCAGCA 
TCTATTATTTTTTGTTTTTCAGTTATGTCCGTTCTTACAGGAGTAAGATGGTA 

6 0 GAGATGTTGAGCATTTTTTCTTATGTTTGTTGGCTATTTGGATGTCTTCTTTTGAGAATTGTCTG 

TTGATAGGTTCTTTTTAAATTAGTTTATTAGCTTTTAATATAAACATGAGAAAAACCAC^ 

AAACTGCAATCAATACTTGAATGAATCTCCAAGCTCTGAACTTCATAAAC^^ 

TTTTTTCT TT T TT C TTTTTTTTTTT^^ 

TCCTTTGTCAGGTGCATAGTTTGCAAAGATTTTCTCCCACTCTGTGGGTTGTCTTTTTACTCTGTTG 

6 5 TCCCAACTATTTATTTTTG T 'rTTT G TTGCATTTGCTTTTGGGTT CTTTAGGC ATGAACTTTTTGCCTAAG CCAATGTCTAGAAGAGT 

TTTTCCAGTGTTCTAGAGTTTTTATGGTTTCAGGTCTTAGAT^ 

AGATGAGGATCCAGTTCATTCTTCTACATGTGGCTTGCCAATTATTCCAGCACCA 

ATGTTTTTGTTTGCTTTGTCAAAGATCAGTTGGCTGTAAG 

GTGCCTGTTTTTATACCAGCACCATCCTGTTTTGGCAACTATAG CCTTGTAGTATAGTTGGAAATTGGG CAATGTG ATGCCTCC AG 

7 0 ATTTCTTCTTTTTGCTTTTGTTCTATTGGCTATGTGTACCrrTTTTTTC^ 

TGAAGAATGATGATAGTATTTTGATGGGAATTGCATTGAATCTGTAGATTGCTTTTGTCIA 
CTACTCATCC^TGAGCATtlAAATATGTTTCCATTTGTTTGTGCC^ 
GAGATTTTCACCTCCTTGGTTAG<3TGTATTCCTAAGTATTTTATTTGTTT 
TTTGATTCTCAGCATGGTCACGGTTGGTGTATAGCAGTGCTAC^ 
75 TAGGAGCrrrrlXaGAUiAGTCTTTAGA^ 
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AGTAGAACTACCATTCCATCTAGCAATCCCACTATC 
CATGTTTGTAGCAGCACAATTCGCAATTTCAAAAATATGGAACCAGCTTA 

GGT AT AT AT ACAT CATG AAAT ACT ACTCAGCCATAAAAAGGAACAAAAT AATGG CATTCACAG CAATCTGGATGGAATTGGAGACC 
ATTATCCTAAGTGAAGTAACTCAGGAATGGAAAACCAAATATAGTGTATCATAAGAATGATATAA 
5 GAAGTTCCACTAGGGAACTTATCCATGTAACCAGAAACCATATGTTCCCCCAAAACT 
TTTTAAAAGTTCACACAGCTGCCCAGCGTGTCTCCCACTCACCG^ 

CACACTGATTGTT AT AAGG AATC CAAACAAT AT AG CACTG C CACACTTGGGG CTGCT ATTTGTTTT AGGG CTCTTTT ATT ATGGA 
AGTCCAACCGATATGAAAATAACCTAAATTTTCTCTGACTTCTCAG 
TGGTTACCCCTTGGTGGGAATCCAGGATTTCCTTATTTCAGTGTC^ 
1 0 GTTACAGTTGATTTTATTTAAGCTGCCAGATTTGGAAATG 
AATTTGTATGTACAAATGATATTTTTTACTTTTGGACTG 
AACTGTAACTTGGTGTATCAACTTTTAGAAAACTCAGAAGACTTAGGAG 

C C CAG CAT AG AAT ATTGGT AGGTGAAACTTTATTAAAT AGAATCTGCACGG AGT AGT CTTAGCCC CAC CT AACCCTG CCAC 

TTGTCTCTGTTACrrGTTTTCTGTTGCTATTAGC^TCTTC^ 
1 5 TGTTCAGCAAAAGTTAAAGTGTCTG^CTGAALTCTGGGTAGCTGGTi i iGGAGTTGTATCGGGTTTCATTTATTCrCCAGGTLrTCl' 

TAGTCTTCACTTGAAAATAGAAACTAGAGTTCAriT^ 
TTCTTATTACCAGAAAGCTTTCTTCrGATCTATTTAGTATCTT^ 
ATCTAATCTTTCATTATTAAAGTTCCTAAATATCTTATATTACTATGTTTTATCATT 

TTG C1TCT CT AATT ATCTTGGGTG AAGGACCAGTITTG 1TT1T AATTTCCAGT ATGTCATGGATTG AT AATI"IT ATAAAAT ACAAT 
2 0 AAAAATGATTTACTAGAAAAATGAAATACTAGAAAAATGAAATTTAAAGAA^ 
CITATAGTCAACAGACT^TAA^TTACTCTGGCAAGCTGCT 
GGACTAGGGCTGGTCTGCAGAGTCCAGTTCGAGITGCACTGTTGGAC^^ 
CTAATTTCAGAACATATCCTCAAAGTGGGTTTTTGGGGACTGTCCCCTTCA 
GGTAGAGTGGTTCATGGCTGTAATCCCAGCACTTTGGGAGGCCAAG^ 

2 5 GGCCAACATGGTGAAACCTCGTCTCTAAAAATACAAAAATTAGCT 

GCTGAGGCATGAGAATTGCCTGAACCCAGGAGGCAGAGGTTGCAGTAAGCCAAGATCATGGCACT 
AGCAAGACCCTGTCTTGAAAAAAAAAAAAAAACAAAAAAGG^VAGTC 
AAAAAAATTGTAGGACAATGCATAAAATAATATTTAGGTGAAT^ 
TCCTTTCTCTATACATCATATACATACATACAAACATATA 

3 0 ATAAACAATACTCTGCAACT'rrCTTTAATATTTTGT^ 

TGCATAATATCCTATGGTATCGTTGTACCATAGTTTATTGAATTATTCCCCAGCTGATATACCT^ 
CCATTCCTAACAACACTGCAGTTAAATTCTTGTACATATATTCATGTGTATCT 
GGAGTCAGTGACTCATGCATGATGTCAGATGATACTCCAAAAAAGTTGCATT^ 
CTGTTTGCCTCACCCCTGCCAGGTGTTAGGAGTCTTTTTAAGTGTTCTCT 

3 5 TGAGTTCCCCTGATTGCCAGCAAGAGTGTGCTTCriTTCATATGTTTAA 

AAAACTTGACAGATTTTATTTTATTAGGTCTTTTGTGTACC^ 

TTGTTCGTTTTAGTAGGGTAGAGGACATGGGAAAAGAACAAAGATACAGTCOT 

TCTGATTTTCTTTGTCrrAACATGTTTTTTTTCXTC^ 

CACAGGGTGGGGCATCTTTATAGCTTGGCATAGTGTATATCGTTATTCAAGATT 

4 0 TAAAAAATTGAGGTAAGTTCATATAGAATTAACCCTCCrCAAGTGTACAATTCAGTGGC^ 

CTGGGACCTCTATCCAGTTCTGAAACATTTTCCTCACCCCAAAGGAAACT 

TCC CT CAACCCCTGGAAACCACTT CT CTGCTTC CTG CGT CT ATGGTTTC AC CTTTTCTGG AT ATTT CAT ATAAATTGAGT CATG CA 
GTATGTGATTTTTTATGTCTGGCTTTTTTCATTTAGCA 

TCCTTATTGTGGCTGGATAAAATTTTATTGTATGCTACAATATACAATAATTTG 

4 5 TTCCACCTTTTTGGCTGCrGTGAACAATTGCCCATC 

ACAGCTGAAAAGAACCTCCCAGGGAGCATCTTCTGTCrrcrAGAGCCTTCCrCT 

AGGCIGGGTCAGTCGTGGGGCOU3ATTTGGGATCTGTTTTCACT 

CACACCCTCTGGATTTAAAAAGGAATTTACATCCCACCCTCAGGGCACT 

GG AACATT CATGGCT ATT CCT C CATGG TTGTTGC CTC T ACCTTT AGCATCTGGGTTGG AGCTTGT AG AATG 

5 0 ATTGTGCCCACATAGTACCTGGGCATATGACAGTATAGGCTCATTTCCCATCT 

TTT CTC CCTC^TAAAATGGCATT ACT AATACTT ACAT CATAGGGTTGTA 
TACAGTACCATGTGGTAGTACTTGATAAATCTTC^TTrcrTCT 

ATTACATACTTCACAAGGTAGTATTAAGTTTAAAATGAGTTGATAATGCAGAGGAAGTC 
CAACACTCACTATAGATAGAAAGGTACCTTACCTTGCAGTTAGGATTTCT 

5 5 CTAAAGTGTTTTAAGTGGTAAAGATTTCACCTGGGTATATTG 

TAATTAAGACAGCCATGTTCAGGATAGATGCACTATCACTAAGATCCT 

ATATATTAAAGAAAGTTACTTTTATATCAAAGTGTTTGCATCCCAAAGAAAATACTTGTACAGC 
• AGCTGACAAGAGACTTCTGTCTTGCCCTCCAAAGTGGTTTAGTATCCTATT^ 
TATTTTAGAGCAAATGGGAAGGATTTTATAAAAAAGGGAGAAGCAACT^ 

6 0 TCTCTGAGGGTGAAGTGGATCCCCATAATGTGGGCCCCATGTGTGAACCTTTTACAAGGT 

TAAAGGAGAGAGTTCATGTTTTGTCTGAAATGGAGCATGCCITGAGT^ 
TAATAGCAAACAGCTTGTATAAGATTTTTACATAAACATTTG 

AATATTTGTAGCACTTATGTTTTTAAATCTACAGGTCTAACATGCAGTTGTTTATTTGG^ 
ATCTACCTCCAACCTCTCACC^TCTCCCAC^CTTCCCAC^ 

6 5 CACTGGGAGGGAAAGACGAGGAAACCCCAAAGGGACAGCTTTCTGGAGAAAAATC 

CAGATACACACTTCCAC^IATGTGCAAATTGAGTCAACATCATGTCTG 

CTTT AG CCCT AGG C CT CTCAAGTTTTGG AT ATGTG ATG CTCAAG AAATG CATTTACAATTTGAAAAC CTGGTGCTCATT AAGT AAT 

TCTTTTAAAATTTTTTTCTTTTCCTTTTGAA 

TTTGTTTATATTTGCCAATTATTAATATTTGTTATTTTGTCT 

7 0 TTTTAGAACTATTTAAAAGTAAGTTGCAGACTTACTTTCATGACATTTTTCCCTGAAGAC 

ACATTTTCCTACGTAACCACAATCCTGTTATAAAGTAAGTCTTAATTGAAAACTTCAGACAGAGGACAA 
AGATTGAACAAAGCTGTAATTGAACTTGGAAAGTATAAGGAGAAGA^ 
AGGATAAATTTTGTATATACCAGCAAATCACTCAATGAATGCAGATTTTCT 
CCTGGTGCTCTGAAGTCAGTGTTTTGGCCAGGGTTATAAGCTTTTCAG 
7 5 CTGAGTGACCACATCCTACTTAGACTTGTGGTTAGGAGCAAATT^ 
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CCTTCAGTTATTACAGTCAGAAATGTTTACTAATGGGCAGTTAGTGTAAACATGGGAG 
AGTAAAAGCTGATGTGTTAGCACACTATTTTGAGAAATAATTTTACAG^ 

TTG CAG ATTTAGGCTT CAGGAG T CAGCX3 CAAGAAAAAATGCT ACTCTT AT AAAGGCTG AACAG CAGGAACAAAGTT AAGTGG ATGG 

CATGTAGATGCATCACTAATTTGTCTATTTTAAAACAAAGATATACATTTT 
5 TAAAAGGTTTAITTGGACTGTTTCCCTGAAAGCCrCCTTCTTCGGT 

TATTTTTCAAATTTTrTAAAAAAGCTATTTTCACT 

G^CATCATCATtyrCAAGAATGAGAAACTATGTGTTTCCTT^ 

GAAAAGGTCCATTTTCAGTAAACTCTTGTGCTTACCTAGTGTCTT 

ATCACAGAGAGTTTGAAACTACACATTTTGTCll'1'lATTTCCrTTTTCTT 
1 0 TTATTTTAAATTCCAATTTTACATATCAAATTTACAAACTG 

AAAAAGCATTTAATGT AAAATG CATGTGT AGTTT CiTGTGT ATCAAAG CGATG CTTTTCCTT AGATCAG CTT AATCG TCT CCTGTT 

TCCAATTTGTGTTAGAAAGTTCTGTGTGAAATGTGAACTCTAGTTTATATA 

AGCrcrTGAGCTATCTTTGTGGCTGTCTTACCT 

ATTGGAGATGTCACTACCTGCT^CTTGGACACCAGTGTGTGCAT 
1 5 TAATTTTGTTTTTCAGAGGCAAATTTTGTGAGGCCA^ 

TCCATTTAGTGCCTACTCTGTGTCAGTGCTAGTCAAAA^ 

CCATGAATGC-GTACTAACAACACAACAGCCACTTGAATGGCCT 

AATTGGTCTGTAAAAGTCATCGCTGTACAAATGAATCCCAAACCAAACATTGAAAG 

GGGTOVTAGAGCATTTTGGGCAAAGAGCCCATACAAGGCCATAGAAGCATGAA^ 
2 0 TCAT AT AATGAG AGTTTGGGAACATGGAGAAGGT AGTGG AGG AAG AT CAGACT AAAACTGATTATG AAGG AC CTTACGT ACCATG C 

CAAGGAGGCTGGCCTTCATCCTGTAGTCAGTGGGGAACAACTTCAG 

AGAAATATTCTCnTTGCAGCACTGTG<3AGGTACAG 

TCATGGGGCAGTCGCAGTGGGGAGATAAGGAGATGATGGTGCTGAGAGCCAGTTAAGAAATAGAATC 
TTGGTTATGAGTATTGAGACTACATGGGTGACTACATGAGTGATCTCAT 

2 5 GATCAAGGGGCGATATTAATGTAGGAAACAGAGAACAGAGAGAACCAATGATATAAGCCCT 

CAAATAAAAAGAAGCTTTAGAAGGAATCATCTCAGGGTATGAGGAAAACCTG 
CTGCAGAAGGAGGGAGTAGTCAAGAATGTTGAATACTTCAGAG 
TAATCCTAGCAAGAGCAATTAGTGGGTGGTGAAAAATAAGAAATCAGATGGT^ 
TAAGGGAAGGAGAGCAAGAGTTCTCAAGGAACATTTTTTT 

AGAACACATGGCAAGATTAGCTTTGGGCTAGACAAAGGACAACT 
GGATATGGATAAGGGATTGTAGGTCTTGAAAGTTGGAAGTTTGAGGGGTGTAGTC 
AGAAGCGTAGTTGACTGATGGCCCAACTGCTGCCCCTCrrG^ 
CCCAGTCATGACCAAGCACAATAGGGGCTCCTATTTCAGGCrCATTCC 

3 5 CAAGAACTCCTATTGGCCTGAGCAGACCTTTCTrGGAACTGTGTTGGAATCCAA 

CCATGTCCATAGGTGTGGTTTGAAGGACCTCCCACATTCTGTGGCT 
GGTACrTGCTTCTGGGTAGACCCAAACTAACATAAAGAATTTGGTCGAAATGTC 
AATAACTTCTCAGTTCAACAACATGTACATCATTCAGACAGAAAAT^ 
TGGACCCAATAGACATGTACAGAATTTTCCATCCAACAGCAGCAGAAT 

4 0 GATAGACC^TATGTTAGGCCACAAAATTTAAGATTG 

AACTAGACATCAGTCACAGGAGGAATCTTGAAAAATTCACAAATATCT 
AAAGAAGAAATTAAAAGATAAATTTAAAAAATGCCTTGAGACAAATGACA^ 
GAGAAGCAGTTCTGAGAGGAAAGTTTATAGCAATATATGCCTAOVTTAGAAAAGAAGAAAC^T 
CACCTCAAC<5AACTAGAAAAAGAACAAACTAAAACCAGATTAGTAG 

4 5 GATAATAGAAAAGCCACAGGAAGAATCAATAAACTTAAGAGTTGGGTC 

ACAAAGAAAAAAGAAAACTCAAGTAAAACCAAAAATGAAATTGAAGAAATTACAACAGA 
TCCCTATTACGAACAATTATATGCCAACAAATTGGGTGGCCTAGAGGAAACAGATAAATTCCT 
TGACTCAGGAAGAAATAGAAAGCCTGTAACAGACCAATCACAAATAG 
CCAGGAACAGATAGCTTCACAGCTGAGTTCTACCAGACATTCAAAGAA 

5 0 AACTAGAAGGAATACTTTCAAACACATTTTATGAGGCCAGGATCACCCTGATACCCAGCC 

ATTTAAACCCAATTTCTCTGATGAATATTGGTGCAAW 
ATTACATGTCATtlACTAAGTGGGATTTATCCCTGGCATACAAAGCT 
TTAACAGAATGAAAGATAAAAATCACATGATAATCTTAATTGACACAGAAAAAGCATT^ 
AAAATGCTCAATAGTGTATGTATAGAAGGAAAGCTTCTCTATGTAATAAAGGCCATTTAT^ 

5 5 ACATGGGGGAAAACTGAAAGTTTTTCCATGAAGGTCTAGTAT 

GTACTGGAAGTACCAGCAATAGTAATTAGATAAGGAAAAGAAATTAAAGGCATTGAGGTCAGG 

G CAGATGGTG TGAT C CT AT ATTG TAGGGGGGGAAAAAAG C C C CTG CAG ATT C CACACAAAAAAACTGTT AG AACAACG AATG AATT 

CAGTAAAGCTGTGGGGTGCAAAATCAACATGCAAAATCAGTTGCGTTT^ 

AAGAAAATGGTCCCACTTAACAAAGCATCAAAAAGAACAAAATA 

6 0 AAACrATAAAACATTAATGAAAGAAATTAAGC^CAAAAAAATGGAAA 

GAAATGTCCATAGTACCCAAAGCAATATACAGATTCGGTGCAATCT 
AATGCTTCTTAAATTTGTACAGAACCATGAAAGACCCTGAATAGCCAAAACAATTA 
TACITCCrGATTTAAAAATTATATTACAAAGTTATAGTAATAAAGACA^ 
GAACAGAATAGAGAGTCCAGAAATAAATCCACGTATATATAGTCAACTAATTTC 

6 5 AGAATAGTCTCTTTGATAGGATGTTAAGAAAACTGGAm 

AAAAATCAACTCAAAATTGATAAAAGACCTAAACATAAAAGCAGAAACTATAAAACTCCT 

TG GACIATTGGCX3TTGG T AG TC^CTTTTGGAT AT CACAACAAG AG CTCAGG CCAC AAAAG CAAAATT AAATAAATGGGGG CTGGGC 

GCAGTGGCTCACC^CTGTAATCCCGGCACTTTTGGGAGGCTGAC^TAG^ 

AACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCn'GGGTGTGGTC 

7 0 CTGAC^CAGGAGAATCGCTTGAACCCAGGAGGCCGAGATT 

GTGAGACTCCATCTCAAAAATAAAATAAATAAATGGGACTACACCAAACTAGAAAG 

AGTGAAGTAGCAGCCTACAGATTGGGAAAAAATTCCAATCAGTATATCCCAATATTCCAA 

GTATTTGGTAAGGAGTTAGTAGCC^AAATTTATCAGGAACTCATAC 

AATGGG CAAAAGG C CTG AG T AG ACATTTTTCT AAAGAAGACACAAAGATGGC CACAGG CAT ATGAAAGGGT ACACAACATC AT AT C 
7 5 AGGGAAATGCAAATCAAAACCACK3TGATTTACCTCCTCACAACTC 
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TTCX3TGAGGATGTGGAGGACTCTTGTCCACTG 

TAAAGAAATTAAAAAOUGAACTACCATATATGACCCAGCCATTCCTTTTCT 
ATAAAGATGTCTGCACTCCC^TGTTCACTGCAGCATGATTCACAATAACTAAGATATGG 
AAATAGAGAAAGAAGTGGCTGTAAGTATACCACATTTGGTTTATACAAAATGAATA^ 
5 GT CATTTGCCACAACATGGGAAAG CC CATTGTGCTAAGCGAAATACGCCAAAGAAAGACAT ATT AT ATG AT CTT ACTTATATGTGG 

AACCAAAAAAATTTCAAATATGCAGAGAGAATACAACAGTGGTTACCAGGGATATC 
TCAGAGGATACAAAGTAGTATGAACAAGTATAGAGATCTAATGTGCAGTATG 
TTTTAGCTGCTCTTAACATAAAAACAAAAATAGGGTAACTG^ 
TATCTATATTAAGCCTATAACATCACGTTGTATACCATAAATATACGTA^ 
1 0 ACTTCTCAACATTTCACTGTTCTAACACAAGTTTCT 

TTTT CCTCCTGGAAAT CCTTCAGG AATTCAG ATTCCTTC CATC^ 

TGCTTG TTTGCATCACAT CAAG AGAGGGGGAAAAACATGG CAGGTT ACACATGGAA^TTTTT ATGGAGTAGG CCT AGAAATT AGC 

ATAGAAATTAGGCCTAGAAATTAGCATAATCACTTTCACTTATATTTCG 

AGGAGGCTGGAAAACGTGGCCCAGCTGTGCACCAGGAAGAAGACACTATAAGTTTC^ 
1 5 CACAGCTAATGCTATGAGGCCTGGATTTATCACACTGTACGTTTTATGTGAGATTTT^ 

TCACTGGTGAAGTTGCTGAGAAAGCAGGTGGTTGAGTT^ 

ACTTGGGCAAGTCTCCGTTGGAAAATGTCTTTTCAGTGTTTCXK3TACA 

TGTCACTCTGTGTGTTGCTGGTTTTGGGTCCTGCACCTGGACAACCAA^ 

CTTCAGTCGGAGATAGTATGACTTACAGAGGGAGAGATGTTTTCAAACCCrcTG 
2 0 TTGCACTTTTCTCTAATACAAATGAATGAACCTOKrrGCTTCT 

CTCATTACArr r r TT GCTrTTGCAAAri W 

CCTTAAATCATTGTCAATCCAAATCCTGGACAACCTGTCCTCTCAGGCCCAACT 

TACCACAGTTGAGTGCCTCCTGTCTTCCTGACACTCTTTGGGTCT 

CTGTATGAGTCTTTCACCTACTAGATTTAAAACTTGATGATAGAACT 

2 5 TACACACAGTACATTITTAGTAAACAATTGTTGAGTGAAAATGAATAAATATTC 

TTGGAGAATCAGTCTTGTGTGGCCACAGATCACATGAGGAATGTAAAAAT^ 
AGGGGAGAAAGGCCTGTACCXIATTTTTTCCATCTCTAATGATCATA 
ATCTCTATTAGGGCTCTGAAGTTTGTGAAGGCTCCAGCCACCC^ 
TACAATCTTGACTTATTTTGGAGCGTTAATGGAGGCAAACTGAGAT^ 

3 0 CTCAGGGGAGTAGTTTTGAGTTTGAAACATTACAATAACT^ 

AATGAATTGACAAGAGACCTCTTTATACAGCTAGAATATATCAAGTTGTTACT^ 
TGGTACTGTTTTTATGGTTGTAGTTGTTTATTGTAGGATAAAAGTCCAAGAGGAAATGT 
ATATAATATATAGATACTATGTATAAGCTArrriTCTGCTCTAGGTCCT^ 
CAAACAGGTAGAGAGCAATTAATTTAAAAGTtK^CCTAAAACCCACCTTCCT 

3 5 TCCAGGATTTATATAGCAGATTATTTTATCACTACTCCCTTAGAGCAACCAATCA 

TAC CCATTTT AAAAATGTTT ATTT AAT ATTTTT AAG T AATTTT TGAAAAGATCT CACAT ACTCATrr ri TCATGAAC 

AAATTATGCTGTATACrTAATTGAGTCTGTCTGTAGGGATCATTTATTTCTAT^ 
TTGAGATGATAGTCTATTGATTTTGTTTCTCTCTCTCATGT^ 
GATCATTCTTTTATGTCAGTTCTCAGCTAGTTGTGATTTGTTGCTAAATTAC 
40 TTTGAAAACTATTTCTTTTAGGATACTAAACAgTAAGACAAAGA 

TTCTTTGTTAAAAGCAATTTGTGCCATGATAGTTTAAAAAAAATAGTTCX^CAA 
CATTTGAGCAGAGAGAAGTTTTTTATTATC 
GTGCTGTGTTTTCTCITTAAAATACACTTGGATTCTTCCAC^ 
ACCCTTTCATTCACTAGCTK^TTGGAGAAAATTAACTCTTCAA 

4 5 CTTAGTATTTAATTTCATGAGGGACATAGCACTTTTCCCCATGTAATTACAGATTGGTAGAG 

AAGTTTAACACCCATCAAATGCAGATACCTCTTCTAAGGTATCCTAATTrCTATATCGA 
GCAAATCTGCCATAATAAAGTCTTTTTCAAAGTTATCTATGCCCAGTTCCCCTOUV 
TCTGATCTCACTCCTTTTTGCTTTCTCCAGTTTATCAGAAAGT^ 
TTTTITGAGGCAAGATTTCACTCCTGTCATCCAGGTTGGA 

5 0 TCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGTCTACAGTTGCATACCACC^ 

AGAGACGAGTTTTGCCATGTTACCCAAGCTGGTCTTGAACTCCTTACCTCAAGTGGT 
GATTATAGGCGTGAGCCACTGCACCTGGCCCTAAAGCTACTCT^ 
AACAGTGTCTTTATTAATGCAGCCTAAGAGTACATTTTTGCAAACCCAT 
CCATGTACTGACCATACAGGACCTCACATTTGAGCCTAGTAAACTTGACTAT^ 

5 5 TTTTGTAATCCTGTTTCTGGCATCCCTGGTGTTGTGT^ 

TGGGCAAATACAGACTTTCTGTCCAGAGTGATAATCATATCC^ 

CCTGGGAAGTTGAATGATAAAGCCAGAAGAAAGCATGCTTTCTGATCATTGTAAGTTTTAC 
TTAGCAGGAAATTTAAATTTTATTAATATCATCTC^TATACTTTAGAAGATCTAAAACT 

GG AGCCTGGTT AT AACC TTTC TT TGG T AGCCCAT AG CTGTGGGGCCCAT AGCTTT GGGG CC CAAAG CT CTCTTTG ATTTGAGTCAG 

6 0 AGACCITCCCATGTGCCCAGGAATTTTGATTATGGCAGGTGATA 

TGTCACCAAGTTGTGTGAATCTTTGTATTGCTAAAAAGGATACCCGGGCAGTCT^ 
AAATATTGTGATGTT CATGTTTGCTCAGGCrcGCTACTCCAGTTTTGTT ACAACATAGTGCT 
ATGCTAAAAAGCACAGTGTATTTGAGCATAGTCAATTCTAAAAAAAAT^ 
CTTTCTGTCTTGAAATCTGTGGATTGAACTITCCCACCCCTC 

6 5 TC CT TTTATCAGTGCAGGTrTATAGAAAGTGTTCTCATATCATCCTGAATCATTTCTA 

TTACAATCCTAACAATTGTAATAAATGCTCCTATTAGAAGCTTTTACITCCATC 
CATCTTCATTAGTC^CCCATGACTTACTCCAGGTGTTACCCTCTAAGTTA 
GTTTTTTAAGGGAAAAAATTCTGAAAGCAAAAGGGTTCTCTCTTTTT^ 
AAGCAGGTGCTGGGTATGTGCTAAACTAATGGTTTTCAGAGTACTGAGTGGCTATTTTTAA^ 

7 0 GCTGCTGGATGAAAAGTACAAAAGGCTAAAAATATTTACAGTTTCAGTAATGT 

ACACCACCCTGAACGTGCCTGATCTTGTCTAAAAGAAACCATCCTGTGTGA^ 
ATGGCTTACAGAGAAGTGGTTGAAAGCAGTTTGTTCAACACTGAGGTTTTTAGG^ 
GTTACAAATCATCAGGATTAAAAAAAAAAAGTTGATTATTAT^ 
AGTTCCAACTGAAAATAGTATTATCTTTGAAACTTTTTGT^ 
7 5 ATCTCAGCTC T ATCCCi TGGATATGTGATCCTGGATAAATTGTTT AACCTCTTTGTGCCTCAGTGTAC CCATCTGTAAAACAGAAT 
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AATAATAGTGCCTTCCTGCTAAGGTTGTTGTGATTAAATGAGATGATGCTCTGCCTAACATGG 
ATGCTTATTTTTTTCATTTAAAAJUUITTTTTGGTT 

ATT AG CT AT ATTT ACATTGT AGTGT AACAAAT CTTT AGAATTTTTCAT CTGG CAAAACTGAAATGTTT ATT ATGTTTTAAAAGCTC 
ATTTTTCTTCAGAGTCAGTTACACAGCTGTGTGGACGT^ 
5 TTCACTGCTGTATCACAAGTGCCACAATCAGCATCT 

TGAATGAACAGTGAGGCTTCATTTGACAAACTACATGTGTGTTTG^ 

TTT AAGT AT ACATGGAAAGTTGACTCCTTT CGGAGGGTGAAGTG ATTT CTAAAACAACTGT1TTGT AATT ATTT AG CAGTTCTGGC 

TTTGGTAGGTCAACl'lTTCTTAAACTGGAAAAAAAAATAGTATTCTTAGTGTAGATTGCAAAAG 

TTGCCCAAAATTTAAACAGGATTCGCTTTTTAAATTCAAGCCTTTT^ 
1 0 TAAGGCATATATGAATCATCTTGTTCTTTCATTCAGAAATCAAAAGTTCAGGCCTA 

TACCTTTCCACTGAGAGTGCATTACCTr'r'l"l"i IGAGl'Cl'riGAAAAGAATGTCAGCTGAACATGGTCTGGAAGTGGCCAAAACTCT 

GTGGGTGGGGAGAAAGGAAGAAATAACCTATCTCCCATTTAAAAACCTACC^ 

TATTCATTTGGGTTATCTATTCTGATGAACAAATACAAATTCTTGGTAGGGTTT^ 

TTTTGTTGTTCAATCITAGGTTTTGTTTCATTTTGTTTTTCAG 
1 5 TGGTTCGTGGCAGCCTCAGCAGCTCCCTGGGCTCAGGTGATTCCCCCACGTCAGCCTCTCGAGTAGTTGAGACTATA 

CACCACACCCTGCTAATTTTTGTGTTTGTAGTAGA^ 

GTGATTTGCCCACTTTGGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACTGCGCCTGGC 
ATGCTCCAATAGCATTTTATTTTTATTTAAACATTGCTG 

CTACCAAGGAAAGGGGACTCTGTCTCCTGGACTTTGCAGTTATGGGAGGAAGATGACAGGAAGCTACATCATCTGCC 

TCTGCTCCCATGGCAAAGACAGTCACTCATCTTCCAAATTGTGGTATCTTGAGTCATT^ 
ATAAAGGAAAATGATCTTACCTAGTCATTTTTTCCCATTCTACAAGACAAAGT^ 

G CCTCT CTGAG AT AAT AAAAT ATTTTTAT ATTT AAATTTTCTGTT AAGAGGAAAATG T ATA CCC AAATTTTT AACT CCT 
TCTTTTGATTCTATACCTACATAG<5TTTTTATTTGGATTTGAAACCAAATAAAAGA 

2 5 GTAGTAGTGAAGCAGCC7VGTTCATTTATTCTTCATCTGCCCTGCCATGGCTTCCTGTTGCACTAAGAATCCAGCCTCC 

GTTGGTGGGAGTCTGCrGACCTGGCCCCAGCCAGGCTCTCTGGCCrCATGCAGCCATTTT 
GTGACGAGGTGCACGAGGCTCTTCCTGAGGCTG 

CGTTCATCCTTCACTTGCATTTTAACATCAC^TCCTCTGAGAAGCCTTCCCTGATCT 

3 0 CTCTCCCATTAG T CT ATGAGCCCATTAGGG AT CACGG CT ATTTTGTCTGCTGT ATGT AAT ATCCACTT C CT AG CAAG CAGGACACA 

CTCAAGGAAGAGTTATTGAATAATAATTTAAAAATATTTTTTATTAAATGAAAAAT 
CTGTCAGGAGTGTTAAGTACTGAAAAAACGACAGC^GATTAGTTAATTGGTTGATTTCA 
ACTATGATGTGCTGTTCTAGGTGCTGAGTACATGTGGACGGGTTC 
AGAGACACAATGAGGAATAAAATTAGTGCCITTCTCTCAAGGAGCATATCTATTCA 

3 5 TGATATACTCTAAGCTAGAAGACACITGAATGTGATGAGTATAAC7VTATAAGAGAGTA 

GCTGTAGGGATGAAGATTAAGAAATGCTGTCTGATTAGGGACATCA^ 

AAGAATGGGTTCAACTTTGACAGCTGAAGCTATGAGGGAGGAAATTCrrAGGCATGGGAAC 

AAGACATTCCAGGXSATAGCTATTAACCCTTTTTAACTGAAGC^ 

TAGGTAGAGGCTATACTITTAGGGGAGTGAAGAATTCCTCTAAGTTTAGAAAAAGTTGATTTTAATTACCn'TAAATA 

4 0 GGTACTAATATTGCTGTTTGGTAGCCCITAGGAAAATGATACTTGTGACTTGGT 

GGGAGATTGAGGTGGGAGGATTGCTTGAGGCCAGGAGTTCAAGACCAGCCTGGGCAACATAGCAAGACCCCA 
ATAACTGTACATGGTGGTGCATGCCTGTAGTCCCAGCT^^ 

TTGCAATGAGCTATGATGACCACTG CACTC CAGCCTGGGT AAGACACCATCT CTACAAAAAAAAAAAAAAGAAG AAGAAGAAAGAA 
ATTGATACTTGTTTAAATATATTACATGTCAAATGTACTACAACAGTCCTCTCCAAATGAG^ 

4 5 ACTGCAGTCATGTTCTCGTTATAGTGGTAGTTAGTATGCAGGTCAATAATTTATATGTGGTCTTTAGCTTT 

CAGTAAAAACGGTAACCTGACATTTGAAGGTTTTTTTTTCA 

TACAAGTATGTTTTTCATGC!ATATTTTAAGAAAATTTTCATA 

CCTAAGAATTTTAGTGTAAAGTTTCATTTTCCTTATCA 

TATGGAGTATCTGTGTTCCTAGAAAATTTCTTTTGAAATATGATTTTGTAAATCAGATTGTATTGTAAT^ 

5 0 GCTTCAAATCACCTTAAAATAAATTCATTTATATTATCTAATTCATATTTTATAAAT 

GAATTGGGGGTAATTCTACAAACAATAAACAAGGTAAATGATTTTTTC 
GGGTGAATGTATGTATCITAATACACTGTACTA(IATTAGAC^ 
TGTTGAAACCTATCTTAAAGAGTTTTGAGC^AAATTGAAGGATGAGCCT 
TAGAGAACAGTAAGAATAAAAGTGGAAAAATTGTATTTAAAATTC 

5 5 ATTATACTTATAAAAGTACAGTAATCAACTCTGCTTTATAACTGTTTTGTGAAGATTATTTAAAG^ 

CACAGAAAAACAOAATAAAATTGGAATCCTAGAGTTTATGTGCATTTAGGTTAA 

CTGGGTTGAGTGAAGGACCCACrGTTCATTGTGTTCCAGCAGTAGTCACTCAGACCCAGCT^ 

TTCTGCCACGTCCCACTCCTCCTCCCTTGATAGCTTCTCTTGCT 

CCAGCCTCCTGTCATGGAATGAGGGTCTGGGCCAGGAGACCTTTGGAGAAAATTCCTGGAGAGCAAGTTTTCATA 

6 0 AATTTTTTCTGTTTCCACCCTGTTACAAATATTTTAATCCAAATATGAA 

GACTTCTCTTGTTTCAAATTCAGATTATTTCTGATCIACCrCATAGCTAGTTCT 
AACTACTGAATGATGTAAATGAACATATAAAGGCTATGTTTACCTACTTAAGAAATTGCTTTTA^ 
ACTCTTTTGTTTTTAAATTTAAAAAATTCnTCAGAGATGGGATCT 
TTCACTGGCTACCTCAGCCTCTGGAGTAGCTGGGACTACAGGCACGCACCACAAC^ 

65 GAAACAGAGTCTCTCTATGTTGCCCTGACTGGTCTTGAACTCCTGGCCT 

GATTACAAGTGTC^GCCACTGAGCCTGGTTGAAATGTCCACTTACTTTTAAAAATGAATTGTATTTTAAGGAA 
TTAAATAATTTTCATCAGTTTACnTAAAAAGTACTAAAGCCTTTTTTTTTT^ 
ATTGCAGTGGCATGATCTCAGCTGACTGTAACCTCTGCCTCCCAGGTTCAAGCGATATTCTCCC^ 
GGATTACAGGCACCTGCCACCACACCCGGCTAATTTTTTGTGTGTTTTTAGTA 

70 CTCGAACTCCTGACCTCAGGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACC^ 
GCCTATTTTTAAAGCATAATTTATATTAGGTATTGTTAATCCTTCACGGTTCTTGTTTGG 
ACTTGGAGCTAGAAAGGCCCATGAGGGTGATGATGGCTCAGAGGGGGCAATAGAAGC^ 
GTAAACATGGGATGTGTCAGATATGATACCAGCCAGTGGTCCCCATAGCACAGGCAGCAAGTGTGAGGC^ 
TACACGAGTATTTGTGTGCGTGTGTGCGCATGTGTGCGCGTGTGAAGGGGX5GAGGGGGTCGTTGCTGT 

75 AAATGGAGACCCCAGAAGCAGCTGATACTAATCCATCCAAGTAGG 
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CCAAGGATTCTAGACCAGATTCTGGAGAG^ 

GGAAGCAGTAATAGTAATTACCTCAAGGGGTTATTTTGATGGCTAATTGAAAACA 
CATAGTAAGTATTTAGTAAATGTTAGCTATTACTGTATTACATGAAATTGGGAAG 
ACCGGGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCCAGGCAGGCG^ 
5 GCCTGGCCAACATGGTGAAACCCTGTCCCTACTAAAATTGCAAAAATTAGCCGGGTGTGC 
TCAGGAGGCTGAGGCATGAGAATCATTTGAACCCAGGAGATGGAGGTTGCAGTGAGCT 
GCAACAGAGCGAGACTCTGTCTCCCAAAAATAAAAAATAAAAAATGAATGTCCTATGACTGAGGTTTAT^ 
ATTCCAATCAATGAAATTTTACTACATAAAGGTAGATGCCTTCTAATGGTTTTC^ 

CTGTT CAGGT AGTAGACCTGTTCATG AGGAAAAAAACT ATTTAAATGCTGAGAAAAGG ATGCTAG CTTTTTTGG ATC CATCATCAT 
1 0 TTTTGCCAAATTCATAGAATTTTTGGCCAGTGGTGTCAA 

TTAGGGAGGGTGTGGAAATAAGGAATGAACGTTATGTAAAGACCATAATGCTAGTCAGAGAATGTTCCAAT 

CCTTCTGCCACCAGGACAAAACTTTTGTGATCTAAA 

CCCAAGTGGTCTTCACTGTCATGATTATTAAGCATTTT 

TAAAACAAATTTGCTTAAAAGTAAAATGGAGTTATTGGCATCACAAAGATCATAGTCACCTT 
1 5 GACTGTAAAGGATTTTTAAAGACGATCCTTGCGACTAGTTAT^ 

GGCACATTGGCTCACGCCTGTAATCCCAGCATTTTGGGAGGCTGAGGCGGGCGG^ 
GACCAACATGGAGAAACCTTGTCTCTACTAAAAAAAAAAAAAAAAAAA^ 

TCCAAGCGACTCGGGAGGCTGAGACAGGAGAATCGCTTGAACTCGGGAGGCGGAGGTTGCAGTGAGCCAAGATCACGCC^ 
TCCAGTCTGGGCAACAAGAGCTAAACTCCGCCTCAAAAAAAAAAAGAAATCAGATAATTTGCC 
2 0 CAACTCCTGAATTTTTTGATCACATTATCTTTCAAGTTTAGTTAGAAC 

TAATGCACTGGTCACATTGTGTAGATATTAAGTTACACTTTGGTAGATGGATTTTTAGATATTC 

GCTTTGAAAGTTTGGCAGCTTCTTATAAAGCTCAAGATAGGCTTAACGTACAATTCAGCAACTACGC 

TGAGTTGAAAATTTATGTCTGCAGAACAACCTGC7VTGCGAATCTTGATAGCAGCTTTATTTAT 

T AAGGG ATC CTTCATTAGATG AATTGAT AAACTGTGGT ACATC CAG ATAATGGAAT ATT ATTCAG CACTAAAAAG AAATG AGCAAT 

2 5 GAAGCCACAGAAAGACACAAAGGAACCTT AAATG CATATTGCTAAGTGAAAGAAGCCAGTCT 

TAAACTATAGAAAAGGCAAAAGTCAAAACCATAGAGACAGTAAAAAGATCACTGGT^ 
ACAGGTGGAGCACAGTGGATTTTAAGGGCTGTGACACTATTCTGTATGATATTGTAATGGTGGATAG 
TACAACTGTGTAACATAAAGAGTGAACTCTTATGTAAACTATATTGACTGTTATGACTAGTTAATCCT 
TAATTATAACAGATGTACCATACTTGTGCAAGATGGTTAATAACAGGGGAAACGGGGAAGAGGGAAC^ 

3 0 TAAACCTAAAACTGCTCTAAAAAATAGTCTACTAATAAAAAAAAAAAAAGAAAATCAGGATAGATTGTAGC^ 

CAATACTCCAAGACAAAACAAAACAAAAGCCTTGTATAATGTCTTTCATATAATATTCCCCACCTTATTTCT 
ATTTTTTTTTCATTT ATCTTATGTGTT AT CTCAT CAAC CTTT AG T AACGGG CAG TTAAAAAAAT AAGCAG TTTGG CAACTT CCTGT 
GGCATACATTTCAGATTGTGAGCAGAGATGCCCACCATOTGACTGGCAG^ 

CACCT AC C CT AG CATT AGTTGT CT CCTAC CTTGCTTTTG ACAACAG AGAGGGTG AAAATGTGGG CTTGTGGCCATT^ 

3 5 TTTT ATTT CT ATG TATTTTTTAATC CTT AC AG CAGTTC C CATTTTGT AG ATG AGGAAATGAG AG CACTGG AAG AG AGG AAATT CCT 

ACCCTTGCAGTCTGACTCCATTCTGATCCTCTATCAGGCTAATGTGAACACACGTGACT 
TAGGTGGGAGGGGGAGCCATTTGGAGGATTTGGTCCCAGTCCAC^TGCTATTTA^ 
CACCACCCTACAGGCTAGTGTCTGCCATTGAGAAGTATCATCTCATCATTTTCAATCTAT^ 
TATACATATATATTATTATATATTATCTGGAAACCATATAACTATAATTTTAGTCLAACT 

4 0 CTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGGGCGATCTCGGCTCACT 

CT CAGCCTC CCAAGTAGCTGGAGACTACAGGCGCACGCCACCACGCCrGGCTAATTT^ 
CTAGCCTCATATTGACAAGTGATAGGGTTOVAAAGTAATATCTGTTGGCTTTG^ 
GAAGTTAGCAGATTTAGCTTCTTAATAATGTTTGGCTGGAGGC!AGTATTAACCT 
TAATGCAAGAGATAGGAATTTGGTCCCTAGGAGGGTAAAGCTAAGGTTCACTCT 

4 5 GAGGATTAAATGTTGATTTTACGTGGGTCATCATTTTTAAAGTCC 

GTACATCTCCTCTTATTTGCTTTGTAGTTCTTATGAGGAAGAATTCCCA.TTATAGTGGGGCT 

T ATGC TT GTTTT CTT AAT ATCGT AAAGCAG ACAAT AG CAGG AT ATTT AATG AAAT ACAT CATGAATTT AAT C CT AG ACCAAAG CAA 
AG T ATTTT CAAGT ATTTTTCATTTT CAGAT AAATT ATATT AAAATT CAT ATTT AAGTTAAAATTTTTGAGACAGTTTACTTGAAAC 
ATAGTAGC^TTCTTCACTGATGTTAGAATTACTTTATA^ 

5 0 ACATG TG TT AAACTTTTTG AT ATTTCATT AAAAAAG AT AGTAT TGG AAAAATG CAG T ATTTTTT CATTT ACTT CCAAATG TTTTGT 

ATTAAACTTTTTTTGACCTAGTAGATGACTAGTACAAATAGAACTTT^ 

TTTTAATGCTTAATTCAAATAATAAAATATATCTGTTTTTGTATTTTCTGTAACAATATAGAAGTAGTTCTCAG 
ATAAGATGAATTTTTAGATCTTTATTTTACTTGGCAAGTCTAAAATAGTG 
ACTGTTTGGCTAATTCTTGTATGTGGTTTTAATTTTTTTAATAGCAAAT^ 
55 GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTATTCCCCCCCCCTTACTATGGAGATCATGCATG 
GAAAACAGAAAAACAAACGTGACCTTTGATCCCTTCAGAGATAACATAATCTCTT 
C^TATAGACGAGATTTATATCTTAGTCATGTATGGCTATATACATAAATATGTGCATATTGCCA^ 
GCATACATAAATAATTTTTAAAGTGAAAAGAGAATCCTCCTATAACACCTTACAC^GTGGCTATGTTCT 
ACCTTGCTGTAATTTTAGAGCCAGCACCTGGCTCAGGTTCACCCATGACAT^ 

6 0 TTCAAACAGAAAGTTCATAGCCTCTTCTCAGCTGTTAATGTGATTTACTTCTTATGTCTGGGA^ 

TTGTTTTAAGGGGATTGAGGTCCTTC^TTAATGCCAGGACATTCCCAGCTGACAGA 
GATTACCAC^TTTGTGGCTCXjTGGACATCITAGATCCTAGGTCGAG 

GTTGTGTATTGATTTTAATTCAAGTATCATTTTAGCATTGATTGGCAAACACCCTAGTGGCTTTCAGTC 
CAGCTAACCTGCTGCTGCTCCTGCCTTAAAATCAGTATTTGAGTTAGTGG 

6 5 GTATACTTTTTCTTAATTTTCAAAATAGGAAGTGAGGTGTGCTCTTCAG 

ATGAAAGGAGACACTAGATGAGCTCCACACAAGGCAGTATCACAACTCACT^ 
CGTGAGAAACAGCCATCTTGTCACAGGAAAATGAGATTGAAGAATTAGTCTGATTAA 
AGGTGGCCTGCAGAATATTAGCAGCTGTGGTTGGAAAGCCATCAGGAGTT^ 
CTGTCTTGTGTCCCAGCCTCTTTTATGCCTTCACCACCCT 

7 0 AGTTTTTAAATTCATTCTTATTAGGAGTGAGTTTTTTTCTTTGTAGATATTAGTAATCTAAAAGTAGAA 

TACAGCATATAATTCAATTGCACATCAGCAGTTGCGCTATCTGTATATGCAAATACTCT 
ACAACTCCAAAAATGAAATCTAATGACTTGTTTCTATGGTAACCACT^ 
TTTCTCTAAGAGTTTCAGGGAGTTCATTAAATTTGCTTTCCCAA 
ATTTTTCCCCCTAATC-GGGTTATGAGACTATTATAAAATTCTCITTAGCT 
75 TATTTGGCTAAGGGACAAGCCTATTGTTT^ATAAAT'fTAGTTTAAACCAGT 
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ATTAATCTTGCTTCATAAATATTTTCTGGTATTTATA 
CCTTTTAAAATTGAAATTCATTTGCTGTTCCTTTGAG^ 
CCAGGGCGTTGAGTGTCTGTCCACCnTGATGGAGTTCTGACCTGG 
GGTTGTGTGCTGCAAATACAATGAAAGTCACAACTTATTrATTTAGAGA^ 
5 ACTACAGTGTTTCC1AGAGGTACAGAAGAGATTCCATTGCATTTTACATTTA 

CCAGTCTCAAGGCCTGTCATGTCACAAAACAAACAGATGTGGTCAGATATTCT 

ACATT ATGGCAGGGTT CATGGAATT CTCTGGTTACATTCAAAGG T AAAGT AAAGTGG CAGGAAAT AGAT ACTG AG 
GAAATAAAACTACACTGGGTCTGACAACCTGGAACTTTCATTT 
TTAGATATGGTTGGTCATAATAACATTTAATTGTTTGTTGAGGAGGGATGACT 
1 0 AGCTCTCCCAGATTTCTAAAACAAGAGCACTGGGCTGGGCGCAGTGG 

GATGGATCGTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACC 

GCGTGGTGGCAGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAG^ 

AGCTGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGAGAGAGACTCT 

CCTGGAGAATGGGTTTGT ACAATTTCACAG TGGAATCATGGACAT AGT AT AAGTG CATGTAG TTTTCAATTTT AATTTGTAGTCT C 
1 5 AGTTCTTTCAAGAAGTAGCTGAACAGTGACAGGCACAGTTTTCT 

TTTCATATCCTGAAATCCCAGAGACTCTCTTTTTTGCTCCCTCATCT 

AAACACAGGAAACTCTTTACCTGCCTCCTGCTTTGCACATATGGCCAGCC 

TTTCTAAAGGCAAAACTAAACATCATGGCATGACTTTGCT^ 

CAGAATGTCTCAGACCTATGTATCTTTCCAGTCTACTCTTTCT 
2 0 TTGTGTGAACTCTTTCTGGGTCCCACCATGCAGGCCTGAGGTCCCT 

ATCGTTAGTTGACGTGCCTTCCTCACTAAACTCTAAAAAGCATTTT^ 

GCCCAGCAC CTGG CACCTAGGTGGTG CT CAAT AAATGTTTCTT ATATGATTG AATAAATGAATGAAGG AATG AGTT AG TG ATAT CA 

CACAGGAAGCAGTGGTGAAGGAGACACTGCATTTAGTTAGTAAAACAGGCTTGAG 

CTGGTTGGTTGCTGGGTGGCCCCCATAAGGGCATAGCCCTGGTGAATAGCCACAGGACCAA 

2 5 GAGATGTGACTTGGAAGAATTGACTTATAGTTAAATTTATGA 

AAAGAGGAAGGCCAGGCAGC^CAACTGACATGGGTCATTTGCTCAGGC 

ATATTAGAACTTACTAAATTCTGAGTAAATGCATCCTTTCTAATTGTCTCTAAG 

AGGATTATTTTCTTTTCCTATTTAAAAATGTTTTTAAATT^ 

AAACTCCTTGGCTGAAAGGATCCTCTTTGCCTCCACCTCCCAGGTAGCTGGGAACT 

3 0 T AGTTG AAGGG CATTTCTCT AT AGATCACTG ACTTIT ACAACATCCTTAACAGT AAT CAAAACAGTT CT ACCTTTGT AAT AAT AAT 

GG CTAATT CTTGAATGTTT ACrGTGTTGGGCACTGTTCTAAATGCTT^ 
TTACTATTATTAGCTCTGTTTTTACAGACTrGGGAAA 
ATTTGAACCCATGCAGATGGACTCAATACCTTTGCTTTTAACCACAC^ 
TGTTTGCTTTGTGTGACTTCGTTTAATGCTCCTAGCAAC^ 

3 5 GT(nTGGTTTTTTGGTTACAAGCAACGGACTAAAGCAAA 

CTGACTATGATGATGGGTGCCCCAGTTGGTATGGGGAGGGGCAACTTTTCACAAAGGAA 
CACATGAAGAAATGGAAGITTAAATGACTTGCCCAAGGTCACCCAGCTAGTGAGTC 
CTAGCACCATCTGGGTTCACTCTGATTTGAGTGGACCCTTTGCAAACTG 
TGCTATCATAACCATAACATGTATCATTTTCTTTATTCCATTCTG^ 

4 0 GTTTCCCAAATGTAGTGCACATTGTTTTATTTAATTCTAAACTATCTTT^ 

TTTAACTAAAATTTGTACn^CTTrCCACTTCTGAAAGTGACC 
AGCTCTGTGAGCAGACCAGAGCTTTTTCAGGCTAAGGTTTATACAAAACA 
GGTAAGCAAGTAAGTTCATGGGAATGCCCATCTATTCACTGTCAACTC 
CTCTGGAAAGTGGCTTATACTTTGCACTTCTGAATCTGTTTGTGAGGAAATGAC^ 

4 5 GOSCTGATCCTrGATGTGTTTAGTTTTTACCTGTGCTCATCTCTACA 

TTCCAGGAAACTTAACTACATCATCGGCTCCCCAATCCGCACAGCCCAGAAG 
TGGTGATAAGGGGCCGAGAGTGTTCTCATCCTGTGGCTCTGCCAGTGAAAGCCTAGAAG 
AAAACCCTGAGTCTGGTTCTATAAGCACTTAGTCTCTGAACATGCCGACAGTGAGGGTGAAGTC 
TAGTGGTCCTCTTTCTTTCTGCTCAGGTGTGCTATCTTGTCCCTGGTCCATAGGGCCG 

5 0 ATTG AAATG ATCCTGGGAAAATTGT CACATG AG CCTGTGG ATGGGG T CAGGAG AG AT AAAGACCAT AGGATTT ACACTGG CCIACCA 

GCAGTTCCTCAAAGAATTATAATAACATAATTGTAATACCAATCGCTGATATTTATTGAGCATTGAC^ 

TGTCG AGGACCCTATGAGCATTAATGTAATCCTCACAT AAAC CCCAAAAGGGACTCTTTATTATTCC CAATTGTCAGGCGAGGTAC 
CTGAG ACT CAT AG AGTTTG AT AC CTTG TCCAGGGT CAT AT AG CCATT AAAGGATG CAAAACTTGAAC CT AAATCTGTGT CATTCT A 
AGCCAGCACTATACATAGTCAAGTAGGTTTTACTGGAGGGTGGAGTGGGGCTG^ 

5 5 AAGATCCATCCAAGGCAGGOTGAAGACTTATACCCIAGAGAAG^ 

AGGATTGGGTGATAGTCCAATAAATACTTTACCACAGGGAACITATGTC 

CCAAATAAAAAAAAGX5TGGCTTCTCACCTCCrCATGATCCC!AGCT 

AAATGTGATTGAAACAAGTATGAGAAATTGATGTTGCTTCAGCTTCT 

CCTGCCTCATGGATCAAAG AGGG ATGGT CTGGAAG AGAAGC CCCCAT AAATATTTGAGG<nTAATG AAGGCT ATCTGTTGAATAAT 

6 0 ACACCTGTATTATTGTAAAAAGTTTTAATTTGGCCTCA 

TAGGAATTAAT AAAAG ATG TCAAGAAAGAC CATTT AAGG ACTT CI^ AG AG ACAGG CCAT AAATGTGGTTTTGAACTTTTTGG CAG C 
TATTATGAAGACGGAAAGAGAAGTGTCATGGGAATCAATTTTCAAAACTTAAAAGTTCT 

GCAG CTCT ACACAGGT ATTGG CTAG C ATT AG TTT AAAGTGGTTG ACATG CAG CCCT ACTGCACAC CT CTTT AT ATTCCCTTTGTGT 
TTAGGATTAGAAAAACTCACATAGTATGATCTATCATATGTCTACCT 

6 5 TCTTXK3TCTTCATTTCTCTTTCCTTCTC 

AACAGAGAAAAGAAAAATAAATACACAATTCCAGCTATTTGCAG 
AGATCTTTCTACATCAGTCTACTCATAATAGAACTCCTGGCTATCriTGGGTTGAGC 
AGATTAAGGCAGGTAAGGGAGAAACACCTGCAGAGGTAGGTTTTGACCCCATTATACCT 
ATATAGAATGTGAAAAGGGTAGAAACTGCTGATTAAAACAAAGTGTT^TGATTATCTGGATTTCAG 

7 0 TCCAGTTATAAAGAGTTGAAAACCTTGCAAGGCACTCTTGTATAATTC 

TCTCTCTCTCTCTCTCTCTCrCTGTCTCTCTCTCTCAGACCAAATATATCATGT 

CCTGTTTGGAAAATAATAATGCCTAC^TTTCACTAGAATT 

ACTAAGGAAGTCTAGCTAAATAGCCTCACTTTGAGGACCAGCCCA 

AAG CAACT AG AAATCTTTT ATT ATTGG CT AT AAT AG CAAGTGG CAT CT CTT ATT ATGTTTGGCAG CTGACATTTCAAATTG CATTT 
75 TGGTTGGTTG TT AT ACATG AAA7 AG TAGGAT ATG AGT AG ACT AAAAAT G AG AA TTtu CACiC L'CAGGG ATG ACATtiAAAT ATCT AAC 
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ATCTAGGGGTTCCCATCACAACTCATCAGAA^ 
TGGAGAACGCTreGGTAAGAGGATGTGAATGTTGTACCTT^ 
AAGTAGAAATGACATTGAGGGAATTTTTAAATTCCCGCAGAACCT 
AGTCTGAGACTTAGCAACTGAAAGACAGGATTTCCTGTCCTTGATAGATAAAA 
5 CTAAACAGTTGTATGTAATGAAAATTTGGGAAArTAAACCCTTTTCATAATTC 
AAGAAAACAGTTTGGCCACAAACAAACCACAGCTGCTTT 

TGGAATTTGGACCAGGGAAGGGAATTCGGCAGAGAGAGGAAGGAAG CAATTGGG AAACAGGGTACCACATT CCTCATCCCCCACAT 
TGGCCTTGACCACAAGGACAGCATCTTCCACATGATAATGCACCTCTCAGGCAT 

CTGG AAACT AAAGAG CTCTGTTCCT AAGT CTG ATCCCAGTT CTGCCACT CACACTGTG ATCTTG AG C AAAACACATTTC AACTCTG 
1 0 GGTTCCCCTAGGAAATGAGGGGGCTGAGTTTCCTCTGGCTGTCAAGTTCrc 

CACATCGCTTTTCTTTGCCATTAAGCCCACATTCCCATCATAACTTTGCAGTA 

TAATTAAAAGAAAGAGAAAAAATGATTTATTCTTATCCCCATACCTTT^ 

TGAGGAAAGGATGAAATTCCAGACAGAAGACAGTCAGTTGCAATAAGTTGTCCT 

TATTTACTGAGTATTTTCTATATGGAGACAGTGCTAAGCCCTTTATATGCATCAAACT 
1 5 ACACATGTATAGGTATGATGCATGTGATTAAAATTGAACATTTG CTATGTACCAGG TGTTATG C CAAATCCG T ACAATAATCCACT 

GAGATCAATAAATGGATTCAGTCAGTCAAGCCTCACAACAAAGGAGGTGTTATCAGTTATCTCCGTT^^ 

AGCCATAGCTAGTAACTAGTGGAGCAGGGATTTGAACCCACGTAGTATGGGCTCAGC^ 

TTCTCCTCACTCATATGACATAACCACACCTG 

TCGGGGATTGCGTCCCAGACCCAGGTTTGCTAATATG CAGCCAGGG CTGGGAACCACTGCCTCAGATG^AATACCAGGGTTATGTT 
2 0 GTTGGAGAGTTG C CTGTAGCT CT ACAG C CAGCAGATGACTGACCTAGG ACTTG CATC CAGGCTG TAGG CTCT CAAGCCiUUVTATCT 

CGACGGCACAATGGGCAACAATTCAGTTGTTATCIGGGCT 

AAGCCGCAAGTTCATTTAGGGCTGTTTAAAGACTGTATGCTACACCAATTATGA^ 
AGCAGTGATTTCCTACTTAAAAGTTTOK3AGGTAAC 

TCATTGAGCAAAATCTTGTACTGACAGAGTCTAGGACAGTGGTTCTAAAAC^ 

2 5 GAGCTACAGATGGCCTGGCTTCCACCCCAGACATT^ 

GAGTGCAGTGGCATGATCTCTGCTCGCCGCAACCTCCGCCTCC^^ 
GATTACAGGCGCTGGCCACTGCACCCGGCTAATTTTTTGTATTTTTACT 
GAACTCCCGACCTCAGGTCATCCGCCTGCCTC 
TATTAGTATGGGATGTGACCTCGGCAATGGGAGTT^ 

3 0 TAAGAACITCGTCTGTGAGTTCACAGATGfrGTTGACCT 

CTGACAGTGTCTCTTGCGTTTCAGCTGGCAGGATTAATTTGAGTTTA^ 
ACTGTCAAATGATGXSTTTCTCCCCAAAGGTTGTTCTTATACTTAG^ 
TGTAAAGTTCTCAGCCTCATTTGACTCTCTTATTTCCACTATAGAATATTTC l ' T ^ 
TCTTGAGACAGTGTCTTGCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCGGCT 

3 5 CAATTCTCCTGCCTC&GCCTCCAGAATAGCTTGGTGTGC 

TTTC^CTGTGCTGCCCAGGCTGGTCTTGAACTCCTGGTCTCAAGCAATCCGCCTG 
GGCGTGAGCCACCACGCCCAACCTAAAAACATTTTTATGATT^ 
TCTTCCACATAATGCATTATAGTAATAATCCACCTATGGTTCTTTTC 
OlGKriGATGTGCCTGTAGTGTTTAACGGCTGATAGCGTCTTACAGATAGATA 

4 0 AGAATTGACTCAGACTTTATCCTCAGAGATTTTAGTAAAGTCAGAT^ 

AT ATT AAAGCAG AAGTGT AAATTTGTTGGCAAGTTTG CAAATG T ACTGG ATT AAT CTAG AAATCATTTTTG CAGT AAGTCTTAAAT 
AGATTITTAAACAGAGATGTTATTTTGTGGATAAAAAGAGACAGGTAATTTACA^ 

GCAAATGG AGTGTTGCAGG CCACTAGGCCAGCTAGCTAGGCTGGTATGAAAAGTATAAACTGGGTGTAG CAGTTAATGAGGAAGGA 
TTGCTGAGGTGGATAGAGCTAGCTATTGCATGAGATTTAATACCAA 

4 5 TCAGGAGTTTCTGTTTTCCTTTATGTGGTTGTTGAAGTCTAGTTGATTTCTAGTCTGTAAT^ 

CAAGGCTGGTAAGATGGGTTAGTTATGCTTCTCTTGAGGATAAAGCTTTTGAAATTAACT 
AATCCTGTCTGTCTACATCTTGCTGCTGTAAT 

GAGCAGGCAGGGGGCTGTCACrGTTGTTTCTCTTTGTTCAGAAATTAG^ 
CTGTTTTCCTCTGATATTTCCrAATTTri'CCCATGA 

5 0 TTTGACAGG AGTGCAGG CTTGTGCACATTTG CTGATCTGGCTGCCCGGGGTCTGAGTGTGGGTGGTAT ACCGCCCGAGGG ATCATG 

CAGACGGGAGAATGCGTGGAGC^GACTCCTGGTAGTTCCAAACACTTTATGTGTA 
AGAAGATGCTGCACTTTGAAAATGTTATTCACAAACTGAA^ 
TCTTAATGGCAAATCAAATGGATGAAGGCCTTATTCAAACTTC^ 
CGCAC^TTCCrTGATGGCCTGTCTTGGATTTATGTCAG 

5 5 TGCATACTTTTTGAGAGTGCAAGATGTTAATTGTAGTAAGAGAAGAGAAATTCAATAACAAATGTTTTTGAATAAAGGA^ 

ATTTAATACTAATGGTGTTTTCTCTAAATCAAGTTAATATATTATTGTTGTATTAAC 
GTGATTGCACCATTGCCCTCCAGCCTGGATGACAGAGTGAGACCCTGCCrCTTAAAAAAAAAA 
GTTTGATTGACAGTAATAGACCAATAGAAGACATTGC1ATTTATATAGAC 
GAATTCTCCCTTATTTrrAAATGTTCTCAGTGGTAACAGTGACGTTGGGACTA 

6 0 TTCACAACTCCATGTTGAGTGAGTGATACrrGGCAGCTTGAAAT 

AAATTGGTGCTCCTCACCCTCAG AGTCTGTTTT^AAACATTTAG CAGCAT ACCACTGCCCAAAATGGAGTTG AGCAGT CAGGGAGT 

CTCCTCTTGAGGCAAGTCATGCAGTGCTTCCACTCAAC 

AGAAAAGTCCTTCATAGCTCCACATTTTTCriGAGCCAAAGCCTTGCT 

GTCCTCCIGG CTCTCACCTCCAG ATGCCTGGTTTC^ CCT ATTTTAGGGCCTTTT 

6 5 CTGTAGGGCAGTGGTTCTGAAACTTTACAGCACATCAGAATCACCTGGA 

ATTTCTGAGCTAGTGACAAGTTTCAGGGTAATGCTGGTGGCCTTAGG 

TGGTTTACCTCTCCCCTCCAGATCTCTGATCCTCrTTTTCTTTCT 

CCTTTCTCCTCCACCACAGTTCTAATTCCTGACTCCAATTTT^ 

GTTACATAGGTAAATGTGTGGCATGGTGGTTTGCTGCACCTATCAACCTATCACCTAGGTATTAAGTCCC^ 
70 TATCCTGATGCTCTCCCTCCCCACCCCCCTCTAACAGG 
CAGCTCCCACGTATAAGTGTGAACATGTGGTGTTTGC 
AGTTTCTTATTTGTGATTATCGTCTGTCTCTCCCCACTAACTCTA 
AAACAGGGTATC^CTCCAATTATCCAGACTGGAGTGCAGTGGCATGATCACAGCT 
TGGGTGATTCTCCCCCATCAGCCTCCCGAGTAGCTGGGACTACAGGCATGCACCACCA 

7 5 GAGAGTTTTGCCATGTTGCCCAAGCTGGTCTTGA^ 
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ACAGGTGTGAGCCAGCATGCCCAACCTTGTTTGTTGTTGTTGTT^ 
CAGTGCCTCGTGGCAGGTGCTCAGTAAATTGTTGAATGAATTAATGGATGAACA^ 

TGGT CC CAG CCTTTT AAATTCT AACTCACCAG AG AATCTGAAAACACTTTCT^ CAGTTTTGTTTCT CAGCTTCACTGCTCTT AGGA 
TAATGAGG AATGGTTTCTGC CCTGGG CTGAAG TTTGGGAATGACTTG ATGAGC CTT AAAAT AG CAGGTGCACCAAATTGTTGTTCT 
5 TCCATTTACATTTTAACCTCAAAGTCCCTAAGCCAGTGATCTTCAAACTC^ 
TCTGCAAACATCTAATTTGCATTTCATTTTAATATAT 

CTACCTACCAAATTTTAACACATTAGGACCACCAGCACATTGAATTGGACTGC 

TTGG CTT ACATATTTG AACTT CTT ATAAAT AAATGAAT CATTG ATT AGGAACTT C CATCC CAGTATTTTTCATATTCCAG CCTGAT 
CATAAATTGAACAGGCAAATTCCZTCTTGTCACACATAT^ 

ATCATGTCTACCCACTCTACATTTTGGATACTC^ 

ATCTTGACTGCTGCATTTTCTCCAAGCTTTTAGATTAGTT^ 

ATAATTTCCAATACAACTGTCCCITACAATTCAAATCT 

TGCAAAGGAATTTGAACTTGTTATGTTCATGTCAGCTTCGTTT 
1 5 GATTTGTCTGAAAATTTGAGTTTTGAGAGCAGAGATTG^ 

GAGTGCAACCAAATACAGACACAGAAGTAAATCATATTCTG^ATCA 

CTGTAATCCCAGC!ATTTTGGGAGCTCAGGGCAGGTGGATTGCTTC 

CCrTATCTCTACTTAAAAAAACAAAAAAAAAATTAGCTGGGTGTAGTCGTGTGCACCTG 

GAGAGAATCACCTGAGCCCAGGAGGTCGAGGCTGCAGTGAGCTGAGATCA^ 
2 0 CCCTGTCTCAAAAAAGTCAGCTGATAGAAGCCTTACTATCTGAAAGCC 

AAACAGTTCTAAGTTATACATTATGGAACATTTTGTAGGCCT 

TTGTTTTGTCTCCTACACTGTATTTGAGCTCTTTGAAAGTAAGA^ 

ACATGTGGTGGG TACTCAG CAGA.TGTTTATT AAGTGTTGGAAGT AAATG CT CGGTGC CACCAAGTG AAAAT AGCACC CAGG CAAAA 
GTTTTCTCAGCAAGGCAATTTGCTTCTATAGAAGGGTGCGTCT 

2 5 AAGTGGGTCTTATCCTTAACGCAGCTAGTCCCTACTG<^ 

TTCCX3ATTGGCTGTTTTAAAGAGAGCAGGGGTGCGCGCCAGAGTGGCGGGTTGAGTAGTTTCC^CAC<5AAGGACT 

GGTG ACAGAGGATGACT AAGG ACAGTG CAGGTGACT AAGGATGACT AAGGACAG AG CAGTTG ACT AAGGACAGAG CAGGTG ACTAA 

GGATGLACTAAGAGCAGATGATAGAGGCTAAGAGGGGGTTGTTTACTGAAACTAG 

AAACTTTAAAATGGAGAACAAAGAACAGGAGAGCTGAACATGCTC 

3 0 CTACATAAGTGACACTGCATTCTACTGCAGAAC^AAGCAA^ 

GCTTTCATTCTAAGCACAAATTTCAGCATGAAAAAAATGTAATC^ 
AAAGTTTTCTGAATTCCTTTTACAAGTATCTTCTGACTCTGA 
TGTCCCTTGCATGGACTGTGGAAGCCACCrOUSAAGCCrTCrC 
CATGAAATAAAACCAAGCATATCCTTTGGTCTGCACTGCCTGCAAA 

3 5 AGCCACAGGTGGCTGTTGAGCTCTTAAAATGTGGCITGTCCAAATTGAGAT 

GACrCAGTATGAACAAAATTTGTAAAATAGCACATTAATACTTTTTATAT^ 
TTAAGTAAATTTCTCTCTTTTTATTTGTTTAATGTGGTTATTAAAAAGT^ 
GGCCAGCACTGGTCTAAGCTATATCATGACATTTAGAGTTTCTTATAACCT^ 

CTTC CTGTC CCTTG C AG AATTT CT CAT AACCC CCAGACTCAC CAAC C CTGTTT AT ACTT ACGGGTTTT AATTTTTTTCACATT CAG 

4 0 TTTCTTCTTTCTGAAATGTCCCTCCTTTTTTAGGTGTCAAAAATCCTGCTCACACTTTTAGGA 

GTGATGTC^CGCCCTATTTCCTCAAGCAGAATTAGCCACTCCTTCCCATATAGTGTCT 
TGCTGTTCTGCCATTAGGTGTO}TCTTAAGTAGACTCCTCAAGGGAAAGGC^ 
CTCGTGTGTAGTAAGCACTTAATGGATGTTTATTACATTAACTTATGAACTA^ 
ATACATCAGACCTTGATCATCTTAACCATTGGCCATGGAAAAAATTAGATGC^ 

4 5 ■ CTCAATTTCAC^TTTTGTGTTCAATAAAATGGCCTCCTTTTTC 

AAAAATCCATGAATATTGTACAGACTAAAAGTATAAAAGGACAGTAT^ 
ATATAAGATTTTGTTTCATTCAAGATTTCACTTCAGAACAAGT 
ATACGAAAGGTTTCATTTTTATTTTGATGAGATGGTTTC 
CACAAAATTGGGAAGCTTTT'CAAAAATACCTCCCCACT 

5 0 TATTTATGAAAAGCACAAATACCTGTAATAAAATGATTCTGCAGTGCTTAGAAAACCAGGTAGTTAACTATGAAAATACT 

TTAAATTGCTAATTTTTCTTTCTTrTTATTAACTATGCTC 
TTATGX3TTTTGAGACCTCAAACAGCATTTAATAGACGC 

. TTTTCrGTCACTCTCTAAAAACCTAC^TGCAGCAG AATTTTAATGT CTATTTCAG 
ATCATGGAAATGAGGACTGACAAGGGCTGTGGGTTTAAGAAGTTTACATAGGTAGTG^ 

5 5 AAACAGTCCTCCCCTGAO^AATCCTCTGTAGCTCTGCCACATTGATA 

TTTCTTTTGTTCTTGATATGTTAGAAACCAGCCCTTATTTTTTTCAGTGTGT 

AAATGGGCT GCT AATGCAG AC CTT ATTCTT AGGGGGACAC^ AAG AAAAG TGAC CT AATTTCTAT AT AT CAAAT ACAT C TTC CTTCA 
TACATTTAAAAATGT ATAGAGTTATAAAGGAC CTTAAATCCTGTAGCGG AGAAACT CTGGGGT AAATACTTTGGAAACATTGTGAA 
AAAATTCTATAAATTCTGATAGTAGGTTCCATTTCATTGCTGTGTACTATTAAAAAGTGCAGATG 

6 0 GGTTGTCAATTTGTTACAAGCGGCTGCATGTTCTTATT CTTCTGAAGTGTCATCTCTTTCAATGTGGG CTGTTT AGGCGGAAAAAA 

AAAAACTCAGAAAAATACACACTTGGGATCAAACCCTTACT 

TGTGTGTGTATGTGTTTTAAAGCCCCACTCCCAGAATATGAAGTGAGTATGAAGGGC^ 
TGTACCTGTTTTTAACATGCTGGATCAAGTGTCAGCACTCAGTA 

6 5 TGGTAAATGGCCCTCTGGCGTGTTTACAATACAGCTGTGCATT CTTAATGGCTCTG CCGCTGCCATGTCAGGT AAAGTTTTCTTTG 

TACGTGATGAATGGCAGAGAATTCTITAGCAGAAGGAACCAAAGGAGCTGGGAGG 
GAGAAAAAGAAACACACAGCTCCAGAGGGACATAGCCTGTGCTTTGCAGTTGCITG 
CACCCGCTTGGCTATCTGAGGAGCauUVCATATGGTTAATGGTCGGGAGAAA 
ATATTACAGATTGTTACAGTGTGTCAITACATTGCCAACCT^ 

7 0 AAGACTTTTAGCTCTTTGCTGGCTCrAAACAGAGAAGTGACATCATAGGGA 

GATAATTTTGTAGTGGAAACAACAATTTTTTTTCTTTTTTTTAO 
TTAATACTAGACTGGTTCTCTCACTCTCTCAATCTCTCTTGCGTGCACTCT 
ACACACACACACACGGAAAGGAAACCGAAATTCAC^^ 
TGGAAGCCACATCACTGACTCAGAGGCCCTGCATTTGAAGTTTCT 
75 GGAACTGCTGAGCCCATTCATCTGCrrTTCACGGCCCOT 
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TCGAGCCAGAGAAAGGGCAGAGCTCCAGGTCCTTTC 

CTTCAGG CAG ATGGATTTGTTGAGTCATGTGAGAT CATTT ACACAAAAACACAGTG AGG CAG TGGAAATT AAAAAAAAAAAAAGAA 

AGAAAAGAAAAAG AGGAGGG TGGGGGAAG AGG AAAGAGGGGAAG CCAGGGGAATGTG CGCTGCCCTGCTGCTG AGGAGCATAGGAA 

TTTTGCAGTGAGGCAGCTAGACTGAGGAATGCCTAGAGTCATTTGTGTATAAGGT 

GCGAGGACGGCGCCTCCAGCAGCTTGCCTCTGTTTAAAAGAAGAAGGTGTTGG 

CCAAACAAAGACCACACTGTTGACAGCGTTGGGTTCCAGTAGCGGTAAACTACCA^ 

TGTGATGTCATACAAGTTTGCTTTGTCCTGACGCCAACATGACACTGAGGCAGCAGAGGAGGGA^ 

CT CTT AGTTAATGGTTTCC AGAGATT ATTTAGCAGG ACTTG AACAGAGGGAGGA^ TGAAGG ATTG CTG CG AGGTG T ACC 

ATAATTGCGGTAGGTAAAACAAAGTCATTACTATGGTTTTCTTGCTCAAGAAGGAT^ 

TATTATTTTTCCCTACCACAGTATTTATTTTCCTTTAGCTTCTGGGGC^ 

GT ATT AATGTT AGTGAAT CAATTTTGTTTACTGTTT AT AC CACAAG CCTTT AAAACATTCCATT AAAAAATTGTCT C CCG CTCATT 
CATGAAATT C I T ITTTTCCTTCAAATTTGCTTACATG^ 

AGAATGCATAAAAGAATAGTGAGTAGATTATCTAAATGTACATTCTTACAACAAATGTAATGAAA^ 

G AATATTTGAAAACAGAAAGAAAAACAGAAATGGGGAGCATTTCAAAAT AGTCACCCTG AT^ CTTTGTTATGATG CATTTCTTC 
AATTCTGCATTTATTTAGAACCCTATTGTGCAGCTATTTCTTTTAGCCACCCTAGATAGAATGTAGGTCAA 

TCAT ATTTTGTTGG T ATTTT AAAAGTTG AAACATTTTG CAGTITTTTC CAG AGCT AG AT ATT CTT AGGTTATTT AGACGGTTTT AG 
TGACCATAGAACTTCACGTATAAACTCCTTCTGTTAATAGCT 

TACTCTGTGTAAGTTACAGCTTTTACTTCGCT 

CCCAAGTTGCATGGTGCCACArTATTATAATGTGATTATGAGTTGGCATGATTT^ 

CCGTCAGGATTTCCCAATCTCT^GCACTATTGACATTTTGGACCAG 

TTCAGTAGCATCATTGGTCTCACCCACrAGATGCCZAGTAGCACCCT 

AGATGTCCCCTGGGAGACAAAACCACTCTGGTT CAGAACTTCTG CTCCACATTTCTT ACTCTGGTTT AGAAGCTCACGCTTTACGT 

CATGAGCTGGCTGCTCAGTTACTTCTAAGATTGAATTAAAGGATATGAAATAAT 

TTTTGAAGGTGCAGGCAAGAGTACTTTTGAGTATGGTTATGCA.GAACAACCTCTCT 

CCTCAGGACTCCCITACAGTGAGTTTCCCAGTGCTGTTTAAGAGGAAA 

AGAAAGGGAACTTCCCCTCAGTCACCTGAGCAGACAGCCrAGGCC^ 

AGGCAGCAGCCACAGTTCTTTCTTGGGAGCACAGCTGACCTC CTCCCCATTCCAG CGGCTGTGGTCTCAGAAGCTGAAACTCTCGT 

TAATAAGGAATGTGGAGCTTTCCACAGCTAAGGCAAGCATAAATTTCCT 

TGAGATCAGAGAGGACCCCAGGGTCA.GCTGCTGGAACTGCTTTCTGTGGCT 

ACTCCCTGCAGACCTCCCTGATGTGTTAAGGTGGTTGTGCCCTTGTTGCATT^ 

CATCAGTGTGCAGTCATCCAGTTCTAAATTATGGA 

TGTGGAAAATGCAGCCTTGGGAAGGAAJGGTGACCTGTTTCCTTGGG^ 

TCTC^GGCCTGGGCTACCCGTAAGCACTTTCTACCCCCAATGGCACTCTGTAATGA 

TAGCTTAGGCTACAAAGAACATAATTCTAGTTGTAGCCTAAATGACAGCTAG 

TTGCATTTAGGGAACCCAATTCTGGCAGAATGGTATACTGAGCTCATGCAGC 

CTAAGCCTGGCCTACTTCTGACCACCCACCCCAAAAAGTCCT 

AAAGCAGGACAGGGAAATTCTTAGGTGCCAGAAGACATTGATGCCACTGGCCT 

CTCTAGGGGCTATGGTGTCTATACTCACACAGGAATTGCTTGTGAAAGG 

TACCTGCATGGTATAACTTCAACTGTTTTGGCAGAAACTGGTTTGAGA 
AGATCCCTGTGGAGAGGCTTAACAACCTGGGACATGGTACrCCCT 
ACAGAATATGGCCAGATACAGTGGCTCATGCCTGTAATCCC^GCAATTT 
TTTGAAACCAGCCTATGCAACATAGCAAAACCCTGTCTCTACAAA^ 

TACTTGGGAAGCTGAGGTGGGAGAATCACTTGAGCCTGGGAGGTTGAGGCTTCAGTGAGCCAAGATGGC^ 
TGGGCGACAGAGTGAGACCCTGTCTCAGAAAAAAAAAAAAAAAAAAA^ 
TTAACAGTGAACCTTCAGGCCAACTAGAGTTTTCTTCTGCAGATTAATGAAAGTTT 
TTCTGGC1TGCTAAGCAGTTTCAGTTGTAAGCACTAG 

GAGAATATTTTAGTTCCTTAATTAGAAACITTTTATTGGAGAGATAGCTGGCCCC 
AGTGGATCACTCTGAAGTAACCCATCAGGCTGTCTGTTATTTCAGCT^ 
CAGCAACCCCCTCCCCCCGCCCCGCCAAGGGAGAGGGAAGTTGAAATTATTTGGAGGGGT 
AGAAACCCAGGTAAATAAGACATGGTTAGTAAACATTTGTTCTAGATGA 

GAT CAAGAG TTGC CTGG AAT AG CCACTGTGT ACTCC CTTTG CAG TG ACGTG CACT ATTGG CAAACATG AGTTG AGAATGCT AAACT 
TAATGCATGATTCCTTAATTTATTAATAGGACAATGAAATCTCTTTTGTTTAGTTT^ 

AC CAGTAGT ACCCAAATAGTGCCTTGCTCACCTAAAAGTGTG CCATG C1TCTTAAGAAAAAACITCAGTATTTATTCTTAATTTGG 
ACTT CTATAAAGTTTAATTTCAAAATATTCTGACCAAG CGAG AAGT AGATATGAATATTGACATTAAAGCATCAGAAGAGG AGAGA 
ATG AAAGGAAATAGGTTCAACTGGTGCCAGACTTACAGGAGATCTTC CCAAAC^TT AT AATTAAATTTGGCAT ATATTGAGTG CCT 
ACTGTATGTATGTAAGGCAGCGGGATAGGTGCTGGGTATTTGGGTACTAGTACAAAATAG 
AAAACAATCCTCAAAAGAATCTCCAGAGATATAAAGAATTTCCC 
TAACTTGCCAAAAATCACATTCCGAGAGGTGGAGTTGATATTCAGA 

GGCTTTCTGTGGGCAGTGAAGCCCTAGCAGTACAACCCAGAGGGGCCAAGAGGCAGGTAGAG^ 
GGAACAAAGAATCCCCTGACTGACATGGAAGGTGTGCAGTAGAAGAGGCAAACT 
AAAGTATTAGCATGAGTTTCCAACATTAAAACAGCAAATTATTTTACACAG 
TGGCACATGGGGCTTGTTCCTCCTTTGCTGGGGCTAGAGCT 

AC CTG AC CTTT C ACTCAT CT CATT CTTTCTTG AC CCTTG CAG TCATCTGAATTTGTGAC CCCTGG TTT AAAT AG T CACTT AGGCAA 

G<^AAGTGGGCCAGGCCAAGTAACCTCTTGGGGG CTCCT CTGTGTTGTGCCTTTAAATTGAGATAGTCCT AAATCACTGGAG ACTG 

CGATTTACATTTTACAGTGATAGCCCTGAATCTAAAGATTAGATATCCTGA 

TGTGTGCCAGAAGTGGGTGTATGAAAGTGATGGACCCTCTACCACTTGTCACT^ 

TGGCTCTCAGACTATTTGTGTGCGTGTGTTACCAACAATGTGTCTCTCTTTTATAATTTA 

TACCATTTCCTATTGAACCATAACTCATATATTAGTTAAAATTTCAGAACATTCATTTTCAAATTACCT 



CGCACAGGGGAGCAAAGCAGAATGGCAAACGTCTGGTCTCTTGGGGC^ 

TGGCGAGGACACTGAAGTCATGGAGAACCACAGCCACACATCT 

GGCAAGCTCCACTCTGTTCCTCTTGCCTC1TCTTTAGGAAGCAGG 

GGGAAGGGGGCAGGAGTCTAGrrrC"rri\iCC~ri"TGGAT C CTT CAAT C CCAGCAC C ATGC C C CCT ATGGT AGAAGGGAGGG ATT AG C 




JTTTTCCATGGTATGCTGTGAAGGCAG 
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TTTCTGTAATAAAATGACCTTATATTTTTTCAGTTGAAAGTCTCCAGTTGAAGATT 

TACTCCACCCCTTAAAATAGAGTAGAGGAAATTTGGGATATTGCTCTCACCTG 

TGCAAGAGCATTTTTT T CTAGACTTCAGA 

ATAGGCCCTCATGCCCCAGCCCrCTGCCTAAGCTGCTGGGCGTGGT^ 
5 CCTTCATGGTACGITATGTCTAAAAGCTAGTGAGACCTTTAACACTGCGA^ 

CCACAGGAGTGCCTGAGCTACCCAGCTTTGACATATATTACTGAAGGCTTCT 
GTGGGGAGAAAAGCCCAACTCACCATATTCCATTCAGTATTTG^ 
ACACA(^CACACACACACACACACAC7VC^CACACAGCCCTTC 
TTATGTTATTCTCAGCATACCTCTATACCACATTAA 
1 0 GTAAATAAATAGTGGTG AATCAGGTACGCCTGGTGTTTTCTCCAACTCXCAT^ CTGACACC CAAGACATCAGAGTAT AACCCTC 

AGAGAAGCCATAGGATGTGGAATTGTCACCAGGAGCCCAAATCTCTGGGTCTCTTC 
TTGGCCATGTTACCAAACCTATCTCTGAGTGTCrrTTGT^ 

TCITCTCATTTCTCT C 1TCTTCT T AG ACTTACT TT T 1 TT 1 lTrrrrr , I , 'l , lTCAGATGGAGTCTCCCTCTG 

TCACCCAGGCTACAGTGCCTCTGCCTCCTGGTTCAGGCTATTCTCCTGCCT 
1 5 CACCATGCCCAGCTAATTTTTGTATTTTTA^ 

CACCCGTCTCAGCCTCACAAAGTGCTGGGATTCTGTAATTCTGGACCACCTTT 

CAAATTATGTTTCATTAACCTTTTAGAAGTCTAAGATTGACCATTGAAC 

TTATGGGATTAACTTTCTCAAGGCACCCTTTAAAGACGTCTGTAGTTTTCCTT^ 

ACAAT AAAAAG AAACATTGAAT AAT AAACT AAAAAAGAAAACATT CTTG ATTC CAC CATGGTAACTCAGG T AT CTTTTTC CCTAAT 

2 0 GCITT ATATTTAAGATGCAGTCTC C T riT TTAAGCTGTAG ATTTTGCTAAG CTTCTTCAT AAACTG AATTGGCTC CAAAACJyACCA 

GGTTCCCrCGATTTGAAAGGAGATGGAAAGTACTTGATAGAACAAGGCCAG^ 

GTTT CT AAAATTGTCAT CTGTT CTCTG AGGCGTTTT CTCTT AAATG GTGGTCTTGGTGTG TG CATGTGCGTGTGTGTGTG TGTGTG 
TGTGTGTGTGTGTGTGTGTGTTOTTTTGGGGTGTGTGTTGTGTTTATTTTTT 
GTCTTGAAATGAAGTCACTTCTAGTCCAGGGTCTA 
25 GTTTCCAAATTAGGAAAA 

HUMAN SEQUENCE - mRNA 

CTCCCCCTTCCCGGCGCACTCGGGGGGCTGGGAACGAGCTGCCATGTGATGCGCGT 

3 0 CTGCCCACCTCGCCGGCTCCCGGGAGGGGGCTGCGAGCCGGGAAGACGCGGGGAAGAG 

GC G AGCGCATTCATT CACAT AG CTCC CAG TTTT AACATTT CGC CAC CT ACTG AAG AC AT CATTTGGGACCAAG CTGATGAGCCCTT 

GAAGCACAGCAGATAAGAGTGTTGCTGTTGATCATCTTTGCCTGGGAAGTTGAATGATAAA 

CATTTGCAGCTGTCTGCTTCAGAAAGTGAGGGCrCC^^ 

CCCTTGCTCCC!ATCCCTGCAGCATGAACAAGGTGGAC7VCTCAC^ 

3 5 CTGAATGGATTGCCAGAGCCTTCTCATCTCTCCCTT03CCCAGTTCC 

AATTACATGGTGTGAACGGCATGTCTGTGGATGAGAAGCCTGACrCCCCCATGTATC 
ATCCTCCTC^KX:CTCAATGACCAGCC^AAAAAGGATATTCTCTGTGACGTGACTTTC 
CCGGGCTGTGCTGGCCGC^TGCAGTGAATATTTTTGGCAGGCGCT 
AGGAGGTCACAGCCAGGGGCTTTC-GGCCGCTGTTACAGTTTGC 

4 0 GTCATCCGCTGIGCTGAGTTCCTGCGCATGCACAACCTGGAGGACrCCT 

GGATGGCCTGTTTGTGTGCCGGAAGGATGCTGCGTGCCAGCGCCC^ 
AAGAGGAGGAGACXaATGGATTCAGAGACGGCCAAGATGGCTTC 

GCCGCCATCCCCGTAGCAGAGAAGGAAGAAGCCCTTGCTGCCCGAGCCTGACGTGCCCACAGAC^ 
CGCXnTAACGCIAGTACCCCAGATACAAGAAATACCAGCITGCATGTACCAA 

4 5 GTTTTGCAAGCACATTCCGGGAAGATAACTCTAGCAACAGCCTCAAGC 

AGTGAAGAGAATGAGGAAGAGAGCATCACGCTCTGCCTGTCTGGAGATGAGCCT^ 
GGACCC^AAACAGCCCAGCCCTGCCCCTACCCCCACGGCCCCAGCTC-GC^CCGCCTGCCTC^ 

C CrrCCTGCTTAAGGTCTCTGTTCAG C AT AACGAAAAGTGTGGAGCTGTCTGG CCTGCCCAGTACATCTCAGCIAGCACrTTGCCAGG 
AGTCCAGCCTGCCCITTTGACAAGGGGATCACTCAGG^ 

5 0 CCACGTGGGCCAGJUVGGAGGTGTCCAACTTCACCATGGGGTCGCCCCTCAG^ 

GAGAGCTGGACCGGAGGAGCGTCATCnTCTCCTCCAGCGCTTGTGACCAAGTGAG 
AGTTTGGACAAAGACCTCTCTGAGCCGGTGCCAAAGGGTCTGTGGGTGGG^ 

C CA CXXjTC^GCTGATGG c CG ACCACTTG CCAGG aagg ATG cggc CCAACAC CAGCTG c c CGGTG ccaat CAAAGT CTG CCCTC GCT 
CACCCCCCTTGGAGACCAC^ACCAC^ACTTCCAGCTCCTC 

5 5 AGCCTCCCTCTCTGTGAGTTCTCCTCCTCGCCCTGTTCCCAGGGAGCCAGATTCCTTGCCAC^ 

GGGAGATGGAATGTACAACCAAGTGCGGCCCCAAATTAAATGTGAGCAGTCTTATG^ 

TCTCGGAAGCAGAC^GTGAGTCGTGTCCTGTGCAGGACAGGGGCCAGGAGGTA^ 

CTTCCAAGGAACGATTTCCAGATGATGATTAAAATGCACA 

CAG CAAG AAC CGCAT CGCGG CCCAGCGCTG CCGCAAAAGGAAACTGG ACT G T ATTCAGAATTT AGAATGTG AAATCCGCAAATTGG 

6 0 TGTGTGAGAAAGAGAAACTGTTGTCAGAGAGGAATCAACTGAAAGCATC 

CAGGAAGTTTGCCGAGACATCCAGAGCCCCGAGCAGATCCAGGCCCTGGATCGGTATT^ 
CACGGCCTCCAGTATTAACCCTGCGCCCTTGGGTGCTGAGCAGA^ 
GCTGCTTGGAGCCAGGCGCGG CTCC CCCCGGACCCCCCTGGG CAC C CAG CAACACCT 
GAAGGCACTGACCCGGGAACCTTCTC^GAGAGAGGACCTCCTCTTG 

6 5 AATGACTGATAAGTGTACAACTGACGAACAGC CCAGGAAAGATT AT ACCTAGTG ACTCGGCTCTGCCT CCCAGTCCGCIACACCTCT 

CCCATCCAGGCGTTCTTCAGTCAGCCTGTKX3CACTGTT 
CGGTCTTAGCAGCAATACTGTTCCGAAGTATCCTCTCCTCTTCTC^^ 
GCCCAGGCAAOIAAAGACAGCAGTGATGACACTCTCTGTCTGTGGCTCAATTT 
C^CCCCAACCACGGATTTTTTTTCTCAGTGGCCCATGTCACAA^ 

7 0 TGAAGACAGCITCTATACACArCAAAGTTrT^ 

TATATATCCATATGCAAAAGTCCTGCATGCCTCAACTTTCTCATCCT 
ACATTCCTCTTCrrTTTGTCTCTGATGCTAGAACTAGTTTGGTAACT 
AATTCGTACTTATTTATGGACAAAATTCAGTGTTGGAAGCTTTTTCCCA^ 
TTTGGCACCTCCAAGTGGTGTCATTTGAGCATTGTA 
75 TTGTTirCCTrGCAGATACTGTACAGTAATGGTCAACl^ 
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TTGTAACTTCGGTATAACrCAAGTATACTGTATATTCTTTGCTTTT 

CAGGTGAT AATT ATGT AGGAAAAACAAT CTTGC CAAAT AATG AATTCATC CT AGGATGTGT AGACAAT AAT CTG CTTGAAT ATTTT 
T AT ATTTCACCT CCT CC C CACCTTTC C CTAAGCAAAGTTT AAACG CAGAT AGAGAGTT CAGAGTTGATGCTGG ATGTTCAGATTCC 
TAAGTGGGGAGAGAGTTTGGACATCTCACTCAAAAGTACATCAGAAAAACAG 



GAGATCTGGCCTGAGGGGATTTGAATCACAGCCCCGAAGGTCCTGCCTTCACCC 
CGATCCATCCCTGACACAGGCTGACAC^TTCTre 

TGTCTrGTCTGAAGGCAGACTTCATTCTGAGGCTTTGGACAAGCTATTCACCGGGAACCCT 

1 0 CATACCCTACCCTCACCK3ATGATAATTTTCTCTT 

CTCTGGGGGCCGTGTTGGCCCCAGCCACGTTTACAGGAAGGTGTGCCCCAGAGGAGGAGGAA 

TGGGTCCGGGTCTCACTGAGCAGCCCGAGGGCCACTCCAGCCCGGCTG 

TGGTGGGCAGTGGGGAATAGATGGTTGACTTTGTTTCTTTATT^ 

AAATT AGT AT ATG ATGT AGGCT AAAAGTGAAAT CAT AG T AACAT AT GTTTT AGT AT^ ATTAACTTTTTTCTG T ACAAAT ATT AGCA 
1 5 CTAAATGTTTAAATATGTATGAATGCCAGAAATTTGTCAGTTCATGCA 
TTTTTAAACAGTTCCACTTTTAAAACCTGCCTCTGGGTT 

GGCCTACTAGGACAGTTCTGTGCTTCACCATGAGGTTTGCCT^^ 
GAATCACGTGATACTTCGGCAGGCGTGTGTTACAGTGGAGTCAGCTGACAGTATTTTC 
2 0 GTGACCTCTCCTCTTCTTTTAAAAAAAGAACACTTTCTGCTCAT 

CCTGGTAACAACTCATGTGCAACTGGTAGTCTTGACCACATTCCATCCATTTCCTCAGGTT^ 
TTTGGCAGCCATTTCTAGCAGGGGCGGGGCCTCTTTATTTCTCTCCACCCTA^ 
CTTGCTTTTCTTCCTCTTCCCCCAACCTAACTTCTGCCATGGGAA 
TAGAGATTTCTCACAAGTTATTTCATTCATAATCCACCIATGAACAGTGACTAGCTT 

2 5 CTTITCCTATTCAGAACTATGTGCTTGTCAAAATTATTTCTGGGTTGATTCAAA 

GCCTCAAGTGGAATTTTAAAACCTAGCCTGTCTCTTTTCCCTGGGA 
GAGTGAACAGCAAGGAAGAGTGTTTGCACAGTACAGTAACATTTGGTTGTTCTTAAGGCT 
CCAACCACGGGCTGTTTAGGJVGGATGCCTGCTTGGGTCTCC^ 
TGAGTGACCCrGAGACTAATGAACATCCCACCTAAATCCAGTC 

TAAAATCGAACTTTATTACATACTGATTCTGGAACAAAACACT 
ACGAGGCCATCCTGCTATTTATCTTCTGAGTTCCCAGCAATGACT^ 
TGTGTCTGCAACTGAATGTTGTGCGAGTAATTTATTAACTGTCTTTCTAAAGGTTTGCT 
TGTAATCCITACATTGCTTTTCCAAGGAAGGGAACAAAAGTCTAGTGATTA 

3 5 AGGACCAGCGACAAGACTCATGAGAGGACTGGCTAAAGTGAAGTGTGCACAGTGTGAA^ 

C C CACATTTT CTCTTTTAAT ATTTTATG ATTGC CAT CAAAG AAG AAG AAAAAG AAGGAAC AG ACAAAGG*ri"IXJAAAATG AT AAG CC 

TGTT*AAGACACCAAAAACTCCTGTCCCX3TGAAGCTGCTTGA 

CTGCAAAGCTTTCTCAAGTCCCTATTTGGCTACCTACTTCT 

TCAAACTGCTTCACTATTCCrrACAGTTTAGGAGGAGTCTC^ 

4 0 AAATAGACTAAAGAACAAGGTG'rriTTTGTTTTTGTTTTTGTTTTC 

TTCCCCAAACTGAAGACTGTGTTGGAAGTGCAATTTCTGGTGAGTC 
CAGGAAATCTGTGTGCACGAGGCATTGTGTGTTGAAAGTGTATGTTTATAGTACTGC 
AACCGATGCTGTAGAAGAGAACATATCTGTCACAAATAGGTGTC 
TTGAGCATAAATTATTCAACTGACGGTGCAAAAACATTACTTGCAAAGAAAAGT^ 

4 5 ATGATATTTCTATCAAATGAATATCAGTGCATTTTAAATGTAATATGAAAACGATGCTGCC^ 

TGCAGAGGCCAACTTTCATAGCTTTGATTTAATGTTGTGACACGGTGTATGCATTTT 
CTTTCATTCTCATTCCAGTTTATTGACCTCAGATAAAACACT^ 
AGGTAGACTCACATTCAGTGCCAAGTGCTCCCZATGGGAATAATCIAGACGCA 
GGGGTTTTCCTACAGATGGATGCTCAGTCTTCTACC!AAAACATGTTTG 

5 0 TATTTTGTGTGTGCAAATCCTGGGATGCCCGTTTCACG CTCT 

ATATGCATTTGGGAGCTTCAGGTGCTTGTTAGACACAGTGAGCCATTCAAGGCAA^ 

CTG TG ACAATTT G AC AATTTGTT C CAG AAC CAGT CTG ATG CAAG TG C AC CT CTAAT ATATG CCTT ACAAACT CCAG AGG C CAT ATT 

CAAAACAGGGTCTTCTCAGTGTATGCAAGGGGCTGCAGCCCCTCTTCTCTTCCTCCCC^ 

CATATCTACCTGTATAACCCTCTGTACCTOTCATAACTGGTCAACGACTGTAACAGGTTACATCA 

5 5 TACACAGATTCTATGCGATTAATGTAATTTAATTCAATGCATCATTTTAT 

AGTGATTGTGGTTTTTCTCGTGGTTTTTATTGTAAAAAATGAAA 
TGGGGGAAAAAAGCATTGCTATGAACTAATGGAATTGGAACTTCATTTACTCAT^ 
ACTCAATTGAATTGCTAGTGTTAACTGAATTTTGTCTAGACACCATTTCTGTTGATGAAAT 
ACTG 

60 

HUMAN SEQUENCE - CODING 

ATGAGAAGCCTGACTCCCCCATGTATGTGTATGAGTCCACAGT 

AAGGATATTCrcrGTGACGTGACTTTGATCGTGGAGAGGAAGGAGTTCCG 

TTTTTGGCAGGCGCTGGTTGGACAGACAAAAAATGATTTGGTGGTCAG 

6 5 tGTTACAGTTTGCCTACACTGCGAAGCTGTT^ 

CACAACCTGGAGGACTCCTGCTTCAGCTTCCTGCAGACCCAGCTCCTGAACAGTG^ 

TGCGTG CCAGCGCCCACACGAGGACTG CGAG AACTCTGCAGGAGAGG AGGAGGATGAAGAGG AG£ AG ACGATC 

CCAAGATGGCTTGCCCCAGGGACCAGATGCTTCCAGAGCCCATCAGCTTTGAGGCCGCCGCCATCCCCGTA 

GCCCTGCTGCCCGAGCCTGACGTGCCCACAGACACCAAG^IAGAGCTCAGAAAAGGACGCGTTAACGCA^ 

7 0 ATACCAGCTTGCATGTACCAAGAATGTCTATAATGCATCATCAC^ 

CTAGCAACAGCCTCAAGCCGGGGCTTGCCAGGGGGCAGATTAAAAGTGAGCCGCCCAGTGAAG^ 
CTCTGCCTGTCTGGAGATGAGCCTGACGCCAAGGACy^SAGCGGGGGATGTCGAGATGGAC 
CCCCACGGCCCCAGCTGGGGCCGCCTGCCTGGAGAGATCCAGGAGCGTGGCCTCGCCCTCCTGCTTAAGGTC 
CGAAAAGTGTGGAGCTGTCTGGCCrcCCCAGTACATCTCAGCAGCACTTTGCC^ 
7 5 ACTCTVGGGTGACCTTAAAACTGACTACACCCCTTTCACAGGGA^ 
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CACCATGGGGTCGCCCCTCAGGGGGCCTGGGTTGGAGGCT 
CCTCCAGCGCTTGTGACCAAGTGAGCACCTCGGTGCATTCTTATTCTGGGG 
CCAAAGGGTCTGTGGGTGGGAGCCGGCCAGTCCCTCCCCAG^ 
AGGAAGGATGCGGCCCAACACC^GCTGCCCGGTGCCAATCAAAGTCT 
5 CCAGCTCCTGCTCTTCCTATTCCTACGCGGAGGACGGGAGCGGGGGCTCAC 
CCCrGTTCCCAGGGAGCCAGATTCCTTGCCACAGAACATCAGGAACCAGG^ 
CCAAATTAAATGTGAGCAGTCITATGGAACCAACTCCAGTG^ 
TGCAGGACAGGG<3CCAGGAGGTAAAACTTCCTTTTCCTGTAGATCAAATCACAG 
AAAATGCACAAGCTAACCTCAGAACAGTTAGAGTTTATTCATGATGTCCGACG^ 
1 0 CCGCAAAAGGAAACTGGACTGTATTCAGAATTTAGAATGTGAAATCCGCAAATTGGTGTG 
GX3AATCAACTGAAAGCATGCATGGGGGAACTGTTGGACAAOT 
GJVSCAGATCCAGGCCCTGCATCGGTATTGCCCTGTCCT 
GGGTGCTGAGCAGAACATTGCGGCCTCCCAATGCGCAGTGGGGGAAA^ 

GACCCCCCTGGGCACCC^GCAACACCTCCGAGAATTGTACCTCTGGGAGGAGACTAGAAGGCACTGACC 
1 5 AGAGGACCTCCTCTTGAACCCAGGAGCCAAACAGTGACCGTGGACTTCTGCCAGG 
GCCCAGGAAAGATTATACCTAG 
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Table 15 

MOUSE NOMENCLATURE 
ICSGNM Wntl 
Celera mCG18420 

5 

HUMAN NOMENCLATURE 
HGNC WNT1 
Celera hCG41496 

10 MOUSE SEQUENCE - GENOMIC 

TGAATCCTT<3AGATAACCCAAATGCATG<rrCCACGCTCG 

TGAGCCCAACTTCAG<JATAGCGGGGGGGGGGGGGGGTCCnX5CTT^ 

CATAAACAAAAAGCTACAAAGTGAACCTTTAATTTTT^ 

AAAAAAAAAAAAG AAAGGCCTCCCCAAAACAAAG AAGACC CAG CAG ATG AAAG AAAGTG TTTGCTG ACTGG GG CGTGGTGGCACAC 
15 CCCTTTAATCCCAGCACTTGGGAGGNNNNNNNNNNNNNN^ 

GGACCCTTCTAATCTAGCTTGCCTGGTATTGCCAGCACGAGGAGGCAACACCCCT 

TCCCTCTAACCCTGTAAGGGCGGGCAGATCAGTTGTGGGGCCCACAGAGAGGGAGCGGTTGG 

GAACCGGTCTCAAATGGGTATGTGAGTGTCCCAGGACCCTCCGTCGTGTGTGAGCTC 

CCCCAAGCACAGCATGACTGGCCCTGCGCACCCAAGTCACATAGATCCAGAGCCTGACC 
2 0 TACCCACAGCCACAACCACCTCTTCATTGGTGGTGGCTAGCAGCAAGGCT 

TGCCTTGTGTTGAAGAGGCATCCCTGGCTAGGTGTCTAGTACAAAGGAAGTTAGGTATCT 

TGAGGGTGCTGGAGCCCTAAGCTAGACCACAGCGGGAGGTGCT 

AGAGCTC CTTCCCATTTAGGGGAAGAAGGCCAAACCATGTATAACTTCTGAGATGGGCCAG CACTGTCTCAGTAACTTCTACCCTA 
CTCCAGAACCCAGGATCCTCACCAGTGCCCCATCCTCTCTCCTGCCTGTCAGCCT 

2 5 GCAGATCCTTCATCAGTATCTCGAGGATATTCACGTTTCTCCCGTGGCTGCAGTTCCATTCACACGTT 

ATTTGTCCCTAATGCTATCTTTACCTCCTAGTTCTATCATTC^GTCAGATGTCAAGAAGGGA 

GCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGCGCGCGCGCCATGCTTTTACT 

AGAGGCAGGTGGATCTCCTTGAATTCAAGGCCAGCCTG^ 

CTCTCTCTAAAACCGATGATGATGATGATGACGACGACAGCGGCAACGACAATAATAATAATA^ 

3 0 GAAGGGGCTAAGGAAGATTATAGCGCCCAGAGGTTCAACTTCI^ 

GAACTCATACCGCrGTTCrcrAACCTCTCTGAAGTCTTGCrCITT 
CGAGGGTGCCCTCTTGCCGACCGTCTGGCCTGAGCACTGACCCACAGGCAGC^ 
CCTGCGGGTGCACACCCATTGCCTAATACCCCAAGCACGCAGCCAGCTCCAAGAAAGAGGAGGGGCA 
CAAGGGTTGAGACCTAGATGAGAGGGGGAAAAAAGGGGGGCTGCOn > 

3 5 CTTTTGATGAGAAAAGTGGAAGGCCCCTCCGGCCCTTGCGGGGATGG 

CGAGGCCCTGAAGGGCGGGGAGTGTCAAATAAAAGAGTGGGGACCTCTGG 

AGCCGCTGATAGGGGCGGCCAGCCCAGGGAACCCAAA1TATAGGCCCTGGAGGGATGGATGC 
ACGCCTCCCTCCACCTCACCGGTCTTCCACTGCAGCACCGAAGGCCACCACCCAG 

4 0 GACTCCAGAGAAGCCCCCACATTCAGTAGTAGCCCCAAGGAGTCTTCTGACTCACCCAAGACAGAATGT^ 

TAGGTTGATGGATGGGTT ACACCCTCCCAGCCCGAGGGGGTAGCGCCTGCCTGGG CTTCAAAGG CTTG CCCAAGAACTCAGAGCTC 

TGCTCCAATGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATACACACAC^ 

AGCTTTCACCCTCCCCCACCGCCTCCCTCCTAGATCTATTCCAAGGCCTGTGTT^ 

TAACTGT AGCCAAAAGTGACAGCAAGCCGCC CCCTGTC CCAT ATACCTAGAAT AAGG AAGTTGAGGGAGGG AGATCAAGAATTCCA 

4 5 GGAGAGCAGATCTACAGATCAGCATCGAAGAGCTCTTGTTGC 

GCTCACAACCATTCATAACCTTGGTTCCTTGGGAATCAA 
CTACAAAGAAAAACAAATCTAATAAATAATNNNNNNNNNNNN^ 
NNNNNNNNNIJNNNNNNNNNNNNNNNNNN^ 
I^NNNNNNNNNNNNNhlNNNN^ 

5 0 GTAACCCAGGCTAGCTTCAAACTCAATATGCAGTTAAATATGACATTGAACT 

TGTCAGTGAACATGGCTTTATCTTCTCTTTATTTTTACCATACCTATGGTAAAGCTTAGGACCT 
TGTGTGTGCGTGTGTGCGTGTGTGTGTGCATGTACACATACGTGTGTGAGCACATGC^ 
CTCTATTTCTCTTTGCCCTATATATTnTTAGTTTTCAT 
AGTGTGCATCGTGGTGACAGCCATTCTCTCCCCTCTGTCCCC^ 

5 5 GATTGCAITCCACCACCIAGAGATCAAGAAAGGTACATGTGTGAAAGGTCTCA 

erCAAGGCAAACTCrTTTTTTTTT r TTT TTT T I 1 1' r iTT ' rr i Tr TTGGTTTrTCGAGACAGOGTTTCTCrGTATAGC 
CTGGAACTGACTTTGTAGACGAGGCTGGCCTCGAACT 

GCCACCACACCCGGCTTCCAAGGCAAACTCTTATAAGGACAACATTTTTTAAAAGAT^ 
CTGTCCCTGTCTTCAGACGCATCAGAAGAGGGCATCAGATCGCATTACAGATGGTTGTGAGCCACCATGT 

6 0 ACTCAGGACCCCTGGAAGAGCAGTCAGTGCTCTTAACCACGAAGCCATCTCT 

TTACAGGTTC^VGAGGTTCAGTCCATTATCATCAAGGCAGGAACATGTTAGGCATGGTTCA 
TTCCXSAAGGCAAACAGGAGAAGGCTGGCTTCCAGGGAGCrTAGGATGAGGGTCTTAAAGCCCA 
AACAAGGCCACACCCACTCCAACAAGGCCACACCCCGTAATAGCGGTACTCCTTGGACC^ 
CAGGCCTCAGTGATGAGGAATTTCTCAGGGACAAAGTCAGGCGACCCATCTATAGTGCT 

6 5 ACTATGTGGAGAAATTCCTAGCTTACGGACATGTGTAGX3AAGCT 

AGCAGAGTTCTAGGCTAAGAGAGAGCCTGGGAAAGACATTTCCATTAACAATAAAATAAAATAA 
TAAAATAAAATATCTCAGCAGCTAGAGGCATGGCTCATCGGTTAA 

AG CAACCACATGG CAG CTCACAGCTATCT ATAACTCCAGTTCCAAAGGATCCJ^GCACTCTCACACAGATATC CAAAAC 
ACCAATGCACATAAAATGAAAATAAATTTATCATTTTTAAGACTC 

7 0 ACATTCACACCTATTGGGATGACTCAGAAAGTTATGGGACCTGTGACCAAATCT 

ACTAAAAAGAGAGAATCAATTCTTGTAAGTTGTTCTCTGATTATGCATGCT 
TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCACACACACACA 

ATGTTTAAATTAAGGGCTGGAG AGATGATTCAG CTGTTGAG AGAGCACTG ACTGCTCTTCC^GGGTCCTAAGTTCAATTCCCAG CA 
ACCACATGGTGGC7CACAACCATCTATAGTGGGATCCGATGCCCTCTTCTGG 
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AATAAATCTTTAAACATTTTAAATTAAA^ 

CTTGGCGGGAAAGGCTGATCCATTCTGAGAGTCAAGCTCTT^ 

GCAGACGAGATGGAAACAAAAGGTGCOV^^ 

ACCCCAGCCTGCTCTGCTGCATATCAGGCAGCCACTGAGCCACTGAC^ 
5 GCTGTCAGX3TCTGGGGGAGGGAACATGCCCCTAATCTCCACTGGAGGACTGG 
ACCGTATCTTTAATTTGAAGCCAGCCTGAGCTACATAAGACCTTATA^ 
AGAAAGATGGCTC^CACTTAAGAGCGCCTCCAGAG 
GCCCTCTTCTGGCCACTGCGTGAACTGCATGCGCATGATACATGTGCATA 

AAGT CT AG CGGTGGTGGCACACGCT AGCACTCAC^GAGGCAGA^ CC 
1 0 AAGTT CC^GGACAGCCAGGGCT ACTCT ATCTCAG AAAACCAAATAAT CAAACCAATAAG ACTC CTCT AC C CC CAAAAAACAAAAAA 

GCAAAAAATAACAACTGGGTTTGGGTTTAGCTTTGAGATAAG^ 

ATGT AT CCCAGGCTGGCT AT ACT AG AACTGGTT ACTATGTATC CCAGGCTGGCT AT ACT AGAACTGGTT ACT G TGT ATCC CAGG CT 

GGCTATACTAGAACTGGTTACTGTCTATCCCAGGCTC^CTATACTAOAACT 

TGGTTACGTAGACCAAGATAGCCTCAGACTCACAGATGTGACT 
1 5 CACCAAGCCTGGTGTCCCCAAGGCrrTTTAATCCTCCTGCCTC^ 

G AGTTC^ CCTGGGC^TTGTTT ATTCTATTTTTCATTAGTT ATT AT^ TTGTTGTTTG AG ATGGGAT CTTT CT ACAT AT CTCAGG 

GTTGTCTCATG1TTGGAGCAATCTTCTTATATTTTATTGTATGTCT 

GCGTATGTGTGTGTGTATGTCTGTtnXSTGCATCTGTGTC^ 

GATCAAAAGACAAGTGITTAGAATCAGGTCTCTGCTTTCATCAC^ 
2 0 AAACACTTGTTTGTTTGTTTGGTTG^TTGGTTGGTTG^ 

TCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCG 

GTGCCACC7VCGCCCAGCTCAGCAAACACTTTCTTTCATCTGCT 

TTTGTTTTGTTTTGTTTGCTGTTGTTGTTTTGTTCA 

TATGCGTATTCACATTTGTGTGGGAACGTCCTTGGAGAAAGCAGGAG 

2 5 CTGGGAAACACCTCAGGTCCTCIXK3AAGAGCAGTAAGTCCCCITAACCAATGAA 

TTTCTTATTTACTTTGTCTGCATGTG^ATGTGTGTGCATGGGAC^ 
CATCCAGCGACGCTCGGGGACTCATATCCTCAGGCTTGGC^GCAAGTGCIT 
CTCGCTGCCCroCAACTCTCAATCTAGACCAGGCTAGCCTCGAAC^ 
TGAAGGCTTGC!ACCACCACAGGCCTTGCTTTGGGTTTTTG 

3 0 TGATCTT C CTG C CTC CCAAGTG CTGGGGTTACAGT CATG CG CCACCAATT CCTGG CTTTTG CAGTG CTGGTGATC CAAACT AGGGC 

TTCATGCAAGCGA£X3CAAAC^CGGTATCCCAGTTCCT 

TGAAGACACTGCCTTACCAAGCATCTCCTCTGTC^AATAGAAGAGAGGAGAAATATTAAGAT 
GTGACAGTGGTGCGTGATAACCAGGCTGGCAGTGCCCrCnX* 

ACCCTGAAGATCACTGGAATCAGGAGGAAAGTAGAATCCACAATGGAAGAGTTAAGGCT 

3 5 GTGGGGGGTGGGGACACTGOVGAAACCTGGGAGAAAAAAAATCCAACT 

TTTCTTTAAAAAACTGTTTCAGGAGATTTATT1TATGTATC 

GTGCACCTTGTGTGTGCTTGGTGTCAGTGGGGCTCAGACATCACCTGATTCCCTGGAACTG^ 

CTTGGGTGCTGAGAACAGAGTCCGGGGCCTCTGGC^AGAGCAGTCAGTGCTTTTAGCCA 

TGTTGATCTTGAGTTG^XXTAGGTACGGTGGCGGAATAG^ 

4 0 ACCTTT ATGTTAG<3TAGGGTCACAC!AGCAAGATCCGGTCACAAAACCAG CAACAACAAAACCAAAAGG AGCCAGCTTCTTCC CACA. 

CAAGCCCGGTTCATCTTCATGACTAGCACATCTAATGATA^ 

AGCGACAGAATGAC^CCTATGAGTGCACX3TCX3TTAATCACAAACACACACACACAC^ 
ACCCACCTGCAAACACAATTGCAGCCTTCTGGACGTCTCCTGTCA 

4 5 TGTAACAAAATGACTTCATGCrCrCCCrGTCCTGAGCCAAATTACAC^ 

TTT CTGG AT ACAC CAAT AC ACAGGAGCGTGCACCCTCAGAACACAT GT ACACT^ CT C ACGGGTG ACACAC CG ACG CT 

TACACTCCCCCTAGCCCACIAGAGGCAAACTGCTGC^CGCTTCTGAGTTTCTCACT 
ACACCCCACGCCCGC^AATCCCTCACCACAGCTCCACCCA^ 
TGGGTGAGGAAGTGTCTCCACGGAGTCGCrTGGCrAGAACCACIAACTr^ 

5 0 GCGTAGGGGGGACAGAGGAGACGGACTTCGAGAGGACAGCCCCACCGGC 

TCCCC^CGCATTGTCTCCGCGCCCCCTGGCGGATCC^ 

GGGAGGCCATCCCAAGGGGAGGGGTCGGCGGCCAGTGCAGACCn'GGAGGCGGGGGCC^ 
CGGTTAGCCTGTCAGCTCTTTGCTCAGACCGGCAAGAGCCAC^ 

CTGGTG CTTTT AGTG CCGCCCGGGCCCGGAGGGGC AG CCTCTT CTCACTG C AGT CAG CGCCG CAACT AT AAGAGGC CT AT AAGAGG 
5 5 CGGTX3CCTCCCGCAGTGGCTGCTTCAGCCCAGCAGCCA 

(^GCCTCGGAACTCGCATCACTGCCCTCACCGCTGTGTCCAGTCCCACCGTCGCG^ 
CAGAACCAGCAAGGCCAGGCAGGCCATGGGGCrCTGGGCGCT 

CrCTGCCCGCAGCCCTGGCTGCCAACAGTAGTGGCCGATGGTGGTAAGTGAGCTAGTACGGG^ 

GAGCCAGGCACGGGCCTTACCCy^CTCCCACGCTG 
S 0 GGGTGCTCACACCTAGAACTAGCTCTGCTGAAGTGGGGGCACATCA^ 

TTCCCATTTAATACGACCCCGTTTCTGCTGAGCAACAGGT^ 

CCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCAGATATT 

AGGTGG^CAAGGCTGCAGGTGGGGTTTCTCCTCGGGGGCTGACTTC 

CTCACTTAGACTCTCGAGCTCAGGGCCIAGGCAAGGATAGGGTGGTACAGCCT 
65 AACCCTTATGCATCCCGCCAGGGGCATCtaTGAACATAGCCTCCTCCACGAACCTC 

TCGAGCCCAGTCTGCAGCTGCTGAGCCGCAAGCAGCC^CGACTGATCCGACAGA^ 

CTCCAGAGCGCTGTGCGAGAGTGCAAATGGCAATTCCGAAACCGCCGCrGGAACTGCC 

CAAGATCGTCAACCGAGCTGGGTCCCCAGGAAAGCGACGCTTCCGGGATTAAGGGAAA 

GCGAAG<3CAGGGAAGAC^TCCCACX3GTTATATGTGATCAAACTGA 
7 0 TTCrTTCTAGCCTTGCGTTGTG AGCATC ATCTTTAACGTTC CCGGATCGTGGGCG CT 

GGGCGACAG CTGTTTTTCCCTAGCCTTCCTCAAAGGT ACCTGGCAAGCTG ATCTCTGAGGG CTAGCT AGGGTTGTGCTTCG CACCC 

AGCAAAGTTTGCACTGC CAATACTAGT AGCGATCTTGGCTATC 

TAAGGCTCTGGAGTCTCAGTAAAGCTTAGAGAGGAGGGCATTCCATGCITCGCA 

ACCAAGTCTTCCAAACAGGGTGCTGAGTTGGCCCCACGCCTTCTCrrCAACTGATG 
75 ACAGCGTTCATCTTCGCAATCACCTCCGCCGGGGTCACACATTCCGTGGCG05CTCCT 
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CTGCGACTACCGGCGGCGCGGCCCTGGGGGCCCCGACTGGCACTGGGGGGGCTG 
GCCGAGAGTTCGTGGACTCCGGGGAGAAGGGGCGGGACCTACGCTTC 
CGTCGGTGTGTCCGGAACCAATGGCAGGGGAGATCT 
TGGGACTCTAGGAGGGAAGACAGAGAAGAGGTGGTGGTTGAGG^ 
5 AGGCT ATCAACACGTGGGATGT ATTGAGATGG CTCC ATGG CAGACTTTTG AAAGAT AAAAG TGACTTG CTGGCGTGGAGCAGAGTC 

TGGCCGAATGTCCCTATCTCAGCGGGCCATTTTGCACTTCCTCTCTCCCGAGCT^ 
CAGCATCGACGTGACCCGGGTGGGGTGGGGGTGGGGAAGTATGGG 

TGCGCACGTGTTGGATGCGGCTGCCCACGCTGCGCGCTGTGGGCGACGTGCT 
1 0 TACXXSCAACCGAGGCAGCAACCGCGCCTCGCGGGCGGAGCTGCTGCGCCTGGAGCCCGAAGACCCCGCGCAC^ 
TCACGACCTCGTCTACTTCGAGAAATCGCCCAACITCTGCACGTACAGTGGCCX3CCTG 
GCAACAGCTCGTCTCCCGCGCTGGACGGCTGTGAGCTGCTGTGCTGTGGCCGAGGCCACCGC^ 
CGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTC^ 

TGCCGCGCCTCCX5GGAACGGGAACGCTCTCTTCCAGTTCTCAGACACACTCGCTGGTCCT 
1 5 GCCACAGTCCCAGGGTTCATAGCGATCCATCTCTCCCACCT 

CCTTTTGCCATCCTGAGGGCCCTGACCCAGCCTACCTCCCTCCCTCTTTG 
AGAGGGTGAGAGAAAGATTCTTCTTCTGGGGTGGGGGTGGGGAGGTCAACTCTTGAA 

TGTGACCTCTTTGGGT ATT ATCACCTTTCCTTGTCTCT CGGGTCC CTAT AGGTC CCTTGAGTTCTCTAACCAGCACCTCTGGGCTT 
CAAGGCCTTTCCCCTCCCACCTGTAGCTGAAGACTTTC 
2 0 CCAGCAGCAAAACCCTACATTCTCCTTGTCTCTGCCTCGGAGCCATTGAA 

CCCTTCCTGTCCTGCCTCCTCATCACTGTGTAAATAATTTGCACCGAAATGTGGCCGCAGA^ 

AACTATTTATTGTGCTGGGTTCCAGCCTGGGTTGCAGAGACCACCCT 

TTTTGTTATCCGACCTTTTTTCTCTTTTACCGAGKITT 

ATTAGTGGCTCCTCGCCCCCACCAATGTAGTATCTTCCTCTGAGGAATAAAATATCTATTTTTATC 

2 5 CCAGAACACAGCATGGCTTCCAACGTCCTCrrTCCCTTCCAATGGACTTGCTTCTCTTC 

TTGAAGATCTCTTTTCCAGGGCCTGAGCAAGGACCCTGAGATCCrGAC 

GTCTCCTGAGAGAAAAGCCTGCTCAAAAGCTCCTGCAGGGCTCTTCCTCCT 

TGTGTCACCTGATGGGGACTCAGCACACCAACAGTCCAGCTCrCCTAGTG 

ATAGCCATGACTACATCCTGAGAGGGGGGGGGGGGAAGCTCAAGCAGAAAGGGTGTCATTGCCCC^ 

3 0 TG CAGTCTGAGCTTTGAGCAGTGGAAGGCCCACCTCTT ACACT CTT C CTGG AAATTC C CT AACCT ACAG 

GTAAGGGACATCCAGGAACAAGCCATGGGTTAGAGCTGGAAAGATGGCTCAGAGGTTAAGAG 

TG AGTTCAATTC CCAG CAAC CACATGG TG ACTCACAAC CATCTGTT AATGGG AT CTGGTG CACTCTTCTGGTG TGTCTGAAGACAG 

CTATGGTCTGCTCATATAAATAATAATAAAGAAATGATATTAAAGAAAAGC 

AAACTTAAAGATGCTCATAGGGAAGAAGAAAAGAGCAGAAAGCTTAGCTTCTAGACA^ 

3 5 GAACCCCAAGAGTGATTGGTCCACAACCAAAGCAGTAAGCTAACCACACCCACAAAGAGCGAGAATG 

AGGCTACCTCTGTGCTTGCAGTC^CCAGTCAGCAGGAAAGGAATCCTTCTCAGAGTACTCAAC 
CCTTCCTTGTCCAACCAGAGCCTTAGCCCTGACTCAGGGAACACT 

CCAGAGAGGACCCGAAAGGCATTGACAGAAGCTACAGAGGACCCCCGCCTTCCCAAGAGCCATAACCAGTCTC 
AGTTTCTTCTTCrTCCCAGGATCCTTTGCTGCAGGCTATCACCTCTG^ 

4 0 CTCTGACCCCTGTTGGTTTTTTGACTCTGTGA^ 

CCACTGGCAAAGCTAGCACTGGGTCAGTTCTGATGAGTGGATCCAACCTGGGATA 

TCTCCAGTGATGATCATCCAGAAGAGCCTACTCAGTCCTAACCCAAGCTGAGCTATCAGCCCCTCAGCGCTCCC 
CATTGGGCTGTCCCAGGGCACAAATGTGTGTGTGTCTGTGTGTGTGTCT 

TGCCTGTGTGTATGTGTGCATGCACGCATGTGCACATGCACGCTTGTGTGTGTGTGTATGTGTGTATGTGTGTGTGCATGTC 

4 5 TGTGTGCACGCATGTGCGAGAGTGTGTACGCGTGGGCTGCTGTAACTCIAATGTATCAGGGAGAA^ 

CCATCGCCGACCAATCTCACCCAAATCCTAACCCAGGAACCTGGAAACTAGTGTGCTGAGCAGAGGTGTCTTTCAGGAGTTG^ 
GAGAGCACCACCACCCCCTTTTGCCCAGGTATTTGACACAT 

T AATTGAG AATATTCAAATATCCAT CTTTG TTGT CG CTT AAGG AGGGG AT AGGCTT CAAATGACAT AG TG T CTGTGACAGCTTGGG 
TTGTGTGGACGGCTACTGATACCTTTTCTGTACTAGCCTCTGTCAAATGCTGGCAGTG 

5 0 AGGACTCAACCAGGCCAAC!AAACACTAACAGGCATAGAGGGAAAGACCCTCGGTGTGCAGGAAAGGCT^ 

AGGGGCTGGAACTTGGAAGAAGTGTGCCATGGGAAAATGCAGAACAAAGCCAGG 

ACTTCAGATAAAAGTGCTGGAAATGTGTGGCGTGATGGTAGAGCACGTGTAAGGACGAGATGGGTTGGATCCCCA 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAAGGGGCTGTGGGGTCTACAAGTGTTGCCTAGGAGCTGAGGACTCT^ 
GTCCCTATTACCTGTGTTATCTGCGCTGCCCTCTCAACTGGATCAGTAGATCAGTAAGGAAGGGGGGGGGGAGGTGCGTACCTAG 
55 AAGAGTCTCAGGGCTCAGAACACAAGGCGCTGTCCATAAGATAATGAATGTCATCAGAAATATGCCTGACGCACCTTC 
TCCACCCACrCGTGATCTTTAGATGCCATGCAAATACAC^CCACCCCGGGGA 
AGCTGTTGCGGGGACTGAGTCTTGAAATATCCCAGGGGTGGCGGAAAAGCTGAGAGGAC 
CCTGGGGGGTCAGCACCGAACACCCTGACACTGGCATCTCGGAAAAACCCTGTCAGTGGT 

CC CCCTTCCTTACGGCCCCAGG CAGGCTCAGT CCCTTGGCTCC AGGG AAG AAAATAAGCTGTCTGGCTGTGGTGTG AGCAGGAACA 

6 0 TCCGTGCAGAGCTGCCCACCAACTCACCCTTACTCAGTCTTTCCACACCATGCCA 

CCCCTCCCCCGTCTGGGCTAAAAGCAGACACACTGGGTTGGTGAGAGAGCAGCAG 
CTCGCTCCCCTGTCATCCCCTGGGAGGGGGTAGGGTGGGGTGACACCTA 

AG AGAAGTGGAG C CT CCAAC CAG CTGCTG AAATATTT ATTG ATTTTTTTT C CTTT ACCTTT CC CAGGTG AATGTG CCACCTTTG AC 
GTGTGCCCTGGACTTTTGTCTACCCCACCCTCCCTGGTATATGGCCTTATGTCCAGCGTGAAAGGGCT 
6 5 CCTTACACCCTCAAGAAAATAGCAGACTTTTTAATTCACTCACTGAGTGGACATAGTGAGGTGC^ 

GAAGGGAAGGGAAGGGAAGAGAAGGGAAGGGAAGGGAAGGGAAGGGAAGGGAAGGGAAG 
AAGGGANNNNNNNNNNNNNNNNNNN^^ 

CCCTTCTCCATCCTCCCTATTAATCCTCGGAAGTGAAGGGGCCGTCCCTGGCGGCCCrTCAGCCCT 
70 TCGGCTGGCGTCTAATTAACTCCTCCTGCCCCTGGTTTTGTGTGCCCCCCTTACrrC 
CCGCTCCCCCCGACGGACATTCCAGTCAGGCCGGGCCAGTGTGCC^ 
CCGGCCGGGCTCAGGGAGCCCCACTCACCTCCCGGCTCT 

AAGGCGCAGCCCCCAGGGGG CTAACTCAGCCTTCACGCCCAGGTTGTGCGG CATTTGGGGAGTTT AATG AATTGTCCATCACGCCT 
TTCAGGGCCCGCCCGTCAGCCTGGATTAATCTTCGGAGCCCT 
75 GTCACTAATTGAGGTAATTATCTGTGACTCTGGCCTGGCCAACAATGCTGCATTCACT 
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CTCTCAGCACCAACCCTCCCTCCCTTCCTCTCTAAAACGCGCCA^ 

CGTTCCAOATGTTCCACGTTCACGTAGTGTCTCCCAAAGCCTCCTCCACTATGGAGC^ 

AAGTTGGTGAGAGAAGAGGGCACACAGGCTTGACCO^^ 

TGGCACTAAAGTCTCACGGGGCCTTrCACATGTCCCTGTAGCCTACAGGACTCTG^ 
5 AGATGAGAGGTTCTGGAAGCAGAAGCTACCAGAGTTGCCAGAGCGGCAAAGTTG^ 

CAGGGAGCAGAAGGGCAGGATGCAGAAGCATCCTAGGAAGAAGC^CCCAAAG 

CCATCAATTAGTGCCATGCAGCAAGCTGGAGAAAA 

TATTCCTGGAGAACXTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGCATGTGTTGTATATACATGTATGTGTGTGAATTC^ 
1 0 ATAAAGAAGAGTAGACTTGGATGAGACTTGATTTTGAGATTCTTGATG^ 

ATCCTAATACTTTACCTAGAATCCCAGGCTATATGAAGCCCGTACTTCACAACCGTTGCG^ 

TGTCCTCCGGCrrCTTCATTTACCTAGTTCTTGGTCAAAAGAGATGTCTAGAGCA 

GACCAGCTGAAGCACTAAGGATCCTGGTAGAACTCAGCGCCAAAGTTCGAACAGCGC 

TGGAGACACTAGCTAAGAGGATAGAGCCAGCGGCTGCAGGGGCT 
1 5 TCITCCCGGCTTTTATTTTAAATTGTCCCTCCATTATTTGAAGTAATG 

CTGAAGATAACAGAGAATACCCGGTCCTTTTGCCTGTACCCCCTGACT 

AGTTTTGGGGATCAACTGTGGGACCTIGTGAATATTAGATCAGTTTCTATCA^ 

CTTACACAAACCTTTTTGGAGGTGTTTTC 

ATTCTAGTAACAAATTTCAAAGGGTAACCAAGTCAACAAAGATGCCAGGCATC 
2 0 GCAGAGGAGGGTGGATCTCTGGGAGTTTCCAGGCAGCTTGGTTTAAA 

CCATCTCCGATACAGTCCCACCATCACTAACCCACACCAAACCAACCAAC 
GCCAGGGAAGTATTCAGAACTGAAGTACAATAAGATACTCCCAGCTCAAAACTCT 

AG ATCCTGTGTTTCTTTAGCCAGAGG CAGAGCCTCATTG CTTTTCCCATGTTGCCTTGGAATTCCTGGGTTCAAGGGATTCCCCAC 
CCACAACCCGCCATGTAGCCGGGACTACAGGTGTATGCGTCTGCCCCATGTGAAGGTCAGATGC^ 

2 5 AACCTCTTCGTGACCAAGGCCTCTTTCTTCCCAGGGCCTTGTTCCCTGTGGAA 

TCCTCATCATCTTTGTGAGAACTACCTCAAACATCCCTTCTTCAGC^ 

AATGCTTTGCTGGCCATCCGTGTATGCAGTAGACCAGGCT^ 

CTTTGCTAGCACCIAGATAGGGAGATTATATGCCCTCATCA 

GATGGATATGGAGAGGCACTTGAATGGATGATAGATGAGATGTGGATAGATCAGTAAGTGCATAGGTAGGATGC^ 

3 0 ATGCATAGGAAGGATAGATAGGTGAGTGGGTGGATGGGTGGATGGATGGAAGGATGACTGGATGGGTGGATGGGTGATGATTGGAT 

GGATAATTGGGTGGATTGATGGGTGGGTGGATGGATGGATGGATGGATGAT^ 
AATGGATGGACTGATGGAGCATTTATCCACAGATGACTTGTATATCCrAG 
CATGGCATGCCATGAAGAGGCAGGAACACCCCTAGTTTAAATTTTTCTCT 
ATTTGTGGCTATATCCCATTCCCTATGAGTCCAAGTCAAGGAGGTGTCCTCCT 

3 5 CAAATTTG AATGTTGGC CTTGCAG AGGCACTGCGCTAATCTCCTCTCAG CTG CT CCAAATTTAGTGT AAAAGTGATCTCCCCAAAG 

GTAGAAGGTGTGCCCGAGTCTCCTCGATGTCCCTGGAACACATGGTTAGTGGGCAAGTGGACAACT 

CTACCCTGTCCTCAACGAACACTCTGTGCACACATCCCAGGACCCTGCAT^ 

ACTGATCTCTGTTTTGTTTTGTTGTTTTTTGGTTTTTCTGAGACAG^ 

TAGACCACGCTGGCCTCGAACTCAGAAATCTGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCT 

4 0 CTGATCTCTTTGATGTCCCAGTTATAGCTCTTGGGTTCCCCACCCATTTGTAGGGGG 

CTAAAAAGGGCAGGGTTCCTGGCTCCCTTAATGCCTTATCAAGCACAACAGAACT 
AGCTGTGGGGAGAGTCATGGGGTCCCCTGCTAACCTAAGATGTGTTGCCCCCAAAAGAGTAG 

AAGTCATTTTCACAGCAGGGAGTCGCTAACCTGCCACCCACAGGGCCCCATGGGGCCTAAGATACT 

4 5 GAAACCTGCAGACACTCCTCTCCTTCTCTCCCAGAACTTTGGC 

CGTGTTG CCCTGACACCCTAAATACCCAAATGTTCCCCCACACAAAATGGGTGGTTGTTACCT CTAGG AT ATGCCTTTTTCAA 
TGCTGGGGACCAAACTCAGGACTGTGTACATTCTAGCCACACATGCATACTACCATT^ 
GACTTCTTTTTTTTrriTGGAGACAGAATCTTGCTGGGCTGAACT 
AACTCATGACCATTCTCCTGACTCCAGTTTTCAAGAAAGTGCGCCATC 
50 CTATCrTATCTATCTATCTATATATATATATATATATGTTTTATTTTGCCTGGATGGCATGCAGCTCATTATGTAGA 

CTTG AACACC CAG AG ACCCAC CAG CCTCTG C CTTCTGAGTG CTG AG ATT AATG ATGG AG AGTGC CACCAATGG AG AGTGCCACCAA 
CCGGGCnTCTTTCCCTAATTTTTATCTACAAGTCACCTC 

TGACTCTTCCTAGAACTTAACACAACCGTCCTCACTGTCAGACAGGCCTTGCCATTTCTG 
TCCTAGGAAGTGTTCAGTAAATGCTAGGTTTCGCTTGGCAGCAGAAAGGACTGCATTGGACGTGGTAA 

5 5 T CTTTG CC CAGGGTGACC CGG C 

MOUSE SEQUENCE - mRNA 

AGTGG CHX3CTTCAGCCCAGC AGCCAGGACAGCG AACCATGCTG CCTGCGG CCCGCCTCCAGACTTATT AG AGCCAGCCTGGGAACT 
CX3CATCACTGCCCTCACCGCTGTGTCCAGTCCCACCGTCGCGGACAGC^ 

6 0 GCCAGGCAGGCCATGGGGCTCTGGGCGCTGCTGCCCAGCTGGGT 

CCTGGCTGCC^CAGTAGTGGCCGATGGTGGGGC^TCGTGAACATAGCCTCCTCCACGAACCTGTTGACGGATTCC^ 

AGCTGGTGCTCGAGCCCAGTCTGCAGCTGCTGAGC05CAAGCAGCGGCGACTGATCCGACAGAAC 

AGTGGAGGGCTCCAGAGCGCTGTGCGAGAGTGCAAATGGCAATTCCGAAACCGCCGCT 

CCTCTTCGG CAAGATCGTCAACCG AGGCTGCCG AGAAACAGCGTTCAT CTTCGCAATCACCTCCGCCGGGGTCACACATTCCGTGG 
6 5 CX3CGCTCCTGCTCCGAAGGCTCCATCGAGTCCTGCACCTGCX5ACT 
GGCTGCAGTGACAACATCGATTTTGGTCGCCTCTTTGGC^ 

CCTGCACGGTGCGCACGTGTTGGATGCGGCTGCCCACGCT^ 

CGCGTCCTTTACGGCAACCGAGGCAGCAACCGCGCCTCGCGGGCGGAGCTGCTGCG 
70 TCCCTCCCCTCACGACCTCGTCTACTTCGAGAAATCX3CCCAACTTCTG 

G ACGAGCTTG CAACAGCTCGTCTCCCGCGCTGGACGGCTGTGAG CTGCTGTGCTGTGGCCGAGGCCACCGCACG CGCACGCAG CGC 

GTCACGGAGCGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGC 

TCTATGAGGTGCCGCGCCTCCGGGAACGGGAACGCTCTCTTCCAGTTCTCAGACACACT 

CCGCGTCCAGCCACAGTCCCAGGGTTCATAGCGATCCATCTCTCCCACCTCCTACCTGGGGACTCCT 
75 GGCTCGAACCCTTTTGCCATCCTGAGGGCCCTGACCCAGCCTACCTCCCT 
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AATTTGGCCAGAGGGTCLAflAXIAA 

ATTTTGCGCTGTGACCTCTTTGGGTATTATCACCTTTCCTTGTCT 

TCTGGGCTTCAAGGCCTTTCCCCTCCCACCTGTAGCTGAAGAGT^ 

TTGCTGGACCCAGCAGCAAAACCCTACATTCTCCT^ 

TGTAAATAAAACTATTTATTGTGCTGGGTTCCAGCCTGGGTTGCAGAGACCACCCTCACCCC^ 
CGCCAGTCCTTTTGTTATCCGACCTTTTTTCTCTTTTA 

ACTGTAGCTATTAGTGGCTCCTCGCCCCCACCAATGTAGTATCTTCCTCTGAGGAATAAAA 

10 MOUSE SEQUENCE - CODING 

ATGGGGCTCTGGGCGCTGCTGCCCAGCTGGGTTTCTACT 

CAGTAGTGGCCGATGGTGGGGCATCGTGAACATAGCCTCCTCCACGAACCTGTTG^ 
AGCCCAGTCTGCAGCTGCTGAGCCGCAAGCAGCGGCGACTGATCCGACAGAACCCGGGGATCCTGC^ 
CAGAGCGCTGTGCGAGAGTGCAAATGGCAATTCCGAAACCGCCGCTGGAACTGCC 
1 5 GATCGTCAACCGAGGCTGCCGAGAAACAGCGTTCATCTTCGCAATCACCTCCGCCGGGGTCACACATTCCG 
CCGAAGGCTCaVTCGAGTCCTCCACCTGCGACTACCGGCGGC^ 

AACAT CG ATTTTGGTCGC CT CTTTGGCCG AGAGTT CGTGGACT CCGGGG AGAAGGGGCGGG ACCT ACGCTTCCTCATG AAC CTTCA 
GAACAACGAGGCAGGG CG AACX1AC CGTG TTCTCTGA^ 

GCACGTGTTGGATGCGGCTGCCCACGCTG CGCG CTGTGGGCGACGTGCTGCGCGACCGCTTCG ACGGCGCCTCCCGCGT CCTTTAC 

2 0 GGCAACCGAGGCAGCAACCGCGCCTCX3CGGGCX3GAGCTGCTGCX3CCTGGAGCCCGAAGACCCCGC^ 

CGACCTCGTCTACTTCGAGAAATCGCCCAACTTCTGCACGTACAGTGGCCGCCTGG^ 
ACAGCTCGTCTCCCGCGCTGGACGGCTGTGAGCTGCIT5TGCTC 

TGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGCCGCAACTGCACGCACACGCGCGT^ 

25 HUMAN SEQUENCE - GENOMIC 

CCCTGGAGTCGGGAGATGACTAAGAGGGAGTCCCTGAGGTCTTTGAAGATGTGTACAA 

ATTCCCCCAACCCTGCAGGGAGCTTGGGTGGGTGAGGTGGGAGCAGAGCCTT^ 

TTCACTTCAGGGTTAATCAGTCTTTTAACCAACCTACCTACCACCATCCTAGCACCCCCA^ 

T C CGCTTC CCAAGATGT AGTTTT AT ATTTG C C CTTCAAG ACTGCCCTTTTT ATGTTT ATC CT AACCTGCTATT AC C CT AT AAAACA 

3 0 CTGAGAGGAGGAGAGAGGTTGGGGAGTGATTTTCCCTAAGGCCCTCTTAAAAGG 

AAGCTCAGGCTGCCAAGCCTTTCCCAGACTTAAATAAACAT^ 

GGGGAAGGAGGGGGACCCTCACITCATrTT AGGCAGGGG CTCTCCGG AGCC CTTAG AGACC 

AGGCCrCIGCTACCCCAGGATCTAGGGCTCCTGGGCCAGCCTTCTG^ 

ACAAGT CTTGGGAATTCCCCrGCAGATGGCAGCTGCCTATT AACC CCATAGTCCCCAGAGCACTGCCCCCATCTCTGGGCTGAGCC 

3 5 CCCATGGGTCTTTACACCTCAGGGAGTGAACTCCAGTGGG AAACCAGG AGAGG CAGCCATG CCTCCCAGACCAGGCCAGCAGTGAC 

CTGTTCTCTGCCTCTCCAGCCGTTCTTCACCTGCTCAGGTGAGCAACACCT 
CCAGCTTCAAAGTCATCAGGGTCTTTGTTCCCAGCTTTGGCTCCTGGGAACC 
GGTTGGGGGCGTCACCCCACCCCCGCTGAGACACTGGGGACACCAGCCTGGGCTTG 
GGAGGGAATAGGGGGGCGTTTGAAGCTTCCAGGACTGGGGGCA 

4 0 AGGAAAGGGGGGCCCTAGGGTAGGAGAGGAGGGACCCAGCTGCCT 

CCTTCCCGCCTCCGCGCGCCTCGCCCTGCCGCCCACCCCCTAGCCTCCGCGGCAGCGCCGACCCGCCCAGCCAGGCT 

TGAGGATGGCGCTGTGGTGCGGCAGGCXjGCCGCCGCCCTCAAGCGCGGAGCAGTTCC^ 

TCGTGGACCGCGATGTGCAGACCCTGAAGCGCGGACGCCGTCACGTCGGGGTTC 

4 5 GCCXSGACAGCGTCAAGCACACGGTGTTGGCCGTCAGCGGCGGCTC^ 

TGGGAACCAAGAAGGCGTCTGGGGTCTGCGGT CCAGACCCCTCCAACTCTCCCCACCCCGGGTCGGTGTTTCTATGG CCTGGGAGA 
CAAGGGGAACTGCTCACCGACTGCACAACTGCCAGGAACAGGAGACCCGCGAGGCCCGAGGGCGGAGGCCGCGGCCG^ 
AGCATGTCGAAGCCCGGAGGCTCCGGTGGGCAGATCGATCAGGACCTGCGGGACGGGAGACrTG 
GCTCGCrTGGGGJUVCCAAGTGACGCCCTTCITCGGGCTCCTAACCCACCGGTGCCACCC 

5 0 CAAAACAAGAAG7VAACACCCCTTG ATTCCCAGAGTCCCTGAGG CTTGG AGGTCTTAAAGAAG AAG AGTCAAGG CGTTGT CCCAGCT 

. CAGGACTCAAGCGAGCIACCCCTGGAACCrTGCCCAATCIAGACACAACGCCC 
ACACAGACTCACAAACACITGCTCC^C^CCrCGCCrACTGCTC 
G<HttGGGAGAGGGATAGGGAGGAGGTTCCCCACCACCrAAGG 

5 5 GGGGAGGAGAGCCTGGCTCCCTCCTAAAAG^SACCGTGGCACAGACTGAGTCCTCTTCC 

CTCCCCTCCTCCATAGTraCCCGTCTGGTCCTCACCCCTTGGCTCTCA 
CTCTGTGTCTTTATCTCTAGCAATATCTCIGGCTCTCTATGCGTCTCTGTGCTTCT 
GACTACACACGTGCCTGTCTCTTCCrCGTGTGCACGTCCCTGCCCTCCCTGCTTTCCCAGGTCT 
GCGGGGTCCGAGACAGGGACGGTTGGGTTGCCTGGGCTTCACTAGTG 

6 0 CTTGTCAGAGCCGCATTCTTCCAGGGCCGGGCCACTCCrClTCACCGCTTCCC 

GAGGAAGGGAGGGAGTGATCGAGAGAGCTAGCAAGCCGGCGAGCAAAGGACTGG 
TGGCTCIGCCGGGCTGACGGAGCTGCAGCCCCTCCCCGAGCCTGGGGCTTCCCCACCCT 
GAATGCAGCCTCCCCCTCCTCCCAGCGCGCGCGCGCGCG^GCGCAC^ 
TCACAC^CCCTCTCCCCCGACGCCTGGGTTC^GGCTT^ 

6 5 GTCGCCAAGGCTGCAGGGAGGGTGTAGTAACCTTCCTAGGX3AGCCAGAATGGX3GCCCAGATG 

TGCGTCAGTTTCTCCACCAAGATCAGGGTGGCTTGGAAGGTGCCTTTGACAAGATG 

GGTGTGGGACAGTCACATGTGG CACCCACAGCCTGAGCGCACCCGGGGT ACTCAGAG CGGGCAGGAG AAGCACACGTCTGGGCAGC 

TTGGTTCCCACCCTCCTCCAAGCCCTGCGCCTGCCCCTTTGGAGGG^ 

AGTAGCACCAAAGATTGTCAGCCTGCCCCGATGCTTCCTTGCCTG^ 

7 0 ATTCCCCCGGTCTGCACTTTGGGTGGCCTGGATTCCTGCn'CCAAGCTGTAGACTTGC 

CCAAAATTGAGCCAAG CCCAAGGGCAG CCGGG AAAGAG CCTGG TC TTGGGGTG CACAGGCAAAGGCAAACCG CCTT AGGG AGACCC 
AGTGGCAGCGAACGGGTTAGGCGCCACCGTGCTCCCGCCTGACTCACCCTCGAAGCTCrAACCACT 
AGGGGCCCOVGGCTTTCCAGCCGCCCCCCGCCCCItSCa^CCGCAGCCOVAGGCA 
ACTCAGACTGCCCCCAAGCAAGCAGAGGAACAGGGATGATACCCACAACCACAGTCACCTCTTC 
75 TGCTGGGCAGCCAGCCTCCAGTCTGTGCTGCTGGAGGTXCT 
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GAGACAGGAGATCGAGCAAACTTACATCTXrTTGGGATATT^ 

AAAGTGCCATGGTCCCTGACACTGGACAGCCTGGACCAG ACTG CTTTTCTCTGATTCCCTACCTCTGGGATGGGGAAGTGGGAG CA 
CTAGAGTTCCTGACCCTCCrTCCCATTTAGAGGACAAGAGCCAAACCATGTG 
TCTCATAGTTTTCATCCTTACACCTGGATTC^TGATCCCCACCAGTGTC 
5 CAGGTCCTTGGAAGAGGGGAATGAGGTTGCCTAAGCCTCTCTTCTCC^ 
CTGGGTCCCCCTTAGGTTTGGGATTATTTTTTGCCTTCT 

GGCGAGGGGCAGAACTACTACAATAGGTTTCAACTTCCTGGTGACCCTAGGTCrC 
CAJGAGTCCTGCTCCTTCTCTCTACTGAAGC^ 

CAGTGCCCTCAGCCCGACCGGCTGGCCTGAGCACTGACCCGCAGGCGGCrAGAGGGCGG 
10 GTGGGTGCACACCCATTGCCTAATACCCCAAGCACGCGGCCAGCTCCAAGAAAGGGGAGGGACG 
GGGTCAGGGCCAATGACAGGAGGGGGAAAGTGGGGGGCTTTCATCACT 
TCAATGAGAAGAGTGGAGGGCTGCCCTTCCCATCCT^ 

GAGGGCGGGGGTCAAAGAAAAGAGTGGGGGCCCCTGGAGATTGGGGGATGTCGAGCAAC^ 
CrCCTCCTCCTCCTCCCCCCATGTTTGCCTTGGCCCGGGGTCCCGG^ 
1 5 ACCCGGGGAACCCAAATTATAGGCCCAGGAGGGATGGATGCGCCCGTGGCGGTGAGCTCAGCTGCGCT 
GCGCCTTCTGCrrcCAGCACCGTAGGCCACt^CCTGGAGGCACCAAA 
CCCACTTTCAGCAGTCCACTCCAACAAATCCATGGGAT^ 

GGCTGTGCCTTCCCAGCCCAAGGGGGC^GTGCTGCCTGCGGGTGCTTCAAAGGAGGG 
CTGGAGCCCTGCTTCCTCCCCAGATCCCAAAGTCAAGGCAAAGCCCCT 
2 0 GGGTGGTTTGCTACTGATTTTCAACTTCAAGCCTTTAAAGTCAT^ 

GGAGATTAGTAGTGGGATTCTATTTTATAAAGGGGGAGGGAAAACAAACTGAAGGAACAAATACA 
ATTTTAGAAGACAGAAAAGCAGGTGCACAGACCCTTAGAAAGGAGCATGCTT 
AGTGGGGAGAAATTCGCAAGGAAGAGAGTGGTAGGCAATGAAGCTGGAGGGCTA^ 
GGTCTCGCTGGAAAATTTTGAGCAGAGAAATGAACTAA^ 

2 5 CATGCCTGTAATCCCACTACTTTGGGAGGCCGACGTGGGCA 

AACCCCACCTCTACTAAAAATACAAAAATTAGGTGGGAGTGTTGGTGGGCGCCTC 
AGAATCGCTTGAAACCGGAAGGTGGAGGTTACAGTGAGCCGAGATCATACCACTGCACT 
CGTCTCAAAAAAAAAAAAAAAGAAAGAAAGAAAGAAAGAAATTGGCTCT 
TGGAGCACTGAAATGGAGCAATCTTGGGCAGTAATGTGGGGTCT 
30 GTCTGCATG CTTATGTGT ATGTT CCAGG AGAGAGAT AAGTGGATGATG CAGGAAAGAAAG AGGGGACAGTTGAT AT CATG ATT ATT 

TGATCTAAATAGAAAGTTGGGTGCTTGTTTTGGCAGCACATAT^ 
AGGATGACATGCAAATTTGTGAAGCATTTCATATTTTGA 

GCCTTACCCTCTTGCCCCCTACTCCrrGGTGTGTGGAATGTTGGAACAAAGCACAGTGGCT 
GACAA.CAGTCGTCAAAGACAGGGCTTCCATATTTTCCAGCCAGCCTCCCA 

3 5 ATGTGACCCAGGCCCCAGTGGGAAGAAACTCATAAGGGATAAAGGACAGGCTCCCCGTGATACATT 

TATGCTGGGTGCTCTCTGCAGGGACTACTGGCTTTTGGATCT 

CCTTGGAAGGGCCAGATATACTAGGCTGGGCAAAAGGGAAGAAAAAAGGGAAAGAAGGACA 

G G CTTCATTTGGG TT AAT ATG C ATCTCATTT AAAACA.CAAGTG C CC GGGAAT ATT AATG AAACTT AC CTGG CATTT ATTCCTT AGA 
GTGATTTCCCTCCCTTAGAAGGGAATCCTAGTCATTTCT 

4 0 AGGCATCCTAAAAGACAAGGGAGATGAAAAAAATCAGAGCTAGAACTCAAAAGGGAGG 

CATCAATTTTGAGAAGGGTTCCCAGCCTGTAATTGCT 

AAATTGGTCTGGGTTTTAATCTGGCTCTACCATTGATTCATTTATTAGACGT 

CTGCAAAATGGGGAGAGAGAAGAGATCTTCCCTCCTTCCAGGGGCCATGTGTGTGGTGGT 

CCCCCTATTCCCCATATAGGGAAAAGCAGCCACTTTTTTTTTTTTTTT 

4 5 ATGGCGTGATCTCGGCTCACTGCAATCTCCACCTCCCGGGTTCAAGCCATTCTCCTGC 

GG TGTG CGC CACCACACTCAGCT AATTTTTGT ATTTTT ATT AG A 

GACCTCATGATCCACCTGCCTCGGCCTCCCAAAGTGCGGGGATTATAGGCATGAGCCACCGCGCCCAG 

GGAGGCCTAAATGGCCTTGAAGCTGAGTAX5GAGTCCCTGGGAGAGAAGAGAAAAGTGTACAATGGATGAGATGG 

TGGGTATCCCAGTGTGGTGGGAACTAGAGCTTTAGGGAAAGACAGAAACTTGGCAGAAACATCCAAAGA^ 

5 0 GCACAAGTTTCCTCATCTAGGTTCAATGTAGCCAGCAACCCTTGTCTTCCCAG^ 

G CAC CATTT CC CATCCTTTCTG AG C CT AGG CCTCAG AG ACTT AG CCACTC C AGG CTGGGTTCACCTC AAT ACCAT CTTGGTTGT AG 
GCTCGGCTCTCTCCCCCAATGACATGCACTGGTTGACACATACCACAGTGTGACAC<SC 

ACAGAATG ACG CAT ACACACAT ATTT AAT CTTCCCATG CACATG CT CATCCACCCACTCCACACACAGT C CAGACACTCTG CAT CC 
CTCAATCATG CTTCTG AGTCTC CTGTCGACAGTTGCCACCTCCTTCCTG ACACACTGCCCCAGGCGGTGACTGTGACAAGGTGACT 

5 5 CCATGACCTTTTCTG ACTTG AG CTAAATTCCAAAATT CTTTGGAAAGTTTCCT AACATCCTT CGTCAGAACAAGGAGTTTCTGGCA 

CGTACCAACACACAGGGAGGATGCACCCTCAGAACACAGCACATTCTCACTC 

CACACACACACACAGCCACTTGTGCGCTTCTTCTGGCGCACATC 

TCCTTTTC*GTTAGCTCACCCCCACAACTCATCCCCGGGATTTCCCT 

TCTCTGTCCAGCCATCGGGGGTTCCTGGG CGGTTAAGCATCTCCCCGGAGTCGCTGCC CAGAACCACAGCTTTCCT^CCGACACTC 

6 0 AGGATGGGGGAGAGAGGGGACGTCGGAGGGGCCCGGGGTGACGTCGAGGGGACAACCCC^^ 

CCTGGCGGGCTCTCCCCGCAGCACACTCTCGCCGCGCCCCCT^ 
AGX3CGGGCGCGCGTGGGAGGGTGTCCCAAGGGGAGGGGTCC<3C<3^ 

GGGGTGAG CCCCGACGGCCAACCCGTCAG CTCTCGGCTCAGACGGGCGGGAACCACAG CCCCG CTCGCTG CCCATTGTCTG CGCCC 
CTAACCC^TGCGCCCTC^TGCCACAGTGCGGCCCGGAGGGGCA^ 
65 GGTGCCGCCCGCCGTGGCCGCCTCA^CCCACCMCCGGGA^ 

AGACTGCGAACTCTCGCCACTCCCGCCACCGCCGCGTCCCGTCCCACCGTCGCGGGCAACAAC 

TCTGCCCGCAGCCCTGGCTGCCAACAGCAGTC^CCGATGGT^ 

GCCAGGGGCCAACCCTACCCAGCTCCCACGCTCTGGGATCCGTCTGCCGACAGGCTCCCTCCCCGCT 

7 0 CCGAAGGGCGATCTGGCATGAAACTGCCCCAGACTCCAGCTCTGTACAAGTGGGGCGAATGATCCG^ 

CCCAGGCttGGGAGCCCACTCTCATCTAGCACCGCC 
TTCCCGACCTCCK3GAAGTGAAGGGCCAGGAGTTCGCCTAGAAAGGAGGG 

G GTGGG C AAG AACTG CAGGCCATGATT AT CTCGCTCAGGCTG AC CGGAAGAGG CT CGG AGAT C CAAGG T AG ACACT CGGTCTCCGG 
GTACCTCCTCTGTCCAGTCTCCGGACCTAGGGCTCAGGCGAGCAGCCCTG 
/ 5 CTTTCAAATTAGTGAGCCTCGGAGAGCGGCTATTATTAA 
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TGGCACAGTTTTT ATGGTT AGGGTG ATC CCCTT CCTGAG CCTGAGCT ATCAT ACGTCCGACCAGGGG TATTGTG AACGTAGCC 
TCCTCCACGAACCTGCTTACAGACTCCAAGAGTCTCCA^ 

TCTGAT ACG CCAAAATCCGGGGATCCTGCACAGCGTG AGTGGGGGG CTGCAGAGTG C CGTGCG CGAGTGCAAGTGG CAGTT CCGG A 
ATCGCCGCTGGAACTGTCCCACTGCTCCAGGGCCCCACCT(nTCGGCAAGAT^^ 
5 TTCCGGGAGCAGGGGAAACGCGGGGTCACCCCCAGGGCATGGGCGGGCGAGTTCA^ 
ACAATCAACCTTGCCAAGTGCCTCGTGCCCAGCGCCAGCTCGGGGCCAGACTTCT 
ACGAAGCTTAACTTTTCTGAGCTACTGCCCCAGATAAAGAAAGTTTCGGGTCGC 
TCTCTCAAAAGCGCCTGGGAAGCTGCrCTCTC 

AAAGTTTTGGGCTGCGAAGAAGTCTTGGGGATGTATGGTTCTTCCGCTCCCCTCT 
10 TCCGCTATCGAGCCAAAATGCGCCCTAGAATCTCCCAGTAAGGTGTGATTACGCCCGTGGACGTG^ 
TTTCTCTACTAGCCCTAGAGACCAGCTITCCAGCACrrGCCGGCCCTGGCT 

CGGT CCCAAGCCCTT CATGAGG<3TGCTGGCCGCGCCC CTGTCGAG 

AAACGGCGTTTATCTTCGCTATCACCTCCGCCGGGGTCACCCATTCGGTGGCGCGCTCCT 

ACGTGTGACTACCGGCGGCGCGGCCCCGGGGGCXrCCGACTGGCACTGGG 
1 5 CGGC CGGG AGTTCGTGG ACT CCGGGGAG AAGGGG CGGG ACCTGCGCTTC CT CATGAACCTTC^C AACAACGAGGCAGG CCGT ACGG 

TGAGCTTTGAGAGGCTCCGCACCCTAAGCGGAGCGGCAGGGGCC 

GGGCACCAGGAGAGGGCGTCCTGGGAGAGCCGGAGGCTTGGAACGAAGAC^ 

TGTCTGGCCCGCGGACAGGTAGAAGAGGTTGCAAATCAAGCACAGTCTCTTCGCT 

GAGACTGACTCGCCGCGGCGGAGCAGGGTTGGGCAGGGTTTCCAAATCTCAGCGG 
2 0 GCTAGGCCTGGGCCTTTGCTGAGGTCCGGCCCCCGTGGCGTCCGGGAGAGGGCAGTG 

TGGGACACTCrrTCTrCCCCTATCCCCGCAGACCGTATTCTCCGAGATGCGCCAGGAGTGCAAGTGC 

GCACGGTG CG CACGTG CTGGATG CGGCTGCCCACXJ CTGCGCGCCGTGGG CG ATGTG CTG CGCG ACCG CTTCG ACGG CGCCT CGCG C 

GTCCTGTACGGCAACCGCGGCAGCAACCGCGCTTCGCGGGCX3GAGCTGCT 

CTCCCCCCACGACCTCGTCTACTTCGAGAAATCGCCCAACTTCTGCACGTACAGCGGA^^ 

2 5 GCGCCTGTAACAGCTCGTCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCTGCGGCAGGGGCCACCGC^ 

ACCGAGCGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGCCGCAA 
GTGAGGCGCTGCGCGGACTCGCCCCCAGGAACGCTCTCCTCGAGCCCTCC^ 

AACCACTTGCCTGGGGCGGCATGAACCCTCTTGCCATCCTGATXXIACCT 

3 0 CCrTGTTGCACTGCCCCCTGCTTGGCCAGGAGGTGAGA^ 

CTGCCTGCTCCATCGCGCCAGCGACCTCTCTGCCTCTCTTCTTCCCCTTTGTCCTC 
CTATTCTCCTGCCATGGGTGCAGACCCTGAACCCACACCTGGGCATCAGGGCCTT^ 

TTACAGGGCAAAAGGGCAGCTGTGATG ATGTCK3AAATGAGGTTGGGGGAA CAG CAG AAATG CC CCCATTCTCCCAGTCTCTGTCG 
TGGAGCCATTGAACAGCrGTGAGCCATGCCTCCCTGGGCCACCTCCTACCCCTTCCTG 

3 5 TTTGCACTGAAACGTGGATACAGAGCCACGAGTTTGGAT 

AAGACCACCTCC^CCCTACCCAATCCCTCTCCACTCTTCTCTCCTTTCTCCCT 
TTTCTCAAAGATGCGTTTGCCTCCTGGAATCAGTATTTCCTTCCACTGTAGCTATTA^ 
CTTCCTCTGCAGAATAAAATCTCTATTTTTATCGATGACrTGGTGGCTTTTCCT^ 
CCCTATCCTCCCCrrTTTACCACTCCCAGGCTTGGAAGCTTACCTCTCTCAAACAAA 

4 0 AACCTTGCCAGAGGACCCTGAGATTCTGACCCTTGGATGACCCTAACGAGGCCACTGGGAATCCAGCT 

GGTCAGTAGCCAAACAACCCTGC^GGGTTCTTCTTCCTCGTGGAACAGCAACTGG 
TGAATGGGGGTTAAACAACCTCAGTGCCTGCTTTTCCTGGCTGGAAC^ 

ACCATCTTCCAAAAAGTAGGAAGAGAAACT AAAGCAGAGTG CTGCTGCCCCTAATGTGG AG CTATTGGTC CCTGAAG CTTGG CTTC 
TTCAGCTGTGGTCAGGGCATTGGACAGGGGTGAAGC CCTTCCACCAACAGAAAAGTAAA CAGTGGCT CATGCC 

4 5 TCTAATCCCAACACTTTGGGAGACCGAGGTGGGTAAATCGATT 

CACCTCTACAAAAGAAAAAAAAAAAAAAAAATAATATATATATATATATATATATATATGTGTGTGTGTGTGTC 

TGTGTGTGTGTATATACACACACTATTAGGTATTATATATGTATATCTATCTATCTATCTAATAAATACCTTATCAGGAG 

TC CCAAT CTT GGGCT CAGGG AG ATC CAGAT AAT AGG TGATGAG C CTGG TG CTGGAC CGAG AAGCTCAGAGATG CTGG TGGGAG AG A 

AGGGAGCAGG T AGG CTAG AAT C CAAATG CAGGCTAGGCTT AAAC CGATGT AACCAAAGGAAACC CT AATGTGGTCCAT AAC CAAAC 

5 0 TACTAAGCTAGCCATTCCCATAAAGAACCAGAATGAGGATTT^ 

CAGAGTCAGTGGGAGTTCTAATCCTACCCACGGCCCTAAACCTATCTCTAA 
ACCTTAGCCTTAACCCTAACTTCAGACAAACACTAACCCCTT'CT 

GGAGGATCAGAAGGGCAAGGGGAGCTGTGAAACACCCAACCTATCCCCTAAGAGCAACA^ 
CCCTTCTCCCCTATTCCCTCCTCrCCTGTTaVGGCCTCTTCTGCTCTT 

5 5 GGGAATGGAGCCAGGAACrGCTCAGTCCTGCAACGACATCCCTTTCCCTT 

GATTCCTCACCCT AGGGGAAGAAATAAAAACCTATTGATCCTGGGGAAGAGGCACCTG AAAA^ CTTTGGGTTTCCATAAAGAG CAC 
CCTGCAGAGAATGCCCAGTCTATAAGCCAAAGGGCTATTAGATCCCTCAACT 
GGAGGGGGGGCTTGGGCCCCTGCTGGACAGTGCCCAGAGTTGAGATGAAGCAGCAGGGTAG^ 
AACTACTTTTGCCTAAATTCAAACTCTAAGAATTGGAGTCAGTATGAGCTTTCAC^ 

6 0 GACAAACJAGGAGCTGGAGGAAGAGCAAGGGAAAGTGGTGAATTCrrT^ 

TCAGACTCAGTTGCCTTGCCTATCAAACAGGGATGGTTATTGTT^ 
GTACATGAAAGCCTGGGTCATTTGAATACTCACTGACACCTCTTCTATACCAGGCAT 
TAAACCATAGTGCCTGCTTGGAGGAGGTTGACAGTCTGGCC!AGGGGATCAAGCACATC 
TGCAATGAAGGCTCAATGAAGTGTAAAGCCTTCTGCAGA^ 

6 5 AGGCTCTTCTCTTCCAAGGGCTGGAATTTCAAAGGTGATGGTGCTGGGAAGTGCA 

AAAACACAAACACACATACACAC^CACACIAAACAAGTAAAAGGGCAGAGG^ 

TTATTGTGATCATCnATGCTATCCTCAGACAGGGCTGGAGCGGGT^ 

CACAAAGGTCTCAGGAGCCACAGTAGGCTCAGGGTGGTGGCCATA^ 

CCACCTCTG ATCTT CAGGTG CCAGGGC GCTGGAT ACACGTTGG C CTGAG CAAGTG AGGGAAAGACT ACT C CTG AAAT ATCCTGGGG 

7 0 GGAGTGGAAAAGCTGAG AAG AAAGGCAGGCTGTG CACAGTCCCCAGGGGGTCAGCAAG CCAGCACCAC ACACACTCTGCCACACGC 

ACCCTGGAAAAACCCTGTCAGTGGGX3CTCCAAAT 

AGTCCTCCTCTCCAGCGCAGAGGGAGAAAAATAACAAAACGAGGAGAAAATAAGCTGTT^GG 
TGOIGGGCTGCCCACC^CTCACCCTTACTCACTCTTTCCACACCC^^ 

CTCCCCCGTCTGGTCT AAAAGCAGACGCTGGTG AG CGAG CAGCAGGTTATAAATATG CCCAGGGAACTGCCGCCAACTCGCTCCCT 

7 5 TGTCATCCCCTG^AGGGGCAGGGX6G<^^^^ 
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GCTTCCTAGAGGCTACCATATTTTCCCCCCTTTTCAA 
TTCAGATATCTGGTCCCTACCCTCCCTGGGCCCTGGCTG 
CCGCTGCTACTTCTCTTCTTCAGATGCATTTGCT 
AGCTGAATGCCAGACCCTGTGCAC^GGCCCACCTGTGTGC^ 
5 CCATGTGCTCACATCAC CTCCAGCCCC CACT AAGGACTT CCTGGCTTCAACCTGC CAGC CTCATCAGGAAAG CAAAG CTTCAAGCT 

CCTGGTTGTGCCTCAGAGTAGGTTAGGGGGCATGGGGTAGGGGGTGGCTGA^ 
CCCCTGTAAATGAGGGGGTCCTTTCTCTCCTTCCTGCCCAATCAT^ 
TTGGGATGCTGAGGGTACCTGTCCAGGCCACCAGAGCCTCCC 

GGGGCACTTCCACCCTGTGAAGCTCCCAGCTCTCCTCCCCCAGGGCTGAACCCCTCCC^ 
1 0 AGTTAAGGGGCCGTCCCTGGCGGCCCCTCAGCCCTATCGTGGCCCCT 
CTGGTTTTGTATGCCCCCCTTACCCCACCTTC 

CGGGCCAGTGTGCCAGGGCAACC^GGGCCGGACCGAGGCCTAATCCTCCCGCCGCCT 
CCCACTCTCTGGGGGCTGCCCGGCTCTGCTGTGGCX3GAGACACACAAT 
TTCACGCCCAGGTTGTGCGGCATTrGGGGAGTTTAATGAATTGT 
1 5 TTCGGAGCCCTGGTGGGGTAGGAGACACTAAGCCTGTAT^ 

GGCCTGGCCAACAATGAGGCATTCACTCCTGGGACCTCGATGGGCCTGGCTCCCACTCTCAG 
TCCAAACCCGCCAAGGACTACGATGGGCCCTGGAGCATC^ 

TGGCTGGCAGAGCCCTCAGGCCAGAGAGGGACCTGTTTCTGAGGACTTTGGTTTT 
2 0 ATGACACCAGGGTCTCACAAGGTGCTrTTGCAGATCCATCCTCATGTCCCTATAGCCrCTAG^ 
CCAGGCCTACTCTGGGGAGGTCCCTAAACCTACCAGTC 

GGAAAGGTTCTGTCAGGGGAAGAGCAAAAGGAGCCAAGTCAAGAAGGAGCAGTCA^ 
TGTGGCCATGCAGCAACTTCAGGGAGAAACTTGX3ATTC 

TGGGT CAAGGTGGTTGGC C CTGGGGAGCAGGTATGTGTGTAC AAG AAAGACAG AG AG AG AG AG AG AG AGAGAG AG AGAT AAG AAAG 

2 5 GAAGATTTAGGCAGAACTAGATTGTGAACTCCTTGAGGGCA 

TTAACATTGTACCTTGCACX3TAATAGGCACCGTCAAGTGAATGGATGAATACAGA 
TGTGAGCTATTCTCTTCCACAGCTGCACTCAGGCTCCTCATT 
GGTCAGCGGTGGGCACTTCACCAGCTGTTGACACGACAGACCCTGGCrc 
TGACTCTCGGGAGAGCTCCTGGAGAGACGCTAGCCAAGAAGGCAGG^ 

3 0 ATCCTTCTTCCTGTATGTCTGAACTGCATTGTTATGCCAGCGGTCCTGTGGCACAGTTTT 

ATTTTCTGAGAGCCTGCTCCAC^CCAAACITGTCGGGTATACAGAGCT 
AGGGGATTCCACAAGGTAACAAGACC!AACGAGGTTACGATCAAGCCGCAGGAAAAAAAA 
ACATTCCCAGCTCAAAAACTCCATCATTCACCTCTCCCTCAATTCCTCT 
ACAGTCTGGGCATGGTGGCTCACAGCTGTAATCTCAGCACTTTGGGAGGCCX5 

3 5 ACCAGCCTGGCCAACATGGTGAAACCCGTCTCTACTAAAAATACAAATAAAAAATTAGCCAAGAGTTATC 

CCAGCTACTCTGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAACTGGAGCAGAGATCACACCACTGCA 
ACAGAGTGAGACTCCATATCAAAAAAAAAAAAAAAAAAAAACCCAAAAAGCAAAAAAGACA^ 
ACACATCTTTCCAGGACTCACTCITCTGCCCCACTGCATTCCCTAATGCCTCTG 
TCTGCATTTTCCCACCTCTCTGCATTTTCCCACATCCTAGAATGTCCCCCTTCCCT 

4 0 TTCCrrCCTCCATGAAGACTCTTCTGATTTCCCTTGGTAGAGTGGTCTTGC 

CTGACCTGGGCATATCAT ATG C CrfTGTT AAATGGTGAAT AG ACG G AT ACATGAAT AAATGGATGCACGG ACAG AGGAAT AAAT CG 

ATGGAAGTATAAATGGGGCCAGGCGCGGTGGCTCATGCCTGTAATCCCAG 

TCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTC 

4 5 CCTGTAATCCCAGCTACTCAAGAGGCTGAGGCATGAGAATCGCTAGAACCCTO 

AATGCACTCCAGCCTGAGTGACAGAGTGAGACTCTGTCTTAAAGAAGAAAAAAAGTATAAATGGCCAGATAGATGACA 
GGTGGATGGGTGGATAAGTGGATGTGGAATGTATAGATGGATGGGTGGGTG^ 
CATAAAAGGACTCACAAAAGGCTATGTAGGCTGAGTGCAGTGGCTTATACCCGTAATCC 
AATCGTTTGAGCCCAGGAGTTTGAGATCAGCCTGGGCAACATAGGGAGACCCCX3TCTCCACCAAA 

5 0 GCCCGGCATGGTGAGACGTGCCTGTGGTTCCTACTATTCAGGAGGCT 

CAGTAAGTTGTAATCGTGCCACTGCACTCCAGCCTGGGTGACATAGCAAGACCCTGTCTCAGAAAAAGAAAAAAGAAAA 

CCAGATGTCGTGGCCC^TXSCCTGTAATC^ 

CCTGX3CCAACATGGTCAAACCCTGTCTCTACTAAA 

TGGGAGGCTGAGGCAGAAGAATTGTTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCTGAGATCGCACCACT 

5 5 TGGCAGGAAAGAAAAGAAAATTTTAAAAAAGCCCAGGTAGCACCAG^ 

AGAGGCAGAGGCAGTATCCAGGCCAAACTTTCCTTCACTAACATCTGCACCCT^ 

CCACTCCCCATAAGAAGTCATGGTCTGGGAGCTG CTCCTCTITTCATCCTCCCTGGTC CCTCAGG AATGCTTTCATTAACAT ATAT 
TATGCCATCTTCCTGCACTGCAATGACTGTGGGTTTCCrrCTTCCACATCTCTTTCCCTATATCATACACACC 
CCCTGAAGGGAGGAAGCATGCCTGAGTCTCTTTTGTGTCCCTAGCACCTGGCATATATG 
60 ACTGAGTGAGTGCTGCCTCTTCCCCATAGACACATTTTATACACTTCCTGCGTCCTCTTGTTCAGTTATCTCTCCT 
TCCTTACACACATCACACTCACACTCCTGACCTTGACACTCTGAAATTCCAGAGTTATATGCT 

TGTGGCTGATCTAGGGAAGTCCCAGCCCTCCCCTGAGAGTCTGAAAGAGACGGCTCTTGACTCCTCTAAAGCCATATTAACT 
CCCATGGAGGGCAGAAAGGGGCTTCCCTAGGAAGAAGCTCTCTATTGGGAGGGTAGCAGGATCCTGCACT 

6 5 TGTCTCAAGGCTGGCTGGCAGACACAGCCAGATTTAGAAGTCATTTTCGCAGCAGGGAGCC 

GCCCAGAATTTTGGCAGCTGGTGCTGACTCTATCCTTAGGCAAAGATGGAGTAAGTTCTGGGG 
TCTAACCGCCCAAGCACTGCCAGGGCAC 

70 HUMAN SEQUENCE - mRNA 

ATGGGGCTCTGGGCGCTGTTGCCTCGCTGGGTT^ 

CAGCAGTGGCCGATGGTGGGGTATTGTGAACGTAGCCTCCTCCACGAACCTGCTTACAGACTCCAAGAGTCT^ 

AGCCCA.GTCTCCAGCTGTTGAGCCGCAAACAGCGGCGCCTGATACGCCAAAATCCGGGGATCCIGCA 

CAGAGTGCCGTGCGCGAGTGCAAGTGGCAGTTCCGGAATCGCCGCTGGAACTGTCCCACTG 

7 5 GATC^TCAACCGAGGCTGTCGAGAAACGGCGTTTA 
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CAGAAGGTTCCATCGJVATCCTGCACGTGTGACTACCGGCGGCGCG 

AACATTGACTTCGGCCGCCTCTTCGGCCGGGAGTTCGTGGACTCCGGGGAGAAGGG^ 

CAACAACGAGGCAGGCCGTACGACCXSTATTCTCCGAGATGC^ 



5 




C^IACCTCGTCTACTTCGAGAAATCGCCCAACTTCTGCACG 

ACAGCTCGTCGCCCGCXSCTGGACGGCTGCGAGCTGCTCTGC^ 

TGCAACTGCACCTTCC^CTGGTGCTGCCACGTCAGCTGCC^ 



HUMAN SEQUENCE - CODING 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTTCTGCTACGCrGCTC 

CAG CAGTGG CCG ATGGTGGGGT ATT GTG AACGT AGC CT CCT CCACG AACCTGCTT ACAG ACT CCAAG AGTCTG CAACTGGT ACTCG 

AGCCCAGTCTGCAGCTGTTGAGCCGCAAACAGCGGCGCCTGATACGCCJ^ 

CAGAGTGCCGTGCX3CGAGTGCAAGTGGCAGTTCCGGAAT<XCCX3CT 

GATCGTCAACCGAGGCTGTCG^GAAACGGCGTTTATCTTCGCTATCACCTCCGCCGGGGTCA 

CAGAAGGTTCCATCGAATCCTGC^CGTGTGACTACCGGCGGCGCGGCCCCGGGGGCCCCGACrGGCACrG^ 

AACATTGACTTCGGCCGCCrrCITCGGCCGGGAGTTCGTGGACTCra 

CAACAACGAGGCAGGCCGTACGACCGTATTCrCCGAGATGCGCCAGGAGTGCA^ 

GGACGTGCTGGATGCGGCTGCCCACGCTX5CGCGCCGT^ 

GGCAACCGCGGCAGCAACCGCGCTTCGCGAGCGGAGCTGCTGC^ 

CGAC CTCGTCT ACTT CGAGAAATCG C CCAACTT CTGCACGT ACAGCGGACG CCTGGGCACAG CAGGCACGG CAGGG CGCGC CTG T A 
ACAGCTCGTCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCT 

TG CAACTG CACCTTC CACTGGTGCTG CCACG TCAGCTGC CG CAACTGCACG CACACGCG CGT ACTGCACGAGTGTCTGTG A 
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Table 16 

MOOSE NOMENCLATURE 
ICSGNM Rasgrpl 
Celera mCG14557 

5 

HUMAN NOMENCLATURE 
HGNC RASGRP1 
Celera hC638304 

10 MOUSE SEQUENCE - GENOMIC 

CACTTTTCAAATAACATGAAAGTGAAAACCTG 

ATTCTTAGATCTTGAGACAGAGCAAAGAACTGCr^ 

GCTTACAAGACACTTCAGAGGTTAGCACCGACAACCATCT 

ATGACTGATGGCTGAGAAGCTCAAATGTTTTTGGAT^^ 
1 5 GCTTCTCAC^GCTGTGTTCGGACAAGTGACTCGACCCCTCTAAGGCTTAA 

GAAAACCAACACAGTGTGTTGTGAGGGTGAATGAGGTC^ 

CrCAGAAGGAGGCTGGAGAATCGGAAAACAAC^AGGTACAAATAAATATTC 

AATATTCTGCTCCATCCCAGCACCCAGATCAATGCAAATTAGCATG^ 

TCCTACTAATCTTGCCTGATGGTGTAACTGACTTTrCTCAGGGATCC^ 
2 0 ACAGCTCGAAGTGTG AC CTGGGACAT AGGGG ACATGTT ATG AAC CAGGACAAGC CCTTGGGAGTTGCT ATG C CAGCAGCT AGTT AT 

GGGGTATGTATGGCTCTTTGGGCCTCTTTAAATCtAGAGTCTAGTT^ 

GAGTTCTTTGTTTACACGTTGAGGTCAGGACTGTACTATGTACCCCAAGCT 

TTCAAGTTAGGGTTTTTATATTTTATTTTTAAGAGCA^ 

(^TGTGCACAGTACCCCTCTCCAGAGTGACACATGTGTGAGA 

2 5 TTGGAGTTCTATAAATTTGAATGACCATTCACTGTCTAGTGTCCCCACACACAT^ 

GAAAAGTCTGCGCTACAGCTTTACITTAAAAAAACCAAATACCAGG 

TGGCAGAAAGGAGACAAGTTTATGATCTATATTGAATCCGAGGCCAGCCTTGGCTCATGAGACCCTGTC 

TAAAAATAAAAAGAATGTGTATTTAAGCCCCCTTTCCACCTCATATACCACT 

AGGGACC^GGAAATGGCTCATGTCTAAATCCACCCATGACAACT 

3 0 GCCAGTGGGTCCCAGGCTCGGGACTCATAGCAATCACAATATCCACAAAGC^ 

ACACAT ATTT AGGG TTGGAT AG ATGACTTGGTGAAT AAAC CACTTG CTGT CTGAGC AT AAGGACCTG AGTT CAAGTCCC C AG AACC 
CTTAGGAAGATGGTAGCGTTTATCTTCAGTCCCAACTCCTT 
GATC^GGAGCCATCTAGCTCAATTTACACAATAGAGAGCAGCAATGAGAACT 
CCAAGGTTTTCCTCTGACCTCAGATACATGTAGGACACCTGTACTCA 

3 5 GTACACACACACACACACACATGCACATGCACATC 

AAGAATCCTCTAGATTATATTCACCTTTTTGCC^CTGCAGTTACAAAATGT 

TGCTGCATTTAAGAGCAGGGTCTTAAGACACGCAAATCGGTGACGT^ CTACTTTG CCTCTACTCTTCCGATTTCTTCCTCT 

TCCATTACCAGGCAAAACTCACAGCAACACTGGGTTCTATGGGCT 

AAACTCC^GCAACAATCACACCGAACTrGTTTATTTCATGTGCCTATCAT 

4 0 TACAAACTAACAATGTCACAGCAACTAACAGCACAGTGGGGTCA.CACCTGCA 

CCTCTGGGAGCGGCTCTCGCTTCAGAGGCTCTTGTGAATCACACACAGCAGCACCATCA 

TTCACTGACT AACG AG CAGG ATCAATGACTGAACACTT CACT AACAC CTGGT CGAGGG CTGAT CT CCAT CCAC CCT GGGAAG CTG A 

AATGAAGTCTTCTCTAGCATTTCCTATCTAATTCAATACAGAGGAGGGGAAAAAACA 

TGACAGTATTTCTATTACATTGAATTTTGTGTGTGGGTTGATACTGAGAGATTATTTCCACCAA 

4 5 TTAGCAAACTAGGCAGTCATGCAAACTAATCTAGATCCATCCTTGCAACAAACTGTAGCTCCCATCCG 

CAATGCAGTTX5TGATTGGCACAATCCAGCTCACATTGTCTCTATTCTGT 
ACTCTCTGCTAACTGCCTGCCTTAGTCTCACCTAGAGTTTTCTCT 
AAATGATCACTCAAACCCTATGTGGATTTTCATGGGAATATTGTCTGCCCTTAAGAA 
CAAACTGAATTGAATGAGAAAACCTTAAGCCTGTAAAGGATAGGTAC^ 

5 0 AAGGTCAAAATCAAGTCATTTGTGGTTGTGTCTGAAGTTGATTCCTTTAGGTTTCA 

ATGATCCCCACCACTGAAGTAAAAGCTGGGTGAAGGCACATAGGTACCTGCGTCCCCCGTGCTTTAGGGTGG 
AGGACGTCTCTGGCCAGCCAGCCAGGCCAAAAGAGAAACATCAGGGTTAGTGAGAGACCCTGGATTGAG^ 

ATCCT CTT CTG AT CACAAC CGCATGTGAG CACATGGG AACACACACATG TG T AT AGT AC AAACACAG CAAAC CT ACACAG AGACAG 
ACAGAOVGACACAGAGACAGAGACAGAGAGAGACTTCCCTT^ 

5 5 AGTCTTTGAGAAATCTCTCCCCACATTGATTTGCCATTATTTTCTCTGCTACT 

TTTTGTTTCAATGCTGTGTTTATCACAGCACTTTTTTGAATGCATATT 
GGTTGGCTTTGTTTTGTTTCTTTGTTTTTACCCAAAAAAAAT^ 

TGGATCTTTCT AT AAACTG CACCCCAC C CC AG CATTT AC C CATG CACCTTTG CAG GT G AT AAGTG ATT AAG C AG AATGAGTTT ACT 
GAGGTATACAATACGAACTTTATAAATAGTGTTTGTGTGGCCAA 

6 0 CGGCCTCTGG AAAG AGGCC CAGTATTGAATTTT AATCTCTTCAGCCCTCTTTTT AGACCTACT^ CC CAGG C ACCCACTGG CAAA 

CCTGAGGGTGTGTGCTAATTGGATCACACCCAAAGGGGAGAGCAGGTAGAAGGAGCT 
CCGCCCTGCCAAGACAATTGAGAGGCAAACAC^CTGATACACACCAGTGGGTGAGGGGAGGGTC 

TGCTCT AACTCTCCCGAACCCTGCATATCCCTTCTCGTTAGCATCCCTC CTTTGTTCTCCTCTCCC^l CTGGTCCAGCT CTGTT 
CCTGTGATCCTTTGTATCCATAGCCTGGCCCTTATGTCCTCTG 
6 5 G TGTTCAC ACAGGATGG AGCAT C CACCCTGAG CAT AG AGGAGG CAGT CAG AG CATT AAG AGT C CTG CG TCTGGGGCAAAGTTCAAG 

CirCAGGCTCTGATTCTTTAAAACTCTTCTGAGATAGGCATTC 

CCTTGAACAAAGTAGACTTACCCAGTTAGCATCAAGACC'G^ 

TGCCTGCTAAAGCAAAAGTCAGGTCCCAGCCCTCCTCTGTTATCTCCCTTCAGGAAGCT 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNTTCTGTCTCC!AAGAATAAACATC!ATGAGA 
TGGGGTGAGATGATATGGTATG<3TGTCATGCTGTGTCATT 
TTATATTTAGTACAGTTTATTACATTAtCA^ 
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TTAGAGACACTGTTTCTGCCTATGTCCTCTATAGGGTAAAT^ 
ACAGCAATTTCOUlGCAGCrAACAACAAGAATC 

TTACCTTCATAAAAATCCCCrrTGAAGACTTGGGGATAGATCAGTTAGTAAA^ 

CAGCGCTGCATAACCCAGGTATATCAGTAGGTACCTGCAATGCCAGCAGCACTCT 
5 GGTCAAGGCCATCCTCAGTGACATAGAAAATTTGAGGCCAGTTGGAG1TACACGAGATTTA 

TTCTCTGAAGATACCTCAGAGCAGTGGAGCAATGACAAGCCCTCCT 

GACAGAGTCCAATTCTAACCCCAGGTCTGAAAGAOVGAAG 

AGGTGAGTCACAGCAAAGACCATATTTACAAGCCTATGGGTTTGCAGAT 

GGCCTAAATTCAATGTTATTGTCATTGTTGTTGTTGTTGTTGTTGTTAGTCAAA^ 
1 0 AGAAGAAGTGGAAAGACTCTAAGAGCCAGGAACACTGATGACTCCAGAG 

GCACGTGTGAACTTAAGAGACTAGGACAGTACACATAGACCTGTACAAGTTCAAGCCAG 

GTAGCTACAAGGACCCATTTCTAACACAGAAGCTATTTGCTGTTAATACCT 

ATCCACTGGGTATATTAACCACGCTCCAGGGCAGGTCTCATGCTCAGGAGCAGGTG^ 

TGTGGGTCACTGAA T TT ITT GGTTTGTTTTGTCTTTCTGGTT^ 
1 5 GGTGGGTGTAGTAAGATCTAGAATCAGTTAGGGGAGGG^ 

TAAATAAAACCAAAACCCACGGTTCAGGGGCTGGAGCAAAGGCACA^ 

GT CITTTTTTG A TTTTTTTA TTTTAATTAGC^ATT TT C T TCATTTAC^ 

CACTTACTACTCTrrGCAAAGGACCTGTGTTTGGTT^ 

AT C CAAC AC C CTCTCCTG AC CTCTGGGGG CATCAAGCATG CACAAGCT ACATACACAT ACATG CAAGCAAACAAGG AAAACTGTGG 

2 0 AGCAACCAGCCACAGTCACATACATAATGAGGCCACGTACCCTTGCCT 

GACCCCGTTTCACCCTCTGTCTCTTCACTCTCCTCTAGAGCCCAGTGCCCCATGG 
TGACTTGGATACAGACCATGGGGGGGGGAGCTCATCTTGGACATTTCAA^ 
CITAGATGGGAGAGATTITTAAAGTCACAACATATTTATATGT^ 
GAGCTCATGAACACAAGGACCTTGCCTGTCTTGTTCTCTCCAAGACT 
25 HNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNGCACGGCAC<3AGTGA 

GAGGGTCCATCCATGAGAGTACACZAATGCTATGATATCTAACTCT 

TGGAAACAGAGAAAGAACACAGTAGTGCAACCATGCAGTGGTGT^^ 

GGGAATGCAGTCTACCCCATAGATCCAGGTGGGGGATGATGGGAATACAC^ 

3 0 AATACAAGGGAATGATGGAGAGAAATGTCTCTACTAGATGCCAAGTGTAGATACTCAGCG 

TGAAAGAAAAACGAAACCACCTCACTTAAGCCTGGATCATATAGAAAAGCCAA 
ATATCCMAAAGTAGGGCTGCXK5TTGCAAAGGAGTCCCTCGGGTGAGCAAAAG 

AGAGG AAG AAAGACTG AGTGGAT CCTGG CTG C CACAGAGACATGGG CTGT ACCAGTG AG AAGGG AAG AAG CAG C AGG AAAGGGGGC 
AAGATGGGGTGGGAAGAAGGAGACAGTTGGTGCCCAGTTGGAAGTGCTTCATATGC^ 

3 5 AGAGGGAAAGGAAGCTGACATGAGTTTTAGAAGGTTA 

CAGGTGAGTACTATCTGGATCTATCAC!AAGTATTCCATCATTTTGGGAA 

CAG TG ACG CCACC CTG CAT ACCGTCTTCAT CT CT ATGTTTGC CTGT ATCTGTT CTG C CCTCATCCTGCATG AC C C CG AGG TG ACAT 

CACTAATGCTCXSGGAGTATCCTCAGGAGAAACGTCTTTTCAG^ 

rn ' CT TTTTGTTTTCTrC CrinTn GGGGGGTGGGGGTGGGGGTGTCTrT 

4 0 TGGGCTTGGCTGAC C CCTTTG AAG GAAGT GTGT AGT AG ATTCTG TTTTC C AAATGTGT AAGCAT ACATTGG T ATGTAATTTT AG AC 

TGCTCTCCACTTCTGGCCCCATTTTATCG^CCTTGTTTTGTTGCTGTTGTTTCTCATACTT 
AAGTGATCCTGAATTCTTTTCTAGGACAACATTGCACATAA^ 

CAG AACGTTG TGG CTGGG AAG AT CACGGG ATG TGAT AG CAAT AACGAGGGTGTT ATT AGGCCAGTCCAG C AGGGACTT AG AGACG C 
CAGGCAGAGCTCAGTCTGGGCCACCTCTCTTTTCGGTTTACT 

4 5 TTTGCGTTTGGAGGGGTGGCTTAACGAGGTAACATGTAGGAA^ 

TTCCAGACAAGTAGCAGGTAGGTCTCCCCTCTTGGCCAGGTCTC 
CACTCCCACTTGGCTCTCTTGGTCTACCACJICCCACAAACTCCCAAGCA 
GCCTCCTCCGTAGTGCAGTGCACACAGCTAAGTCTTCTCTTTTT^ 
GGTTCrcCCTTCCTTGCCCAGACCCGACTGTGTACCCTCACT 

5 0 GAGCCCCCACCCCCACCCCCCTAGGGGACTCGTAGCCTTGGGTGGAGGCAACTCITCGCA^ 

CAACTCCTCGGGGCGCTACCAAGTAATGGGGGCAGGATTCGAAGAAGAACTGAGGGCGC 
CATCAACGCCTGTCAGGTt^TTGGTGCGAGAGAGTGTGTGCGCGCGCGGGAGCGAGCGTAGGGAGCGC 
CCTGCGGGTGAGCCTGGGTCGGCTCGCCGGGCTCCAAGAGTGGTCGGCTCCGGGCT 
5 5 GGCGCCCGGCCGAGCCGATCCCACCGCTCTAGGTGAGACGG 

CGAGAG AGGCTCCGCGGTGAGTAGCTGCGTCT CCTCGG AG AG CG CTG CACCTCTACTGGGGAAGG AT AGCTG AGCGGTGTGCCCTG 

TGGTCCTCTCCAGTGGGTACCAGGAGGGACTAGACGACCAGGGCGCCTCCCTCGGAGGAGAGTTGC 

CG<KX3ACCTGGGAGCCAGGTCAAGATAGGCGTGATACAGAC^ 

TGCTGGGGTGGCGAGGCCACAGGTCACCTGGGCAGATCTGGAGTGGTCTGAGCCCAG^ 
60 ACAGTCGACCTCCCTGGATGCTCTCGGTCTGGGCACCCCCCGGG 

GGGCGCGGGTCTAGGACAGCGCCACCTTGCGGACTCTCCTGGCCTTTCAGGAGCGACT 
AGTGCTTGGCTGCCCTTTGAGGAAAAAGAAGTTTCCAAATCAATGCT 
GTTTGGGAGACGGAAGGTGAAGGAAGAATGGCTATGAACAGTCTATCTCTTC^ 
GGATGTGAAGAAATTTAGTGTTTGTCCTCAAAACACCAGGACCCCGTA^ 

TGTTGGGCCATATCATCCCCTGAGTGTTCT^ 

ATTGGCGGTGTCGGGCAGGACTGGAAGCTACACTCAGCAGAAATGG^ 
GTCCCGGCCCTCCTCTGTTTCCTCCITTGCCCTGTGGATAACTG 
CCTCTCCCCAGGCTCTGTGGGGTGGGGAAGGCAGACGCTAACTACTGCAGGGCTGAG 
7 0 CAAGCTTTTGTTTTTGCTGTTATTGTTGTTATTTTGTTTTATT^ 

AAATCATCACTTTGTAGAATTAGAGGCTGTTCACTCTTCAGATGGTTAAAGCACTCT 
AATCCTTACCTACAACCATCTCACCAAGAGGTGGAGGGAGGGGAG^ 

TGTTTGCCTGGCATGCAACAATCCTGACTTTGGTCCCTAGCATCATGTAAGAAAGAAATTAAGAT 
TGCATCITGGTGGAGTGATAGGATTGTTCGTTTT^ 
7 5 CTAATACaGAGTTTCAAGCC^CaTAGtA^ 
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CT AAAAAAAATT CTTTTT AAAAAAAATGT AAAGAAAG AAAGAAAAAG AGAATGTGAC CTGAG AGTTTTTCCAGGG CAAGCAGTCAA 
GAGACTAGACAAAGCTTACATGTGCACAGAGTGCTGTTGGCT* 

TCCTCACTGGGCTAGGTTTAGGTAGTTGTC'l I'TITrCTAGGAAG GGAAGGG AG CCAAGCT AGGAAGTGTGG CTTGGTGOT AAGGAA 
GCTGGCTGCTAACCTGCTGTGC^TCTGTGGCCAATATATCTCT 
5 ACTAGCCAGCTCTTGGGTCCTGGTGACCATTTCAGGAACCTACTAAACAAW 
TCTTTCTGAAGCACACGGTTGTTATACATTGAAT^ 

ACACACAGTTT CATGTAAAAAAAAAT AATAATAAAG C CT ACTAGATGAAT AAAAG CAAGAGGGACCCCAG AT ACTAGT CCCTGAGA 

TAGCCCATATGAACAGCCTCAGTATGTGTAGGACTCAGTTTCATATCA 

TGCAAATTCCTGCACAGTATCTTTGGAACATt^ 
10 TGATCTGACAGTCTGkGGGTCTGGATAAAAGACGGTGG^ 

AGTCGTTGGCTTGGGTTAGCATGTTCCAAGTCACCAGG^ 

CACAGCCCGATGCTCTGAAGTTGGAGCAGAACTGGAW 

GGGTACTGTCTTTGTGCAGGCTGATTTCTACAAACCCAGG 

TGCTTGCTTGCTTGCTTGCTTGCTTGCTTGCT^ 
1 5 GAGGAGCTCTTCCACCCAGCCCCTGGAAGTGCTTTGCCCTCTGCTTTGCCCTCGACAATC^ 

GGTTACACTCGCTGTCTTGGTCAGATTCCCCTACTAAAGCTTTGT^ 

ACACGCTATGGACATGTATAATTTACAGAGAAAGTTACAGAAGCCATCTCACCACGGTCT 

CTCATGGGAATAAGTATGCGGCGAATCGATGCAGACTTGTGT^ 

TTTGACAGTTAAACT ATACATT AG CCAAGGGTGCAATG AGAAAAGAACT AAGAACATGCAAAAAAAGCTACCAG CAAAAATGCCTA 
2 0 TCATTATTAAACCTTCAAAGATTATAGACGTGGGTCTCTGTGTGGTTTTTAATAAA 
TTTGCATGGTGACCTGTATCAGCCAGATCACGCCAAGCTAAGGATGAG 
TCCAAAATGGTTACTTTATATTCGGTTCTAGAGCAGGCA^ 

AATGTAAAGACCCGATGGAATACTGTTTACTGGTCTAAGTGATTCATAGTGGrrCCTTCT 
AAACCAACCAAGATGTGTAAGAAATGCAGAGATGACTGAAAAATCTAAA 
2 5 TTCAGGGGACCTCCTGCCCTTCTGTATCTTTGGCATTGCTTAAAT 



30 



AGACGGCAGCATAJGAGCAGTCCTTACACCAATGGGGAGAAGGCTATTGCTGCCCT 
GTCTCTTTAACAGCCAACCAGTCTGTTCAGTGCTTITGCCAAGTGGGAGG 



CCCAGCCCAGCTTGGCCCAGATCACCCAGTTCCGAATGATGGTGTCCCTGGGACAT^ 
ATTGACAGCTCCATTCAATCTTTTGGTGAGTTGGGCAACTGCCC^ 

AAAG TG TCTT AT ACAAGAGCAACCT AGATTGT ATTTTG CCTTTAA^ CITTAG AG CTTTTGAAATGTTGGAG ACAGTCTGTA 
GCGCTTACGGCACATGACTATGTGGGTGGAATAGGTCGTGATGAGAAGTACAATTTGAAGTGAGA 

3 5 TTTTCCTCGCATGGTGGTTTGTCTCATGGTGAGGGGAAAAAAATCA 

TCGAAAAATCCITAAGCCTGTCTCACACCGCCAAATACAATT^ 
CCCTTACAATCTGGACTTGTGCrGTGTTTGTGACTAGAA 

ATGGGCACATAGGCCGGCAGCGGTGTATGGTCCAGGTGCTCCCCTCTGTACTGCCAACA 

ATCCAAACA'mTCl f TTG ATGGTTGCTTATG AAATGGT GACTTG AAAGT AGT ACT AAAT ATTGGGTTTGCTTC CAAAAGGGCAGG 

4 0 ATTAAAACCGACAGATTTCAGTGAAACCCATTAATTCAGAGATGTTTCAGTGTC 

TGTGCTCAGCTCTCAGAAGTGGGGACTGTTGGTGTTTAGGCCC^^ 

TGGGGAGATTCTTTTTGTGCAACTGCCTC^TTTTGGTTTAAA^ 

TGATTTTTTTTTTTTTTTTTTTAGTAATC 

4 5 GTCCAAAAATTTGCATTTCTAACGAGGTCCCAGGTGAGGCCTTGACAGCTGCT 

AGTTCTGTTGGCTCrrAGACTTCTTGACnTTGAAAGAAAGCAGATTC 
GCTTGAGTGCAGGCTCCATCCCTGAAAAAGAAAAGGCCATTCATTCTGCTCCCTG 
TGAATTTTCTACTGCTAAGCTACATCT/TCACCTCCAGATTGAATAA 
AAGGAGACAGCTAGAAGCTGGTGGAGACCAGTGGCCATGTGGTAGTCATT 

5 0 TTCAGTAGACCTGCAGAGAGC^TAGTGTAACTATTGGATGAA 

AT(ZACCGTGTTTATCATGTGAACGCT(nTCTTTTTCrACCAATCA 
AATACCTGTAGCTTCAACCTCAAAAATAC^TGGCC^ 

CACTTGAGACTAAAGCTCCTCAGGCCACTACCAAGAAGCATTTATTAAGTCTGTAAT^ 

5 5 AGAAAGTCACA.GTGACCAAGTGCCTGTTTCATTAGCA.CrrTATGA 

GACTC^CACATGCACATACCTGTGATAATAAGTTAAGGTGTGGTGTTTGGAAAACACA 

CACACATACACTCACACACACATACACTCACACACACATG 

TC^CACACACACACACACACACACACACACAC^CACAC^CACA 

TAGGTAGKK5TTTGAATCAGGGCACTTATTGATAAGGACCAGTCATCACCTCACATACCCT 

6 0 CCCCGAGCTTTCTGTATTATATGTTACAGAATTTCCCATCACAGAGTTTACATGAAAGAC^ 

TAGAAGGCTGCCTTTGTGAGTGGCCTCCTCTTGTTATCAGCTGACCTAGCTCTTGT 
CCGGTTTGAAGATTAAGGCAACTTGACTAATTCCTTTCTTTACT 
AGTATTTCAGGAGGCTGTGTACACAGCACCCACAGGGCCCCCCCCCCCCC^ 
TTAAAACAAAACCAAGTGTAAAGTTTAGTTGATGCCACAGCAGCCAGC 

6 5 GAAACCGAGTAAGCAGACAATGCCTGTGGTCACAGAGTCTGACTrcGGTACT^ 

GGTAGGTAGATTTTTCTGGAAACTGAGGTTTTATTCM 
GGGCATTGAGACGTGTATACTCAAAGCTGTCT 

AAGGCTGGAATAGAAGGCCTTTAAGGATACGAATGCAAATGAAGGTACT 
TTACAGATCTGTGATTAGAAACACCGAGCCTCCATTCCGTGGCrT^ACCCCTGCA 

7 0 AGGGCATGCTACCCATCCCGACTGTGGCTGCrAGTTAACGCTATCTGAGGT^ 

ACAGGTTAAAGGCCTATGAGAAGCACTTAGGCTTTTC^ 

AAAATGGAATACCCCAAGTTACAAGGTGCTTTCATGTCTCCTCTCTGGGCAAGAGAA 
CGTACCTCAGAAAGCA.CGGTAGCAAGCAGTTCCTCTGTGTCCAGATCTGGATGTCAGGG 
TATTATATGGAGTGTTACTTTTTTTTCTTGGTGTTCTGTATGTATTAATTTTTCT^ 
7 5 AGTATTATTATCTCCATTGGGTGAACAAGAAGAAAAG7GGGA 
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AGAAAACAGGCTGC Cm - r rC I TCTGCCTCTACCACCAGCAGCTATTTGTACCCTGAGCTCGTGCAGATGTGCCTCACTTCCCAAG 
ACAGGCCTCATAGCCTCAGAAGAGATTTCAGATTTTGTGTTGCACAAGGTAGGCT 

GTCT CCCTCAGTT AG AAACCACATT CCX3 TTCAGTG CAAGAAAT AGT CAG ACTTCAC CTTT AT ACCATC AACTGTCTGGTTGAAGTT 
AATCGAGTCAAACTTATATTTGTAAGTTTCTATGTGTATGCATTGTAOT 
5 GAGTAGTTTCTGTACTATGGGCTAGTTGAATACTAAGCTCCTTATAGACCO 

CACTGAGGCAAGGTGTACAGGAGACAGGCACCACACCACGCTCTATGAC^ 

AGCCACATATATAAGGTCACATATACAAAGCCTTACCTAGGGCTGATAACAACCCCCGAGGGTAAG 

TCCTCATGGAATTCCCCAGATTTACAAGGTTTCTCGT^ 
1 0 GGGCGTGTCAGAAGTGCCCAGTATGTGGAGAGAACTGGC 

CTCTGCTGGGGTGATATCAACCAGTGCCCCTGCAATATCTTCCCTTTCATGTC 

GAGATGGTTTAACCCTCTTCACCTCAAACTTTCCTGTGT^ 

CTTCAGACGATGATGCCTAAGAGCCACAATTTCTCAGGAGTTCTGGGTAL'r 

ACTGCTGACTGAAAGACAGAGGGTGTGGAGTAGCrrGAGCACTTATGGCT 
1 5 GACCCTGCTGGGCTTCTGAACAGCACTGTGCCTTGTTAGTTTCCTCTCTT 

ACACATGCCTGGCCCAGCTCArrACTGGCCCAGCTGCTGAGACCAAAGCTCAGGT 

GAATAGAGTCTCATCCTCTGTGAGTTTGTCAGACATGCAGCCCCTCA 

CTCCITGGACATCCCAGAACATGTTCTTAGGCTTTCTAGGAAAACAA^ 

AGGCTTTT CAATTT ATAAACCG AT AAAG CTCTCGGGG^ CATGAAAG CGGT CCAGCAAGGTTTAATTAGAG ACACT C 

2 0 TTTGCCTGGGAGCATCTTGGAAATCCCAGCACAAGCTA 

GG GG CAGT ACTTCATTG CTCTC CAAGAC T TrrCTT CAGGCTT ACATTTG CTGCTGTGGGG AGATGGTCAG AT CCATTCCTGATG CT 
ACAGTAAGGTCTCCATTCTTGTCTCCAGCCAGCCTCCTGGACTCAGAGGC 
AAGCCCTGAACAACCAGCCAGATGAACCTGTTTGCCAGTAGAT^ 
TCTGTTTTGTTATCAGAACAGGGGCTGGCCCTCTGAGAGCCTCGGAGAAGGGCATCT^ 

2 5 ATTGTTGCCTCATGCCTGAAGCACTTGGTGAAACTAGGTACTTCAATGCC^ 

TAATT AATT ATTAAT AATTT AATG ATCCTTG CTG ATT CAACT AT AATGTTGAAAGGGGGG CAT CTTTTCAT ACCT ATTC-G ATCTT A 

TGGGCCTGTAACATATTATATTTTTTATGAACATGGAAGTGTGAGTATGAGAGTGGAGGGAGATG 

GG CAT ACTGGATGATT ATCCAGCACACAAGATCTAACCTGTT CTATTTTGT CT AATTT^ CAAGC CAGTTGTT AAACAT AGC CAC C 

3 0 ATTAAAGAGTCTGTTATTTACATGTGTGCTGAGTAAGTTCATAGAAGACA 

TGTTTACTGTTCTGTGTGCTTAGGAGGGTGTGTCCATCATGCCT^ 
GCAGTCCAGTGCAGAGCAGGGCAGCGCCCCCACTGGTCCA 

ACCTTGGAAATAAGCAACATTACTGCCAGCCCAGCCTCCTCAAACCCAACCCCCT 
ACACCATTGTCAGTACTAAACCCCGGGGCGCTGTCAGAGCAGAGTCTAGAG^ 

3 5 ATGACTCAGTTAGGAGTAAGGCTAACATAGTTTTTGGGTCT 

ACAAC^CAAGATGAAGAAATTAAACATCAACAAAGCTGGGATTTTCTTA 
TTGATTTGTAATTGAACCAATTACATATATATATATATATATATAT^ 
ATTTTTTTTCCCCCTTAACTCACCTCCACACATTCTGCAGGAT^ 
ACATGTATGACCCATCTCTATCTC-CTTAC-GGCTCCCTCTGCAGT^ 

4 0 GGAATGAACACACGAGCTTTGCTAAGCAGGTGTTTITT 

TCCTGATCATGAACAGTCTCrrCATACCTCTTTGTTATCTCCrc 

GCT AGCATTTG TTGATG CCTTTCTCCT ATTTT CATCCCTTCATTATTTTTACTCTTAOVATACGTAGAAATTCC^T 
AGTTAAATTTGTCTTAGCCTTTTAACCCTTTATTAATTCTAAA 

ATTTT AATTAAAATGAAAATTATTTT AATT CATTCAGTGTATATAATATGC^TTATACAGTCT^ 

4 5 AAGTTATTGCTGGAGGAGAGAAAGCCTTTGGCTTTATTTCTG 

CTGCACACAGGTCATTTTGGCACTCAGAACCCTGCTCTCTCTGACTCTGTCACAGATC 
GTCTGGAGGAAGAAATAGCGTGATCTCAAATATGATATTGAATGTGAACTTGAAGGAAACCCTTCAGAACT 
TTTAACTGTTTATAGAACCAATAGCCTTAGAAGACTAAAACTCTCG 
TGAACCTATCTTGCTCCTAAGTAGGCCGTTTTGTGACGAGATACATAAAACAT^ 

5 0 ATTTT AGAAATAACCAGCCTTTGGATAATTATTGTGAACAGTACCGTGTTTATGTTAA 

CTTATGTTCCTATAACTACAAATATTTAACrrAATACTCAAAATTCATATTTTTAAAAAATT^ 
TTCACTCTG^GAAACTTCAGAAATACAGTTTAAAAATGAAAAGCATTGGGGCT 
GTTCTTCC^GAGGATCCTGGTTCAATTCCCAGCACTTACATGGTGACTCACAACGAT 
CCCTCACAAACACACATGCAGGCAAAACACGAATGCATGTAAAACAAAATTAAA^ 

5 5 ATTACAGTTCTGTATCCTACCCCCACAAACAGTGTTAGCTATGAACATTCTTGGAT^ 

TGCGATGAACATTACTACX5TGCCAAGAAGCX3TGCTAGCATACAAGGTTTCT 
TGGCATAATTCCATCAAATATGGGACATAAAATTGAGGGTTGCTTGGCAAC 
AGAGAAGAAGAATACTCATTTTTCCAATGTAAAGTGCTAATTATA^ 

6 0 CAATACTTAAACTAACTGAAAATCCAAGAGGCAAGCATTCTTCT^ 

AACGAAGAATAAAATCTCTTGCCTTCATTATAACTTCTCTAAGTTATAATGAAGGAGCAGTGAGGTGGG 
GAAGAGGCTGAAGTCTGCTCAGTCCAGCTGCTTTTGCAGTGACTGTGAGGTGGC^^ 
TCCTTTTTAGTGCAGAACAGGCCAGGGCTCAGGCACTGAGCATCTTG 
TGGAGCTTGAACAAGGGGCACACCAGTGATTGACTGAAATGAACAAAAGAACA 

6 5 AGTACTAATTTTATCCTGAAGGAGAACACGGATTCATAGGAGCAAGATGGGT 

AAGTGAGAATTCCGAAGTAGAAGAGCAGAGAGTTCTTCTGTCTCCT 

GATCTTTTGAACAGCCCCATTTTAAAGAGGJ^CTGAAGCACTCAGGAAACGGG 

CTTGTTAACATATAGTCACACCCTACAGAGCACCCTTGAGTCTGTTGAGCAGTTGGCT 

7 0 CAGGCCTGGGCCACTCCAACCACACAATAAAGCAGT^ 

TGGCTACCCAGCTATCTTCAGTTCCTCCTAAGGATGAGTCCTCACTCTGTAATAACT 

TC CCCTT ACTT ATTG TG C CTAT AGTCT CC CG CACTG CCTG CGTGC C CAATGGCTT AG AGT CCT ATT AC CTTGT CAC CAATCACAGG 
TAGATTTAGCTAACTGAGGAAGTGGACCTTCATAGTGTCACAC^GAAAATATGAA 

75 MOUSE SEQUENCE - mRNA 
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C<;ATCCGCGGCCATGGGAACCCTGGGCAAGGCGAGAGAGGCT^ 

ACTAGAGGCCAAATCAACGAACAGCCCTCTCCCTGCCCAGCCC^ 

GACATTTGGCCAAAGGAGCCAGCCTCGATGATCTTATTGACAGCT 

AACCAACTGTTACAAGTCATGCTAACCATGCACCGGATCATCATCTCCTCAGCCGAGCTC 
5 GGATGCCCTGGAAAAGAATTCTCCAGGAGTTTGCCn'GAAGATCTG 
TCAAGATGGATGCCAGCTTGACCAGCACCATGGAAGA^ 
ATCGACACGACCCAAATTAATTCTCGAGACTGGTCCAGG 

GTC CCTGCTGTTTGACCAT CIGGAACCTG AAGAACTGTCTGAACAC CTCAC CTACCTTGAGTTCAAG TC CTTCCGACGGAT ATCTT 
TCTCTGATTACCAAAATTACCTTGTAAATAGCTGCGTAAAAGAGAACCCCACCATGGAGA 
1 0 TCCCAGTGGGTACAACTGATGGTTCTCAGCCGTCCCACCCCGCAGCTCCGGGCAGAGGTCTTCA 
GAAACTCCACCAGCTACAGAACTTCAACACGCTGATGGCTG 
CAAGTTCACATGTCCCACATGAGATCAATAAGGTTCTGGGCGAGATGACTC 
AGGCGAGCCTATGGGGAGTGCACCCACTTCAAAATCCCCATACTGGGTGTC 

GCCCGACTATCTGGAAGATGGGAAGGTGAATGTCCAAAAGCTC CTGGCCCTTT ACAATCATATCAATGAATTGGTCCAGCTG CAGG 
1 5 AGATGGCCCCACCATTGGATGCCAACAAGGACTTGGTGCATCTGCTGACGTTATCCCTG 
TATGAGCTTTCCTATGCCCGGGAGCCGAGGAACCACAGAGCr^ 

CrCTGGAGTGTCTCCCAAACCTGACCCAAAGACCATCAGCAAACACGTCCAAAGGATG^ 
TCGACCAGGATGGCTATATCTCTCAGGAGGAGTTTGAGAAGAT 
AGGGAAGGCCTCATCAGCAGAGACGAGATCACGGCCTACTTCATGAGGG 
2 0 ACACAACTTTCAAGAAACCACTTACCTGAAGCCCACCTTCTGTGACAACTC 

ATCGCTGTAAAG ACIGCGGGATGAACTGCCACAAACAGTG CAAAGACCTGGTGGTGTTCG AGTGCAAGAAGCG AATCAAGAG CCCA 
G CAATATCCACAGAGAACATCAGCTCTGTGGTACCAATGTCCACCCTTTGT CCACTGGGAACCAAAG ATCTG CTCCATGCACCCGA 
AGAAGGATCTTTCATTTTCCAAAACGGAGAGATTGTGGACCACAGTGAGGAGAGCAAGGATAGGACC^ 
CCTCCCAGAAAATTTCAGTTCGGCTGAAGAGGACnGTTGCCCACAAGAGCA 

2 5 ACGACCCCTGGTCACTTTGTGCTGTCTTCTCCAAGGAAGTCGGCX3CAGGGCGCT 

GGCTCCGGACCTACCAAGAACTGGAACAGGAAATAAATACCCT 

CAGAAGAAAATAGAATCCCTGCAGCTTGGGAAAAGCAATCATGTCTTAGCCCAGAT 

TTCAAGGACCAGAATCTGCAGACGGGTTTACTGGGATGTCGAC 

30 

MOUSE SEQUENCE - CODING 

GGATCCX3CGGCCATGGGAAC CCTGGGCIAAGGCG AGAGAGGCTCCGCGGAAACCTTGCCATGGCT CAAAG CAAG 

ACTAGAGGCCAAATCAACCAACAGCCCTCTCCCTGCCCAGCCCAGCTTGGCC 

GACATTTGG CCAAAGGAGCCAGCCTGG ATG ATCTTATTGACAGCTG CATTC AATCTTTTGATGCGGATGGAAACCTGTGTCGGAAT 

3 5 AACCAACTGTTACAAGTCATGCTAACCATGCACCGGATCATCATCTC 

GGATGCCCTGGAAAAGAATTCTCCAGGAGTTTGCCTGAAGATCTGCTATTTTC 

T CAAGATGGATGC CAGCTTGAC CAGCAC CATGG AAG AGTT CCAAG AC CTGG TG AAAG CGAATGGTG AGGAG ACCCACTG C CAC CT C 
ATCGACZACGACCCAAATTAATTCTCGAGACTGGTCCAGGAAACTGACTCAGAGGATAAAATCGAATACT 

GTCCCTGCTGTTTGACCATCTGG AACCTGAAG AACTGTCTG AACACCTCAC CTACCTTG AGTTCAAGTCCTT CCG ACGG ATATCTT 

4 0 TCTCTGATTACCAAAATTACCTTGTAAATAGCTGCGTAAAAGAGAACCCCACCATGGAGAGGTCCA^ 

TCCCAGTGGGTACAACTGATGGTTCTCAGCCGTCCCACCCCGCAGCTCCGGGCAGAGGTCTTCA^ 

GAAACTCCACCAGCTACAGAACTTCAACACGCTGATGGCTGTGATCGGGGGACTGTGT 

CZAAGTTCACATGTCCCACATGAGATCAATAAGGTTCTGGGCGAGATGACTGAACTGCTGTCCT 

AGG CG AG C CT ATGGGG AGTG CACCCACTTCAAAATC C CCAT ACTGGGTGTG CAC CTC AAGGACCTCAT AT CC CTGT ATG AGG CT AT 

4 5 GCCCGACTATCTGGAAGATGGGAAGGTGAATGTCCAAAAGCTCCTGGCCCTTTACAATCATAT 

AGATGGCCCCACCATTGGATGCCAACAAGGACTTGGTGCATCTGCTGACGTTATCCCTC^ATCT 
TATGAGCHTTCCTATGCCCGGGAGCCGAGGAACCACAGAGCTCCGCCACT 
CTCTGGAGTGTCTCCCAAACCTGACCCAAAGACCATCAGCAAACACGTCC^ 
TCGACCAGGATGGCTATATCTCTCAGGAGGAGTTTGAGAAGATCGCTGCAAGCTTTCCATTT^ 

5 0 AGGGAAGGCCTCATCAG CAG AGACGAGATC^CGG CCTACTTCATGAGGGCCAGCTCCATCTATTCCAAGCTGGGC CTGGGCTTTCC 

ACACAACITTCAAGAAACCACTTACCTGAAGCCCACCTTCTGTGACAACTGT 
ATCGCTGTAAAGACTGCGGGATGAACTGCCACAAACAGTGCAAAGACCTGGTGGTGTTC 
GCAATATCCACAGAGAACATCAGCTCTGTGGTACCAATGTCCACCCTTTGTCCACTC 
AGAAGGATCTTTCATTTTCCAAAACGGAGAGATTGT^ 

5 5 CCTCCCAGAAAATTTCAGTTCGGCTGAAGAGGACTGTTGCCCACAAGAGCACCCAAACAGAATCGTTCCCG 

ACGACCCCTGGTCACTTTGTGCTGTCTTCTCCAAGGAAGTCXXSCGCAGGGCGCTCTT 
CAGCCCAGCACTGGTCCGAAAGCGGGCATTCGTO^TGGGAGAACAAA 
GGCTCaX5ACCTACCAAGAACTGGAACAGGAAATAAATACCCTGAAAGCAGATA^ 
CAGAAGAAAATAGAATCCCTGCAGCTTGGGAAAAGCAATCATGTCT^ 
60 TTCAAGGACCAGAATCTGCAGACGGGTTTACTGGGATGTCGAC 

HUMAN SEQUENCE - GENOMIC 

GGTGCTTGGGAGGGCATGGCTGGAGAGGTAAGATCCAGTCATGCAGGTACATGATGTTAAAAAGTTT 
ATTGGAGACTCCTAGGCAGTTTAGAAGCAAAAGAATTATGTGACCATGTACATGAAAG 

6 5 AGGAGGTTATGTAATTAGGGACGCCAGTTACAAAGCTATTACACATGTCCAGGTAAGAGATGATG^ 

CTTTAAAGATGGAGAGGAAGCCCTTGGGAACAATGAAAGGAGGTGG 

GATATTCGGACAGAAGAGTCATATAGGTCCTGCTGAGGCCATGGGAGAAAATTAAAAC^ 

AAATCTGAAGAAAGCTAACAACCCTTCCCCCAGAAAAAACATACACAAAAAT^ 

AG T C CCTTCT CCC CAGT T AATCTC CT CATGGACATT AATT AAG AGT CCATG ATTG AG AGAAGGAAGAAT AACAAAGAAAG CAT C CA 

7 0 GAGGAAGGGCCTC!ACCC!AAGGGGTGAGAAGAGGAAGACTCCCCAACCAGACIA 

ACAAGATAACACACTTGGGTGGCTGGCAAGATGGCCAAATAG^ 
CACATGATTTCCGCATTTCCAACTGAGGAAACTGGCT 

CCAAAGCAGGGTGGGGCATCACCTCACCAGGGAAGTGCAAG CCTAG CCAAGGG AAGCTGTGAGGG ATT 

GTGCCATGAGGAATGGTACACTCTGGCCC^GATACTACACTTTTCCCACAGTCTTTGCAACT 
7 5 TGCCrrATGCCACCAGGGCCCTGGGTTTCAAGCACAAAACTGG 
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TTTCATACCCCAGTGGCACATGGAA.TGCCAGTGAGACAGAACCATTCACT 

GTCTAGCTCAGCGTATCCCACCCCCACAGAGCCCAGCAAGCTAAGATCCACTGGCTTC 

AAGTCAACCTGGCACACTCAAGCTTGGTGCGGGGAGGAGCATCTGCC^ 

TAAATGAAGCCGCTAGGAAGTTCGAACnKMCAGAACCCACTGCAGCTC^ 
5 CTCCTCTCTGGGCAGGGCATCTCTGAAAGAAAGGCAGCAGCCCCAGTCAGGGG 

AGCACCTGGGGGAAGGGGTGGCTGTGGGCCTCCTCCTGGCTCTGAAGAGAGCA^ 

AGGGACAGACTGCCTCCTCAAGTGGGnrCCCTGACCCCTATGCCTC^ 

CATACAGAAGAGCTATGGCTGGCATCTGGCAGGTGCCCCTCTG^ 

TTCTGCAGCCTCTGCTGGTGATACGCAGGCAAACAGGGTCTGGAGCG^ 
1 0 CTGACTCTTGGAAGGAAAACTAACAAAGAGAAAGGAATAACATCAACATC 

GGTCACCAGCATCAAAGACCyAAGGTAGATAAATCCAT^ 

AATGCCTCTTCTCCTCCAAAAGATCACAACTCCTCACCAGCAAGGGAACAAA^ 

AGTAGGTTTCAGAAGGTGGGAAATAACAAACTACTCCAAGATAAAGGGGCATGTTCT 

AAAAAAGGTTAGATGAATTGCTAACTAGAATAAACAGTGTAGAGAAGAATGTAAATGACCT 
1 5 GAACTTCGCX3AAGCATACACAAGTATCAATAGCTGAATCAATCAAGTGGAAGAAA 

AATAAAGCATGAAGACAAGATTAGAJGAAAAAAGAATGAAAA^ 

CAAACCTACATTTGATTGGTGTACCTGAAAGTGACGGGGAGAATGGAACC^ 

AACTTCCCCAACCTAGCAAGACAGGCCAACATTC^CATT 

CCCAAGACACATAATCATCAGATTCACCAAGGTTGAA^ 
2 0 T CAAAGGG AAG CC CATCAG ACT AACAGTGGAT CTCT CTGCAG AAACTCT ACAAG C CAG AAG AGAGTGGGGGC CAAT ATT CAACATT 

CTT AAAGAAAAGAATTTTCAACG CAG AAT CTCAT AT CCACACAAACTAAG CTTCAT AAG TGAAGGAGAAAT AAAAT CCTTTACAGA 

CAAGCAAATGCTGAGAGATTTTGTCACCACCAGGCCTGACTTACAAGAGCTCCTGGA^ 

AATATACCAAATTGTAAAGACCATCGACACTATGAAGAAACTGCATAAACTAATGGGCAAAATA^ 

GGATCAAAITCACAATAACAATATTAACCTTACATGTAAACGGGCTAAATGCCCCAATTAAAAGATACAC^ 

2 5 AAGAGTCAAGACTCATTGGTGTGCTGTATTCAGGAGACCCATCrCACATC 

GAATATTTACAAAGCAAATGGAAAGCATAAAAAAGCAGGGCTTG 

T CAAAAAAG ACAAAG AAAG ACATT ACAT AATGGTAAAGGG ATCAATG CAACAAGAAG AT CT AAAT CGT AAAT ACAT ATGCACCAAA 
T ACAGGAG CACCCAG ATTCAT AAAG CAAGTTCTTAG AGAC CT ACAAAG AG ACTT AGATT CC C ACAC AATAAT ACTGGGAGACTTT A 
ACACCCCACTGTCAATATTAGACCGATCAACGAGACAGAAAATGAACA^ 

3 0 GACCTAATAGACATCTACAGAACTCTCCACCCCAAATCAACAGAATATACATTCTTCT 

TGACCACATAATTAGAAGTAAAACACTCCTCACCAAATGCAAAATAATGGAAATCATA^ 
TCAAATTAGAACTCAGGATTAAGAAAGTCACTCAAAACCTGCAC1AA 
GTAAATAAAAAAATGAAGGCAGAAATAAATAAGTTCTTCGAAACCAATAAGJ^ 
AGCTAAAGCAATGTTAAGAGGAAAATTTATAGCACTAAATGCCCACAT 

3 5 CACAATTAAAAGAACTAGAAAAGCAAGAGTAAACAAATTAAAAAGCT 

AAGGAGATACAGACACACACAAAAAAAACCTTCAAAAAATCAATGAATC 

AGCCCACTAGCCAGACTAATAAAGAAGAAAAAGAGAAGAATCAAACAGACACAATAAAAATC 

CCACZAGAAATACAAACTACCATCAGAGAATACTATAAATACrTCTA 

T ACACCCT CCCAAGACT AAAC CAGG AAG AAGTCAAATC C CTGAAT AGACCAAT AACAAG TTCTG AAATTG AGG CAGT AATT AAT AA 

4 0 GCT ACCAATCAAAACAAAAG CC CAGG ACCAGTTGGATT CACAG C CAAATTCT AC CAGAGG T ACAAAG AGG AG CTGG T ACCATT CCT 

TCTGAAACTATTTCAAACAATAGAAAAAGAGGGAATCCTCCCTATCTCATTTTATAA 
GCAGAGACACAACAAAAAAAGAAAATTTCAGGCCAATATCCCTGATGAACATCGAT 
TGAATCCAGCAGCACATCAAAAAGCTTATCCACCATGATCAAGTCAGCTTCATCCTAGGGA 
GTCAATAAATGTAATCCATCACATAAACAGAACCAATGACAAAAACCACACGATTAT 

4 5 AAATTGAACACCCCTTCATGCTAAAAACTCTCAATAAACTAGGTA 

AAACCCACAGCCAATATCATACTGAATGGGCAAAAGCTGGAAGCATTCCCTTTGAAAATCAGC^ 
ACCACTCCTATTCAACATAGTATTGGAAGCTCTGGCCAGGGCAATCATGCAAGAG 
AGGAAGTTGAATTGTCTCTGTTTGCAGATGACATAATTGTATATTTAGAAAACCCCATC^ 
ATAAGCAACTTCAGCAAAGTCTCAGCATACAAAATCAAT^ 

5 0 CAAATCATGAGTAAACTCCCATTCACAATTGCTACTAAGAGAATAAAATACCTAGGAATAC 

CTTCAAGGAGAACTACAAACCACTGCTCAAGGAAATAAGACAGGACAO 

GAATCAATATTGTGAAAATGGCCATACTGCCCAAAGTAATTTATAGATTCAATGCTATCCCCAT 
ACAGAATTAGAAAAACCTGCTATAAATTTCATATGGAAGCAAAAAAGAGCCTT^ 

AG CTGG AGG<2ATCACAC!TACCTGACrrTCAAACTATACTACAAGG CTACZAGTAAAAAAAAAACAGCATGGTACCAGTACCAAAACAG 

5 5 AGATATAGACCAATGGAACAGAACAGAGCCCTCAGAAATAATGCTACACATCTACAACCATCTGATCT 

ACAAACAATGGGGAAAGG ACTCCCTATTTAAT AAATGGTGTTGGGAAAACTGGCT AG CCATATGAAG AAAACTGAAACTGGACCC C 
TTTCITATACCTTGTACAAAAATCAACTCGAGATC 

C CT AGG CAAT AGCATTCAGG ACAT AGACATGGG CAAAGACTTCATG ACT AAAACACCAAAAG CAATGG CAACAAAAG CCAAAATTG 
ACAAATGGGATCTAACTAAACTAAAGAGCTTCTGCAC^GCAAAAGAAAATA 

6 0 AAAATGTTTGCAATCTACCCATCTGACAAAGGGCTAATATC CAG AAT CT ACAAAG A^ 

AACCCCATCAAAAAGTGGGCGAAGGATATGAACAGACACTTCTCAAAAGACA 
ATCATCACTAGTCATTAGAGAAATGCAAATCAAAACTACAATGAGATACCATCTCATGCCAGT^ 
CAGGAAACAACAGATGCTGGAGAGAATGTGGAGAAATGGGAACACTTTTACACTGTTGGT 
TGGAAGAC^GTGTGGCAACTCCTCAAGGATCTAGAACCAGAAATACCATTT^ 

6 5 (K5GATTATAAATCATTCTACTA 

CCCGAATCCCCATCAATGATAGACTGGATAAAGAAAACGTGGCACATATACACCACGGAATACT 

GTTCATGTCCTTTGCAGGGATATGGATGAAGCTGGAAACCATCATrCTCA^ 

TGTTCTCACTCATAAGTGGGAGTTGAACGATGAGAACACATGGATAC^ 

GGGAGCTAGGGGAGGGATAGCATTAGGAGAAATACCTAATGTAGATGACGGGTTGATGGGTGCAGCAAACCAC 

7 0 TACCTATATTACAAACCTGCACGTTCTGCACATGTATCCCAGAACTTAAAGTATAATC 

(^TTAGCTACCTTATAAGAATGTTATATTAAATGATGTATTTTGTGAAAAAAAAAAAA^ 

AT AT AACAC CAGG CAGACT AATCAC ATGG CCGATGATT AAAAG AGACATAG ACAAACTTCT CATTTTTTAG ATGTTT© 
TT ATTG CT C AGGT AGGAGT AGCCAG AAAACCTCAAAAAAG AG CTCGAGGGGG C CCAG AAG CAG AATCAG ATGCAAGTG AAGTG CAG 
AGTACTCAGAAAGTCCAAGAGAAAAACACTCGCTGGGCATATGTAGGCCTCTTTCTTTAGAA 
7 5 ATGAATTAATAACCTTGGTTTCAACCATTTGTATATATGTGGACTGTAGGTGTGCAGATGTGTGTAACT 
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GT ATT ATG CAT AT ATGT AT ATTT ACAT ATAAAT AT AATG ATGCATTT ACAT AAAGGCAT C CATTTGGCAT ACAT ACCT ATGCAT AT 

GCAGTATAAGGCTGGATTATTCCCCTGCCACACACAAAGATTTTGCTG 

CTATTTTCAAGGGTGAAGCAATAGAGAAATACCGAAGGCCTCGGATAACCTG 

GCCAGTAAGCTTATCACTCCTGCCATAGA(^GCGGACTTCAAAGAAAAACTGAA^ 
5 TATTTTCTTCTTTGAGGTOIGCACATTTTC 

AAGTTGGCAGCTGTGGAAGCTAAAGAAGCATAAGACCTGATTCTTGACCT 

GGACACAGAATATTATGTGGCAATTGAAAGTGTAAATATTATAGATTTAAGTAGGOT 

GT^AAAAGTAATTGTTCACCTGGTAATAAGAGAAGATCTCTGTGAACAAAGTCT 

TCAGATAGATTCATATATAGATACAAATCITTACCATCCTCA^ 
1 0 GAAGGGTGTATAATAGATAACTCTAAAAATGTTTGAGCCT^^ 

ACATTCACCTGCCCCTTCTGCATCTGTGGCTGTGACrGAGGA^ 

AAATTTCTTGCTGGTCTCAGTATCAAGAAAGGGGX^TCTrCTCTTGA 

CITTAGACAAGGCCTCTCITTAAACTTGACTGCTAGAAATG 

TATGAGATGCATTTATTTCAAATGATGTTCATTTTTGGCTCC^T 
1 5 GTTATGTGCTCTTGTAAGCTATCTCGAATTCTGTTTTAA 

GAAGGCTTTAAAGTCCAGAACACTATGGCCAGGAAGACAACTATGACTGTG^ 

TGTTATTAGGGCAGTCCAGCAGTGATTACAGGGAGTCAGACAGACCTC^ 

ATGATCTAAGGCATAGCATTCTCTAAACCITCATTTTCT 

GAGGTAATGT AT AGAAAGCACTTAG CACAACCG CACTCAAT CATT AGCT ATTG AG ATT AT CAG CACTAT CAC CTGGACAAGCAG CG 
2 0 GATGGGCCGCCCCTCTTGGCGAGGTCTTCTGTGC 

AGGCGGCCGCGTCTGGGCCAGATATGGATCACGCCTGTGGAGCGCACCTCCACACAGGCAGC 

GGCACAGGGCACATG TCTACGGC CCAGTCCTGCAG AC GGCT ACCTAGAG AC C C CAGGG<^CAGGTT AAGG CAACTTCACAACTGTC 
TTTCCGTTCCTTTCCCCTGCCGGACTCCGCGCTCT 

TCCGAGCAGCGGCCGC^U3GAAGAGGCGTGGAAGCTGCGGGTCGGCTGCGGGAAAGGGGGTCC 

2 5 TTTCCCGAGCTCAGGTGGAGGTCGAAACGC^C^GGTGCCCTC 

CCCTGAGACTTAACCCTTTGGCCAAGAGAGGTCGAA^ 
AGGGGGCAGGTTTCGGC^LAAGAGCGGGGCGCCGCACGTTTCCT 

TGAGCGCGCGCGCGAGGCAGAGGGGGCGCGCTCGCACAAAGTTTGTGCCTGCGGTGC^ 
CTGGCGGGCGCCGGGGCTAOSAGGCCCGGGGGGGAGAGAGCCGGCAGG 

3 0 CGCGCTAGGTGAGCCTGGCACCGGGAGCCGGCCGCGGCCATGGGCACCCTGG^ 

AGTGGCCCTCGGGAGCGCCTGTGTCCTTTCrrTGGGAAAGAGGCCCCrGCCra 

GGGGGGCCGGGGTCGGCCGGAGTCCCGGGACACCAGCGCGACGGGCGGACGTGTCCAGAACGCCCGCGCGCCCGGG^ 
NNNNNNNNNNNNNR^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNTO^ 

NNNNNNNNNNNNNNNNNNNNNWINNb^^ 

NNKNNNNNNNNNIWNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNKNNNNNNNNNNNNNNNNNNNNN^ 

ITONNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNIINNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNRfW 

NNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNKNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNOTJNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNHNNNNNNNNNNNNNNNNNNN^^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNIWNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 
65 NNNNNMNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN1WNNNNNNNNNNNOT 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^^ 
7 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNNM 

NNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNl^^ 

NNNNNIWNNNNNNNNNNNNNNNNNNNNN^^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNMONNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNMNNNNN^^ 
10 NKNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNHNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNIC^ 

HNNNRJNNNNNNNNNNNNNNNNNNN^ 
15 mnJNNNNNNNNOTJNNNNNNNNN^ 

mJNNNNNNNNNOTWNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNJWNNNNNNNKNNNNNNNKNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNIWIWNNNNNNNNNNNNNNNNNNI^ 

NNNNNNNNNNNNWJNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNOTJITONNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNKNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNHN^ 

NNNNNNNNNNNNNNNNNNNNNTO 

ACTGCAAAGTTCCITACACCATTTCAGCAGTCAGATTTACT 

3 5 CTTCTTTCTGTTTCACTATTGAAAAAAGAGGGACGAGAAACATGTTTATCACACCACCTTCCTT 

TGCCGGCCGAGTTCAAGACTGTTTTTCTAATCTTCAAAGAATCATGTTATCAACCTGAA 
CCCGCCAAATAACCAAGCCTAATTC^TCCCCTTAACAGACCTTCCAAAGGCAACGA 
CTGAAAACAATGGCTTAAAATCCCAAAGCTGTAAAAGTCCCGTT^ 
AGCCCTTAATTGTATCTGACACTGTGCTAGGCAGTGCTCAAGAAGA^ 

4 0 CACCTGAGAAACATACTGTTCAGGAAATATATATATATATATA 

CAGGGTGATAAATACCG AAGGCATG CAGG AAGAGTAAGTTTTTAT AC CATGTAG AGTTTG AGTAAGTGCACCTCATGGGTGAGGAC 
ATAGGTC^AGGAGAACAGCCACCATTTGTTCAAGTCCCTCACGGGCC^GGGAA 
GGTTTTGTGTATTATGTACTGCAGAGTTTCTCATCACAGTACATGAAAGAGGCAAAA 
CAGAAGCCAGCACCTTTTGTCTGTCTCrCITCTTC 

4 5 CTACAGAGGAAATGGAGCCAGGAATCAGCAGCAGACTTTATGAGGGAGAAAAGCCCATAGCAAAATATATG 

GTCTCTTTAAGAGGTCATCTGACAGTAGAACAGAGTGAGGCTAATGCTTCTCCTGAAGTTTACCTTACT 
TGTAGAGAAAAGTGGTGCTTTAGATTAAAGCAGTCTAGCTAGCTTTTTGTTAATGAGTTTAGAT 
AAAAACTGTACAGAAGTGTGGAGGGGAAATTATTATCATTATTATTATTGGGACTCTTTTGCCT 
CACCCAGCAACTTGAATAAAAGAGGGAAAGAAACTGTTTTAAAGAGAAA 

5 0 AACCTCACCTTCTTGCTTTCAGGAGGTTTGAATTGGCTACAGCTGTGCTGTCCAATGCACT 

CATTTCAATTAAAGTTAAATGTAAGATTTCATTTCTAAGCCTCACCAACCATAG^ 
CTGTTGGACAGAGGAGAGAGCACTTCCATTGTTGCAAAAAAGTCAGTTGGACAATG 
AAAAAGAATAGATTTTCTACAAACTGTGGGATTTGCATTT^ 
GGGCATTCAGAAGTCTCACTGATGGCTATCTCTTTAGCATTATTAACC^ 

5 5 AT ATTGCTTTT AAGGG CAGTGG CT AGT ACATAT AAATT AAG AG CACTTT CAAAGG C CT CT AGCAAGCCTCTCAGTG TCACAT ACAA 

GGTTACAGATCTGTGACTAGAAACTTACGGCCTTGAGTCATAGTTCATTGCTTTAACCATGAAACAAC 
CTAACCAGAGATCATGCTTTACATTCTCAGCATATTATTGCTAGTTAATCCTCTCT 
AACCTGGTTACACTTATGTCAAATATATGTGAATCACACCATCTGTTTCAGAGGTAGAAAAGACCT 
ATTTTTTTTCCCrTTACAAGAGGGCTGCCAGCATTTGGGGC 

6 0 ATCTCCCTCTCAATGCAAGACAACTTCTGACAGCATCGGAAAGAAAACTGCACACCTGCCT 

ACACAACTCCACCCCAGCAAGCAGACCTCCTACTCCCAGTGAGACGTGGGAAGAAACT 
AGGGAAATGCTTCCTAATATCTTCTATTTATGG^ 

GAACAGCTCTGAAATAGGCAGGTCAGATATTACTCTCTCCATTGAACAAACAGGAAAATAAACCTTTGAGA^ 
CTTGTGCCCCCAGTCTCAAGAGAGGATATGCTGCTCCTGTTCTCCTTCCCCTACCCCT 

6 5 AAGCCTGGATCTCTCTCCTAAGACAGGCCTTCCTACACT 

ACTCTTCTGGAATCCTGGTGTCTCCCCAGTTAGAAATGAAACTCCATTCAATGTAA 

CAAACTAGAAATACCCCAAAACTAATAAGTATGTTTTCrrCCACCATGGGAGCAGTTGGATATTAAAAGCA 
TTATTGAAAGTAAAACATTCCGTCTCTTTTCAAAATGGGAGCCGCAAAAGGCAAACACACACT 

7 0 CAAAGGAAAGATTAGGT ATTTCTIT ACCACTCTG CAACTTGGT CTGTG CACCAAAGCAAGG TG CCCAG AGGGC AACAGG AG CAT AAG 

ACGAGAAGCCAAGGGCCATGAGATCATTGGGGATGCAGACCTAATAAGAG^ 
AAGGGGCCGAAGCCITAGC(^CTGTTGATGGCAGGCCCrGTGTG 

AGT CGCTAGTAGTCGTGG CTACAAG ATTTCACGTGTAAAGGATGGTCCTCGGGGGCACTTGCATCCTG AGGCT/TCT 
GCAGCTCCCAGGCCGCTCTTAGCCCTCTAGAATGAATTAGCATTATTATAAGAC^ 
7 5 GCAT CTTTG ATG AU CAGG CATCACCAGTT CrCCACGCTGGTCTG C CTT CTCATGAG CATGGGTTTCAGTGACTGCCAGGCCTGAG a 
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AAGGAGGAGATGGGAAATCCCCTTCATTCATATCTGTGCTCrCCT^ 
GGCAATGCTTATGCGTAAGTTAGACAGTGTCTCAGGAATTCTGAGCAG 
TTCTGAGAGAGCTGGAGACAGACAGTAGTTTAACACTTATTGTGGACT 
AATCCTCCCAACAACCCTATGAGGTGGTTGTTATTTTGCTCCTTTAA 
5 TGGAAAGTGGTTCAGCTCATTAGTGAGAGAGCTC 

ACTCCAGAGATTCTCCAACTTTGGCGTGAGGAATAATCAGGGAGTGTGT^ 
GCCATTCAGCCAGCAGTGACTGGTTTGGATGG 

TGAACATCCCAGATCATTTTTCTCAGACATCCTAGGAAAACCTGAAGGGCACAA^ 

TTCAGCTATGTGAACTAGAAAAGCACATGGTGAAGCCAGCCTCTTGGA 
1 0 TGCCTAAGAGCAGGTCTGAAACCCTGGGAGCACTTTTATCAGATTTTTCTC 

ATAAAATGTACAGCACGCCTGTCTGTTGGCTGTGTAAATAGAATGTGGATGTCTGTTACA 

GGCCATATTCCTAGTGTACATTTCCACCTTTATTCAACACTCCAGTTTCCrATGGTGCTGGGAG 

CGTCCCCATGGGTCAGAGGACTACAGTAGTGGCTTAGCTGGGGAGGAACTGGGG 

TACTTTGGCAACAGATTATGTAAGGCAATATGTT^ 
1 5 AGCAAAAACAGTCTTTTCAGTAATTGACAGAAACAGCTGTGTTGGACCCTGAGAC 

CAGCCGCAGCCCACCTGCTGATGTTAGAACTTGTCCACTCCTGCCCGGAACAATT 

TGAGCTTTGGCTACAACCTAGTTGGG CTTGCCCT AG AGTGG CTGGAAGG AAAAGAACGGGG AAAGAGTGAGAATGT AAGCAGAGCC 
AGCTTCCTGGGCATGCC^CCTCTGAAGGCCCACAGGTTCCCATGCTTGGTTTAATC 
TTGAATAAGGGACCATCATTTTTAATTGCACTGAGTCCCACCAACTATGTAG 
2 0 GCAAAAATAATTTGCCATATATTGAAGACACATAGAACGAT 

ACAAAAATTATTTCATTATCTCCT AGCAGTGGTGTG CTGGGCTGGCTCATACCAGCTCACAAGAG CAGATTGTTAAATTTTCAGGA 

ATTTTGCAAGCTAGTTATTAAACACAGCCATGATTGAAAATTAAGTTATATATGCTTAC 

AGAAATACTCAAAACTCACCACTTCCTAATTCATGACTACATTT^ 

CTGGTACAAACGCTATATAATGGTATGCTAGCATGCATCTTTTCCCAGCTCCGTGTTCAGTGAA^ 

2 5 AGTCTGTTGGGAGTATTTACACCGTGGAAATCAACAAACATCACTAATCAGGGCTTCCTTTTCAATCCCT 

TGTTAAACATTTACCAGCACACCACTGTTTCCCAGATACAGAGGGT^ 
AGCTCAGACTTCTATTTTTTATCTCTTAATTAAGAGAAAGTAATA^ 

TTAAAGAAGTGACAATGGAACATGAAAACAGCATGAGAGATTCTCATTC1X3AAGTCAA 

3 0 TCACCTCTAAAATATCTCTTAAGATTAACCAGCCTCGTTTCTGACCAACC 

TGCTATCTCTATTTTGATTTCTGATTCCATCTGCTTAATCATTTGCCT 
TATAACCAAGCTTTTGTTGAGTATGTTTTCATAAGTTTGTTTGCC^ 

TGCCTATTTATCACTTGCTGCTCAATGCTTTTATTATTAGAGGCGTGCAAAAGCTTTTTATATAAAA^ 
ATCATTTACTATAGGCA TTTTTTTT C C r GTT CTCA TTTr CA TTTTT ATC 

3 5 TTTGTGTTT AAATTTGCCCATTTCTATTT AG CTTTTT AAA^ CTT AAAAAAATCC CT ATT C CAAAGATTT AATT AATAG 

TCCATTGTCATTTTAATCAGTTTTACTTAACCCAGTGTATAATTCCATATCATACATTGTCGTGGATG 
ATAAGTTATTGCTGGCAAAAAAGAAGCCTTCAGTTTTGTTrCTGAGAT^ 

TTTGT AACTCAG AG CCTCCGTTTCTCG CTGTGATAC CATT AAAATG ATTTCTGAGACTT AAAGTGTCTGG AAGAAGGTGTAATGGG 
GTATCTCCATCACGATATTAAACTTGAAGAAAGTCCTTCAGGAATTTGCATATTTTAAGAATGTG 

4 0 TCAATATTCTTAGGATGGAGTCTTAGAAACCCAACAAATTTTTAACT 

TTACATAGACTTTGTGGTAAGATATACATAAGATCAACCACTAGCATAGACTTTTAAAATAIT 
AGCATTTACTATGAATAACACAGTGTTTATGTTAATATTTTACATATGTAGTCTCATTTAGTTCCCACTAT 

ATTTGCAAAGAT CT AACTGTC C CT CAG CATTT AT ACTT ACTTT AT AAATGTT AAT AACTTTT AAAATT ATTCATGT^ CAC CAT AG A 
GAACGTAGAAAATACAGACTGGCAAAAAGAAAAAAATTACCGTCACTCATTTTACCAT 

4 5 CATGGGTGTGCCTTTCTTGATTCTCTTCTATGGATATTTACATGTATTATAATCACAGTAAC^ 

CTTTTTGCTTCCCATAATTTATATCTTAAACAATTTTGATCTTC^ 

GGTTGCTCTTCTATTTTAAAATTGATTTTTTAAAAATAATTCTGCTTAGT 

GGCAACTGAAATGATGGTAATGAGGAATTAAAGACATTAGAAGAA 

CAGTAACTATGGCTGATTACAGTTGTAATAAATGAAGGAAACACAAAACTTCCATTAAAAATTCTTGAT 

5 0 TTGTATTACAGTGAAAGTGAGATGGGGTTTGCTGAAGAGAAA 

TTTTT T T CTTTTCTI 1 rCI ITTT TT 1TTT7 T1 T T TGGCGACAGAGTCTCACCCAGGCTGGAGTATGGTGGCATGATCTGGGCTCAC 
TGCAACCTCCACCTCCTGGGTTCAAGTGATTCTCGTGCCTCAGCCACCCGAGTAGCTGGG 
TTCTATTTTTAGTAGAGATGGGGTTTC^CCATGTTGCCCAGGCTXW 
TCTCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGCCCAGCCTATTAACCT 

5 5 ATACGTAGCAGAAG(!ACAAAGATTTGGTACATCTGTGATTCTT 

ATGTTAACTACTTGTTTTTTTTTCCCCAAAGAGTAGTCAAGGGCCCT 

AAG ATGCTT CCT AGG ATGG AGATGT ACTT AG AAGGACAC CAGTGATT CATTG AAAT GGACAAAGGG AGTAG AGACT AGAGGGGGAA 

GAGGGAAGCAGAGGGAAAGAGGGAAGCAGAGGGGAAGAGGGAAGCJ^jGGATTGCC^ 

AAGTAAGGCAGTTTGGGATGAAGCCATCCAAGAACTTATCAAAGGTATATCTTTGCCAGAAAGTC 

6 0 AAATGGAGCTATGTCTGTCTCCTCTTCACTGATCTATCTGAATCATGGTATA^ 

TTTAAGGGGGAACCCACAATATTCAGGAAGCCAAGAAATGTCCCTGTTAA 

GACTATGGAACATCCTTAAATCTGATGAG CAATTAATCACTGGTTAATTTTGGATTCACTTATT'AAATG CCAAATCCAT AG 

TTTTCACACTCTCrrGCCCTTCCTCCTTCCCAGTTCATGAGCACCACAAT^ 

GGGTTCACCATTTACCTCCAGTCCTGGGGAGTTCAGTCCAGACTGTGAA 

6 5 GCCATAGGACCCTCGCCTCTGGCCCTGGACTCAGTCTTCGTTTAT 

CCAGTGCCTAAGAGTCTCATTACCTTGTCAGTAACTACCAGTX5TCTACCTGAGATGGACT 

CCCTCAAAATTGCAAGGAGAAATGTTGGAGGAGGACCCTCTGGATTAAAAGAGACTTAAA^ 

GAACCTTTTTTCTAAAAATTATGTGTGTGTACACATACACACTCATAGGAGTATAG 

7 0 TGTTGCAGCTGGCTGAGTGCATGGAGGTCCCTTAra 

TTGAGACAGGGTCTTGCTGTGTCACACAGGATGGAACACAGTGATGCCATCTTGGCTCA 
TGATCCTTTCACCTCAGCCTTCTGAGTAGCTGGGACTATAAGCATGTC 
TTTTGTAGAGACAGGGTCTCACTGTGTTGCTCAGGCTGGCCCCAAACTC(rTGGG 
GTATTGGGATTACAGGCATGAGCC^CTGCACCCGACCTCTAACAATATATTAAA^ 
7 5 ATG ATTTAGG ATT ACAAAAATr CAAACGG AT't"!"!' AGGAAATTTCTCT AAA CTT AGGTGGCAAAAT AG AGTGG CTGTGTCTGT 
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TTTAAAGAAGGG AG AGG AAAAT AAAG AATGAAAATGGTT ACTTTGGTT ACACAT CATG AAAAT AATT CAC C CGTGTCATGT ACTTG 

ACATCTCGTCTCACAGCX5CATTCCCCAATCAGCACTGAGATGCT 

CTGGAAACTTCTAATCTTTAATGCTACTGTCAGAC^ 

CCATCTCACACACACACACACGCGCGCXSCGCACACACACACAGTTTTGAGC^ 
5 GCAAGCGGTGAAAAGATT ATGGAGG CCACGTCACATGGCAGT CTTTGGAAT CTGAAG CTGAGTCACTGAGTCTGCCATTTAC CACA 
TCTGGGGCTTGTTTGCCCGTTTTTAAAATGGAGAAT^ 
GATTGTCTTAGTTTGAGTTTTCCCAGGAGCAGATCCTGAGACA^ 
G<3AAAAACTGGTAGGGGAGTAGGGAAG<3CAG<rrCTTGGAG 
TTTCAGACCTATCTTATCTGAGGGAAAGCAACTGGAGCATTGATC 
1 0 GGGTGTTAAGTTTGCAGCACATTTGGCCTGCCCTGCACAC^ 

GTTGJU3TTGGCATGCATTACAATGGGAAGGTCTGAGAGTATGTGTGTGGG 

GTTAGATGGGCAAATATAGGCTACATGAAAATCCA(^TAAACAGTTTCAAAGCT 

AGATATGAGTATTGAAGGATGTCCTTCTGCAGGTGAACCATTTATGAGTGGCTGTGG^ 

TGTTTTTGGTT AGACACACTG TG<K1AAAT AGCAT AATT AG CAG AATT C AGG ATGTG AAT AACAT AGCACAAAGAACTT AGACAAAC 
1 5 AGGACTTCAATAAAACGTAAAGTCTCGCACTTAAGTTTAAAAATAGTGCT^ 

TGTAGTTCTTATGACCAATACCAAGAGATTTTAGTAGATTGTAAATTCA^ 

TTAACTTTCCAAAAGTTAAAATAATTAGAGATTCCCTCAACAGAAATM 

CTGAGGTCCAGTCCTAATGCTGACACATAGTAACTTTATTCCCTTAGCCGGTGTCTTAATC 

CTGAAAGrrAGTAGATACTTCCCCTGGATTGTCGTGAAGTAAGCATCTTACCAG 
2 0 CArcCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGCAGG^ 

TG AAACT CCGT CT CT ACT AAAAATACAAAAATTAGCTGGGTGTGG TGGTGCAC ACCTGT AG CCC CAG CCGCAACAGAATACTCTGG 

AGGCTGAGG CAAGAG AAT CACTTGAAC C CAGG AGGCAGAGG TTG CAGTGAG C CAAGATTG CGCCACCG CACTCCAG CCTGGTGACA 

GAGCXK3GACTCCATCTCAAAAATAAATAAATAAATAAATAAAGTT 

CATACTCAAAAACAGTAGCAATCATTATTAGTATAGACTTGTTAAGTGTTAGTGTAT 

2 5 C^GACXS ATCCAAAATCACATCATGGAGATACAAG ATCX3TT AT AATCX5 CT CAGG ATGGTGT CAT AATAGAC 

ACGAACAGCTATGAGACTAGAAGAGAGTGTTATAATTTAAATTTTCCCAT 
AGACCTGAAAAGTCTCTCATTTTTCIAAAATATTAGGACAATGATGGAAA^ 

AT ATGG ACT ATTTCAT AT ATCT AAAACACAGG AAGAG T ACAGGATGC CTG CAGG CCTT AGG AGTG CCTGAAATT AGGT ACTAT AAT 
ACTTTTAGGAGACTATTGCATATGTTCTGTGTATCTATGTGTCTCTTCTCT 

3 0 rriT'LTlT'CACTGAAGATrrGTTlTCTACATC^CACTAACAGCTGTAA^ 

tccctcaagttcagtgaaaaaaaaaaaactgtcagggaagaac^^ 
actgtggccac&gggttggagtcatttaataatccagcatttct 

aaac4gaac4gctattgtgctggg<^tatgaagcagtatctctcataaccaagatctgaccaaacacta 
attcaagtttagtctctggattttaagatc^aatggaaacx;ct 

3 5 aaataacatctgtciaggatataagatggcagaacctgaaaagaagac^uvataagtc^ 

aacatgtatgtctcaatagtaataaccatagaagagctagtatc^ttatttacctccactgggacata 

taaatctacciaaagaactgggacctcccataaaaaaattaaacatc^^ 

atattagaattattttcctgctaccaacagctataaaacrcgacaaaatatct 

CAGACATGTGATTGTGGATAAAATAAAAAGATGTGAGGTTAACTCAAAACTCCT 

4 0 CATGGTACAGGAAACTGGAGCCCJVGTGAGCAGC^TCTTG 

aciactgggtaatttataaagagatttatttggcct 
tcataacatggcagaaggcatcacacagacagagagagtgcitgcaagacagag 
cctttccc^gatatggagttaatccattcatgagggtgaacccctcgtgacccagtcacctcttaaa 
tgttacaatgacaatactaaatttcaacaaattttc^^ 

4 5 ctgggatggctggcatttgtggagtggcatacaagagagaaggt^^ 

cgacccctaatacttcctgc<k^tctttgagttaaac4gack3tctcatggtct 
gagacataacactctgc^cattc^ggtgggatgagaaaaagactacaatct 
aatctt<3actcaccctaatgaagcctaaatctgagcttcaacag^ 
acatattaataattcaaattatcaatattgcacttaciaatgccc^ 

5 0 atgtgactcataaccagaacaaaaaaoatc^tagaaatgaactcagagatgacac^wvtgt^ 

aagcagctattataaataaatatgttggaggatttaaagtaagatgaatagatgtaagtaaa^ 
tgcaacaattaaaggaaccaaatc^aattctagaactgaaaaataaaatataccaaaaa^ 

TATAGTC4GATGCAAAAAAAAAAAAAGAAAAAAAGAAAAAAAGAAAC4GTCAGTAAAGTCAATGACAG<3TC^ 
ATGCTGAAAAAAAAAAAAAAAAAAAAGACTAGAGTTTCAGATGATGTAACCAGAGTTTAAATGG 

5 5 CAGAATCTTGAAAAAGAATGAAGTGGAAAATTTACACTAC^ 

ATATACATATAAGGATAGACAAATATATCAGTGCTACAGAATAGCACGTTCTt^AAGAAATCT 
AGCAAAGGCACCAGGGCAGGTTAACGAAGAAAGGACTATCCTCTCAACA^ 

GAACTGTGACCTGTGTCTCACACCATACACAAAAAATGAGATGAATCAAATGATGTAACTCTAAATAATGT^ 
GTTAAAACGATAAGTCTTCTAGAAGGAAATTTAGGACAATGTTTTTGCAAACTTAGAGT 

6 0 ACCATAAATAATATATTTAGCAAATTGC^CTTTATAAAA^ 

AAAATACTGGGGAATATATATATGTNNNNNNNNIOJIOINNNNN^ 
ATATACACACATATATATGTGTATATATATACACATATATGTC^ 
TTTATGCTAGTATACAAAGGAAAATTCCCTGTGGCCCAGATGAATTGCCAT^ 
TGATCATGCCTTTGAAATTAGAACTAACCAAGCCCCATCAAGGAGTCTGTAATATAG^ 
€ 5 TTCCCTTGCATGTTGCCCTGAGAACAGGCGAAAGACITC^ 

ATAGCTTGTAGATCTACTCCAG<X3CAAGGATGAACTCAATGGTCTAAA 

GTTGGTGGTTTTTTGTTTCTTGTTTTGTTTAGGC 
TGTGATGAATTAGTTGTAAACACCACTTCAGTCAGAC 

7 0 GTTTGAGACAGGGCCrTGCCTCTGTGTCJ^CCCAAGCTGGAT^ 

AAGTGATCCTCC CATCTCGGCCTCCCAG AGTGCTGGGATT ATAGGCACCTAATGGG CTGCCAC CCCAGGCCC AGCCTTAGTT 
TTGTTGGTAAGCTAATCAGGAGGTCTTGAAAAGTCTCACCTGCCTCTTAGTCTATCAGAATCTGACC 
CAAATGCTACCATCTCTATGAAGCTTTAGCCATTTCTTCCAGTCAAATCT 
CCCTCTAGGCCCTGACTGTACCATGTACTGTAGTATTGCCTGTAGCITCTTCA 
75 CCCACAGTaCCTaTCaCaGTGCTCTTT-GAGCAaaGAACT 
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CTTGATGGTATTATCCGTGGGTTAGTGATTTATCCAG 

AATAAATGACAGGGTCTTGCTCTATCACCCAGTCTGGAGTGCAGTGGCACAATCAC 
AAATGATCCTCTTACCTCAGCCTCCCAAATAGCTGGGACCATA^ 
GAGATGAGGTCCTGCTGCGTTGCCCAGGCTGGTCTCAAACrCCTGAGCTCA 
5 GATTACACGTCACCACACCTGGCCTTCAAAATATTTAACCAGCATGGC^ 

GGATTAGCCCTGTCTTTATGCTTTTATTGTCGGTAAAGTGGATACAGTCCCT 
AGATAATGCATGTGAAAGTACTCTGTAAAGTACAAAGTGCTATA 
TGTGTGGGGTTGGTTGCCCAGCCCTCAACATGGTCTCATGGTCTGCTCTC 
AGTTGTTGAGTAATTCAGCAGGACACTGCAAATCTGA 
1 0 CTCCTTCAAGGATGGCCCCTTTTT AGG CCTGAAAAAT AT CAATGTTTCAGT AGAT AT ATTTTGGAAAAG CT AAAT AAAACAAAAAC 
CT AAAAAGG CTTTTTGAATTTT AACCTAAGTTTT CCTT ATTT^ 
CCTTAAAGATATCTCGGCATCTCAGACTCATCrniATTGTCACCATATG^ 

AATG CCAGAGACATT AACAT AG ATT AAAGGGACAGTCT AAATCTT AGTCACTG CTTTGT CTTT AT AGTTTCTTG CCTCCTCTTTTT 

ACCCTTTTACTTCCTTAAAAACTGTCTTAATTTCCTCACCATCGCCTTCCCA 
1 5 ATCCTTAGCCACTTTTCTCTTCCAAATGCATTTCACT 

GCCAGTCAATGTTAATGTTTGTTCTTATATGCTTTGATTTAAATGAAGGA 

AGACTGTAGGGAACAGACAGAAAATAGACACAACCCTTTTACTTACTCTT^ 

CACAGTGACTCATGCCTATAATCCCAGAACTTTGGCCGGCCGAGGTGGGAGGATCGCT^ 

GCAACACAGCAGGACCTC^TCTCTACAAAATATAATAATAATAAAAAAGAATTAGCCTGG 
2 0 GATACTTAGGAGGCTGAGGTGGGAAGGTTACTTGAGCCCAGGAAGTCAAGTGTGCAGTGAGCC^ 

CCTGGGCAATAGAGTGAGACCTTGTCTCAAAAAATAAAAATAAAAA^ 

GATGAAATTGGAAATCATCATTCTCAGTAAACTATCGCAAGAACAAAAAACCAAACACCGCATATTCTCACT 
GAACAATGAGAACACATGGACACAGGAAGGGGAACATCACACTCTGGGGACTGTTGTGGGGTG 

TGGGAG ATATACCT AATGCTAGATG ACG AGTTAGTGGGTGCGG CACACCAGCATGG CACATGTAT ACATATGTAACTAACCTGCAC 

2 5 ATTGTGGACATGTACCCTAAAACTTAAAGTATAATAATAAT AAAT AAAT AAAT AAAAA 

GGTAGATATT AATAACTTTAAAACAT AATGATAAAAT AACCAGGGTGAT AATTTGTAGC CC CTTCTTGCTACTTCCT AGCTGTGGG 
ACATTGACTAGATATTTTTACCTCTCTTGACTCTGTTTCTTCATGTGTG 

ATATTGAGAG CATGTCTGTAACATGGCCTAGTTCACTC CTGGCACATGGTGGATGGTTCAGTATATGGTGGCATTTATTAT ATTCT 
CATTTAGCATGCAGGCCAGCTCTGGGAAACCCTCAATACCC^ 

3 0 GCAGTGGTATCTGGTGGAAAGGGCAGGGTTTACCTCAGAAGACTGGAACTCAAGTCACAGCACT 

TTAGGCAAGTCATACCACCTTACTCTCTGTTTCCTCACCTATAAAATAATGCAAATAAT 
GATGAGGCAACGTGTGAGGAGAGTGX5TCATTCATAAAACGCTAATAAATAGAGGTTATTAGTAC 
CATGTGCCTACGTGACAACTGTCAGGCACCGTAACTATTTGGGACCCIACAAGTTAACCAGA 
AACCTTGTTCTCCTCAAAAGATATTAACACrTTTTTCTTATCTTGA^ 

3 5 GTAAGACATTATGAAG CTCT CATTCTG CAAAGAG C AAATT G AGGCTCAGAAACAAT CAGGG CCAT CC CAG AGCCAGG CAGCCTGGG 

TTTGAATCT<2AGGTCTACTAC!TTACTAGCTGTGTAAACTTGGACAAATTAC 
TGGCACCTGTCTCACA.GGGTTAATATACGTAAAGGCTGTCT 

AAAAGAGAAGACGGGGGTGGCTAAACCTTGAATGTAGTTCATTTCTTCTGGTTAGATC 
TTCTCCTGGTCTCCCCCATCTTCCAATAAGTAACAACATTGTTAGTTTCAGTGTTC 

4 0 C AACACAT CAATTCATG ACTT AGGATG TTCTG AAG CACATT CTTTTTTGTTTC ATTTTG TTTATGG AGACAGAG T CTTGCTCT ATC 

CCCCAGGCTAGAGTGCAGTGGTGCAATCTTGGCTCACTGCAACCTCCGCCTCCCAGGT^ 
GAGTAGCTGAGATTACAGGCGCCCACCACCACACCGAGCTAATTTTTGTA 
GCTGGTCTTGAACTCCTGACCTCGGGTAATCTGCCTGCCTCAGCCTCCCAA^^ 
GCCCCTTGAAGCACATTCCTATTCAGCTCTTGACTACTGAATAGAGACGAGTT^ 

4 5 ACCGGTGCAGTTCTCACTTCAGAGTGGCTCTCCAGGTCCTCCTGCTCACTTCrCTCT 

GrcTTGGGGGAGTGCGCTTAAAACCTAGGATTGAAGGGAAGCTTGGA^ 
CTCATGTTGAAATGTAATGTTGGAGATGGGGCCTGGTGGGAGATGCCTGGAT(ZATGGG 
CCTCCTCTTGGTACTGTCCCCATGATAGTGAGTTCTCATGAGATATATCT 
GCTTCCCTTGCCTGGCTCATGTCTATAGTGCTGCATGGGAATGGTCAATGTCTATGAG 

5 0 CAAAGAATCAAACTGTAGCATAACTACCCAGCCAGATGTGTGGCACTTCCTC 

GTGCCTGCTTTTTCACCCTCTACCATGATTGTAAGCTTCCTC^ 
GCAGAACCTTGAACCAATTAAATTTCTTTTC TT TATAAATTATATTTTCTT^ 
ACTAACTGCGAAGGAGGACCCGATGCTTGAGGGGTCCAGTCTCGCTGGCAC^ 
AGTGGTC^TAGTTAAGGGACAAAAAAATCATGGTTCCAGTCTGGAATAGCCAAACAAAGAAAA 

5 5 AAAAACAAAATAGTGCTTAAATCTAGCAAGAGAAATAAGAATAATTCATCACTGTT^ 

ATGTATTTAATCAAACATCATCTAATCCTAACCACAACCAAAGAAGGGGTGAGGAAACTTTCAGA 
AAGACTCTGTTCAGTTACTTITTACCCTACATGGTGATGTGATGTTTTTTT 

TT AAAGAACAGTTTCTGTGGTGAAAAGCAT ATTTTACAATAAG ACTATGGT CAGTGGATTTCTATTTTATGTCAG CATACACTGTT 
TCCC^CAATTCCATCATGCACTGGAATGAGGTTATCTATTATCTAGGCTTGT 

6 0 ATTGTCTGAGGAAGGCTGAGGTCATGGCTAACCAGTAATTCTCAGATGGGCCTTATATTATTTTGT^ 

TCTCCAAGTTGCTTCTTCGTGCCTATTGGGCATGTCTGAAAATATGCAAAAA^ 
ACAAATATGCTCCAGTGAGGAAGGTAAGAGCCTGACCCAGATTTTCTTCCCAGTTTGT^ 
CCTTTGCCAGCCCTCCTCCCAGCAAGCTGCTGGCAGCTCTCTGTGTCC 
CCCTTGCCTGGCTCATCTCTATAGTGCTGCATGGGAATGX3TCAATGTCT 

6 5 GAATCAAACTGTAGCATAACTACCCAGCCAGATAATTCAATTGTTCC^ 

TATATTCAGTCGGCACTGACrTCAACAAATGCTCrrAGAGTTTGAAGTCATAGCGTO 

CAGAACTGAAGAGGACCJ^CTTCCCTCCTG ACTTCTGAGACT CATGATT CTTCAGACATGACAAGTCCAG ATCG AGACTGCAATCC 
CCAAGGAAGGCCAGCTGTCTTGCTCAGCAGAGACAATAGA.GCTAAAAGCTCTTTCTCA 

7 0 CATTTATAGACTTCAACTGCTCATCCAAAAAATCTAGGTCATTCT 

GGAGGAGCGTTTCTGTTGATGTCTGGTTTGGCTGAGGCAGTAGAGAGTGOT 
GGCACTGTGGGCCCTTAGGGGACGTGAATTTTCACCCCCTGGCCACTGACT 
TG CCTTGTGTTGATGTCATGAAAAGTGG TGGGTTTAAGGATT AGG CTTT CTA^ 
GAGAAGGGCTGGGTGGGTCTGTTGGGGGTGTAGGGCTTCCTGGGGAGGAAGA^ 
7 5 TAACATGGATGGCCAAAATGGTAGAGCTCTTCCAAGTATATTCCTACCTGC^ 
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AGAGCCTTGGGCTGGGJVGTGAGAGGGCCCTTCT 

TTTTTAAATTTCAGCTCATGGTTCTGCTGCTGCATTGAGCTGCTGT^ 

CCTAAGTGCCACAAAAACGTAACATATCATTATCAGGTCTTCTCACCCAGAGAGAATAGTC 

5 CTCCTTCCTGCTCTACTACTCTGGCCCCAGCAGGACAGTGAATATCAATAGAA 
GTCTrrrCCACGTTGGCTTCTCrrGCAACAGGAGTrGAG^ 

GAT T 11TCCCTCTTCTGTGTCTGTGGCCITGGTTGTGGCACCAGCTGTCATCACTGCC 
AGCCTGGCCAGCCCCTCTCCTGCTTTAGTAAGACTCCGCTTAAAGGCCTGTTG 
AAAGCTTGTGTATTGTCCATGTTATGGTGTCCCCCAGCCATTACAAGTAAATAATTAAAAGC 
1 0 CTCCATAAATTCCTGGACACCATAATATGACAAGAAATAGACACTAGTCTAACT 
TGACATACCAGGTTATCCCAGAAAGAAGAACAGGAa\AGCTTT 
TAACATTTTTGTTCrrTGGTTTAGAGCACCAACCACTTCAAGA 

AAGTCCTTGTCTCCTGCAGAGGAATTTGTTG ATCTGG ATTAACTGTCCAGT CATTGGGCAAGGCCCAGAGGG CATGAGATTAGGAG 

TGAGATGAGGAGTGGAATGGGTTCAGGAACACCTGGGGAGCAAGGAAACCAGACA 
1 5 CCAGCCCTCTTGAGGAGGCTTGC^GCTGCTGTGGGTGCATGCT 

CTTCCTAGGTGCATGTTCAGTGACTTCATGTTGGTAGCTTGAAACTGGCCATA^ 

ACTACAAACCAGGGTTCCTCTCACTTCTCTCCCAGGAGAGCTTTAC^ 

GGATTCTCACTTTCCCTGAAAAGCAGGACTTAATCCTTAGCAGATCACTTC 

GGGGCTGGGGACTAACAGAGACCAAAGTGGCCTTGGTGATATGTATGAAGTGGCT 
2 0 TGTATCACCACATTGTTTCTTCACATGCACTTCCTGCCTAAGTTGCTGC 

AACAAACTCTACTGATCTTATTTTCAGCTTGCCAATTT^ 

AAATAGGTTTTACTTCTTTGGAATATTTATCACATAGTTATTTATTTAGAATTCTGTC 

CAGCCT AACCATGTTTCTGG AG AT ACTGACCT C CATT AT CAACCT CAGCCTT AC CACAGG CT CTG CTCT AACCTTGCCC CTAGGTT 
TTCTGATCCTAGCTTTCCCATTACCACCCTCTGGCCTCTGGTTGTGGCACACCTGTTAT^ 

2 5 CTTGCCTATCCTTCATATGCTGATCTGTCTCT 

AGGAGAAGAAGTGACrrGTTCCGGAGTTGCIX3CCTGGTGCrGGACGTTTG 

CCATCTCATCCCCCAGACAAGCCT/TCACTCTAGGAAAGCTGGCCCCCA 

TTTGCCAGTTTTCTCTCACGAGACTCTGATCATCAGGGAGCAGGTCrrCTCT 

GCAGITTI'CATATGGTT CTTCAG ATCTC CTTTAGTACCTAGGACAGTGC"1"I'TCCATAGGGTAGATTCATTAT CAATGTTGGTT AG T 

3 0 GGTGTCAAAAATTATTTCTACTTGAAGTAGAATGAACAGTTGCTATAT 

C CAAGT AGAGGC CAG ACAAC CACTTACCTGTGATGGTTTT AGGAGGGAAT AATGCATTTGG AT AGAAAGAA C 
TAGCCTTTTCCACCCTGATTGTGATCTCATAAACTATGGGATGCCAGCACTGAGTCTAACGGTGGA 
GACAGATGACTCTCCAGAATCCGCTTAGTTTTTCTGTTTTCTTC 
TTCTCTAGCAGCTTACATTACTCCCCTAGTCAAATGGCAGAGTATCTAAGATCAA 

3 5 TAAAGAAGTCTTTTCCTGTAAGTGTAAGGAGAAGCAGATXK3TAATGAAAA 

TATTAGAATTTTTAGTATGACACCCTGCATTTTAAAAAGCCTAAAGATGAAAACAA 
AGCTTGC^GCAATTATCTCAAAATATGAAAATAAAATCTAGCAGCCTCGAAGG 

AGGTTGTTCAAAAAGGAAGAAGAGCATGCTGTACGGAATGAGGGCAGGAAGGGAC^ 

AGAGGAGTTACCTCGGAGTTCTGGACTGGGTGTAGAGAGAGAGTGTTTCTGTCTGCTCTTTCCCT 
GGCTGAGTGAGTCATCCTCTGCCAGCCTTACCATTTGCCAGCACTCACTCCCACAG^ 

TGGG AACGT CACTGAAACTGTGG AAAAGCIAGTG C AAGGT AAAGGTGAAGTGTG AAT ATG ATTG TG AAT CTTAAG CACAG AAC C CTG 
AGTT CTT AAAGG CAATTTTTAAGATT AGT ATC CATCAAC CAAAGTTCACAAG AG TTTTT CAGAAATCACCAGAGT ACAAAAG CT AT 

4 5 TTCACGAAATGTGCAGTGGCTGTGACCTTTAGTGAGATT^ 

CCCACTTTATTCTTGAATTCCTTATCCTGACGAATGGTGATTCAGCATGCA 
CAGACAGACCTTGGGCCAGGTCCCAGCTCCrCCACCCATTGTCCCTTTGACCTAGAAC?^ 

GGGCTTCAATAAATGATAGCTGCTATGCTAATTACAATAATGCT^ 

5 0 ATGGCAATGCTTCCATCATTAGACAGTGCTGATGGCTAGAATTTTCTGCCACA 

TCC^ATCTACTCrTGGTGCTGCCTTTTGTGGCCCCATAGAACAAGTCT 
AATTTTAATCTCTTCCCCAAGTTTTCTTCATGGTAAACCCTTTAGT^ 
CCAGGACACGTCTAAGTCCCTTTTCCTCIACATATGAGTATCTAT^ 
AGTTTTAGATGCCATCCATACTGAATAAGCTGGCTTTTGTCC 

5 5 CACAGTT CCTTCTTTTCACTGT AGTTTCACCAG AGTGTTCAGGAAT ATG AACACTG CT ACCTGTGAGG AAGNKNNNNNNKNNNNNN 

NNNNNTTGCAGTTCCTGCCCTTTGACTTTATATTCAGTAGTGTGACT^ 
CAGCTCCAAACCTAGGCTTCAGAGGCCCTGGGTATTTCCAGTTACT 

CT AG CCTACTGGCCCAAGGAAG CAGGG AGGCATGT AGGCT AGGGCCCCCAG C AG CACCCAGCCTGCACTAGCTGATTGTAACCAAC 
TCACAGAAACTTGAGCAAAAGTAAGTGATTACTGTTTTAAGGCACTGAGTTTTGGGGTG^ 

6 0 G AAAT AATTATT ATGGAAATT ATGTGGGGCCAGAAAACAT AAAG T AAT ACTAAC CCTAC CTGG ACAG AGTACAAGTTT AGAAG AAT 

TAATGAAAGATTTACAAAACTATAACCAAGCTAGATGAACACTTTTTAAAAATGATTTACCAA^ 
CAAACTGTGACCAGGAAGAAGATAAATGTGTATATTACATTTCACGAATGAAATAGGTAATGAC^ 
TATCTGTGAAAAGTTAACGTGTTTCCTATTCGATATAGTTCACT 
TTCTAGGCCCTGGAAATATAGTTCTGGAAAAAGTGGACAAGGCCCAGGTCT 

6 5 AAAAAATTAAATAAATAAAGCAGATAACTTCAGATAATGGTAAGTACCATGAGAAAACTCAGGC^ 

GTTTAGAAAAGACTTCTCTGAGGATCTGACATTGAAACTAAGACCAAAATGAC^ 
AGAAAATAGCAAATGCAAAGGCCCTGAGGTAGGAATAAGCTGGGAGTGTTT^ 
GTATGAAATGAAGTGGATGATATTTAAAGAGGCCAGGACATGTAGTGCTTTTTTGGCCACAGTAA 
GGAATGGAATCCACTGAAGTGTACTAAGTGCATGACTTGATTTATAGCTTT^ 

7 0 TTGTAGGTGGACAAGAGTGGAAGCAAGGAGGTCGATTAAGTCATTGCATCATGGTGGCT 

AGAAATGAACAGGTTTGAGATATATATTGGAGGTAGAATCAAGACTrCCTC 
ATAATGGCTAGGTTTTTAGTTGGGGTAACTCAGAGGATGGTTGGTGGTGCC^ 
ACTCAGGGGAGGGAGAGATCAGGAGTTTTGTTTGAATGAGTTAAGTCTGAACTTA 
TGGACATACTAATTGGGTAGCGAAGGCTGAAGATACCAGTTGGGCAGTCATCAGCATAT^ 
75 TGAGT _ TC^TC"TAGAAaATaGTAGTGAAAGTG ilTTAi in i AATTTAAAATTTAAGAAGii I T AAATTTTAAAAACrGTCAAAAAA 
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TAAATAAAAGAAAAAGCGGGAGGACTCAACCACAGAGCATGCTGA 
CCTGAGAGAGAGGCTGAGAGAGTGTGATGCTATGGAAGCCAAGAGAAGAAAGT 
GTTGGGTTGJUITGCTACTCAGTCATAAAGTAAGATAAGGATCAAGGA 
TTTGAAAATAAAAGATTTAGTGGAATAATGGACACAGAAGTCAAATGGAATGATTC 
5 CAGTCAGGTTTTGCTAAAAATGGGAATAGAGAAATAAGTAGCT 

TTGGAGATGCAGAGGCATATTTCAATGACAATGGAAATAAACC CA 

TGCAGGATCAACGTCCTTGAGAAAGGGGGAGACCCAGAG^ 

AAGGGCAACAGAGTACCAGTGAAATGGTGGATTTGGGGATGGGAAGA 

ATTTTAAT ATAT ACACTIT AATTATTGAG ATGGGCTCATGCTCTGTCACCC^ CAG TGATGTT ATCATAGCTCACTG C 

1 0 AGCCTCAAACTCCTGACCTCAAGCGATCCTCCCAC 

ACTTATATTTTCTCAATGAGATATGACTTrAGATCGTCGTTTGAGG 

CTTCATTTACCATTTGGTACATTTCAAGCAGAGTGAATATGATAACCCC^^ 

CAATTCATGTATATTCTTTCATTTGATACTCTTGCCTTGAAT^ 

AAGACATTGTAATGAAGGGACTATTTACAGAGATGAGAGTGGAATTAAGAGATOT^ 
1 5 ACCACCCTTGGGGCCAGGGGGCCAAAGGAGATTAATAGTATTAGATCCAACGAGAGCT 

TGATCATTCATTGAAAAGATATGTAGTCACTGTCAGGGGAGTGGTGT 

CCTCCACTCCCAGTCTCTTGCTGGTGCTTCC 

ATGGTGAAAGAATAGATCTGGCAGAGAATAACCAGCAGAGCTCCT 

CTCCAAGGGGAATCITTTTACCCAATGCCAGTGTATAAATC 
2 0 CCATGTTCTTTAATACAAATCATCATGAGTACTTGT 

TTCCATCCAGAGG CAT ATT AAC CATAAAACTGGAGAAG CATAT ATTTCAGGGCT CCTCACTTGCATGAGCCC CTTCCAAGGCCATG 

GGACGGCCCTAGAAATGTATTCATATGGTCATTTGTITTTGTAAAATTTGT 

CCAACTTTTCTCACrrCTGACCCCAGAGAArrCTCTCCCTGCCCACCCCCCAACC^ 

GATCTGCTTCTGCT'TTCAGTGAGACTCAAGTGTGCT 

2 5 GGTAGTTAATGAGCATCAAAATGGACATTCTGCAAGGGAAGAATGTAGCAGGAG^ 

GCAGTGGAAGAAAATCACTGTGCCTGAAGGATTGCACCTGAGTCA 
CAGGGCCTAACTCAGGTGCTGCTCTTTATATTTTCAGATC 
GACCATGCACCGAATTGTCATCTCCTCTGCAGAACTGCTCCAAA 
CAAAATGGGATCCACCCAAAAAGTAGCTTGAAATTTGAAGTATGCTTTT 

3 0 GGTTCATATGTCCTTTCAGGGGTATGGGTGCATGTGTTGGGGGTGTGrrCAT^ 

CTGATTG CTCTGGG^GGTGGTCAAAG AAATCAAT ATG CAAG CACAATCTTCAGG CT CTT AAT AGT AAAAAAT ATTTTTTCTGTAGA 
GTGTGCAAAATAGATATTCATCCTGTGCAAATTGACAAAGGTGCAAGCATGACTCAAGGATA 
AATTCTTCAACTTTCTCCAGGTTCTGAAGAAAGATTCCCAGTGCCACAGC^ 
AGGAGGACTGACTATGTGCCAAGCAAAACCTGCTGGACTTTAACCT^ 
• 3 5 GCTTGCTTTTTCTCACTGATGACACTGTAAGCAGCAGC 

GGCGCAGGTCCCTGGTACTGTGTCATGACTTGTAGCTGCTCTAACAClTTr 
ACACAGAAAAAATAAAAACCATCAGGACTGAATCCCTTCTCTATACCTACTCATGAG 
TTCCAGTAGCACACAAAGTGGGGGAGCATTGCGAGAAGTGCAC^^ 
AGAATTTTAAACCATCATGAAACCAACTCAGTGTCTAC 

4 0 AATATTCCTCCCTAAAAACrrTATTTGTTGGCCTAGGGACTAAATT 

CCAGCACTTTGGAAGGCCAAAGTGGGCAGATCACTTAAGGTCAGGAGTTTG^ 
TACTAAAAATATGAAAATTAGCCGGGCATGGTGACATATGCCTGTAATTC 
AACCTGGGAAGTGGAGGTTGCAGTGAGCTGAGATCACACCACTGCACTCCAGC 
AAAAAAAAGAAAGAAAATTGTACCTCG<K3GTTCCTGCCCTTAGTGCACTACTGG 

4 5 TTTCCCATGCTCCCTCGTACTTCTCACAGAATCrT^ 

GCACCTGCCCCTCAGCCTAAAACACATTCAAGTCTC 

GGGTTAAAAATTTGATGTGAAAGCGTCTTAAAATAGTCTACACTCACTATCTCC 

GTTAAGTTGGCCCCCAGTATTCGAACTTTCCTCTGATGTO 

TCTTTCCATTGACCCCCAGAAGCATTAGGCATGACTGACAACCCCGTTCTCCTC 

5 0 CCCTCCTGGCTTTCCTACCTCTTGGGTTGGGCCTTCTCCAGCTTTTCATC 

ATGTCCTGGTTTTCC1V1TACTCAGCTTCCTTTCTGTGGGTGA 
GCTGATGGTGACTTTCAAGTCTGGATTTCTCTAAC^ 
TATATTGCCTAGTCTGGTCTCAAACTCCTGGCATCAAGTCATCCTCCCACCT 
CCACGGCACCCAGCCATCTAACCTAAACCATTTTTCT 

5 5 ATTAGCCCACTGTCCCCACCCACAGCCACTACCTCATTAATTCTGCTTCGACACCAGTACAGCCTTGTTTTGCCT 

TGAGAAATACATCATTGGGCGACTCTGTCATTGTGTGAACACCATAGAGTGTACTTAC^ 
CTTCTAGGCTAGACXXSTACAGCCTGTTGCTCCTAGGCTACAAACCTGTACAGCAG^ 
GCACATGGTAAGTATTTATATACCTAAACATAGAAAAAGTACAGTAAAAATATGGTATAAAAGATTAAA^ 
GGGCACTTACCATGAATCGAGCITGCAGGACTGGAAGTTGCTCTGGCT 

6 0 ACATTACTGTAOVTAACTGTAGGTTTTATAAACACTATATACTTAGGCTATACT 

AATAATTTAGCTTACTGTGACTTTATGTTATAAACTTAATTm 
TTCTTTATATTCTTCTGTAAGCTTCTTTCTGTTTTAAAAATGTTTAATGTTTATT 
ACACAAGTATACACAGTAGCCTAGGCCTATATTGGGTCAGGATTATGAAGACAGGATGAGGCAA 
TAATCTCATGGGACCACTGTCATATATGTGGTCCGTCATGGACCA 

6 5 CAGTTAATAGTGTTAGTATCTATTCrrTGCCTAAACCAGTAACATCAGAAGC^ 

• TCTGTCACCTGAT CCT AGGGTTTCr ACTTCAGAAATG ACTCTTATCrGGTCTCT CCCAT ACCATGCCTTAGTTCAT 

GTCTCCGTGGTCTCTCTCCTGAACCACTGTCCCACTCCAGTCTTTGTATCTACA 
CTTGAAATTCTTTCCTACAAAGCAAATTGGGCAGGCCATTGTTTCT 
TTGCCAGCAAAAGGGCCTCCTCAGTCTGTTCCCAGCGAGrTTTTCC 

7 0 TC^CTGAACTTCTCAATATACTGGATGTAATGGGGCCTTTTATCATC 

GCTCTTTCAAGACCCTTCTGGGAAGCTTTTCTAGACTTTTCCAGC^ 
TATTATTCATTGACAATCTATTTAATTCCTITGATrrTAGr^ 
GATGTAATCTGCATTGAAAAACTCACAGAAAGTGTGAAGTGTTTTTATTCTGT^ 
CCGTTTTGCAATGTAGTGCCTATGTACTCTGGTCTTTATAGATGACCGTTTCA^ 
7 5 T^ATTTCAATTTGGTTAATAATATAAAAGATTGCAAGGGTACTGAAAAA 
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ATTTTAAAAGTATGTACACTGTAACTTACOTATTTGTGTGTCGTCTCCCCCGCTAGACT 
TTATTTATCTTTGTATCTAGAACTTAGTTTAGTGGCTGTTTCATG^ 

TGGGAT ACTTTTT AT ATTTACAAG AACITTTAC CTG TGATAT CTGGGGT AGGGAAAGG 
GATrCTGAGAGTATGAAATAATACTCCTGTAATGAAA^ 
5 TTCCCTGTTCTCTGACCTGGAAGTAATAGAATTTTTAGTTAAATAAGCT 
GGGATAAATTTGAAAATGTGCTTTGAATAGTTTAGGTGCCTC 
TAGCTATGAAGAAAAGAAAAAGGTATGGGGTCTGAGGGTTCCAGCAAACACT 
TTACTATTTAGCCTGCATGTGCTGGTGTTTGCTTCAGCTCCTC 

CCTTTG CCATGGCATAGGGG CTCT AAGG ACCTCTGAATG ATGC C CAGGCT CAGCACTCACGTGT AG ACTGACTGGATTT ACAAC AC 
1 0 TGACGCTG CACCACATG CAG CCAACG TGCACTGT CAG AAT ATACTTTTG AAGGGAAGGAT AAGGAG CCACTC^ 

ACACGGTAAAAGTCACCATGTCGATTTAATGTATCTACTTAGCTTTAAAACAGA 

GAAGAATAAAAATTTCAACAAGTAGAACAAATGATAAAAATATAGAATCTGAG 

CTTTGCATTGAAGTGATGC^TTTTCTTAGTGATGTGGCATA 

AAGAAAGGCAGGCATAAAAACTTTCTACTTCATGA 
1 S GTAGGGTTTTTTGGGTTTTTGTTTTGTTTTGGTGCTCT 

TTTAAAACTAGAAGTCATTTTATTAAAATTACAAATTTT 

ACTGTTTCCATAAGTCTAGCATGTTTCAGCAAGCATTTTTAAGG 

AT ATT CCCAATGATTTTAAACTACAGTTCTGCATTTTAT AT CTTAGAT AATTTTTAAGTGCTT CAGAT AC CTCCCTGAATTCT AAA 
TAGTTTITTAT AACTCTAAT AGATTAAATTTAT AAACAG T AAAGCT AT CATT AT CTT AG ATGTTCCAAAGCT ATG CATAAACTT AA 
2 0 TAAGGTTTCAATTTAAAACCACAAGTGTAAAGCACTTAATTATATATTC 
TTAAAC^TTACAAGCATATACAATTTCAAAATGCACAGATCT 
TGCTTTTCITTATTTAACTCTAAATTTrCCAAAGGTTAAAATATATACACC 
AAATGTTTGCTGGGTTGTTCACTGGATTCCTCTCATGGGCC 

TTTrAGAAGGAAAACRAGGCATGGCCTTTACGGG CGGGCACAG AGCCCAC^T AGCCTG ATGCCTCAGTCTGTTG ATGGG AAATGG C 

2 5 aGGACCTATTCGGGCTCATCACTGCATTGGAGTTCTCATCT 

AAGACATGTGAACTTGATGAGATCTTGCTATTCTTTCATA^ 

TTTTGTTTrCGATTATTCACACGGTTTATCTGACT 

TTGTAACTTTCATATTATTGTTCTTCATTATCATGACA.TAT 

CATGTGTATCAGTTAAAAGAAACATTGTAAAAGATAAATGGGGCTGCrGl'lTlT'l'C 

3 0 GGGCTGC'rrrrrGAGCACAGGCAAGGAGTCATTTGCCCCITCTTCAC' 1 1 

TCATTTCTTAGTTCrGCAGTGCTGCrcCTTCCT^ CTGGAGAGAAAATTACCAT CATCGCCTTTC 

TAGGCAATAATATGCATACAGCTGTACACATCAAGTTGGTTTC 
AGAAAAAATATTTTGGGAGGTAGTTTCAGCAAATACTTCCAAAGATGAAAAGTCG 
CTTAACCACAGCCCTACCTATAAG£ATG<!TTTGGCAAAG 

3 5 TTTCTTTATGTATTTCAAGACTTTGTGGAGCAAGGTGAA^ 

GAACACCTTTGGACAGATGGAAAAGGGTAGTCCTTCATAACAGTGAATTTTCrCTGGTC 
C C1T1T1T CAGTCTTTGGTATCAAATGACATGAAGGAG 
TAATTATACAAGTAATATAGCTCATTATAGAACATTTTTAAAAATC 
GTTACCTATTTTGGTGTATTTCCTTTGTCTTGTTTTCATG 

4 0 GTTTTCACTTAGCTTTATAGCATGAGAATCTTCTCTAGTCATTAAAGACX^ 

CGAATATGATAAAGCATAATCTATGAATAGATTGTCTCATTTTC 
CAACCCACATGATTGTGCCTGAATCITTGTCATGGCTCCTTTGATTATGC^ 

CTTTTTCTGAGTTCrCTTTGTGGCAGTAAATTCCTTTTG^ 

4 5 TTTAACCCCCTCCTTGTAGGTATTGGATAACAGAATTCTGGGTCATGTTTAAAATC 

TTTCAGGAACTGIGTGAAAGCTAAGGGTGAGGAGTTACATTGCCGCCTGAT^ 
CATCTTTCOAAACCACCrrCCCrCCTGTAATATAATCCCAGTTGCTTGTTGGGCTTCTGACCC 
CTCCACTCTGAGAAGCCCCTCAAATAGACAGGTTCAAACTGCCTC^ 
CCTCATGGTGATAAAGCAGTAGGTGACATATCCAGGAGCAGAAAAAGTAAGTCCCAGTCA 

5 0 ACTAAAATTAATCCAGATAATGAAAAACTGAAAGTGCAAATGACTG 

ATGGGACAGGACTAAAAAATGTATTCAGGAGAAGGCTTATAACATATATGGATC 
ATGGGATTCCAGGAATGTTTTAAATTATTCCrrrGGAATTCrTATTTA 
ATTAGGGGGAGGGGAATGTATGCTTTGTATGCO^GTTTT 
ATAGAATTCTAGAACTGATATATATTCTAGAGATCATTTCATCCAACATCCTGAT^ 
55 AATGTGACTCATCTAGGGTCCAGCTAGTTAAGCTAGTGACAGAGCAGCAGCTAGCCrCT 
CACGTAGAAATGATTTCAAGATTACATTCCAGAGCCCCAAATAAGTCTGTTTATCAG 
ATTACACAACTTGAAAGATAAGAGATTATTCAAAGTAATA 
CCCCAGCTCTCCTTGTCCCAGTATTT'CCrAGCTGACTTTTCTTGACC 
GTGGGGATATTACTTGGCCAGATATAACTGAGGAATAATGAGTTAAACITTCCT 

6 0 CACTGGCCAAAGAGGCTACAGAATTTATAGCACTTCCCAAGCT 

AATACCAGTTTXjGAAGACAGGCAACCTGCTG^ 

AAATGCATGGGTTGATTACAAGAAAATAATGGGGGCAGGGCGCACTCT 

GGACATTTGAGGAAAATGACACAACTCTGCGAAGAGTGACCATATC 

GGTTCrTTGAGCTCrCTTTCTGGCATGAAAACTGTTTC 

6 5 AGAATGAAGGAACACACAAGCAGAATGGAGGGAGCAGACATACCAGATGATACAC 

ACCGAT ACCTGAGACTCCTTCCTTCCCACTG CCTTAAAGT AAACTCATTTATCCTGTACCCAACTGCACACTTAAAAGG CTCACTG 
AAAACCAGCAAGACCTCCTGGAGGCTGCCATGGAAAAAAACTGATTTGGAG 

G AAAG TT AG CT AATGG TGTTTTG T AT ATT CTTGTG AGAAATGTT AATGG ATTT ACTTGOVAAAAG CG ATTTATTC CTTTTCCCT^ 
CTTCTCCCTTTCACATTTACCCACAAAAGCAATGCCCGTG ACTGGT CCA CAA 

7 0 GAAACGGAAAGTCTCCCTGCTCTTTGACCATCTGGAACCAGAAGAGCT 

GGAGGATATCTGGTATGTATGCAAGGGGCAGGAGAAAAATCTTCCCrGGGATATTTCTGA 
CCGCTTCATAACCTTAGAGGCCCCCTCAAGTC^TGGCAGGCTTTCCT 
AATTCCTTTGATTGTGACTCCTGTGCTTCTCCCATCTCTCTTCTTCCATGCCCCCGCCC 
ACCATACHXjTGGGCACACATCACACAC^CACACACACACAC^ 
7 5 CCTCATACCCCTGATAATCCTCATACCCCTGATAATtlATTAGGCTrCAACA^ 
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GGGGGGGTGCGGGGGCGGGGAGAGGATATG^ 

CAAAATAGATTCTACTGCCCTAGCCCAGAACATACCTAAATCCTTGAAGTGGTCCT^ 
AGCTGTGCTGACCCAAAACCTTTCAACTCGGGCAACrr 
GAGTACAGGTTCATCCTTTCCTCACTGCAATCCAT^^ 
5 AGATGCCAACAGACCCTCACTGGCTAATGTGCTTTCCTGAGAAATC 

ATGGAGGCCCAGAGGGCTGACTTGAACAAGGTCGACACAGCTGATTACA^ 

CACT CAG C^CCAATTTGTTGAG CTCAG TTGTGT CTGGAAGGA^ CAGGTT ACAAAGG TTATTTT ACATTTTCATT CA 
TAATATTGACAGCCCTTCTGTTTTCACGCAGCTGTTTGGTCACTC^ 
1 0 AAGCATTAACTCTCCATTCTCTGCACCCCTCAGTACCCCAGA 

C CTCC CCGT CACAGGGAAGG ACAGCAGTCAGAG CTG AG C CACTG AAAAAAG CCATTT ATCACACAGGAAAT CTG AG CT CTGCAGGG 
GT CAAACTGGCAGGCTGCTTGGTCAGGC CTGCAGGATGT CTG CTTG ATG C CAATGCTCAC AT CAGTGGAGG CAG AAGTGAAAAATC 
AGTGCAGAACGAGGAATCCCAGCCAGTGTCCCCAGGGGTTGAAC&GTC 

TT CTTTTG AGTGGCTTGGTCCCTT CAT AG AAGACT ATCT CAGGG TTTTTTTG TTTTTC CTGT ACAG TGCT AAACAGTTTT ACCAAA 
1 5 CACAAGTTITCT CTCCAACATG AGG TTGGAGGGAAATTT AT CTG TG CTGTG AATTG AAAGGCATTTGAT AT CTTCTGTTTATTACA 

AAACTTTAGGGGTAGTGAGGATGTTACTGGGATAATTGTTTTAAAAAAACAAG 
GTATTAAATGTATTCACTC^AATATATTAATTGAATGAATATATTCAAAATAT 
CAGTAAGATGTTTGCTTTATATATCTCTCTTAAGGAGC^ 

ACAGTAAAATGGGG CTCAGTT AATTT G CAG CTATTTT ATTTGGGTTGTGACTTCT CAT CT ATAT ATTT ACT ACTT AGGCAG AACTA 
2 0 AGGTGTACTGTAGTAATAACCCCCCACCCACCCACCTCCTGTACAATCTGAGTGGCTTAAT^ 
CACTGCATGTCTAACATGGGTCAGCAGAGGCTGATCAGAGTTCCCCAGGAACCC^ 
CGAAGATCTCTGCAGCAGTAGGAAGGCAACATGGCAGACCA^ 
CTCAACATTCTGCTC^CTGAACCATGTCATATGGTCACA^ 
GGGGAAAAGCACCAAAGACTTCCATAGGAAAGTTGAAGA 

2 5 AATAGCTACAAACATCACTTAGACTAGATCTTCAGGGGACATTTCCAGT^ 

CTATTGTCAACTTGGGAAACCATGATCAGCAAACATATAGCTA^ 
AGTAGAAGGAAATACAAACAGG CTTG T ATT AGGRTAAGGTTCATAACT 
AAAAGTTAAAGCTTTCACACATTATCTTCTTTGA 

TAAGGTAAGAAAACAGCTGGGGAGAGAGAGATTTGTTTTTGTCAGC^AATTA 

3 0 ATCTTTATTCTGTACTGTATTCTTTAGAATTTAGAATAGTC 

TCTACACATCCACTTATTCCTAATAACTTTCGATGTAGAGAAATCATC 

TTGTTGGCAAGTGG CCAAG ATGCAGAAG AG CACCAGGAG CTCCTCTGTTCCATGG CAGTTCCCAG CCATCAACTGAGCAGACTGGA 

CTTGTCCCTTAAATCTrCTGGAATACCAGAGTACAGTACTGATAGGlTTTTTTTC 

ATTATCAGAATTACCTTGTAAATAGCTGTGTGAAGGAAAACCCCACCATGG 

3 5 TGGGTACAACTGATGGTTCTCAGCCGCCCCACGCCGCAGCTCCGAGCAGAAGTCT^ 

TAACTGGTTTAAGACATTTGGCTTGGGCTATAAGGTAAACTTTCTATC 

TTTT CCCTCCTG AACCATGGCG AGGGGGTGG AG AGGTTTGTGTG C CTTCAAC CAG ATCT AGATATGG TG A CTTGTCTCTTCTTCCA 
TACAAGGACAGTGTGGAGGGGAGGGGGCCATACAGGCCATATCAGGGGTCAGGATC 

TTTTC AAAAGATG CTT ACCAGC CTT ACATT ATTTTC CCG AGT CTGGCCCACATGG C AAAGGAAAGGGCAATTCT AAG ATTT ACTGGA 

4 0 GCTGGACCATCCTGGATTGTCTTAGGTCTTTAAGCCCATGCAGTGCrcA^ 

GGAAAACAAGTACTATGAACCCATATTTGAAGTTCCCCAGAGATTCT 

CATTG CATGGGAAAAT AAGACAAAGTCCTTACCCTCAAAAAT AAAAGT CTGGT AAATTTGGAAT AAATAAAAGATTACACTTAAT A 
ATATCTTTCTCCAACTCTGGATGGTCTAGACCAGTGTTTCTCAAAGTCAA^ 

TAAAGGTATATTAAAAGTACAGATCTCCAAGCCTAAC CCCAG ATCCAGGCAAT CAGAATGTCTGGGTTAGGTCCAGAAACCATTGT 

4 5 GTTAAACAAATACTTGAGGTGATTCTAAGAAAGCACT 

AATGCCTCCCAGATGTGGAATCCACCAGTCTGAATAAAGCAGAAGGGGAGACIATGGACAGGAA^ 

TTTAACCAGCGTTAAGGGAGACTCACC^CACAGAGACAAGAGGCAGCCTCTAAGCTC 

GCTGTCATCTGTCTTCTGTGTCACAGAAGCTCCACCAACTACAGA^ 

AG CT CAATCT CG AGGCTCAAGG AG ACAAGTT CG CATGTC CCACATG AAATCAATAAGGTCAGTGCT ACTGTCTTCT CTGGGG C C CT 

5 0 TAGCAGCATTCTCAAGATGTGTGCCAACATTTTCTCCTTG 

CAGGCTAAGTC^CAGGGAGGCAAACCTTAAGGAAATCCATAGGATG 
CCTCAGACAGTGTCACTGCAAACATTCCTGCTGGACCAGTTGAAACAATGTG 

CAGTT ACCAT CTTTGACACTG ATT C CCT CG CT C CAG CTC CTTGCTCTAAAATGTGTTCTCAAAAC^AATTGTGGTTCGGTTGACCA 
TAGGTCAGTTTGCACATTGAGTGTAGTTATATAGTGATATATTAAAAACTTAGCTATCCA^ 

5 5 TAGTTGAATTTTACCAGATAATTGGATATGAGTTCTTCAGATACCATTTTGTAATACATAATGTTTTATTAATG 

AAATGTCCTGTACATTTTCCTACATTATTAGATTTTACAGAATATA 
CTTTTTTATAGAAACAGGGTCTTCCTCTGTCATCAAGGCTGGA 

TGGGCTCAAGCAATCCTCCCACCTCAGCCTACCAAGTAGCTAGGACCATGC^TGCACACC^ 
TTTGTAGAGATAAGC^TCTCCCTATGTTGTCCAGACTGGTCTCCAACTCT^ 

6 0 GTATTGGAATTACAGGCATGAGCCACTGAGTCCAGCTGGAGCTGTTGTTATAAATATAAACCACT 

TGTATGGCATGGGTTAGATAATGACTGCTCTTTGAGCACTTTCATGTAAGAGGACCT^ 
CTAGCATGGAGAGGAACTCTCTTTTTAC^GGAGAGATACCTAAGCTCAT^ 
GGGGAGTCTTTTATTATCTCTGTTATCACAATGCTGGGCCTCTC4GGGAGG 
ATCTTTAGTCACCTCTTCATTCCrTAACGGTTCTGCTGAAA 

6 5 GAGAGAAATAGTCACAGAGATTAAGAAGCCTTCATGTCATTTAGCTTATAAGAGAGAGCTTGCTCCTCTTCAGCT^ 

CACAGCAGTGCTTCCTGTCGGTAGCTACTTGACCCTCAGGGTGGTCn^CATAGTGAATTAGTAAGTC 
CCCAAGTTGTTGATTTCACCTTCGGAGGGGAATAAAAATATCTTT 
ATG CCTAGGG CTCACCTGCACCATTCGT AGAGGTTTATGIAAGAAGAA^ 
TCCAAGAACTCCAAGTTCTTCATTGTGGAGAAATGGTACTTCGTAGGCCACAATTTTATTA 

7 0 TTACAGTAGTCCCCCTTATCTGAGGTTTTGTCTCCTGTAGTTTCAGTTATCTGAG^ 

AAATCCCAGAAATAATTCATAAATTTTATTATTTATTTATTGATTGAGACAGTCT 
TCAGTCTCAGCTCACTGGAATCTCCGCCTCCCAGGTTCA^ 
GTGCCACC^CGCCGGGTTAATTTTGTATTTTTAGTAGAGACGGGGTTTCA 
AGGTGATCCGTCCACCTCGGTCTCCCAAAGTGCTGGGATTACAGGCATGAACC^ 
75 CATUCrGTCCTGAGTAGC^ 
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GTCTTGGTCATCAGATTGACTGTCATGGTATTGCAGTGCTTGTGTTCAAGTAACTCT 

AAGAGTAATGATGTTGG CAATTTGG ATATGCCAAAGAGCAGCCCTAAAGTGTTTCCT CTAAGTTCTTTACTTAATAAGGAAGG AAA 
AAAATCATATGCTGAGCTTGCTAAGATCTAAGTAAGAAGGAATTTCATCTATG 
TTCTCTTTTGTACTTCATACTGCAAAAAGTTATG^ 
5 AGAAATGTGTTCCACTTGACAGCAATCGGGTTTGTTACTATG 

AAGGATAAGAGAGGACTCCTTATGTTCTATGATAACATGATTGGCAG^ 
TCACTCATGGGGAATGGATTCTGTGAACTGCTTGCreACTTGAA^ 
TCTGTGCTGGTGAGCAGCCGTTCCCACTTCCAGGTTTGAAG^ 
TTGATCTAGTCTATTCAGTCrrTTTTAGTTCAAGGTTTTAGTCT 
1 0 AGAATCTTGAAAGGCCCATCAAGCACCTCATA^ 

CCCATGCCAG CAACAT AGTCAGCATTTCCTGACAT CACATTTCAACTGTT AAT C AATG C AG TGTCTCTGGAAGCATTCAGAAGACA 
CCAAGGAAGGGAATCACATG<3CTTGGGGACAGGAGTTAGAGGGATAGTGTTCA 

CAGG T CCTTGTTTTTGAAAAGAAGGAAATTTT AACATAAAAATG AT AA C 
AAGCAT ATCTTGTGTTT AT AAATAT ATTCTGTGA 
15 ATGAATTACCCCCTGCTCAACAAATTCTGGATGGGGTGAGATCATC 
GCAC^CTGCCTAGCGGGTACCTGTGATTGGTCTGTTCTGCTTTG 
AACTACGACAATTACC£X3CGAGCCTATGGAjGAGTGCA 

CCTGTATGAAGCCATGCCTGACTATCTGGAGGACGGGAAAGTGAACGTCCATAA^ 
TGGTCCAGCTGCAAGAGGTGGCCCCACCCTTGGAGGCTAAOU^GGACT 
2 0 AATTACTAAGCACTTTCTGTGTGCCAGGTGCC^TTCTATACCCCTT 

GCAG AT CCT ATTACT AT CT CGAACAT AT AGCTAGGGATACT AAAG CACAAAGAT AT ACCTTGCCAC^GAG 
AGCAGAGCCTTGATTCAGACCCAGGTGGTGGGACTCTAGAG^ 
AGCAGTCCTAAGCTCTCTGGGAGGAGCIX^TGGCAAATCAGGAT^ 
AAAAAGGGCAGGCAGTAATTTAAAAAAGAAACAAAAAACAC 

2 5 ATATTCAAGTGAAGATGAACACTTGTCAATGGTGTGGGAGACCAGGATTTGCATTTTGCATATGG 

CCCTTCCAGACCTGCAATTCCGTAGTTTACCAGCTGTGTGACTATC 

GNNNNNNNNNNNNNNNNNNNNCCAGTTCTAAAGT^ 

CTTTCTGCCTCACAGCAGCATTTTGATAGAACCCTGGGC^ 

GTGACCTTGGCAGGTCTGTGTCTCAG ACATCATTTGTAAAAACATGGA CTTAACATTCTTTGG ATT ATAAAT CTTG TGAT 

3 0 GAAAGCTGTGAACCITCTTCCCAGAAAAATGTACATAAGCCATTTATTTTTAA^ 

AAGC CT ATCCATAGACTCAAGGTT AAG AAT AAGATT ATTCCT AAGGACTTCTCTC CT A 
CrCACTATTTTTTTTTTTTTTTTT^ 

CACTGCAACCrCTATCTTCCAGGTCCAAGCCATCCTCCCA 
CCGGTTAACTTTTGTATTTTTTGTAGAGACAGGGTTTTGCT 

3 5 CTGC C CrrC^CCTCCCAAAATGCTCGGAATACAGG CATGAGCCAC CATGCAC CCAG ACTCATT AACT CTT AAT AG CAATT AAT ATG T 

CTTACCATTTAAACTGGATTTTATTGCTGGACTCAAATTTA^ 
TTTGAGAGAGAACAAGTGATGTATTAGTCATCTGTCCAGAC^ 
TAGTTAAAAACIAGGGCATCCTGAGATATGTCCAAAATTAAAAACATTTCAATC 
GTTATACCTGTTTTGTGACTTTTGAAAACAAACTGTTT^ 

4 0 ATCAGCTGTC4GGAGAGGCAAACAGCATCTTAGCAATGAACCCTGT^ 

GAGGATG AAAT CT ATG AGCTTT CCT ATGCC CGGG AACCAAGG AAC CACAGAG CTC C AG TG AGTTTTT CATATTAAGACT C CACATG 

TAACAGGTGCTTTACAAGCTTCTCCTCGAGTTCATTCATCCT 

TTAGTGCTCAACCTTATGCT'AGGAACAGAGAAAGCTTT^ 

TAGTTG AAGGGATAAGCCTAATGTGTG AC CAGTGAAT AACT AGTGAACAGT AAAT AACTAT AT CATCAG AATGCCAGGTGTGGTGG 

4 5 AATCAGTCAGTC^TTTACAAAGT AGAG AAGK1AAG ATACTATTGTG CACCAG AGT 

ATAAATTTCTTTTCrATGATTCATTGATGATAATGAGAGTAATATGCGTA 
GGTAGATAGAAAAGCCCCATTTTCCn'GTAATGTTCTGGGTGA 
CCTCAAAATAGTACIACAGTCCrGGCACTGCCTGCAGTAGGAACTTCACAG^ 
ACTTCAGTCTCCTCGGACGTTTCCAAACCCCAAAACAAGTGA 

5 0 TCTATIX«KX3TGGCTAAATTGC^CTCCTCCACACAGAAAGAT 

TTTGGTAGATTCCATTTTCAGGTTAGAGGCATTTCAATGGAAAGGGTACATTCT 
TTTCAAATACAGTCATGCACCACATAATGACATTTTGGTCAGTGATAG 
AGCAGAAAAATTACTACTGCCTAGTGACAGAACAATTGTAACATTGTAGC^ 
TCTAAACAAACCTACTGAGCTGTAAAAAGTCTAGGTTATACACTTACGT^ 

5 5 ATGCACITACTATACAATACTTTTTATCATTATTTTAGAGTGCA 

AAGTCCTCCAGAAGGTGACAGCTCCATGCATGTTATTGCCCCTAAAGACCTTCCAATAGGACAAGATGT^ 
TATTGATGATCTTGACTCTATATAGACATAGGCCAATGAGTGTAATTGTACCTTAGTT 
TTTTAAAATAGAAAAAAGTTTATAGAATATGGCTATAAAGG AAATAT 
AAGTGTCATAACAAAAGAGTAAAAGTTAAAAAAATTAAAAGCTTTT^ 

6 0 AGAAACTTAGTGTAGTCTAAGTGTACAGTATTTATAAAGTATATAGGAGTATACAGTAATGTTAGG 

TCACGACTTGCCACAGTAATTTCTAATCTTGAAAGCTCTATTC^ 

CTAT ACCAT ATTTTT ACCTTTTCT ATGTTT AGAT ATGTT AAG AT ACACAA^ CAT CATT ACAATTG C CT ACAGTACT GAGT AT A 

ATAACCTCCTGTTATAGGTATGTAGCCTAGGAGCAGTAGGCTATACCAAATAGCCTAGGTC 

TGTGTAAGTACACACTGTGGTGCTCATACAATGACAAAATCACCTCATGAC^ 

6 5 TG(ZATGACTGTATTTAGAAAAGAGCTTGTCCAGGCTTCCGATGATli'rr 

TTCACATTTCTACTGCAAAATGTGACTC^AGAATAGAGAGGAAACATAAATGATCCACTTTACT 
GGGTTTAATGGAGAGGGAATTGCAACTCAGGCAGATTGAGAATTTCAAATTATCTGTAGCT 
CTC^AAACCGTAAGCACAATGCCTTGTTAGCCAAAATTATTAGTAGAGATTTCAATTGCCT 
AACATAATTGAAGATTGTCCATAAATGTAACCGTCAGGTAGTGGGAAAAGAAAGAGCT 

7 0 GAGGATTTCTCTAAATATTTCATTTTAGCCACTAACACCTTCA^ 

AACCTGATCCAAAAACCATTAGCAAACACGTCCAGAGGATGGTGG^ 

GCCTGG CTCAGTTTTGT ATGTGGGGT AAAGGGAGAGGAAAG CACAATC^TGGAAAT<3TAAAGGCACG AAACTTC^CCACTTACTT 
CrCTTACTGTGGCTTTAGAACAATTCAAAAGTTACATTTTG 

AAGACCrAGCAAAAGAGCATATGCTTAAAGGTTGTCAGTTGATTTAGGAGTACAGAA^ 
75 C^TAATOl^TAAACCrAAAAC-TGAAAACer^ 
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TCATACTTATGCATGTATCCTCGTGGACAACT 

AGTGGGGACAGAGCACTTCCAACTGGGCTGTTTGTAGAAGCTTCT 

AGAGTGAAAGGAGAACATCCCTCTGCGCAGTGAACTAGGAGTAGCC^^ 

ATGGCCACCC CCCAC&TGCTGCT AC CAGCAAAGG CCTGAGATG CT AAAGGAGCAGGTTTCCC 
5 TGTTCCTCTCGTTCAGGTTGTTAAAAGGCTTTTATCCAACTGCCCC^ 
CTCTTTAGTAATCACCAAAAAAACTCCTTTTTCACAGTCTGTC^ 
AAGAATTTGAAAAGATTGCTGCGAGTTTTCCATTTTCCTT 

TGGTTATGAGCCCATTGGTGAGAATAGGTTTATAGGTATGTCACTAGAAGAATTTGAAAACT 

TT AAAGAACAGAC CATGAAAT ATGCATTCTTTT AAAAATATTCTGGG CCTGGGCAC AGTGGCT CACG CCTGT AAT CC CAACACTTT 
1 0 GGGAGGCCGAGGTGGGCAGATC!ACCTGAGCTCAGGAGTTCGAGACCAACCT 

ACAAAAATTAGCCAGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACT 

GCGGAGGTTGCAGTGAGCCGAGATCATGCCACTGCA 

AAAAACTGAATTGAACATTCCATAAATCTGCCACTTTATAAATAGTCA 

TGTCATTACAGATTTTGGGAAACAACTAATACTTATTCATACTTTGC^ 
1 5 TTCTCTATAGGAAACTAATCATATGGTTAGTGGGTTCCAGTTAGGTACTGTCT 

CCCTATACACATGTACATGTTACATTGTTTCTTTTTC^ 

AAGAACACCTCTGTGGAAATGAGTAATGCTGCCTTACCTACTTACCACATGT 

AAGTAATCATGAGGTAGCACATACCATATTACCACTGAGGATAAAAAGAAAGGTGAGTTTCT 

CCTG C CT C CATGC C C CTTAT ACT CACATCCTTG C CTTGGTCATCTGC CGGCAGGGAAGG C CTCATCAGCAGGG ATGAGATCACAGC 
2 0 CTACTTCATGAGAGCCAGCTCAATCTATTCCAAGCTGGGCCTGGGCT 
CTTTTTGTGAC!AACTGTGCTGGATTTGTAAGTTGTT(riTAGGGTG 
ACAGGCACAGACCTCACAATCCTTATGAACCCATCCCAGCTTC<3AG 
GGTTTTTACCTGCACTGATCATCTCTCGC^ 

TG CTG AATT AGGAT AT ACTTTT AAAGT AT AAT ATTCATG TGCTG TTTC^ 

2 5 TTAGGAGATAACCTTGGCn'ATGCCCCrCTTCCX^TTCCACTCCTGACCAA 

TATATCCTTCTCAGTTTGGAGTCCTGAGGAACCTGTTTGCCCGATGATTCATC 

TAGTGAGAGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCACT^ 

CGAGACCAGCCCACCTAACATGGTGAAAACCCGTCTCTACTAAAAATACAAAAAAATTTA^ 

AGTCCCAGCTACTTGGGAAACTGAGACAGGAGAATTGCTTGAAC CCAGG AGGTGGAGGTTGCAGTGAGC CG AGATCGCACCACTGC 

3 0 ACTCCAGCCrGGGCAACTGGGCAAGACTGTCTCAAAAAAAACAAA 

ACTCTTCTCATGGATTTAAGGTCAGAAGGGACAAGACATGAGAGATAAATGAGGTT^ 
AAGGAAAATAAAGGAATGTCACCTTGTAGCCTCACACCAGATGGATGAATC 
TCC-GGAGTGATCRAACAAGGATATCXlATGTAAAGGTAAGACrrCAATA 
TGCTATAGAGAACTGCCTGAGACTGGGTAATTTATAAAGGCAAGAGGTTTAATTGACT 

3 5 AGGAAACTCACAATCATGGCAGAAGGGGAAGCAAACAC 

GAAAAGCCCTTATGAAGCCAACAGATCTCATGAGCAGTCACTCACTATCATGAGAACA^ 

TTTACCTCCCATTGGGTCCCTCCCATGACACATGGGGATTATGGGCACT 

ATATCAGCTCGTCTTAGAGTCCCACCTCTAAAACATTTTTTACCC^ 

CAGAGTCTCATCCCCAAAGTGGTTTCCCTCGGAC1ACCCCCGGTGAATGACCGCCACACCGAAG 

TCATGGAAGGGTGGGTCCCCAGTGAAAAAGATAGAATCTCTTCTGCAACAGAAGATAAATGA 
CCTTTATCACTCAGGACATAACTGTCCTTCCrrGTGACTAG^ 

AAAATAAGACCTTAACAGCTGAGTGTTTCCGGAGGGTCTCCCTAGAGTAGGCAGAATGTATGGGATACCAGG 
TCCAAGATATCAGGATACTGCTTCTGATGCTGAGATTCACTGATA 

4 5 AGAAGAATGCTAGCITCATTAGGAATGTTAAGTGTC!TTAGCTCCAGAAGA 

ATTTGGTTGTG AGACACACTGGGGAAGAGAGAAAAGTG AAAG ACTTCTTGACC CAGGATGTC CGGGATGAAC 
TCTCACAAACAATGCAAAGATCTGGTTGTGTTTGAGTGTAAGAAGCG 
CTGTGGGGCCAGTGTCCAACCTTTGCTCATTGGGAGCCAAANNNNNI^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNKWNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNKNACT^ 

GATAGCCCCAATTTTTCTCCAGGAGAGGAACCCCAAGCCATTAATTTAAGACCAGGTCCT 
TAGCAGGTAATTGTCAAAATTCTAGAAACAGAAGGCTAAATGTAGAAAGGA^ 

5 5 GCAGATTAAAATTCCAAATCATACAAGGAAAATCCTAGGACTGAGAC^ 

TT ACATT AAAATG C!AAAGAAGA.GT AOU^T AAGG AAATG ACTTTG AGT ATGTTTGAG ATGC C C CAG AT CT AAAGTG C AAAGAAG CAG 
AATAGACTGTCTCTGTCTTCATGGAGTAGTTCTAGTTAGGGGGAAGATAATATACAAATAT^ 
GTATATGCTAAGAGCCAAATAOTTTGCAGTGACAGCAAATGACAGGAGGACATAGCAGAGAGATCGCT 
CTrTATGGAGAAGGTGGAATTTCAGCTAAGTCTTGAAGGAAGGGTAGAATTAAGTT 

6 0 GGATGTACAGTAGGAGCAAAAGGCCAAGGGCAGAACAGCTTTA 

TTTAGGTAAGAGATGAAGTAGAAAAATGAGACAGTCTAGAGAGTCTTAAATGACAGGTGCCTGTCTGTATGATAAAATGA 
ATTTAAAAAAGGGTAACATGAAGGGCTTGTTGGGGACGGTTGGGGCACAAG^ 
CTGGCTGGTTAGCCAGCATGTCAAAACACn'GATCACCTTGACT 
TAGGTTAGTGCCAGTTTAATATAAGAAGCTGTGTGACTTCTAA 

6 5 AGCACTT AAAATCGGGCTT AACACGT ATGT ACCAT ATGTT AAT AT CATT ACT CTTTTG AAAT AAG CTC CCT AAAGTGG T AAGG CCA 

CTATCGTTAGAACAGATGCCTAGAGAAAAAGTCCATATAGTATACATTCTGC^ 
CCACATGCATTTATTGATCAATTAGATCAAAGCTTAAGTGCTTCACTTCCA^ 
CAAGTAAAAATTTTCTGTCTTCCTAAATCTTTCCCCTTACTATCTATCTAAACA 
TCTTAGGGGGAAAAAATCCTCTTAGGGTCTTTTCTAAGAAAGAATATGCT 

7 0 AATCACAAACACC^TTGTGAAAAATAGGCCTGTACCCTGTATTTCATTCATGCACTCTGTATTTTCAG 

G^CACCACAATTCTGTCACATAAAATGCAGATTTTCAAGTGGAAAAGACA 

ATC CCTGAAGGGGCAGTTGTT AGAG ATTTCTCCTCTC CAGGCCAT CTCTCCCCTTTTCCAAACC CTGCCCCACAGTGGT AAG AT AA 
GTACTTGTCAGGAAGTAGACCAATAGCTAGGATAAACATACATGTATCAT 

TTG CT AAAAGTCAAGGTCTATTTAT AATGG CAAAAATG ACT ATTGG AT ATCATCCATTCTTAGTTATTTCTATCTC ATT 
75 GGGATGGTTTCTGCTTATCAAATCAAAATTTCC^GTGATTTCTTTTAGCCCATTACT 
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TCTTGAAACTTGTACCTCTCCTAATTTCTGTTAACACT 

TTTGTTCrTACTG CT T T CT 1 ' CCTTTTGCTCCTCAAAGTGTATAGTG CCroflATTTCTGTCCCiU^CrCCCrGTATGTCTAGGAATT 
C^CCTAAATAATGGAGGGGATTATATTCATAATTTGACAATCrAGGTT^ 
GAAAATGGAGATATAAAAAATAATCACCCATAATACTCCTCCAAAAAAAAAATGTA^ 
5 TATTTGAGCACTTAAATAACTATGTAATTACAATTTTGAGCTTTATTG 
GGCAAGAATT AAACT AAGATCTTTGT AAGATTTCT AGTTTC 
TTGTTTTTTAAAGCCATCTTGACTATACTAATCCAATGTCTT 
AAAAAGTAAACreAGATGATCTTCAGTTTATCTTAAAGCCT^ 
AGGAAATTACTCCTTATGGCTAACATTACTAGGCTGGATCCTGGGTAGTGGGTCTT 
X 0 TTCCTCTCAGTATCAAACAGGGGCAATACATTTTAAGGTACATTTCCAC^ 
ACTGTACAAACGGGTTCCCAGAGAGTACTTACCTCCTA^ 
TTCCAGGGTCCAAAAAAGTATTTATGGAGCATTTCTC 

AGGCAGGGGCTTCAACTTACCTCCAAAAACAGAAACCCTCTAAGCAACCTTTGTGCAAAC^ 

GTAGCTTGACACCA3TGGCCATGTCTTGGCCAAAGCCCCTGGATTTAGCCCTCACCCCCT 
1 5 CTTTGAACATATGGAACTCCAGTAACAATAAGAAAGGTCTCATTG 

G AG T ACT AGCATGTCCTGT ATCTT CCT ACAGCACCTGAGG AAGG AC CTTIT A CTGTGG AACATGG TG A 

GGAGAGTAAGGATCGGACCATCATGCTGATGGGAGTGTCCTCACAGAAGATTTCT 

CCACCCAGACTGAATCACAGCCITGGATTGGC^ 

GATACTCTATATGTGCTTCCCAGTCCCACCTCTCCA^ 
2 0 TAAAGACTCCCTCATAAAATCAAAGGAGGAGCTCCX3TCACCTCAGACT^ 

CATTTTCCITTTTTTCAGACTATCTTAGAGAAGTGG 

ACAAGCAGTCTAAAACAAAAAAGGAGGAGGGGAGAAACAGGGAGGG 

GATTGATTCCAGCTTTCTCACAACCCACTGGGGGCCITGGTTC^ 

ACACAGGTGGTGG(ZATTTCAGAGTGCTGGTGCCAGTGCCCAAAGCCCrc 

2 5 GGTTAGCTTTACTGTGCTGATACCACTCCTTTTCT 

TAACATGTTCATCAGGGTTATGGTTTAGGATCATCATGGTTACCTAAGOVTCrTCCT 
CAACAACAACAAGAGAAAGGTGAGTGTATTGTGGCTCACCATAGAAAACGCCT 

AACACATCAGGGAAT ATG CTGATCTTTT ACACATATT AG AAT AGTGAAAG CAACAGT AG CAGG CAAACACACAGTGGAAT AT AAT A 
T AGAGCAAT AT AATCT AT AAAATGTTAACAAT CATGACAAG AGG AGCTG CTTTT CAAT GTT AGGCTCACAATGT CCT AATTATCAG 

3 0 GTACCTAACAGAGACAATGGAAAAGAC!AAAATAAAGTTTACCAA 

AGAGAAGGGAATAATGACATTAAAAACAGGAAAATAGGGCTGGGCGTGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGAGCAGCCTGGCCAACGTGGTGAAACC 
GCATGATGGCAGGTGCCTGTAATCCCAGCTACTCAGGAGGCn^AGGCAGGAGAAT 
AGCTGAGACCAAGCCACGGCACrCTAGCCTAGGCAACAGACTCC^TCTTAAAAAAAAAA 

3 5 CTCACGCCTGTTATCCTAGCACTITGGGAGGCCAAGGTGGGCGGATC^ 

GAAACCCCATCTCTACTAAAAATATAAAAAATTAGCCAGGCGTGGTAATGGGCGCCTGTAGTCCCAGCT 
AQ1AGGATGGCATGAACCTGGGAGGCGGAGCTTGCAGTGAGCCGAAATGGCACCACTG 
TCCAACTCAAAAAAAAAAAAAGC^GGTGGGG AGGAGT AGGAG ACGGGAA 
ATACTCAATTCTGAAGGTTCAATCATTACTATACTTCIT^ 

4 0 ACCTATAATTGGTAAGAACCTACAAATGGTAAGAAACTACrGTCCCATGGCATAGTAG'l IT C I 1 AATT ATAG AT AACT AAGC CATT 

TTAGAGACAGGGAAG CAAGGATACGTTAATATCAACACTACTATTCACT^ CTTTTCCC 
AAAAACATCAAGGTTTGCACTGTGTTTTTGGAGACCT 

ATTCTACATTCTACAG ACTG AAAAACTGTCCACG AGGGCTTCAGGAT^ CTCAATGCTCT ACCTT CCCTGTCAGA 

CCCCCAGAAACTGAATTAGAGAAACATCTGAAGAAACAAGT^ 

4 5 ACATAATATTAAGAGAAACATACGAAAAAAGAACAAGAGATTTTCTGGATG 

GTGCTGTAAAAACCAAGTCAATGAGAAGGTGGCAAAACGTGGGTTAGACAAACCCCTTCCTCIT^ 
CAACTTCTAAAAATGATGAAAGAAACCTCX1AAAAATGAACTCGGTAG 
TTTCAGTCCCAGACATTTCTTACACCAGCTGTTAAACTCT^ 
CTGAAAGGTTTGCCTCAAATCTTGACATTTTACATGCCGATGAAAAOT 

5 0 GCAGAAAGATTAG AATTCCT ACTCAGCAACAT CAGAGCCAGTAAACC CTCTCACCATTTTGACCACCG CCTCTCITTACTTTT^ 

GCTTGT CAG AT AG AC CT AAGG TCCAG AATTCACTG TG TG TATTTTAG AGG AACAATT ATGTGT CCCACTTT CCCATT AAGT AT AAT 
CCTTACAATGACTAGTCTATGTGTAATTCAACAGTCCTATCCATAATAACTAGTCAAATTAAATCAG 
ATATTAATAAGAATAAATATTATGTATTATTTTAAGTATAATGTAC^ 
CTTTTATGTTTAGATATACCAAGGCTCAACTCTGCAGTTTTATACTATATGG 

5 5 CTATTTTCAGGGTGTTTGTGTAGGTTAAGGCAGTCCATTATGTCACACAG 

GCTG AGACAGCTATGTAGATAATT CAGATTGAGAAACAACCCCAGTGT ATCCCAAATT AC C CAGAC ACTGGCTATAT ATAATATTC 

TCTTGGAGTAATCAGTATAGGATAACATANMNNNNNN^ 

NNNNNNNNNNNNNTTGGCTCTATAATGATACCACT 

TCATCAAAAAGTTAAATTTGTTTTCAATAATTTGTGGGGGGGAAACTTA 

6 0 AGTCATGCTTTCATGGCTCAGACCTGACQVGAGACTGTTTCT 

ATTATCTACCTATTGCTCGGAAATGGTTTATTCTGCCTGCTAACC^ 
TACAAATGCTCGTTTGATATTTAACTTGCTATTTTCTATTCAGGGCAA 
AGTGAATAGGAAATGCCAATCAAAAGTAGCTCTCAATT^ 
CCTTTCCCTTTTAGGAAATAAATACTCTGAAAGCAGATAATGA 

6 5 CCCTCCAGCTTGAAAAAAGCAATCATGTCTTAGCTCAAATGGAGCAGGGTGACTGTTCTTAGCCCAGAAACTA^ 

GTAGATGAGTATAGTGATCTCATTTCCTAAACTGTAATGCACAGACCTGAGGAACTTTACACTG 
AAAAGGAAAAGCCrGTTACTGTTTATTTACCTAAAAGATTCCTAATGTGCAGCA 
GGGAAAACTAAAGAATGCAAAACITTTGCTATTCAT^ 
CTGACAATGGGGCATCTCGGTGGCCrrGCCTGGCTGATTCTTCCTTAAAA 

7 0 TGTTTTCTGCAAACTGACTTACTTTGTTCCAGCCAAAGCTTG 

GTATCCCAGCATACTTTGTGAACCTGGCTCCTTCTTCACTACCrTCAGATCT 
AGTC^TAACACATGAGCATATCTACCTAGAAGCCAATTTCTACTGATC 
ATCCAATTGCAGGTCAACTTATAGTTGTAGCCTTACAACTTACAGGCATCAGAAAATAAG 
AGCTATTACCATCTCATATTACTGTCTAATTAAAATAAACATAACGCAA^ 
75 TAGGAATTGATGGCTTAAAAAAAAAAAGTCTATGAAGAGTCTC 
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TTTGAAAGTTCTCTTTTTGAAATGAATTTGTGCTTAACTGTCT^ 
CTTAAATAATAGCTGACTGTGTGTACATATGTATATAATATGTATATACAATATC^ 
TCCATAAAATGTGG AAGTGAATT AAACAAGTTTTAGT CATTT AT ACAAAOT 
TTGCrCAaVAGGTAAAATTGTATTGTTTGGOVAAA 
5 GGCTCATTTAGTTGCCTGGGCACCTATTCACAAATTCATTTC 
CATTTTCCTTTTTGGGGGTACTTAGCTTCCATC 

TGCCAAGAAT ATGTTAACAATT AAAAGTTCCAAACT CAGT A 
TTGCrTACCTGGAGGATTCAGATGATTTAGGAATTCTC^ 
TAGTGATATAGAACTAGCTTGTAGTTAGTGTTTCATTACATTATAAAGAATA^ 
1 0 AACTCAGAAATACCCAAAGCAGGCCTGCTTAAGCCCACTACCTGGCATATAAAC^ TCTTTTT AAT 

AGGACAAGCATGAGTTAGGACAAACTCTAAAAATTCATATTC^ 
TGT ACTCT AATTTTTTAAAACAGT AATGG AACACAAT1' 1 ! 
TTTTATATAAATAATCTATAGGGATTCAAAAGGTGTCA.CAGTCCACTTAATT 

CTGGAGAATTT ATCACTG AAATGG AT AAG AGG AAATAG TTT AGT CACAGG TTTTT ACAG T CCAG CAAGGG CCAAAG AGGT AT AGTA 
15 • TACAAGTTAATAGTATTTGTGTTGAGCAACATGGGGCTAGTGGGATCAC^ 

CAAGCCTAGTGTAAATTTCTGCATCTCAC^CGACTTTA^ 
TAACAGCAGGGTCACTTCTCATTTTCTTTGCTGACITACCT^ 

AAATGCAAGAAAAAAACAAATGTACAGATTGTAAAGTTTTTTGATACCTAATGTAAGTT^ 
AGACATTAC^ATCCCTACAACACAAATGTCTTGTGTTTCTTCT 
2 0 TCCACAGCTAAGACCACTGCCCTATTCTATTAATCAAATTGAGTTTTCCTGGAAGGA 
TTCTACATCITTTTACAACACGAAGAAGTAACACT^ 

AAATATTAAGTCTATAAATACTTAACATTAAAGGAGACAGACTTGAGGTTGTCCCTTCTCrGTGGCTTT 

CTAAGCAAACTAGCATGTAAAGAAGAAATCCTTTGAAGCTATGGATAAAATTGGTAACGA 

TCAGGGCAATTTATTTGTGGTCATGATTATTTAGAATGATCAGACATAGCCACAATTTTCCCT^ 

2 5 AACCACACCAATGAAGAAAGAGGAAAAATTTCAGTCACTCAAATTCCTCTGTGGA^ 

TTTATCATTTGGGCAATTCATAATACTTCATAGAAAGTAA 

TGCTCACCTCCCCCAATTTTCATCCCCACATTTCTCTAACCTGTGCT 

TTCTGGTCACAAATTTTTTTTAGGTTGTCATGCCT 

CTTTTTCTGTTT AATG AGGCAAGG TTTC CCTGATCTG C AAAACAAAAG TAAGTTCAAAG AAAACC CTTGTTT CCCTTTTG ATG TG T 

3 0 GCTTATGAGTAATAGTGTGGTGATTATGTTATAGACAGAATACATGCTCTTTATACGGTCGAGAGGAAAGATGACT 

GAAACTCCCAGTAAGATCATGTCTATATTTGCTATTAGGTACCAGGCTGGACAAAAAG 

TAATTAAAAAATGAAATTGAGAACCAGGCAAAAAAGAATGAAAAGGAGGTATCAAAAAAAGTGTTGGATTTTAGGAAG 

TGCCATTCATCATACTCCATGTCGTATTGCACTCAATTTATAAAAC^TTTTCAC^ 

CCTTCCTGCTATTCCAGATACTGCTCAGCAAACTGTTCTACAAAAG 

3 5 TACAGGAAAGAGTAAAATATTAAGAGTCTACCTGATTGAGCTTTTCATTCAATTAAGGTTTATA 

TTCAGCTCCAACACGATTCCAGGAGCAAGTGCTAAACTTTGATAGGAAATAAGTGAT^ 
TAAAGGTACAAAGAAATCTTATGGAAAATCTAATGAAACT 

AAAAAGTTGAAAGTAGTTGTAAATCTCCTTCATCAGCAACTAATGCTACCAAAAAAGAAGTGAGGCAG 
GAAGATTTTTAAACAACTCTTGAAATATTTTACTCAAGTAGTTTTTTTCT^ 

4 0 CTACTGATAAAGGTATTTTCTTAATTATGGAAAGCCTTTTTT^ 

ACACAAAAAATAAATTTACAAATAACCAAAAAATTTTATATTTTCCTACCT^ 

ACATGTCTAAGATTACAGAATATTCAATAGTTACrrTTTAGGGTTTTAACATTATTTTAGAT^ 

T AACAACAAAAAAACCTG CTAAAC ATCAAAT ACCACTTGTCCACTGGGAACAATT CAGCTTTCTATCAGTTC 

CATTAAGTTGTTATGTATAGGTATTGACCAGAAGCCTACAATGAGATACAGATTTTACACTAAGATTGAAAAAATTTATGGTA 

4 5 G CT CTTTTTCTCTT AACTT T CT AC ACTAAG AGAAAAATAC ATT AC TTGGCCT ACATCTT AAAGCATTCAAAAT AGGC C CT CTGT AA 

GTTTTTAACAAATTGTATTTGTATTTAGGTCTTTAAGAAACAACCTGAGATGATGGTAAG 

GG ACAAATCACTCCTTGAGAACTAAGGTCATTT CAATCCATAGCACCCCTG ACCTAGTGC CATGATTC AAATTCACAACTCTTG AT 
AGCTAAG ACTTTGACTGCTCCAACTAGC AAATCACAT ATAGC CATTCCCTAACTTTGTGTTTCCCAA CTGTGAATCAG AA 

TTACCAC^GGATGCCTTAGCCCTACCCTTGATCTACTGAAGCAGAAAATTACT 

5 0 TTTTAATATTTGGATATCTAAAGTGAGAGCATAAATGATGAAGAG 

GGT ATGTGTCTTTGCTCCATTAAAGT ATTTCTGACCT CAATCTTAAACTTTGCTTTGT CTGATTTCAAATCC 
AATTAGAAAATAAGTATGTACAGTAAAAAGTAAACTTGGGCTTC^ 

CCT CCT AT AAG AG AAAAAAATGATT CACAACAACTG C CT ACT AAAT ACAACTGG AAGTG C CTTT C CCT ATTTTGTAAAAT CG AATG 
AAAAACCCACAAAACACATCCACAGCAATACAGGTGAGGCAAGGGTCTATATC 

5 5 TCAAACCTGAACAAAATACAGGTAACAATCTAAATTCAATCATGACAATAA 

TATTTACATTTTAAAAAATAATTTTTGTTCTTTTTTTTGTTTTTTAAAAAGA 

AAGTCATCACTTCATTTGAGAGATCAACTGGTAATAAAAAACAATGTTCTTAC^TATTAA 

AGTCTAAGATGGTGTTTACTTCAAAGATGATATTCAAGTA^ 

CTGCTATCTAATTTTTATAGTTTTTAGTATCTTCTATATTCCACCTCCTCCACCACCACC 
60 ATGGATCTCCATAGCCTCTTCCACCATAGCCCCCTCTTCCACCATA^ 
CCTCTACCANNNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TCTCTTTTGCCAAGGGGG CATTTCTGGAGGTGGTGGTTCAATTTCATTCGG CCGGCCTCC CCTGTCAGGCACCCTTCCCATCAGCA 
CCTGTGTGAAGAAACATTTTCTTTTAGTTCCTAATATTTCAAAAAAGTTTACATA 

6 5 AGAAGCCAATATTTAAGGACATGAAAGTACCAGACATGGCAGAACAAATTAATATTAAAAATATTAAGTGTG^ 

AAT CTGTT AAAGGAT ACAAAATCACGGCTAGACAG AAGT AAT AAATTCT AG TGTTCT TATGATG ACTGT AG TTAAT A 

TATAGTTTCAAATAGCTACAAGGAGGATATTGAATGTTTCCAACACAAAGAGATGATAAATGTT^ 
CCGTCTGATCACTATACATTGTATCTATCAAAACATCACTATC^ 
TAATTTTTTTACTTCCTGTTTACCAAGTCTGAAGAAAAAA^ 

7 0 TGTGTGATACATGACAAATTCCTTAGTGTATATATATATATTTTTTTTTCT 

TGCCCAGGCTGXaAGTGCAGTGGCATGATCTCAGCTCIACrGCAAGCTCTGCCT^ 

CGAGTAGTTGGGACTACAGGCACCCGCCACCACGCCCAGCTAATTTTTTGTATTTTTAGTGGAGACAG 
AGGATGGTCTTGATCTCCTGACCTCGTGATCCGCCTGCCTCGGCCTCCCAGAGTGCTGGGAT^ 
GCCTAGAATATATTATTTTTAAACAATCTGAGAGAGTATTTCTGAAACTTAAAGGC 
7 5 CAAATTCTGTAACAAATGTAGTACCCATTTCACTAAGMGTTCTTTTAGGAAAACATTT^ 
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TTCATTAACTTAATATCCATTTATACAATAACA 

TATATTCCTAAGCACATTT CATTTCAAAAATG T ATGAAGTACAT AT AACATGGG ACCTT AATG AGGTCAGAGATTGATAT AAAAAC 

GATGATACTAGTACCATATTATTAAAAGGAAAACACAGTTAATATAATTCCTTTACT 
5 AGGGAAGAGTTTAAGTATCCAACCTTCTATAAAAAGTAGCAGCTTTATATG^ 
TCATCTGCTCACCACAGACAAGGGGAAGCCAATAGCTTCCT 
CAAAATGACCAATTAATTATACACTTCCATrGATCCTCCAATCAAAACC^ 

TG CAATATCCTCAAAGTGAGACAAGGTT CT AAGT ATG CAA<^TTCTTTTAAAGATGGCAATGATGTCCT AACTAC CAAATC CAAAA 

AACAGCCTTTCTTGGTTATTATTCTCACTGXSTCTCTCTACCAAAC^ 
1 0 TAAGACTACCCTCAAAT AATGAGACCAT AAATTAT AAACTCAGAAGAGATGG C CAATGTT ACTT ACAGAAATAGAACTCTTGAT CC 

AGCAAGACrAAATTATCACCCAATGCAAGATGTTAGGTAATATATTTAAAGAGA 

TGGAAGCACAGAGGCAAACACTGGTGATGCCTAACTT 

TATTTTATCTCAAGTTACACTGAGTGAGAAGCGATGCTTrAAAATTATC 

AAAGTTTGCCATGGAGTGCTCTACCTAAATGAAGAGGACATGCCAGTTAAGCI^ 
1 5 GCTGTTGTAAAGTGAAAAATTCAGACTCCTTGAGCTTGT1TAAA 

AACAGAAGCTCAAAGAAGTT CAGT AGCTTGCTCAGGGTCATTTCTG T AAG TAAATG ACAGAG C CAGG ATGGG AACC CAGATTGGTC 

TGACTCTAAAGCACAAACTCTTCTTCCCTTTACACCAAGGATATAACGAATCT 

CTGACAAAAAATGATT ACTTTTTAGAAAG ATGG AACATAATTCTTGCT CTTTGACAAAAACAATT CAT AATT CCTCTAATTC 
AGAAAATACTTTGATTTATGAATACACAAATTCAAGTGAACATA 
2 0 CCCCTGCCACTCTTTTXTCTAAACATTTACCAGGACATGAAAGA 
CAATTCATTTTCCACTGTACTGAAAGAAACACCAACCTTATTA^ 
TAATGATCTTGGATAGATCTTCACGAGCAGCAAGTAGATGTC 

TG AT AAATTCTTG CT AT ATCAT CTGTTTTT AC CT AAATG ACAAG AGTT AAGGAAAAAAGGAACAAAAGT ACATTTT AATC 
CAAAAACAATCATAATCITTAGAAATAAAGAAAAGGAGATAATGCATAAGCAAGGAGTTAGAA 

2 5 TGAGAATTTATGAATAATTTAGAACAAATAAAATAGACCAACAGGTGTTAAAAATATGTAACT 

TAATAACAAAAAAATTTCTATACATATTAACAAAAAATACCTAAAAAATAGT^ 
AAACATCAGTAAATCTTGCAAAAACAAAAGGAAAACTTC 
TATTGTCAGTTTTCCCCCAAATTAACTTATAATCCTATATAAATA 
AAACTCAATATCTAAGAATAACTAAAAGCATTTTGAAATATAATCCT 

3 0 CTT AT CAGTTAACATTTAATGACrAACTT ATTAAATGTGCCAGGATGCT CACTT ACT AAGG CATT ATTT CTTT AAT CCTCAAAA 

TATCCTTTTAGAGTGGTTGGTTTTTATTATCCCCTTTTGCAGATGG 
CAAAATATAGAGAGAATCCTAGCTCTGCCCACrTAGCAACAAAGGTCTA^ 
TTGCTATCCTACCTCGATCTAATTAACTAGAACTTGGGAGAAA^ 
ATAAATCTATGAATGCAATAAATGATAAAGGAGACATCACAGATCA^ 

3 5 TGACCTGGTGGCTATCTGTAATAATAACAATAATACAGCACACC^TATAACAAAGC^AA 

TTGAGGTTTAATTTTCAATTTAAAAACAACTACATTATAAA 
ACyU^CTGGGGAAAATATTTTCAACAAATAATTCAACCIAGCA 

ATT CT ACAGG AAAAAAAAATG ACIAAG ACAT AAAT AG AAGTT AAAAAG AGGAAATT CAAATGACT AAT AAG CTT ATGAAAACAGTT C 
AACTTTGTCAATAATAAGAACAGACAAAACAAGTTAGTCTTTTTA^ 
40 TCATTTACCTTACTGGAAGGAGTTTAAATTCATATAAA 

HUMAN SEQUENCE - mRNA 

CCCX3GGGGGGAGAGAGCCG<5CAGGCXK3CG^CGGTGGTGGCGGG<K5CGATGCGCCGCGCCCGGCCGCG 
GAGCGCGGGCCGCGGCCATGGGCACCCTGC^CAAGGCGAGAGAGGCTC 

4 5 GCAAGACTAGAGGCAAAGCCAGCCAACAGCCCCTTCCCCTCCCATCCCAGCTTGGCCCAC^ 

TCTC^GACATTTAGCCAAAGGAGCCAGCCTGGACGATCTCATTGACAGCTGC^ 
GAAGTAACCAACTGTTGCAAGTCATGCTGACCATGCACCGAATTC 
TATAAGGATGCTTTGGCAAAGAATTCACCAGGACTTTGCCTGAAGATCT 
CATGTTTAAAATGGACGCCAGCTTGACAGACACTATGGAGGAGTTTCAGGAA 

5 0 GC CTG ATTG ACACAACTCAAATCAATG CC CX3TGACTGGT CCAGG AAACTT ACTCAAAGG ATAAAAT CAAAT ACCAG CAAGAAACGG 

AAAGTCTCCCTGCTCTTTGACCATCTGGAACCAGAAGAGCTATCCGAGCACCTCACCT 
ATCX3TTCTCTGATTATCAGAATTACCTTGTAAATAGCT 

GCATCTCCCAGTGGGTACAACTGATGGTTCTCAGCCGCCCC^CGCCGCAGCTCCGAGCAGAAGTCT 
GCTCAGAAGCTCCACCAACTACAGAACTTCAATACACTGATGGCTGTGATAG^ 

5 5 GGAGACAAGTTCGCATGTCCCACATGAAATCAATAAGGTTCTCGG 

ATTACCGGCGAGCCTATGGAGAGTGCACCGACITCAAGATCCCCA 

GCCATGCCTGACTATCTOGGGGACGGGAAAGTGAACGTCCATAAGCTACTGGCCCrATACAA 

GCAAGAGGTGGCCCCACCCTTGGAGGCTAACAAGGACTTG^ 

AAATCTATGAGCTTTCCTATGCCCGGGAACCAAGGAACCACAGAGCT 

6 0 TGGGCTTCTGGAGTGTCTCCCAAACCTGATCCAAAAACCATTAGCAAACACGTCCAGA 

TGATCACGACCAGGATGGATACATTTCTCAC^AAGAATTTGAAAAGATTGCTGCGAGTTTT 

AAGACAGGGAAGGCCTCATCAGCAGGGATGAGATCACAGCCTACTTCA 

TTTCCTCACAACTTCCAAGAGACCACCTACCTGAAGCCCACT 

AGGATATCGATGTAAAGACTGCGGGATGAACTGTCACAAACAATGCAAAGATCTGGT^ 

6 5 ACCCAGTAGCTCCCACAGAGAACAACACTTCTGTGGGGCCAGTC 

CCTGAGGAAGGACCTTTTACATTCCCTAATG^GGAGGCTGTGGAACATGGTGAGGAGACT 
AGTGTCCTCACAGAAGATTTCTCTTCGGCrGAAGAGGGCTGTTGCCCACAAGGC(^ 
GTGAGGGCCCTTCAGGTCCCTTTGTGCTGTCTTCCCCAAGGAAGACAGCCCAGGATACT 
CCATGTCCTAGCCCAGTCTTGGTCAGAAAGCGGGCI'TITGTCAAGTGGGAGAATAAAGACTCCCTC^ 
70 CCGTCACCTCAGACTGCCrACCTACCAAGAACTGGAACAGGAA^ 

TGAAATATCCACAGAAGAAAATAGAATCCCTCCAGCITGAAAAAAGCAATCATGT 
TAGCCCAGAAACTAAGTAGCACAATCTGTAGATGAGTATAGTGATCTCATTTCCTAAACT 
CACTGACCAGCTTTAAAACAGTACTTTAAAAGGAAAAGCCT 
TTCTCTTTCAGTTAGTTGACTCAAAGGGGGAAAACTAAAGA^ 

7 5 TGCGATCTAAAACATGACT ATCATTTCCTGACAATGGGGC^^ 
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TGGAAAATGG ATTTG CTT CTT ACCCTG TG TTTTCTG CAAACTGACTT ACTTTGTT C CAG CCAAAG CTTG CT AAT AATAGAAAACTA 
CCCATATTCCAAAAGTAGATTTCCTCTGTATCCCAGCATACTTTGTGAACCTGGCT 
AGTCCATGTACCCTCCTTCCTCACTACAGTCATAACACATGAGCATATCTACCTAGAAGCC^ 
TTTTAGAGGCCTATTAAACATGACGTTATCCAATTGCAGGTCAACT 
5 GTAATCAAATTAGGTACXTIXKSAAACATAGCTATTACCATCT 

CCTTATATATTCTACAGTGGATAGAATTAGGAATTGATGGCTTAAAAAAAAAAGTCT 
CATCTTTCTCITCTGAAGTAAACTATTITGAAAGTTCTCT^ 

TAGAAATAAGCTAATTGGATCAGTGGCTTAAATAATAGCTGACTGTGTGTACATATG 
TGCATGTGGCTTGGAATTTTGTTTCCTCCATAAAATGTGGAAGT 
10 AAAGTTCAGTTTGTCACAAGATTAAATTGCTCACAAGGTAAAATTGTAT^ 
TCTATTATGAAGGTTAATAATAAATGGGCTCATTTAGTTC^CTGG^ 
TCTTAAAAAAAAAAAAATCATATGATCATTTTCCrrTTTTGGGGGTACT^ 

TGAAATTCAT AAACAAGTTGTCCAATTGC CAAG AATATGTT AACAATT AAAAGTTCCAAACTAAAG C CAAT AGCACCAAG TCTT CA 
TAAGAATACAAAGTATACATACAGTATTGCTTACCTGGAGGATTCAGATCATTTAGGAA 
1 5 TTTGAGTTCCTCCTTGCACTGAGTTTTAGTGATAT 

CACGTATTTACCGTTTTCCAAATTTAAACTCAGAAATACCCAAAGCAGGCCT^ 

TAACACTTTGTTACTTTCTTTTTAATAGGACAAGCATGAGTTAGGACAAACT 

CTTTGATTGATATAGACCAAAGATGGTGTACTCTAATTTTTT^ 

ATT CG AAGT AAAG AT CCCCAGTT AG TTTTTAT AT AAAT AATCT AT AGGG ATT CAAAAGG TG TC ACAGT CCACTT AATT AG TCAAAT 

2 0 TAG CAATGG CTAAACAGT ATCAAGT ACTGCAGAATTTAT CA.CTG AAATGG AT AAGAGG AAAT AGTTTAGT CACAGGTTTTTACAGT 

CCAGC AAGGG CCAAAGAGGT AT AGT AT ACAAG TT AATAGT ATTTGTGTTGA 
AAAAAAAAAAAAAAGGCTTTGGCTTATCAAGCCTAGTGTAAATTTCTGC^ 

CCAAAACAAGGACAAATCTTGTTGT ATTAACAGCAG^ CTTTG CTG ACTT AC CTTTTT ACTG ACCGTTG TG A 

ATTTCTGTCT CAAAATGTAT AATAT AG AAATG CAAG AAAAAAACAAATGT ACAGATTG TAAAG TTTTTTGAT ACCT AATGT AAGTT 
25 TTCTTTGTGTAATATTTATATGATAAAAGACATTAGGATCCCTACAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGGCACCCTGGGCAAGGCGAGAGAGGCTCCGCGGAAACCTTCCCATGGCTGCAGAGCTGCCT 

AAGGAGCCAGCCTGGACGATCTCATTGACAGCTGCATTCAATCTTTTC 
CAAGTCATGCTGACCATGCACCGAATTGTCATCTCCTCTGCAGAACTGCTCC^^ 
AAAGAATTCACCAGGACTTTGCCTGAAGATCrGTTATTTTGTAA 
CCAGCITGACAGACACTATGGAGGAGTTTCAGGAACTGGTGAAAGCTAAGG 

3 5 CAAATCAATGCCCGTGACTGGTCCAGGAAACrTACTCAAA 

TGACCATCTGGAACCAGAAGAGCrATCCGAGCACCTCACCTACCrTGAGTTCAAGTC^ 
AGAATT ACCTTGT AAATAG CTKjTGT^AAGG AAAACCCCACCATTX3AGCG ATCTA 
CAACTGATGGTTCTC^GCCGCCCCACGCCGCAGCTCCGAGGAG^ 
ACTACAGAACrTCAATACACrGATGGCTGTGATAGGTGGGCTGTGTCACAGCT 

4 0 TCCCACATGAAATCAATAAGGTTCTCGGTGAGATGACTGAG 

GGAGAGTCCACCGACTTCAAGATCCCCATTCTGGGTGTGCATCTCAAGGACCTCATCrCCCT 

GGGGGACGGGAAAGTGAACGTCCATAAGCTACTGGCCCTATACAATCATAT 

CCTTGGAGGCTAACAAGGACTTGGTACACTTGCTGACGTTATCCCTGGATCTT^ 

TATGCCCGGG AACCAAGGAAC CACAGAGCTCCACCACTAACACCTTCAAAG CCACCAGT AGTAGTGGACTGG^ 

4 5 T C CCAAAC CTGAT C CAAAAAC C ATT AG CAAACA.CGTC CAG AGGATGGT GGATTCTG TCTT CAAGAACTATGATCACGACCAGGATG 

GATACATTTCTCAGGAAGAATTTGAAAAGATTGCTGCGAGTTTTCCATTTT 
ATCAGCAGGGATGAGATCACAGCCTACTTCATGAGAGCCAGCTCAATCTATTC 
AGAGACCACCTACCTGAAGCCCACTTTTTGTGACAACTGTGCTGGATTTCTCTGGGGAGT 
ACTGCGGGATGAACTGTCACAAACAATGCAAAGATCTGGTTGTGTTTGAGTGTAAGAA^ 
50 . GAGAACAACACTTCreTGGGGCCAGTGTCCAACCTTTGCTC^ 

TACATTCCCTAATGGGGAGGCTGTGGAACATGGTGAGGAGAGTAAGGATCGGACCA 

TTTCTCTTCGGCTGAAGAGGG CTGTTGCCCACAAGG CCACCCAGACTG AATCACAG CCTTGGATTGGCAGTG AGGGCCCTT CAGGT 

CTTGGTCAGAAAGCGGGCTTTTGTG&AGTGGGAGAATAAAGACT^ 

5 5 CTACCTACCAAGAACTC^AACAGGAAATAAATACTCTGAAA 

AAAATAGAATCCCTCCAGCTTGAAAAAAGCAATCATGTCTTAGCTCAAATGGAGCAC^TGACTGTTCTTAG 
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Table 17 

MOUSE NOMENCLATURE 
ICSGNM Nmycl 
Celera mCG19753 

5 

HUMAN NOMENCLATURE 
HGNC MYCN 
Celera hCG1783900 

10 MOUSE SEQUENCE - GENOMIC 

T AATGAAATT CTGAAG CCT ACAGACTCTT C CT AGGGG AGAGAGG AAGTATT CAACTTGGT AGAAAAT AAATGATGGGTG A^ 
AGGAGGATGAAAGGGCCTGATGAACAGGTGAGCGCCTATGACTC^ 

CCT CAT AATG TTGTTC CGGG ACAATCG AATTCATCCAGAAGGATCrrTGCC CAAGGT AGGTAAAGGG G ATTCAG AGAAAACTTTTCT 

ATGCTTTX^TGCTATGAGATAGGAGAGCAAAGAAAAATCACGAAGACIT^ 
1 5 GTCCTCAACCCAATTCTAAAGGGACCCTTGTGTTACTGTGCCCAAGAAGT^^ 

AAAAGCCTTCAGTCCTGGTTCCAGCCAGAGAGGGAGCTTGCCTTC^ 

GATATTACAGCACCITGCAAGAGAAAATGGAGCTGAGACAATGCT 

AAATACCTGTTTCCTTTCATTTAGTCCTATACCTGTCTGGACATCGAAGA 

GTCATTGAACCCCTTTATATTTGTTTCTCTCTCTGATGTGATC^ 
2 0 CATCATTGGTCTATTGAGGACGAGTAGTCAACTCITC 

GTGTTACGGGGCTGGAGAGATGGCrCAGAGTTAAGAGTACTGACTGCTCTT 

GGTGGCTCACAACCATCTGTAATGGGATCTGTCTGGTGTGTCTGAAGACAGCT 

ATT AAAAAGAAAAAACTTT AAAAAT ACACATT AAAAAAAAAAAAG AT A^ TG TT AC CACCCTTT CC CAGTG CC CTTTTC CGA 

TGATGGTOCTGCATGGTGAAAAGCITGAGTTAGTACTTGTAGCTATGCTGTCCAAGGA 

2 5 CACAAG AAGACAGTCACTG CTGTGG CC CACTG CAAATG AGG AAACGGG CTCATCAACGTG AATGGATGAAG C CT CT AAGAAGG AGA 

TCAAAGATAGCCTCATTCAGTATGATCGGAC<^TTCTTGTAGCTGACCTCCAGTCGCTGCGAATCCAAA 
TGCCCGTGCTTGATACCAAAAATCCTACCAATAAGCCCATCC 
TTITTTTTTAAATGACATTACCGATACAACCCCTCAGCCACACACT 
CCTCCAGCCTCCAATGGCAAAATACATGC^CCTAGTTTGGT 

3 0 CTTGTGATTCAAATTCAACTACCACCTAATTTGTCTCATCTCAGC 

AAGAGTAAC AGCAAAT CTGGG T AGG AAAACAT AG ACTTAAGGGTAGTTTAGGCCGGGCGTGGTGG CGCACAC CTTTAATCC CAG CA 

CTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCGAGGCCAGCCTGGTCT 

GAAATCCTGTCCCGAAAAACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG 

AGAGTCRCCACCACCACCCAATCCCGAATGTTATTCTAATCAAAACAA^ 
3 5 ATTTTAGCTCA.CAAGCGCACGTTACAGTTCACTCTACGGCAGGGA 

GTCAGGAGCAGAGGGCGAATGAAGTATACTCGACATTCCGTTCACTffTTTCCAC^ 

CGGTACCACCCACAGTG^TCGAGGTCTTCCCACATCAATTGAGGGAGTCACT 

AATCTCTCATCAAGACTCTCTGCCCAGATGATTCTAGGTTGTGTCrAGTTGATAATTAATCT 

ATGTAAATANNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNN1JNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

T ATACAT ACACATAT AT ATATACACACACTT AAAAACAAAT AAG CAT AC CTCAAAAT CT AG CAAGCAG CATAGC CTG GCAG CACAG 
TGCAGTGTAGAGTATCTGGTGTTTATCCTCAAGACTGTAAGGCTTACTAGAGAGTATA 
5 0 AAGAGTACCATATGTTGTTGGTTCTGAGAAAGTATTATTATTGGAGGGGGTAGGGGTGT 
GTCAAAAGACAATTTTGTGGGAGTCAGTTCTCCTCTTCCACC^ 

AAGTACCTITATCCAATAAGCCATCCTGCTGGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT^ 

TGTGTTTTGGTGGCATTGGTTGGGGGGGGGGGGTGTATCTTTGTTTT 

CTTCTATATGTAACCTAGGTTGGCCTGATTAGTTAGCTCAGGCTAATACCAAATT 

5 5 TAGGATCACAGGCAACIACACCCTGCTTGTTGATTGGCACT^ 

GTATTCTTGGTGATTTGTCTTAATCTCTCTCTCTCTCTCTCTTrCTCrCT 

CTCTGTCCCTCTCTCTGTCCCTCTCTCTGTCCCTCTCTCTGTCCCTCTCTCCCTCCCTCCTTTCTT 
TGACTTAGTCTGTAAACCACTCTAGTCrCAATTTATGTAAATCTACCTTCT 

AATGTCACTAATGGAGTCTCAGTTTGGTTGTTGTTGTGGTTATGGi TT iGGTGTGGTTTTG'i l riTGTTTGTTTGTTTGTTTGTTT 

6 0 GAGACAGGATCITGCTAAAGACACCACATTAGCCTCAATTTTGTAAATCTTCCCAATTC 

TATGCTCTGTTATCrTCAGGAGTTTTAAAAACTATTTTACATTGCACA 

CTTTAAGCAGATGATTCTCTCCTTCCACTAAATCAGACCCAGGGATCAAACTCAAGTTGCCAGGT 

CACTAAAGACATTTCAGCAGCTCTTGCTTTATTATCTCT i~i i iGGGTGATTCTAGAATGGAATTCTTTCTACi-i~i rGGGACAGAAG 
CAGCAGTGGTGTATGAAAGTGTTCGTATCTCTCACTCTCCAGCCTCACCACCGTTTTG 

6 5 CTCCCTGAGATTTAATTTGGATTGTCCCGAGTCAGGCGAAGTAGCCCATTTTCTTGTTGGTACT 

AAATTGCCTATTCATATCCTAGAGCCAGATTCCTCTCTGGCITT^ 
GAATATAACCTTTGCCTCTGAGGGCCCTCAGGAAGTGCCATCCCAAATATGACAT^ 
GAGCTGGAAACTGCAAGTGTAGGCTTAGCTGTACCCCTTATCTACATAAAGACATCCTC 
ACTCTCCAGTTTCTTATCTCCCAGTGGAGATTACCTCCGTGACACCCAGCCA 

7 0 TTCCAGACAGGGTTAATAATACCTGAGACTGTATCGGTCTGATAAAGTCAGACCT 

GCATCCCGGGACACTGCATGAGTCTCAGTCATTTGACCGTCCTTCAGACCT 
TCAGTGTATTTTCTACTATGCTAATCTGTCTGTTTTGATAGTTCTACTTTTIX3C^ 

TTTTACTTTTTTTGTGCAATG CAGAATATAAGTTTTCTTTT CTCCTCTTCCrCCTCTGTCTrTTTl H rcri"rCATACAATACTTCCT 
CATCGTCCTCCCCTATCCCCGAca iTiiAAAAAAATAAAGTCAATTAATCTGCAGGAi niunin lAGTATGTATATGTGTTG 
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ACCTGTTTGCCTCAGAATGCATGCAGAGAT 

GGCAGTTAGACATAGAGGAAAG<K5TCCTTACCACCrcAGCTAGCr^ 
AAAATTACAAGTCCTTTATTTTGTTGGCCCTGGGTCTTGTGG 
GGCTGGAGTACAGTACC^CAAGGGACTGTGCCTTTATTTGGGCAG^ 
5 AAAGCTGGATGAGGTTGTTGGTCTACTAACCATGATGGATTAAATCCTCCT 
GATTTGGTAAAGAAAGANNNNNNKNNNNNNNNNN^ 
GTCACCTTCTCTCTGTCCCCTCTTTTCTGAAACATCTTATCTTT^ 
ATAACTAACCAGGCCATCTTAGCCGACrTTCCCAGTCCCCT 
TGTCCGCTGCTGAAGTTCTGAGGAAGATTCrAGAAGCCAGATGTG^ 

1 0 aggatcaattcaaggtcatc ct cctcaat aaaaat AGAG tt CAAGGTCATCTTGG act AT ATGAG ACCT CGG ctcaaaaaacaaac 

AGAAACACCTCTCCTTCCCCTCCCCCCCCCCCCAAAGCAACAACAAAAAGAGAAGGTTCTGGGAGC 
TGTGCC^CTAATAAAOUVAACTAACCTCCCA^ 

GGAACACTTACCATACAGCCCCTGCTGTGT ATGTATAATTACAAT CCGAGG CAAAGTTTT CAG AG AG CCAG CATCTTG AGGGAGAA 
TTCAGCGGGAGGTCTCX5GTTTGGAAGAGGCCAGGGGTGTGAAAATGAGGGTGTTAAAGGCVi^ 
1 5 ATGTGAGCCCAATAAGAAAGGCCTTGGAAGGGTGGCTCACCCTCATTCACGGCTACTTCCT 
ATGTAGGCATTCGGGGACAGATGGGAAGACTTTCACACATGCrrAATACACAT^ 

TCGCTGG AGGTGAATGAGGGGTGT CATTTT CAATTT AG AGGG AAATGG AG ACGCITC C CCACLAAACAAG CTT AAGGGTTTAAGG AT 
AAATCCCACCCTACTGAGAATGTGTGTTGAAAAAGGGCTTCACCCAGGGACCCAGGTTACTCCT^ 
CATTTATTTTTAGCCCC^CATTTAGGAAACAGACCTTTTAGCATGCTTGC 
2 0 GGGAAATGTGCCCACGGCTGTGAGGAAAGAATACACGCACCCTGCTCT 

CCTCGTAAGTGAGGCAAATGGGACACAAAGAGGCCAAATTAATGATGGCTTTCCTTGATG 
ACAGAGCAGTGAGTGATTTGGGCAAGGAAAGCAGCGGTCCGTAATTC^ 

CGCGACTTTAGGGGACGTCCCTCCACGCGTTTCACAATCCCTTCCGCGGTAGCAAGGATTCGCTAATTGTCT 
CACTCGGGCTCCGAGGCTGGGCGGTGCGGCTCCGCITTCTG CTTCGTCT CCGCGAGGTGTCGCCTTCGGCCGAAGAAAGCGGTAAG 

2 5 GCGCCACCCTCGTAGCCCGCACTTATTTATTTTCAAACAAGGGGGGCGCCCCTTC 

TCTCGTTTCTACGTGGGCGCGTCCTCGGCCTCCCAITTTGCACTCCTGTGTCGC 
GACCAGGCCTTGTTCCACTTTAGCCTGGAGCTCATCTTrCAC^ 

AACCTCTTTTCCCATTCCCAGAACACACCCAGCGACCTTTAAAAAAAAAAAAACACGG^ 
AAAAGGGAGGGGGAAGCACTCCTCGGGTTTCCCAGCTTTGCTGCTTGCAGTCTGAAA^ 

3 0 GGGGAGGAGGGGGAGAGGAGAAGAGG<^AGGAGGGAAGGGAGAGAGAGAGAGAGAGAGAG 

AAAAAAGAGAAAAGCAAATGGCTTTTGGCGCGAAAGGCTTTGGCGCCTCCCCTGATTTTT^ 

CCTCTTGCCTCTGCCTTTCTCCCTCCCCCTCGTCTGCGCCACAGCCCCCTTCTCTCCCCGCCCCCCGGGTGTGTC 

GTTAATAATATCCCCCGAGCTTCAAAGCXJGAGCCAGTGACAG 

NNNNCCCGCCGCCGCTCCCTCAGCCCACCCGGAGACCCCAGCCCCGAGTCGCCTCCGGATCCCCGGCAGTCTGCGGGAGG 
3 5 GCGGTACTTGCGAAGCTTCGAGCGCCCGGGGAACGGCGGAGCGCTGCGGCTAGGCAAGCCCCTGGACGCGGCTCCGCACC 
CCAAATATGCCGGGCTTTTCTGGGGGAGGGGGTAAGGGGGCAACACACCGCGTCTGCTTCCGAAACAAA^ 
TTTTTTTTTTTTTCCTGAAAAGCCTATTCAGCACCCGAAGGCACCCT 
GCCCAGGCAGAGAAATAGCTITAGCGAAAAGAAATTCGCCCG 

CGTCCCCACCCCAAGCTGGGGTTCTCCAAAAGGTGCCGCGGGGAAAGAAGGAAAAGGGGTTA 
40 AATGTAGGTCACGGCTGCTCCGGCrTAGGAGACGCGATTCTCGCCGGCGACCCTGCACCATTCCT 

CCCCCGCCTGTTGAAGAGGTTGAGGTCTCTGAAGGAAGGGGAATGGAAAGGGTATAGGTATAGGCAG 
CTGG CAAGATTGAGGCTCTAGGTCTAGGTGGAGAGGGTTGTATGTAATTGCCTC 
ATTAATAAAGTTTCCTGAACCCCGTTAGCGTTTGCGCTCCCGTGATCGCATTTCTGCAAAAGGGG 

ATGCCTGGTAGCTGGGCTAGAGAGCAAGGACTTCCTTTAAG CAACT AGTGCAAGTGGGGTAGTCG CGCTAGTAAGAGCTGAGATCT 
45 GCAGCTTGAACAGCCCCCCTCCCCGAGGCAGTGCCTTGTGTGAAT^ 

CCTGNNNNNNNNNNNNNNNNNNNNNNJ^^ 

NNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNWTONN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNTTCGAATTGGGCTACGGAGA 
GGATGCGTTCGGTCTCGGGGGCCTGGGTGGCCTCACTCCTAATC^ 

5 5 GCGAGAAGCTAGAGCGCGCAGTGAACGAAAAACTACAGCACGGCCACGGGCCCCCGGGCGTCAGCTGAGCCT 

GTGGGTGCCAGCAGCCCCGGGGGCCGTGCCCTTGGTGGGTCGTCGAGTGCTAGCCACACCGGGGCCACCCTGCCT 
CCACCCGGCTGCCGAATGTGTGGACCCCGCCGTGGTCTTCCCCTTCCCG^ 
CCIACTAGCGCCCCGGCGACGAGCGCTGCGGTCACTAGTXJTGTCT 
GGCCGTCCTGCCAGCAGTGGGGAGGCCAAGGCCCTCAGCACCTCCGGAGAGGAT^ 

6 0 AGTTCTTGCTAGCTCCTTACGGACTGGGAAGGTGGGGGTGGTGCG 

GTTCACCAGCCTCCCGGGACAGAAGAGGCTGGAAACACCAGCTACACCTTCTCTTAA 

GCGGTAGGCATGCCCTCACTGTGGCAAAAGTTGGTGGCCAGCTC!AGTCAG^ 

GITAGTCCGTTGGAAAGGAACCTTTAAGCCAGGTCAAGGAAGGGAAAAGACrrCCCrG 

GTTCAGCAGCTGGCAG CAAGTGCATT AGTXX5TTGTGTACGTTTT CAG AG AGCCTCTTTCCGGTTTTGATTGAGT ACTAGTCCCGTT 
GTGTTTCCTAGCTGGAATTTCTGACCTAATTGAGCTGAGTTGTGTTCCATAACCCATTGCCCTTGA^ 

CCACTGCCGCT CCTAAGGAATGCACGCACCCTTAGGATTCATTGGTTCTATTGTAAGCCT CTCGGAAAGAAA 
GGGTTAAAAAGAAGAAGAAGAAAGCTTTTTTTTTCC 

7 0 TGTCCTTCGAGCTGT AGGTCCAGCCCCCCTATTAG AC CTTGGAAGCCAGTGTTTTCAAAATG CCAAAGAATAAAAAGTCTAAATTT 

CCTCCrGGGCrGTGGAGGGATCGCTCTAAGGCTCCTGGGAGGAAACITGGT^ 

AATGTCTGTGCCTTGACAGCTTTCTGTTAGGAAGGAGTTCCT 

AGGGACXTTACCGGCrGGAGCAGCCTAACATTTCAGCCTTAGAAGAAGAAACCT 

TAAGCAGACGATGTCTTTTCGTTTTCGTGCTTGC'rrrTA'l' TTTCT rcrriTAAGCCATAATTTTCTCTCCTAGCCTGGCAGTAAAC 
75 GAA^CAGAATGAATACAC^ 
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GTGGATTAGAGCTTTATAATTGAGATCTGATCT 

GGTTGTTGTCGTAGTGAATGAGATTGTCCTAGAGTTTTAAGGATGGCAGAATC^ 
CTCCGGT<nX5AAGGGCGGCTGTGGAAGTGTGTAT 

CCGAGCATCTGTCGCCAAGTCTACATAGTGTCTGCCAAACTCAAACACT 
CCAGGGGTTGAA 

CCTCTTGTCCGCGTTGTGATCAGCAAACCTAGCTGACTTAAATAACAAGTGTATC 

CCCCTCAGATGATGAGGATGACGAGGAGGAAGATGAAGAGGAGGAAATCGATGTGGTCACCGTAGAGAAG 
ACAACAAGG(XGTAACCACTTTCACGATCACTGTGCGTCCCAAGACCT 
1 0 ATC CT CAAG CGC^TG TTCCCATCCATCAGCAG CACAACTATGCTG CACCCTCAC CCT ACGTGG AGAGCGAGGACG CGCCCC CG CA 

GAAAAAGATCAAGAGCGAGGCTTCTCCACGCCCCCTCAAAAGTGTTGTTCCAGCA 
ACTCCGAGGACAGCGAGCGCCGCCGGAACCACAACATCCTGGAG 
AGGGACCATGTGCCTGAGCTGGTGAAGAACGAGAAGGCCGCCAAGGTGGTCAT 

ACAGG CCAACGAGCACCAGCTCCTGCTGGAAAAGGAGAAACTGCAGG03AGGCAG CAGCAGTTGCTAAAGAAGATCX3AACACGCTC 
1 5 GGACTTGCTAAACGTTTCCCACACGGACAGTCACTGCCACTTTGCACAT^ 
CATTAAGAATGTTGGTTTACITTCAAATTGGTCCCCTGT^ 
CTTGGAAAAAAAACTGACTTATGGGATGCTGGGTGGCTTGTTTTCCTCCT 

ACT AAGG AGCTTCAGGAGG CTG TGAAGTCAC CTTGTTCCGGTCCAAGATTCCAAACAGAGTCATTCCTTt--rrrri ACAATGGTG CT 
TAAGTTCCAGCAAATX3CCACAGAAGGG05GGGGTTC 
2 0 TTlTGTATACGTCCTGGGTAATGAGAGGTGGCrcrrGCGGCCAGTA 
GAATGTTTGAGGGGCATGTTTTGTATACAAATATATTGTO 
GTATGACXJCTGATACATAACTAAATTTGATACTTATATTTTCGTATG^ 
CACITTTTGAACTAAGAAACTTTTGTAAAGAAATTTTACT 
CTGTATTTGTTCATGTTTGGTGCATAGAACrGG^ 

2 5 TGCACCirAGAGCACTTTGAAATACCTCATGTTTA 

TGGCTGtZAGTCTCTGCACACAGTCCATTCCTTACCCC 
TAAATGGGTTTGTGAATTCCACAGATTAGAACCIAACTACTTTCA^ 
TTTGGGTGTGAGTGTGCCCACAATGTCACATCCCCTGCAACTGGAGTT 
ACAAGTGACTCACCCCTGAGCCATCTGTCACCCCTTCATTTTTGAG^CCTCT 

3 0 CCCATATCTAGTTACCAAAACACTTTCACATACTCGAACT 

AAGCAGTAAAATTGGCAAGTTCTGCCGAAATTTACATAAACACTTGTCCT 
AGGAAGAGCTAGGTGTG<3AAGCATATCTGATGCCrGCAGAGGTGACAAGAAAAATC 
TATATTGCAAGCCCAAGGCCGCAGAGTAACACTTAGTCCAAAAGAAATCT 
AATAAGAGGATGGAGTGACTGTTCATCATTGATCTCGGTCACCAGCCT 

3 5 AAGGAG AG CTGTCATTGGGTGGACTGTGGGGG TG CCT CAAGG AAG AATGA CCAGAACTAAACATGA 

AGGAGG CTTC CCTCTAT CT ACTCAGGGAAAATTTTCTGGAAGT CTGAAGACCATCTT ACCTTACTT AAGAACAGCACTTCT 
CITTGGGGTAACAGACTACACACIAGCATTTCCCrATGGTT^ 
TCATGGAAAAATGCAGCCCAAGGGTCCAGACAGCTTTGCITGTGTG^ 
TGGCCAGACAGGTAGAAGGTGAGCCATC^AGGTCCITATTTG<XK3TGACAGCT 

4 0 GTGGTACTGC CTGC^TTCTCrGGTGATTTGACAAAGGTTGGAAAGTGGGCAAAATC CCCTTT ACAAAATC CTATAACTC CCCAGTC 

CCAAACTAGCAAATAATAAGCCACTCTGAAGGTCCCATTCAAAAGAACT 
ATCACTGTATCAGTACATCTAGAAACCAAATGGAACTTAAAGATCTC^ 
GTCAACTGAATTGCTCTGTTTCrCTTCTCCCAAAGGGGACCrc 
AGAACAGAGACCACTGAACTGAAATTTCTACCACATTCTTATTT^ 

4 5 TCCAGGACCCACTTCTGCCTCGAAAGCTCTGTTACTCTCTTGACATCAAT 

CATAAATAAAAGAACATATATTGCTCTTCCAGTXXSATCTG^ 

TAAGGCACCAGGGATTTGACATCTCTAGACTCCTG 

AATAAATAAAAACACACTAAAAACTGTTAACACTTTAG 

CTGTAACTCTGCAGTTCTCTGTACAATAATTGGAAAAGAGAAGAATCCAAGCAG 

5 0 AGAAAGAAACTCTTTCTTAAAAGAGTAAACTGGCACTTCTGTGGAGTGTTCCACCATCTCTCA^ 

GGTGGGGTGTCCCTGGACTAACTTGGGGCACAGGACrGACATTGAG^ 
AACCTATAGGCTGGCCTCCACCTCAATATGTAGCTGAGGATGGTCT 

TAACAAAGTGAGCCAATATACCTATGCATGTATTATGTAAGTGCTGGGGATAGAATCCAGGGTTTTGTGCA 
TCCTGAGCTATGTCACGTACCCTTGCTGATGGAATTCCCTGGCTCTTTCCTCTC 

5 5 GTCAGGTGTTTCTGGTGCTTGGGAGAAAT CAATGAAG AACAGCAACAACAACAAAAAAAAAACAAAATTG ccaactgcacccaatt 

CTTGTCTCCTTGTGGTTTATAATCTAGGAGACAATGGGAGTAAGAATATAAATAAjG 
GTAGTGAGTAATTTATAAAAGAAGTGGACTGGTATAGAGGAATTGAAGATT^ 
ATGCACCTGGTCCITGGACCrGTGGAGGAAAACGCAAGGTGACTGAC^ 
CATTTAGTCCGTCTCCCCTCCTCACACCCTG^ACATGCGTGTTCCCTACTATGTCCm 

6 0 ACTATGATGCAGCTGCAGTCTGATTCCCTGGGACTGAGTCCCACAGTTAACAC^ 

AAGCATTTTTGTTGCTACTTCATAACTATGATTTTGCTACTGTTA 

TGAAACCCCAGTGAAAGCGCTGTTCAACCCC^CAACTGTTTCCACCCACAGAC^ 

TGTGGGGGGGGGGGGGTTATTTGGTATCGACAGAATTGTTCCTTTGTTTCCT 

AAATGAGCAGTAAAGCAAATAT ATAAGCCTGAC CTCAATG AG AACTCTG C AGACTTTTGTTAC CCCCAAAAGGCTTT ATAAAACCT 

6 5 AAAAAACGGGCGCTTTGGAAAAGAGTAAAGGTAAGCTCTGAA 

GGACCATCTATGCTTTCTACAATGGTTTATTAGTTCTCCTTACTTC 
GATCTCTCTTGATACTGACAGGTCATACTACAAGACATCC^ATCCTGA 
AAGCATG^CGACTO<»CTGGGACAGTGCATATTTGGGGAA<IACA 
TGATTGGCGGGAGCATGTGTGC^CTGAGCGGCAGCGTCCAGTGGACTT^ 

7 0 CTTAAGGGCTAACAGATCTTTTTTTAAGGATTATTTTTAT^ 

CAGAAGAAGGTATCAGATCTCCAGC!AACrrGAGTTAGAGGTAGTT 
CCAGGAAGAGCAACAAAGGCTCTTAACTGTTGAGCCCT 

TCACTCAGTGGGGGGGGGGC^X^TGCATGCCACAGTGCATGTAGGTATAGAGGT 
TrrCCAGACCAGCTGGGCCTTAGGAGAGATCTTTGGCCATCAGGCTCA^ 
7 5 ACCIATGTGAATGGCCCCTGTTAGAGAATCTTTGAGAGGCAGGA 
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GAGAGTCAGAGATAGACACACACACACACACA^ 
AGrrTGAACCCAGACTTTCACACATGCKXJACAAGTGTTGACT 

GGCCCTGTATAGCTGGGGTTGCAGACCTGACCCACCAGGTCTGTCT 
5 TGAGATGATAGGACGAGGTGACACTGAGCGTTGCCTGGGAGACAGACTTTGTGGGC 

NNNNNRACCT CC CTGTTGGCTG TGAAT ATGTTT CCACTGACACCTTTGGGAGGTG ATTT CTCCAGGAGTGCGACCT 

TTCGCCTGGGTAACAAAGGTACCAGGCCTGTCTGTCACTGTTGCCCAACTTC 

CTGTACCCTCACCACCCCACOGTCAGCTGCATAGCrGCTG^ 

GAATCACAAAGCAAAACAAGGCTGGGGAAAGGAAGTAGTACTTAGTCCAAGTTC^^ 
1 0 ACAGAGAGCAGGCTAACTGAGGTAGAAAAC^CCAGAC^ 
AGCCAAGTATGGCTCCCAGCACACAC^CTCAAGGGGATGX3 

CTGG CTTTCAT AGGCAT CT ACATTCAAGTGCACACACACACACACACTT AAAAAT AAAATAG AACT 
AGAAACCCTGTCTCGAAAAACAAAACAAAACAAAACAAAACAAAACAAATCT 

GGG TGACTGACCAAC CCATCTTGGCTTT C AAACTAAAGG ACCTCG ATCCT AGGGAACATTGAGGT CC CCTT ACTCTGC CCTGC CAC 
1 5 TCCTCCCCCCCCCCCCCCCCGACAGTGTTTGCTTCTTCATTGTGCATCATAAATTGTAC^ 
TGCC1TCTTTGAGGGTCAGCCAAATGATAGGGAAAGGAGTGATTGGCC 

TGGATAGCCCATAGATAAGAGATCTGGAAACCTTCCAAGGGTAATAAAAGGAGAGAAATCTATTTTCCT 
GTGAAGCTGGTGGGTCTCACATAAATGTCAGGGACCAATTTAANNNN^^ 
TAGGACGAGGTGACACTGAGCGTTGCCTGGGAGACAGACTTTGTGGGCCAGCA^ 
2 0 TTACTCTCCGGCOVGGAAAGTTTGTATa^GTTCCCCATCTGTGAGC^ 

TTGCAGTCACTCATCCTGAGGAAGGGCTGCAGCCTTCAGCATCCCCACAAGATGC 

CACTG AGAAGGGAAAG C CT AG CCTGGT CTACAT AG CAAGTTTCAGGATAG CT AGGGCTACAT AT AG AG AACCTGTCTT AAAAATAA 

AAGATGACTAAAAGAGCTAGGGATGTAGATCAAGTGGTAATTTGTCTGGAGTC 

AGGAAATCCTACATGTAGGCAGAGGATCAGGCGTTCAAGGCCATCCTTCCCTA 

2 5 TTCAAAACAAAAACAAACAAAAACCAACTCAGATGTCGTCAGTGCTGGTAAACA^ 

CAGCTCTCTGCTGGTCACCTTCrCCAGAACTGTCGTCATTCAGTGCATCCATCCCT 

AATAAGAATCACAGCTGCAAGGTTCTTGCAAGGATCAAGTCATAA^ 

ATC CATGAGGT CTT ACTGCTGC CCG G ACTCAG ACCACACTTG AG CAGCCTC CCTCCAGT CT ACAG ACTGC CATCTAG ATTCGACCC 

3 0 TCAGGGTGTTTCCCCAGGCMCAAACTCCTTTCCTTGTGTAGCAG^ 

CTTGGCAGTCTAGATGTTATTITCATAGGGAAAACCTGG 

GGTACTCTGGCTGGTCTCTGCrTGAAACTAGAGAGAAGTATGATTAATCCATTGTT 

GGTAACAGACATTTCATACCTTGAGTCTTTGCTTGGGTTATGCTAGTCACT 

TAAACCAAGAGTAGATACAAGTTTGCTCTTTTCTTTTTTCCAGAGCA 

3 5 GGGAAAGCACGAAGGAAAGAACACCCCAGGTTTAACTGCATGGGTTGAAAAAA^ 

TACATTCCCAGCACTCGGAAGAAGCAGAGGATTGTTCAGACTGACTGGTCCAGGTACGGTCTAGG 
TGAGACTTTGTCTCCAACAAAACAAAATAAGAAAGGAAATCTTTCCCACCCTGT^ 
TTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTGCTTGCTTC 
CCATCCTTC CTTCCTTCCTGTCTNNNNNNNNNhnWNNNNNNW 
40 NNNNNNNNNNNNKNNNNNNNNNNNNNN^ 
NNNNWWNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNTC ClTITlTlTri ' T TTTTTT ^ 

ACAAGGCTGGTTGGAACTCACAGCGATTGTTCTACATCAGCTrCCCAAAG 

4 5 TTATTGCTCATGTTCTGCCCGCCTCCCTCCACCCTCCTTCTTTGGACAGGGTTTTT 

ACACATGTCTTTGCCTCCATGTTTTGGAATGAGAGGCTTATGGAGGCACACT 

TCTCCTCCTTCTTTCAGTCrTTGTACGCTGCCAAAAGAAACGGTTGTCCGATGAAAGAGCACAAAGC 

GAAACACGTAAAATAAAATGCTCCCCCAGTrCCTAGGGAAAATTGAGCCCTACTTA^ 

ATGTGGGTCAGGACCTTGATGCTCTAGGATTATACCATCTCCCAGGAATCITCCT 

5 0 AGGATGAGACCACCTGGGCATAAGCCAGTTGTGGGTGGAGGGGCTGCATT^ 

AGAGGATGGAAGAAAGTAGACCTTCCAGGAGCCTTTAGCATGGGTATTCACATGGATTTAACTTGG 

TGGTTAGCAACTGCrTGTAAATGGGAGAATCCTTATTCCAGGGGCTGG 

CTGAAGGGAATGGTTCTATCATGTTAATGCTCTGCATCTTATC^ 

TTTGGAGTCTTTTAATATGGAGATGACTCTCCCTAAGGGGAAGGAAGAGGACCCTGGAG 
55 TGGTGGTGGGGTGTTGAGACTCACCACAGGTGCGGTGCCCCNNNNNW 
ACTGATGTCTGTCTCTCTGCCTATCTCTTGGTGTT 



MOUSE SEQUENCE - mRNA 

GAATTCAGTGGGAGGTCCCGGTTTGGAAGAGGCTAGGGGTGTGAAAATGAGGGTGTTAAA^ 

6 0 TACATATGTGAGCCCAATAAGAAAGGCCTTGGAAGGGTGGCTACCCTCATT 

TATGTAGGCATTCGGGGATAGATGGGAAGACTTTCACAC^TGCTAATAC^ 
CTTACTGGAGX^GAATGAGGGGTGTATTTAATCCAGAGGGAAAT 

TCCCACCCTACTGAGAATGTGTGTTGAAAAAGGGCTTCACCTAGGGACCTAGGTTACTCT^ 
TTATTTTTAGCCCCACATTTAGGAAACAGACCITTTAGCATGCTTGCCT 

T CGT AAGTG AGG CAAATGG G ACACAAAGAGG CCAAATT AATGACGG CTTTCCTTGATGT CAC CAT C CGT AG AACT CGAGGTAT ACA 

GAGCAGTCAGTGATTTGGjGCAAGGAAAGCAGCGGTCCGTAATTCAACCT 

GACTTTAGGGGACGTCCCTC(IACGC<3TTTCACAATCCCT^ 

TCGGGCTCCGAGGCTC^CGGTGCGGCTCCGCTTTCTGCTT^^ 

7 0 CACCCTCGTAGCCCGCACTTATTTATTTTCAAACAAGGGGGG^ 

TTTCTACGTGGGCGCGTCCTCC^CCTCCCATTTTGCACT^ 
GGCCTTGTTCCACTTTAGCCrAGAGCrCGTCCCCAC^CAACAC^ 
TTTTCCCATTCCCAGAACACACCCAGCGACCTTTGACCTTTAAAAAAAAAAA 
CAAAAG<XaAGGGGGAAGCGCTCCTCGGX3TTTCCCAGCTTT 
7 5 GGGACGGAGGGGGAGAGGAGAAGAG^GGAGGAGGGAGGGGAGAGA^ 
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GAAAAGCAAATGGCTTTTGG CGCGAAAGGCTTTGGCGCCT CCCCTGATTTTTATGGAAATCAAGAGGGCX3GGGAAAAGCCTCTTGC 

ATCCCCCGAGCTTCAAAGCGCAGCCAGTGACAGTCATCTGTCTGGAC^ 
CACACACTGCAGCCGCCGGAGGACAACCCCCTCCOSCCGCCGCTCCCTCAGCCCACCCGGW 
5 TCCCCGGCAGTCTGCGGGAGGTAAGAAGCGGTACTTGCGAA 

CCTTGGACGCGGCTCCGCACCCTGCACCCAAATATGCCGGGC^ 

CCGAAAaU^CCATCTGGGTTGG Gl - rr i^r m 'r n ^ ^ ^ 

ACCCGCCCTAATCCCTTTTACAGCCCTAGCCCAACAGAGAA^^ 

TGAGTGTGAGGTGACTGGGAACCGGTGCGTCCCCACCCCAAGCTGGGGTTCTCCAAAAGGTGCCGCGGGO 
10 TTAGGCGAAGCTTAGGTAGGTGCCTGAGTGATAATGTAGGT 
NNNTCrCCAAAGGTTGAGGTCTCTGAAGGAAGGGGAAT 

AGACTGAGGCTCTAGGTCTAGGTGGAGAGGGTTGATAGTAATTGCCTGTTGTTGGTCrrG 
AAAATTTCCTGAACCCCGTTAGCGTTTGCGCTCCCGTGA^ 
AGTAGCTGGGCTAGAGAGCAAGGACTTCCTTTAAGCAACTAGT^ 
1 5 TGAATAGCCCCCCCTCCCCTAGGCAGTGCCTTGTGTGAATGAA^ 

TCTGCAAGCCCCCTCCCACTCCCAGTCCTAGACAGCTTGTACACAAAAGGAGG 

CCAGAGCTGTGGGGAGCTTGCAGAAGAGATTGGGGGCTCCCACTGCCTGTCCCCACCAA 

GTTTGCCTATTGGTTGTAGAGTTGGAGGTTGGCGCGACTCT 

GGAGATATTAAAAAGAGAGG CG AACCCATGCCCAGCTGCAC CG CGTCCACCATGCCX5GGGATG ATCTGCAAGAACCCAG ACCTCGA 

2 0 GTTTGACTCACTGCAGCCCTGCCTCCACCCGGACGAAGATGACTTCCACTTCGGCGGTCCCGACT 

TCTCGAAGAAGTTTGAGCTGCTGCCCACGCCCCCGTTGTCGCCCAGCCGCG^ 

GCTACGGAGATGCTGCTGCCGGAGK3CCGACCTGTGGGGCAACCCGGCCGAGGAGGATGCGTTCGG 

CACTCCTAATCCGGTCATCCTTCAGGACTGCATGTGGAGCGGCTT 

TACAGCACGGCCACGGGCCCCCGGGCGTCAGCTCAGCCTGCTCGGCTCCCGGAGTGGGTGC 
25 GGTGGGTCGTCGAGTGCTAGCCACACCGGGGCCACCCTGCCTACCGACCrrCTCCCACCCGGCTGCC 

CTAGTGTGTCTGTTCCAGCTACTGCCCCGGTGGCTGCTCCTGCTCGTGCAGGCGGCCGT 

CTCAGCACCTCCGGAGAGGATACCTTGAGCGACTCAGGT 

GGGGGTGCTGCGTTCCCTTTGTTATCATAGATTTGTGCGGGTGATC 

3 0 AACACCAGCTACJVCCTTCTCTTAAATTTGCCATCCCCTTTT^ 

GTGGCCAGCTCAGTCAGTCTGTGGAGTCCAAAAAACTACCTGTACG 
TCAAGGAAGGGAAAAGACTCCCTGCAAGCGAGGGCTGCAGAATCCCACT^ 
CCCTCCCCTTGGTTCAGCAGCTGTCAGCAAGTGCATTAGTGGrrGTGTACGT^ 
CTAGTCCCGTTGTGTTTCCTAGCTGGAATTTCTGACCTAATTGAGCTGAGTTGTGTTCCATAACC 

3 5 GCGGGTCCCTACCACTGCCGCTCCTAAGGAATGCACGCACCCTTAGGATTCATTGGTTCTATTGTAAGCCTTTATCTGC 

TCGGAAGAAAGGGTTAAAAAGAAGAAGAAGAAAGCTTTTTTTTTCCATGGCCATGGGG 
AAATCCAGTCTGCCCTTGTCCTTCGAGCTGTAGGTCCAGCCCCCCTATTAGACCTTGGAAGC 
TAAAAAGTCTAAATTTCCTCCTGGACTGTGGAGGGATCGCTCAAG^ 
GGGCTCTGTCCCTTTAATGTCTGTGCCTTGACAGCTTT 

4 0 AAAACACTGGCTTAAAGGG ACCTACCGG CTGG AGCAGCCT AACATTTCATTCAGC CTT AGAAGAAGAAACCTCACAATTGTT CCGC 

TTTCCGGTCCTCCCTAGATTAAGCAGACGATGTGTTTTGGiTi "itKTGCTTGCT'IT'i'ATTlTClTC'TriTAAGCCATAATTTTCTC 
TCCTAGC CTAG CAGTAAACGAAGTCAGAATGAATACACATAGGACGATACAGGCTTTTTTTTTTTTT^ 
TTTGGGACTCTGAGCTACTGTGGATTAGAGCTTTGTGGTTGAGATCTGGTCTCTGCC 
CATGTTCGCACGAGGAAAGGTTGTTGTCGTAGTGAATGAGATTGTCCT 

4 5 TTTTACCAAAGGAGGACCTCCGGTCTGAAGGGCGGCTGTGGAA 

GCTAAAAGCCCCTATTACCGAGCATCTGTCGCCAAGTCTACATAGTGTCTGCCAAACrCAAACACTGTGGTTATTA 
ATAGCTGAGGACACTGAAGAGTTTTTCCAGGATCATCCX5ACAACGGCTCGTTCTGACT 
CCACATCCTCAAGTTTGTCACCAGGGGTTGAATCCTGGGCCCTTCCACATCAGACAAGTGC 
AAGGAAAGGGGTGGGGCGTACCTCTTGTCCGCGTTGTGATCAGCAAACCTAGCrrc 

5 0 ATTAACCAGAAACTATTTTTCCCCTCAGATGATGAGGATGACGAGGAGGAAGATGAAGAGGAG^ 
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60 



65 



70 



G AGCACAGCTTGG CGAG CTGATCCTCAAGCG CTGTGTTCCCATCCATCAGCAGCACAACTATGCTGCACCCTCACCCTACGTGGAG 

AGC G AGGACX5CG C CCCCG CAGAAAAAG ATCAAG AG CG AGGCTTCTCCACGCC CCCTCAAAAGTGTTGTTCCAGCAAAAGCGAAGAG 

CCTGAGCCCCCGAAACTCAGACTCGGAGGACAGCGAGCGCCGCCGCAACCACAACATCCT 

GCTCCAGCTTCCTGACGCrCAGGGACC^TGTGCCTGAGCTGGTGAAGAACGAGAAGGCCGCCAAGG 

ACCGAGTACGTGCACGCCCTACAGGCCAACGAGCACCAGCTCCTGCTGGAAAAGGAGAAACTG 

AAAGAAGATCGAACACTGCTCGGACTTGCTAAACXnTTCCCACACG 

TAAACAAACArTGTGTTGACATTAAGAATGTTGGTTTACTTTCAAATTGGTCCCCTG 

CA.CGGGGTTCTGCCATGACCTTGGAAAAAAAACTGACTCACGGGATGCTGGGTGGCTTC 

AGCCGTGGAAGTTCGGGACACTAAGGAGCTTCAGGAGGCTGTGT^AGTCACCTTGTT 

TTCTTTTTACAATGGTGCTTAAGTTCCAGCAAATG CCACAGAAGGGGGGGTTGCCATTTGATGC CCTTGGG AACACTTGTGAAATA 
CCATTGATAC^CCCGCCTTTGTATACGTCTGGGGTAATGA 

ACTGTAAGAATACCTCAATGTTTGAGGGGOATGTTTTGTATACAAAATATATTGTTAATCTG 

GGCCTGTATACTTTTGTATGCGCTGATACATAACTAAATTTGATACTTATATTTTCGTATG 

TAGATATTACTTTACACTTTTTAACTATAAACTTTTGTAATAAATTT^ 

TTCCTTGTTTACTGTATTTGTTCATGTTTGGTGACATTAACTGTGTAAAATGTCAAA^ 

ATATTTAGTGCTGCACCTTATGAGCACTTTGAATACCTCATGTTTATAAAATAAATAGCAATTAAATAATG 

CCTTCCTAGTTTGGCTGCAGTCrCTGCACACAGTCCATTCCTTACCCGGACTTT^ 

TTGTGGAAGGATAGATGGGTTTGTGAATTC 

MOUSE SEQUENCE - CODING 

ATGCCCAGCTGCACCGCGTCCACCATGCCGGGGATGATCTGCAAGAACCCAGACCTCGAGTTTGAC^ 
CCCGGACGAAGATGACTTCTACTTCGGCGGTCCCGACrCGACCCCACCGGGGGAGGACATCTGGAW 
CGCCCCCGTTGTCG CCCAGCCGCGCCTTC CCAGAGCACIAGCCCGG AGCCTTCGAATTGGGCTACGGAG ATGCTGCTGCCG 

GACCTGTGGGGCAACCCGGCCGAGGAGGATGCGTTC^ 
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CTGCATGTGGAGCCK3CTTCTCTGCCCGTGAGAAGCT 

GGGGCCACCCTGCCTACCGACCTCTCCCACCCGGCTGCCGAATGTGTGGACCCCGCCGTGGTOT 
AGAGTCGGCGTCGGTGCCCX5CTGCCCCCACTAGCGCCCCGGCGACCAGCGCTGCGGT 
5 CGGTGGCTGCTCCTGCTCGTGCAGGCGGCCGTCCTGCCAGCAGTGGGGAGG 
AGCGACTCAGATGATGAGGATGACGAGGAGGAAGATGAAGA^ 

T AACAACAAGG CGGT AACCACTTTCACGAT CACTGTGCGTC CCAAGAC CTC CGCT CTGGGTCTGGGGCGAGCACAGC CTGGCGAG C 
TGATCCTCAAGCGCTGTGTTCCCATCCATCAGCA^ 
CAGAAAAAGATCAAGAGCXIAGGCTTCTCCACGCCCCCTCAAAAGTGT^ 
1 0 AGACTCGGAGG ACAGCG AG CGCCGCCGCAACCACAACATCCTGGAGCGTGAGCGC CGGAACG^ 

TCAGGGACCATGTGCCTGAGCTGGTGAAGAACGAGAAGGCCGCCAAGGTGG 
CrACAGGCCAACGAGCACCAGCTCCTGCTGGAAAAGGAGAAACTGCAG^ 
TCGGACTTGCTAA 

15 HUMAN SEQUENCE - GENOMIC 

TACAGATATCTTTCCCTTATGGAATAAAAGCAGGGATAGGGCTCATTATTCCGAT 

T AC CTT AT ACTACCTTGACTCTGC AGT AC CTT C'TTTCAAG C CTAAAGTT ACITG ATT ACGG ACAAGCCATG AAG AGT AAAGGGGTG 
TGGCCCAGAGAGATTTCCAGTTGAAGAAAGAATGATTCAACAGTC^ 
2 0 CACACATGCTATGGTCTATTGCTGCGGGGTACATGGAGCCCCrrc 
GACATCCATTATAAGCCAGATGGCTACTTTGTCTAATCCT 
TCATGCCTGTCATCTCAG(^CTTTGGGAGGCCGAGGCGGATGGAT 
TGAAGCCCCTTTTCTACTAAAAATACAAAAAAAAAAATT^ 
TAAGGCACAAGAATCGCTTGAACCCGGTAGGCGGGGGTTGCAGTGAGCCG^ 

2 5 TGAGATTCTCTCAAAAAAAAGAAAAAAAAGAAAGAAAGCAAGAAAG 

TAGCAGAACTCGCTGTCATGTCTGGTGTCAGAGCTGTTCT^ 
ATGGGAAAGGGTGCTGCACTCAGGAACTGCTC^TCCTGTT^ 
GCCGGGCACGGTGACTCACACCTGTAATCCCAGCTCTTTGAGAG^ 
CCTGTCCAACATGGAGAAAGCTAAAAATACAAAAATTAGCC^ 

3 0 AGGCACGAGAATCGCTTGAACCTGGGAGGAGGAGGTTGCAGTG 

TAAAAATAATAATAAAAATTAAAAAAGAACCCCCTTGTTGGCGGCAAGCA 
TAGGGTTAGGGGAAGGGAGGACGAGATTCCTTCCAGGAOVGTCTCCAGCT 

AAGG AAAAGCCTTTCT AGGCCAGAAGT AGCAG CTT CTGG AAATG AG CAGAT AGAATCACTG AT ATTT AT CAGTGATTGTT AT CTG A 
GTCCAAGGAAACTGAAAGC^CCAGCAGGCAGATTCTAGCAATTCCACT 

3 5 GGAATGTATCCAGCATTATAGGAAAATGAAATTAGCAAGAATCAGTGGGTG 

AGATCTGAAAACTTCAGCACZAGCGCGAGGTGATCTGGACACATCCA^ 
AGACCCCTGTCCTTCGTAGAGAGCCTGCCTGTTTCAAGGGGCAGCGCCAAAT^ 
TGTAGCCCTTACAGATTCCTACTAAGATGAGGATCCCCTAATTGTGAGCCATTCCAGTGCACT 
CAAGGGGAAACCTATTTCTCCCCAAACCCCATACCTGTGGAGCCTGGCCCTGAGCAGCT 

4 0 ACTTTTAGCATAAGGAAACTAAGGGAAATGAGAATTCCAACTCCACAGCAGC^^ 

TTTGCAGTGC^TTTTGACTCACTCCATCTTAGCC 

AGGGGTCTTTGAGCTCCCTCCAAGGATGCAAAACTAGTCTCTGAG 

GTAATCAAATTACTGGCATTAAATGAAGATAATTT^ 

CTCTCTCAGT<^TGGATCTCAAACTCTCCrCCAG CCTCAGATGTC AAAAAACT CCTTGGGCACATCCAACATCATA 

4 5 TACGTTCAATGGCTTTGATGGGCATGATTCTAATCTTAATTAAAGCAGTGCCCG 

CTCAGCTGACTCAGTTCAATAGCJU3TTTGGTGGGAAAAATG CC AACTGG CCTGGG AAACAAAGG AGAAATTTGGGGGTGACTTTAG 
GCTTCACAAAAGGGTCCCITACATCACTTCAAATGATT 
AAGAAATTAAATTTGTTCATAACCCCATTGCTTTAAAAAAAAAAAAAAAGT^ 
TAGCACTTTGGGAGGCGGAGGTGGGTGGATC^CTTG^C^TCAGGGG^ 

5 0 ACTAAAAATACAAAAGTTAGCCTGGTGTGGTGGCATGTGCCTCTAArrCCAGCT 

ACCAGGGAGGTGGAG<;CTGCAGTGAGCCGAGACCGTGCCATTGC!ACT 
AAAAAAAAAATTAATTAGGCATAGTGGCGGGTGCCTATAATCTCAGCTACT 
GAGGCAGAGGTTGCAGTGAGCCGGGATCACGCCACTGCACTCCAGCCTGGATGACAGA 
AATAAATAAAAGTTATATCAAGAAGAAAGTrrCTCCCTCCACCTGACTGATT 

5 5 GTGCACATTCTTCTCAATTTGTTACTGGGGGGTTGATGCAGATATATCTAAGTACA 

TATACCrTGCTTATTTTCCTTCATGTTCAGGAACATTTCCCGTCTG 
AATTCCACTGCATGGATGTATTACATACCATTCrCCTGTTTAATTTGTTTTAG 
TTATTGAGAATGATGCAGCCACAt^CATTCTTGTTTATATGCTTTT 
GAGTGGAATTGCTTTTCCTCTTGGGGCAGGCACACCAACAGTGTA 

6 0 TGGGCCAATCTGCTAGACCCAAAAATGTTAATCTCACTGAGATTTAATTTGCATT 

TCATATGTTTATrGACAATTTGC ^T ' l 'rCl T ATTCAGTAAATTACCTGCT 

TCTATTCAATTTGTGGGAGCTCTTTGTATACCTGGGATCTAATCTTTGACTCATATATATC^ 

AACTTTTTATCATAGTAAAATTCTTACATCCAGCACCCCCACCCCTCCCCTG 

agagagctagcrrttactgttttccaaatgaagtcaatcgatcccaag^ 

6 5 tgagggaggtgtgaagatcctaggagaaactgtgctcttgtcccrrgaagacaga 

caaggtctcactctgtcatccaggctg^sagtgcagtggtgtgatgatggc^ 
ttcctgcctcagcctcccaagtagctgggaccacaggcacgcaccc!accacgcccagctaat^ 
gtttc^ccatgttgcccaggccggtctcaaaatcctgggctcaagccatctcccct 
gcgtgagccaccatgcctggccaacccagcagat(nttaattcacaatcrgga^ 

7 0 gtcatgatttgggcccttctcaacatgtgtatgtgtgtgttgtttctto 

taaaatattctcctctctcaaaaatcaccttaattataaaccaaatgct 
aggcatccatttcagatgtgggtctccttccctggtc(^cagtgagctccttgcggct 
cacccaggggctcagacacagtaatagatgggtacttgtctc^ 
cttataaagaaaagaaatgtatttctcatgtttctg^aggctgagaa 
7 5 tggtgtaccatcccctggcaggaaaaagg 
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CCATTCACAAGAGTGG AG CCCT CATGACCT AAACACCTCACATT AGGTC CTG AGAC^ TGG CTGCAGAAT AAT 

AGAAGATTTTGGGCAGCAATTTCACATGACTAGCAAAAGGAAACTGTTG 
ACAGAGTGTAGACCAAGCTGGCTAAGACCCACTGAACCCAGCATGGCACTG^ 
CGCTCATTAACATACTAAATCACACACCCCACTAGCA 
5 CACAGTTCTGAGAAATCTCCTCCTTTTTCTAGGAATGATCATG 

CCCAAACCCCXTTTCGT G TGAGTCrrCTCGAGTACACCCATATTC^ 
CTTTCACTATTTTCCAACTGGTCTrrcAATTCATTCT 

GCATTTGGGGACCTCCCCTCCAACACTGTTGCACTGGGGGTTAAACTTCCA 
GCAGTACTGAACAAAACCTTGGCTTTGAAACCATATGAAGATTTGTTTTGTG 
1 0 AAGAGGGATTTTCAGGCTTCTCTATAAGCAGACTAAGAATCAGCATTAGCATGT^ 
CTTACAAATGAAGGTAACCAATAAATATTTGTTGAATTTTATCATTCTGT^ 
TTGCATTAAAAGCAGGAGATTAGAGATTGAGGTCCAGGAAGTATAAATGATTTG 
TCCCTCTGTCCCC CITIT CTGCTAC^CCTTGTCC 

CAAAGGTAGGACTTTGAATCGGAGCAACTGAACCCATCCTGGCCTATTTTCACA 
1 5 GGACTCATTCAGTGATCTTGGTCTrGAATAATACCATGAGTACTTACTATATAC^ 

GGAGCAAAACTGACCCAGGTCACAGGTTAGTGAGGGAGACTGAGCCTGTCTTCCACAGTGTCACG 
CTCCCAGGCATGGATCATTACAAATGGAGGTGAGTGCTGGACAGGAAAGGACAG 

GTGGCGATTAGGGGGCT AGGGAAAAG CAGAAGTGGGGCTGT AAAAGGCTTTTGCAAAG ATGAAGGCTTT^ 
TAACAATAATAAAGGTG CTTGGAAGGGCTGCTCATCCG CATTC ATGGGCCTCTTCCT^ 
2 0 TCTGTAGGCGTCCAGGGTCAGGGCTGGAAAGGCCTTCACAAATGCT 
ATCCTCCTCCTTAACTAGATCCAATGGTGAGGTGAAGA^ 
GAGAGGCGGGGAAGCCTTCCCACCCCAATCCCAGGGCCAG 
ATCCCACCTCACTCTGTATTTX^TTGTGAAGGGGGCTC 

AATGTCTGACCCCCTGCAAACAAATACACCTGTAAGAAAAGCTAGACCAGAGTGAAGTAAAGCC^ 

2 5 GGAAGGGGCCAATGCTGCrGCTGGACAGAGGGGTGTTATTTTCCCCAGAAGAATAGCATGTCGTC 

GACACAACTATTATGGGCTGGGGACTTATAAGAGCTTTCATAACTCrCAGGTAGGAGG 
AAACTGAGGCACAAAGAAACGAAGTAGTGACAGCAGCTTTCCTTTTTCCCT^ 
TGGGTCTCAGGAGGTGTGGACAGCACTTCCATCTCGGTAAATCAGAT^ 
GGTTAACTTGGGAGCCCTGGGGACGGCTGCAGGGAACGTACGCCCATTCCCCCTAGGAAGC^ 

3 0 AGTGGTGAGGACGCGTGGACAGTCCCGCGG 

CGTCCTCrCCCTGCTATTTTGCACCTTCGGACT AGCCTTCTTT CGTAATTACACAGGAG CAACCTCCCTGCAAGGCCTTGCTCAAC 
GTTGG CCTCGCGCTCAGCTGCACAACACGCAGTCAAAGCGGGGG CTGGGTTAGAAGCATCGGTCTCCCCTCCCCAACACACACCCC 

3 5 CGGAGCCCrCCGTAATTTTTTTTTCTTTT A ATGACAAGCAATTGCCAGGCTCGCAGGGTG 

C AG CTGGTTCTCAGAGTG CAGCCGGTGCAAGCCCGGGGGTCCAAAAGGGCGGGAGG AGCAC ACCCTGGG CTTCCCAGCTTTGCAG C 
CTTCTCTCTGCAAAGAAAAGCAAGTGGCTTTTGGCGCGAAAGCCTO 
TAAAGCCGCTTTCCTCTCCTTTCTCCCTCCCCCITGTCTGCGCCACAGCCCC 
TCAGTTAATAATATCCCCCGAGCTTCAAAGCGCAGGCTGTGACAGTCATCTGTCT 

4 0 GGAACTGTGTTGGAGCCGAGCAAGCGCTAGCCAGGCGCAAGCGCGC^ 

CCACCCGGAGACACCCGCGCAGAATCGCCTCCGGATCCCCTGCAGTCGGCGGGAGGTAAGGAGCAG 
CCCAGGGAAGCGACGAGCGCCGGGGCAAGGCAAGCCCTGGACGGGATTGCGACGTGCGCACCGGGCGCCCTAATA 
CTGTTTCTGCTTCCGAAACAAAACCATCTCTGGGTTTTCCCAGAAAAGCCAGTTC 
CCGCCCTAATCCTTTTGCAGCCCTTACCGGGGGGAGTAATGGCTTCTGCGAAAAGAAATTCC 

4 5 GTTTGAGCTGTCGGAGAGCCGGTGCGTCCCC^CCCCAGGCTGGGGTTCTTCTCCAAAGGGTGCCCCT 

ATTAGGCAGGGCGAGGCCGCCGCGGTCGCAATCTGGGTCACGGCT 
TCGCGCGGGCGCCCCTGCCATTCCCGGGAACAGGGGCTCAGCCTCTCCCT 

AGGGGTGGG CTTAGAGAGCTTCCAATT AAGCT ATTGG CAGGAG T AT C C CTGCAG CGGGTG AATG CCGAGGGG CGTTTGCTCAAATT 
. TGGGGAGGGGAAGGATTTGTGGATATGGGTGTCTGTTGTTGGTCTCTGTCTAGAGAAA 

5 0 AATCCCCTGCTATCATTTGCACTCCTGAGGTTGCATTTTTACAAAGGGGGTA 

TCGGAGAGCCTGGGGCTTCCCCTGAGCAGCCGGCCCCACACCG CTGCGAGTGCGGTTGTCTGCGTG CTCGTGAG AGCTAGAATTCT 
GCAGCCAGGAACAGCCCCCTCCCCCAGGCAGTGCCTTGTGTG^ 
CCTGCATCTGCATGCCCCCTCCCACCCCCTGTCGT^ 
CTTCCTCCACCTTCGGGAGCAGTGGGC^GAGTGGGGGGCTTGGAGGGAA^ 
55 ATCCCCTCGGTCTGCCCCGTTTGCCCACCCTCTCCGCT^ 

GCGCCCCCCACGGGAAGGAAGCACCCCCGGTATTAAAACGAACGGGGCGGAAAGAAGCCCTCAGTCGCCGGCCGG^ 
ATGCCGAGCTGCTCCACGTCCACCATGCCGGGCATGATCTGCAAGAACCCAGACCTCGAGTTTC 
CCCGGACGAAG ATGACTTCTACTTCGGCGGCCCCX3ACTCG ACC CCCCCGGGGG AGG ACATCTGGAAGA^ 
CGCCCCCGCTGTCGCCCAGCCGTGGCTTCGCGGAGCACAGCT^ 

6 0 CTGTGGGGCAGCCCGG CCG AGGAGGACG CGTTCGGCCTGGGGGG ACTGGGTGGCCTCACCCCCAACCCGGT CATCCTCCAGGACTG 

CATGTGGAGCGGCTTCTCCGCCCGCGAGAAGCTGGAGCGCGCCGTG AGCGAGAAGCTGCAG CACGG CCGCGGGCCGCCAACCGCCG 

GTTCCACCGCCCAGTCCCCGGGAGCCGGCGCCGCCAGCCCTGCGGGTC^ 

GCCGCCCTGCCCGCCGAGCTCGCCCACCCGGCCGCCGAGT^ 

GCCAGCGCCCGTGCCCGCAGCCCCGGCCAGTGCCXCGGCGGCGGGCCCTGCGGTCGCCTCGGGGGCGGGTATTG 
65 GGGCCCCGGGGGTCGCCCCTCCGCGCCCAGGCGGCCGC 

ACCCTG AGCGATTCAGGTAAAG ACCGAACTCGGGTCCGGCTGCCTCC CTGGGG CACTGG ACCCCGGGTCGCGTCCCCTTTGTTAGT 
GCTCGTATGTCTTGGCCTGGGGAGCATTlTGGAGGCAGTGCTAGGGGCAGAGAGGTCCTGTTTCCC 
AAGAGAAGGGGCTGAGAGAATGCCGTTGCAAAAGGGGTGCTCTCCAATTCTCGCCTTCACTAAA 
GAGCCCTCCTCTAGGCCATCACGGGCCCTCACCCX3GTCCCCCACCTCTCTTTTGCAGCGCAGTCT 

7 0 TTGGTGGCTAAACCGGGTGGGGGTTTAGGGGGTTGCTC 

CTCTGGCCCAGGTCAGGGGAGGGAAAGACATCCCGAGAAGATTCAAGGGCTGTGCAA 

GAGGGTTGCACAGATGGCTAGAGCCGATTTTCTATTCTTTTTCTTTTTC 

CCCCGATCCTCGGTGGGTGGTGGGCTATTTGCTCCTGGTGCGTGGCCAGCAGGCGGCG 

AT CTG AAAGG CTGGGGG TGGTGGGGG CACCCT C CCTCC CTCCATTCAG CAGCTGG CT G CAAGTG CAACAG CAGTTGTGT ACATT CT 
7 5 CAGGGGGCCTCCT CTTTCCAGTCTGCAGTGGAAACTGG CTGTAGTTTTGTCT^ CCAGGCCTAATTTGAGATGTG 
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AGTTGTATCTGTAACCCAGTGCCCTTGAAGGTGAGGGCAGGCACTCAGCAGCCTCTCCAGG 
CTGATTAGTTCTATTGTGTTCATTTGTCTGTGTCTTAAGCrGAAGG 
ACAGAACTCAACCCAAAGAGTGGC^TTGCCTTGTCCTTGGAGC^^ 
AGAATGCAGGTGGATAAOVAGCCTAAATTTAC^ 
5 CTCCAGTTTGAAACGGCTCTGTCCCTTTAATGTCTGTGCCTTGACAGCTT^ 
TTGGCAGAAAACCAAAACATTGGCTTAAAGGGACCCACAGACTGG^ 
TTCAGCTTTCCGGTCCCCTTCAGATCAAGCAGAAGATA 

CGTGGTAGTCAATGAGGAGAGAGGGGAAGAATGCGCACATGATGCTACACGTTTCTGTTC 

GAGAG CTGCTCCCATTTGGGGGTTTATACCAACTGTGGATTATGG CTTTGTCATTAAGATTTG ATCTTTGTTAAATG AAAAACTGT 
1 0 TTATTGTATAAAACTCAGGTTTGTGGACGAAAAGTTGTl'l'T'l i 1 1C1 1CAGTTAATTAAATTGTTCCTCAAGTTTGTTTAAGGACT 

TAAAATCAAACACAACCATGTGTAAACTGCTAAATGAGGCTC 
• TAAATAAGTCCACAGGGCAGACTGGTGATTATGATAAAAGCTACCA 
GTCTACATGTTATCTGTCAAAGTGGTATCATCCCCATTTAATA 

GGTCATGCAACTAGTCTGCGGTGGAGCCAGGATTCTGTCTGACCCTAAAGGCCAAGTTCTTTA 
1 5 AGTCTTGAATGGAGGCTGAAAGCACAGTTGGGGTATGGGGAAGAAAAATATATATACATACATATATGTATATGTATC 
TATGGGGGGTTGTTTTGTITTTGTTTTTGATAAGGAGTTTrc 

CTGCAACCTCCGCCTCCCGGGTTCAAGTCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGA^ 
AGCAAAGTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTG^ 

GCCTCCGCTTCCCAAAGTGCTGGGATTACAGGTGTGAGTCACCGCGTCCGGCCTACAGATAT^ 
2 0 AAATACAAAACTGTCCACATCTATGTTGATGGACCCATAAAAATAGCAGTCTGCCAGGGTCTGCCGG 

ATATTAACATGGATATATATGTGAATTTCATTCAAATGGTTCTCACATGAGAGTAACTAGCATCTTTCTCTCAGAT^ 

ATGAAGAGGAAGATGAAGAGGAAGAAATCGACGTGGTCACTGTGGAGAAGCGGCGTTCCTCCTCCAACA 

TTCACCATCACTGTGCGTCCCAAGAACGCAGCCCTGGGTCCCGGGAGGGCTCAGTCCAGCGAGCTG 

CATCCACCAGCAGCACAACTATGCCGCCCCCTCTCCCTACGTGGAGAGTGAGGATGCACCCCCACAGAAGAAGATAAAGAGCGAGG 

2 5 CGTCCCCACGTCCGCTCAAGAGTGTCATCCCCCCAAAGGCTAAGAGCTTGAGCCCCCCAAACTCTGACrrCGGA 

CGCAGAAACCACAACATCCTGGAGCGCCAGCGCCGCAACGACCTTCG^ 

GGTAAAGAATGAGAAGGCCGCCAAGGTGGTCATTTTGAAAAAGGCCACTGAGTATGTCCACTCCCT 

TTTTGCTGG AAAAGG AAAAATTG CAGG CAAG ACAG CAGCAGTTGCTAAAGAAAATTGAACACG CTCGG ACTTG CT AGACG CTT CTC 
AAAACTGGACAGT CACTG C CACTTTG CACATTTTGATTTTTTTTTTAAACAAACATTGTC 
30 ■ TCAAATCtKnxrCCCTGTCGAGTTCGGCTCTGGGTGGGCAGTAGGACCACCAGTGTGGGGTTCT 
TCCCAGGATGCTGGGTGGCCCTGCAGCCTCCTCCACCTCACCTCCATGAC 
GCTGTTGAAGTCACCTTGTGTGTTCCAAGTTTCCAAACAACAGAAAGTCA 
ATGCCACATAAGGGGTTTGCCATTTGATACCCCTGGGGAACATTTCrcTAAATACCA 
GGGTAATGAGAGGTGGCITTTGCGGCCAGTATTAGACTGGAAGTTCATACCTAAGTACTG 

3 5 ATGTTTTGTATACAAATATATTGTTAATCTCTGTTATGTACTGTACTAATTCTTACACTGCCTGTATACTTTAGT^ 

ACATAACTAAATTTGATACTTATATTTTCGTATGAAAATGAGTTGTGAAAGTTTTGAGTAGATATT^ 

AAGAAACTTTTGTAAAGAAATTTACTATATATATATGCCi rTTTCCTAGCCTGTTTCTTCCTGTTAATGTATTTGTTCATGTTTGG 
TGCATAGAACTGGGTAAATGCAAAGTTCTGTGTTTAATTTCTTCAAAATGTATATATTTAGTGCT 

T ACCTCATGTTT ATG AAAATAAAT AG CTT AAAATT AAATG ATG CAACTCAAC CTTTT C CTT AATGG CATT ACACT CTGTCC CTT AA 

4 0 GG AGCAACCATAAATAATCCATAACCTATAGAGGGAATTTGGTTTCCGTAAATAGCCCT?TTTTC 

GGGCATGAGTCATTGTCCCCACTTAAGGTGGAGGGAACTGAGTTTGGGGAAATTAA 
ATGTTATTAGCGACTATTGTGTCCATTGTAGCAATGGCATTTGATAATTT 
ATTAAGCTGAGACCTCAGATCAGTCCCTTTAAAAGAAAAGTAAAAATAGCCACAGGGT^ 
TAGTATTCGTCCCATTGGCAGACAGTCTTCTATTTTAAGGTAGAGCATAGT 

4 5 AGGATCTGTGGAGGTTCTAGGCTAAGCAGAGTAAGCAGAATAAACAGATGGAAATGGGITC^ 

TCACCAAAGTGTAATTCACGAGTGACAGTTACGTAGTCCAGAGCAAGTACCAGGTGCAGGAGAGACCCTGGG^ 
CCCATATGGGTGGCATGGCTCTCTACTTGCCATTCCTATTACAC^ 
CAATCITGOVGGCTCCAGAAGACAGAGCCAAGACTTGTGTGTTAAGGAGATAGTr^ 
AGCAGCHTCTCAGAACTAAAAGTGTACAAATAACCTATTTCTC^ 

5 0 TTCATCTCAGATTTTTAACTGTGAGCCGAAAAAAA 

AGCTTTGCTTGCCTGGACACAGAGTTTGTCAGGGGCAGAGA 

ATTTGTTTCTTTTGAG ATGGAGTCTCACT CTGGTG CC CAGG CT AG AGTG C AGTGGTG CAATCT CGG CT CACTG CAACTTCCG CCTC 

CCGGGTTCAAGCGATTCTCCTGCCT(^GCCTCCTGAGTAGCTGGGACTACAGGTGTGTGC 

TTTTAGTAGAGACCAAGITTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTTCT 

5 5 TTCTGGAATTGCAGGCGTGAACCACCA.TGCCCAGCCGCTTTGGTCATTTTATAAAGGCAAGTAGG 

TATCXTTGCTT ATCGCACACCCTTGAG ATACGAGGTTGGTG ATACTAC CATGG ACATCTTTCTGATGGT CTCGCCATGTAG AAAAT A 
AGATAAAATGCCCTTTAATTCTCCTAGTCCCCATCCCCAGACTAAAAAATAATGCACC 
CTTCCGTTCGTTTGCATTTTACAGGCCCCAAATCACTCACGGCCTCTGCCCCTGTGTGTGTGCAAGAGC 
ACCTCACATTAATTGTGTACCTAAGATTCTTGAAGCCCAGCTCAGTCATCCCAGTTTCT 

6 0 CCTGAGTGTGTCCAATGAAATGGAATGCTTAACCGTGAAGAAAGAAAACT 

CTATGCTCTCTTTTTGCTCGGGCTGCTGGGAAGCTOVGAGCCAAA 
ATCTTTACGGTTTCTTGTTTTCCATTGTTAGGTTAGTCATCTTTTT 
TTAGGTGGACCACAAATATGCACCTAAAGGATTAAAACTAAAGTGAAAAAATCCCAT 
TGTCACCAAATTTAGCTCTGCAATTATCTATAAAGTAATTGCTGTAGTCA 

6 5 AACCATCTCTGCATTCCAAGAGCAGTTTTGAGACTGGGA 

GCACAGAGATCATCTCATTGAAATTCATTTCTCAGAATGAG^ 
AGAGCCGGTGTCTCCTATGACCCGCCCAGTGCCCTTCCCCATAACATTCACTC^ 
TGGCCTGCCCGTCCACCTACATC^TGCTTCCATATTAACCCTTCTAAACCACCTACTCTTTA 
TATTTACAGAATCGAGTCCCGACTCTCCAGCCCAGCACAGAGGCACTTCTACCCAGTCCCACCAGCT 

7 0 ATGTGCTGGACTCTGGCTGTTTCTCATCCGTGACTTGTC 

CCTTCTCAATCTTGCACCCCCAGGTAGGCCrTTCCCAGCCCTTGGCCTCrGTC 
GACCCCCTAAACTGTGGGCTCCCTTAAGGCAGATAGGGCTCTCATTTGCCTCTGATTCT 
ACAGAGCCTTAGCAGATGCCAGCAGTATAGACGGTAACAGAGGAAGGAGC^ 
AACTCCAGTTCACAAGCTCTCAGGCTAATTCAGATGGGAAAAGACATCCCTTAG 
7 5 G CAGTTCTG AAGAAGTGAAG AT AAAATG CTCAT AG AGGACTG TCT ACAAGG AG CCCACAGTCTG CAAG AGAGTT AAAG C C CTGGGG 
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GTTGTATAGAAGG CAAGAAGAGAAACCAAGAGG CACTGGGTTG AGGGGCTGATGAATGAAAG CATATGAGGGGCCTAGTGGCTG AG 
TTTCGGGCTATCTGAAGGAGGAAGCTGGTC^ 

TGCCTCAGGCCACAGATGGTCTACACTCGTAGTGAAAAAGGTTGCAATGCCT 

TTTTTGT ATGCATGAGTCTCAAAAAT C CTTGAGG ATTTTTGACCAG TTT ATCACATT CC CACATT AT AAGTGAGGGAATCTGGTCT 
5 TCTGAGTTTTAGTTTCATTAATCCCTATGGTGGCTATGTTT^ 

TTGGAAGAGGCATAGAAGCCAACTAGTTTACTCTCCCATTTCAGGG^ 

TTATTTTAl 1 TTTC1 1 I ' ll 1 1 TGCAACAGGGTCTTGCCGTGTTGCCTAGCCTGGCGTGGAGTGGCTAATCrCAGGC^ 
TGCIX5ATCAGCACAGGAGTTTTGACCTGCTCCA 

TACTCCCAGGAGTTGCCATATTGATGCTAAACTITAGTGTGGACATCCAGTAG 
1 0 AAGTGATCCTCCTGCATCAGCTTCCrGAGTAACTCGGACAACAGGTG^ 
TTTTCTTTCCCTCTTTCCCATTCTTTCTCTCAACAAA 

AGACATCAATGAACAAAAAAGACAAAACTCTCCCARATCTTCTCTGCriTCr 
ACAAATAAACATAATAAGCAAATAAATTAAACAGCATCTAATATGGTGGCAAATGCC 
TTCAGGGATGCCAACAGCAGAGGTAGGAGGCTCAGGGAAGGCGCATCTGGTCCT 
1 5 GTCCTGCCCTGGGCACrCTTGTGCTTGCTCAGAAACCAGATCTO^ 

CTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTT^GGGAAGCC^ 
AGCCTGACCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTA 
CTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGACGGAGGTTGTAGTGAGCCG 
GGTGATAGAGCGAGACTCCGTCTCAAAAAATAATAATAAAAAATAAAAAATCTTCCCTTATCTCC 

2 0 CCCCACC^TTTCCATTCATTGTCTCGAGGAGAAACCATTGAGCATAGTAGTT 

ATGCCTGTAATCCCAGTACTTTGGGAGGCCAAGGCAGGTGGATCACCTGAGGT 
AAACCCCATCTCTACTAAAAATACAAAAAAATTAGCCAGGTGTGGTT^ 
AGGAGAATTGCTTAAACCCAGGAGGCGGAGGTTGCAGTAGGCCAAGAGCGCGCCATTGCACT 
TCCGTCTCAAAACAAAGAGTATGAATCTGAAGTTAAGACTC 

AAATACAAAATTAGCCAGGTGTGGTGGCTCATG CCTGTAATCCCAG CTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGG 
AAGGCAGAAGTTGCGGTGAGCCAAGATCATG CCATTG CACGCCAGCCTGGGCAACAAGAGTAAAACTCCATCrCAAAGAAAAAAAA 
AAAAAGTTAAAGACTCCCTGGGCTTGAATTTGATTCCCAAACTCACAGGCT 
GGATAACACTGGGATTATTAGCGGGTTGGCATCATTATTATTAATATGTTTTAAAATG^ 

3 0 GCAGTGGCCTGATCTCGGCTCACTGCAGCCTCGACCTCCTGGGCTCAGGCGTTTCTCCC^ 

ACAGGCGCATACTACCATACCCAACAAATTTTTTCCATTTTTAGTG 
CCTGACTTCAAGTGATCCTCCTACCTTTGCCTCCCAAAGTGCTGGGATGA 
TATATACAGCACATAGAATAGTGTCTGGCATGTGACAGGTGTGGT^ 
TTCCTCTCTCTATTTCCTCAGTATGTTACCAGGGCAAAGAACCTTTAGAA 

3 5 AGTTTAACAAGAACTTTGCAGACTTTTGTTCCTOiAAAGAGGCCT 

GAGCTTTGAAGCCTATTTTAAGACGGAGCCTAGGAATTTGAGTGCCGCAGAACATCTACGCCT^ 
CTTACTTGTAAAGGAAATCTCTTCATAGGCCACTGAC 

TCATCCTGAAACCTTTGACCAGGAGAGGTAACAGAGCCrrrrTATGGTCGAA^ 
GAGCACAC^CAGCACAAACTCCTCTGATAGCTACAGACAATTTGC^ 

4 0 CTCTGCATTCCAGTGGGCCTGAGATTCTCAGCAGACCTTTGAGGTAGGG 

TCAGTTGAATTTTGTGTGTTCTTGTTTTGTGGCTCAGGCTATCTTC^ 

CTGGAAAAATGTTGTTCCAGGTTCACTGGGAGGCAATCCTGAGTCACAGAAACACTTAG 

TAGATCAGTTATTGACTATTTTCAACCTTGAAATAGGAAAGTAA 

G AAGTCAGTGATCTCTG<K^CTCCCTGTCAACTTTGAGGATCCAG AGTCATTGGTC CTTTGCAATGGCTG CAGCTTTAAAGGTGGC 
45 TGCAAGACTGC^GTGTCTGTaUVCACCCTCTGGTTGGAAGATGTGCCCrGTCAAGTTCATATGATGG 

ATAGTGTTGGGAGGTGGGTACTAATAAGAGATGATCAGATCATGAGGGCTCCGCCCTCTCAAATGGATTAATGTGCT^ 
AGTGGATTAGTGACCTTGAAACCGCCATTACAAAATTGT 

AACCTCCAAGCTGTCCTTGTTCTTTG CTGGGCGTAGG CTGAACTAACTTTGGGAGG AACTTAGTTTGTAG ATTAGCCCTTTCCCAA 
AACAAACCTCCTrCTTGCCTGGGGACTAGACTCCCTTTGTGGGACr 

5 0 TGC AGCTGG AGGCT ACAAGATTC CGACCCTCCCTAAACTGCTCCTAAGATCAGTG CTTGAGATATTCTGCAGACCCTG CACTTGAT 

GATCAGCTGGCACCACCCAGATTGATAAACTGGCTCATCTGATCCTGTC 
CTTCGACTCCCTATGATTTCATCTCTGACCAAITAGGA^ 

TCTGATCCCCAAATGCTTGAGGAGACrTCATTTGAGTAATAAGAAAGCTCTGGTCTCC 
TTCTCTATTGCAATTTCCCTGTCTTTATAAATTGGCTGTGTCTAAAC^ 
55 GAGTGAGTGGGCTTATTATGAAAGCGAGTTAAGCAGCCT . 

ATGG AGTG ACACAGTAAG AAGG CCCTTG CCAGGTGCAGGTCTCCCG ACCTTGG ATTTCTCAGTCTCC AG AACTGTAAG AAATAAAT 
TTCTTGGCCGGGCGCGGTGGCTCACX5 CCTGT AATCCTACCACTTTGGGAGGCTG AG ACAGGTGG ATCACCTGTGGT CGGGAATTCA 
TGACCAGCCTGACCAACATGGAGAAACCCCGTCTCTACTAAAACTACAAAATTAGCTGGGAGTGA 

G CT ACT CAGGAGG CTGAGG C AGGAG AATCGCTTGAAACCGGGAG G CGGAGGTTGCGG CGAG CTGAGATCGCGCCATTG CACTCCAG 

6 0 CCTAGGCAACAAGAGCGAAACTCCGTCCCAAAAAAAGAAATAAAA^ 

TAAATTACCCAGTCCGTGTCATTCTATTACAACAGCAGGGAAGAGACTCAGATAGG 
CACCAACTCCATGGCTGAGGGTCAGTCACTTCCCTCTGAATCTCACTGTCT 
AGGTTCTCATAAGGACCAAATGACGTATAGGAGGGAGGTGCTAGCACTTGAAGATCGGGC^ 
CGATATTGACACATTCAGCCCTTGTCCAGC^CX^CCCACCCATTCCACTGlT'r 
6 5 CAGGTGGGAAACTCCTTTAAAGACAATTAGCATACCTCT 
TCATAATGTTTGTTAAATAATGACAACCAATGAAAAGACT 

AAACTGGTTACACAGGCTTGTCTCATCTGGAACTATGGAAAACTATGATTAATCT 
AATATGGCAACAGACATGTAACACCCTGAGTTTTTGCTTGGGTTGGTTTAGCCACT 
GAATTrAAACTAGAGATTTGAAACCAAGGAAGCTTTAGTTAGATGCATCTTT 
70 ATTATTATAAATAGC 



HUMAN SEQUENCE - mRNA 

CTCCGGTGTGTCTGTCGGTTG CAGTGTTGGAGGTCGGCGCCGGCCCCOT CCTTCCG CGCCCCCCACGGGAAGGAAGCACCCCCGGT 
ATTAAAACGAACGGGGCGGAAAGAAGCCCTCAGTCGCCGGCCGGGAGGCGAGCCGATGCCGAGC 

GCATGATCTGCAAGAACCCAGACCTCG^GTTTGACTCGCTACAGCCCTGC i i <^ i ACCCGGACGAAGATGAu i i CTA^rrCG<iC\J<iC: 
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CCCGACTCGACCCCCCCGGGGGAGGACATCTGGAAGAAGTTTGAGCTGCTGCCCACGCCCCCGCTG^ 
GGAGCACAGCTCCXLAGCCCCCGAGCTGGGTCATC 

TCGGCCTGGGGGGACTGGGTGGCCTCACCCCCAACCCGGTCATCCTCCAGGACTC 
CTGGAGCXSCGCCGTGAGCGAGAAGCTGGA^ 
5 TOCCAGCCCTGCGGGTCGCGGGCACGGCGGGGCTGCX5GGAGCCGGCCGCGC 

CCGCCGAGTGCGTGGATCCCGCCGTGGTCTTCCCCTTTCCCGTGAACAAGCGCGA^ 
GCCCCGGCGGCGGGCCCTGCGGTCGCCTCXK3GGGCGGGTA 

CGGCCGCCAGACCAGCGGCGGCGACCACAAGGCCCTCAGTACCTCCGGAGAGGAC^ 

AAGAGGAAGATGAAGAGGAAGAAATCGACGTGGTCACTGTGGAGAAGCGGCGTTCCT 
1 0 ACCATCACTGTGCGTCCCAAGAACGCAGCCCTGGGTCCCGGGAGGGCTCAG 

CCAC CAGCAG CACAACT ATGCCGCCCCCTCTCCCTACGTGGAG AGTGAGGATGCACC CCCACAGAAGAAGATAAAGAGCGAGGCGT 

CCCCACGTCCGCTCAAG AGTGTCATCCCCCCAAAGG CT AAGAG CTTG AGCCCCCGAAACTCTGACTCGGAGGACAGTG AGCGTCGC 

AGAAACCACAACATCCTGGAGCGCCAGCGCCGCAACGACCTTCGGTCCAGCTTTCTCA 

AAAGAATGAGAAGGCCGCCAAGGTGGTCATTTTGAAAAAGGCCACTGAGTATGTCCACT 
1 5 TGCTGGAAAAGGAAAAATTGCAGGCAAGACAGCAGCAGTTGCTAAA^ 

ACTGGACAGTCACTGCCACTTTGCACATTTTGATTTT1TTTTTAAACAAACA 

CAGGATGCTGGGTGGCCCTGCAGCCTCCTCCACCTCACCTCCATGACAGCGCT 

GTITGAAGTCACCTTGTGTGTTCCAAGTTTCCAAACAA 
2 0 CCACATAAGGGGTTTGCCATTTGATACCCCTGGGGAACATTTC 

TAATGAG AGGTGGCTTTTG CGG C CAGT ATT AGACTGGAAGTTCATAC CTAAGT ACTGT AAT AATAC CTCAATG TTTGAGGAGCATG 

TTTTGTATACAAATATATTGTTAATCTCTGTTATGTACTGTACTAATTCTTACACTGCCT 

TAACTAAATTTGATACTTATATTTTCGTATGAAAATGAGTTGTGA^ 

AAACITTTGTAAAGAAAITTACTATATATATATGCCTTTTrCCTAGCCTC 
2 5 ATAGAACTGGGTAAATGCAAAGTTCTGTCTTTAATTTCTTCAAAATC 

CTCATGTTTATGAAAATAAATAGCTTAAAATT 

HUMAN SEQUENCE - CODING 

ATGCCGGGCATGATCTGCAAGAACCCAGACCTCGAGTTTGACTCX3CTA 
30 CGGCGGCCCCGACTCGACCCCCCCGGGGGAGGACATCTGGAAGAAGTTTGAGCTGCTGCCCACG^ 
GCTTCX5CXX1AGCACAGCTCCGAGCCCCCGAGCTGGGTCACGGAGATGCTGCT 
GACGCGTTCGGCCTGGGGGGACTGGGTGGCCTCACCCCCAACCCGGTCATCCTCCAGGACTG 
CGAGAAGCTGGAGCGCGCCGTGAGCGAGAAGCTGCAGCACGGCCGCGGGCCGCC^CCGCCGG 
CCGGCGCCGCCAGCCCTGCGGGTCGCGGGCACGGCGGGGCTGCGGGAGCCGGCCGCGCCGGGGCCGCCCT 
35 CACCCGGCCGCCGAGTGCGTGGATCCCGCCGTGGTCTTCCCCTTTCCCGTGAACAAGCGCGAGCCAGCGCCCGTGCCCGCAGCCCC 
GGCCAGTGCCCCGGCGGCGGGCCCTGCGGTCGCCTCGGGGGCGGGTATTGCCGCCCCAGCCGG 
GCCGAGGCGGCCGCCAGACCAGCGGCGGCGACCACAAGGCCCTCAGTA 
GATCATGAAGAGGAAGJ^TGAAGAGGAAGAAATCGACGTGGTCACTGTGGAGAAG 
CACATTCACCATCACTGTGCGTCCCAAGAACGCAGCCCTGGGTCCCGGGAGGGCTC^ 
40 TTCCCATCCACCAGCAGCACAACTATGCCGCCCCCTCTCCCTACGTGGAGAGTGAGGATGCACCCCCACAGA^ 
GAGGCGTCCCCACGTCCGCTCAAGAGTGTCGTCCCCCCAAAGGCTAAGAGCTTGAGCCCCCGAAACTCT 
GCGTCGCAGAAACCACAACATCCTGGAGCGCCAGCGCCGCAACGACCTTCGGTCCAGCTTTCTCACGCT 
AGTTGGTAAAGAATGAGAAGGCCGCCAAGGTGGTCATTTTGAAAAAGGCCACTGAGTATGTCC^ 

45 
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Table 18 

MOUSE NOMENCLATURE 
ICSGNM Myb 
CeXera mCG282S 

5 

HUMAN NOMENCLATURE 
HGNC MYB 
Celera hCG32380 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

tTONNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGTCOVAACCT 

CCGGCGGTGCGGAGCGCCGCTGCGCAGCCGGGGGAGGGACGCAGGCAAGGCCGGAGGGCAGCGGGAGG^ 
4 0 CCCCGGGGCTCTTGGCGGAGCCCCGGCCCGCCTTCGCCATGGCCCGGAGACCCCGAACACAGGTAATGGGGA 

CCTCGGG CTAGGGCGCGCGGGG AT CCAGG CATCCCGGAGCAGGTGGGAATGCCTT CCCTGTTGGTTTGCGGX3GCT 
GCAGAAGAGACTCGGGAGAAGGATGAATGGAAAACTCTGCGAGAAGCGGACGGCT 

TATTTTATTTCCCTTTTTCTTTCTTT1 TTCrTTTrCTTTTTG CAAGTTGCTCGX3GG ATTCGTTTCTCTTTGGGX^GTGGGGTGGGG 
GGGGGGCTCGGC^CTCTTCAGCCGAGGACTGAACGGTTCCATCAGACCCt5AGGAGCCCACCAGGTGTTTGGCTTGATGCCCCGAGT 

4 5 GTCGGCGAGGGAGTCGCAGCCTAATTGCTGGATGCATTGAGGTGTGTT^ 

GGGTTGTTTAGTAGCTGCAGGTAGTTCATTTAAATTTCATTCATTCATTATGGAGGCGAGAGAGGTG 
CTGCAAAGATTTTAGGGGGACAAAATGGGTTTGCAGATTGTT^ 

GCTTTTGCAAACACTGCACGCGCCCGCGTCGCACATGTCACCGCACCGGCCGAGATGATGCGTGCAC^ 
CACCTTCGGGAAGAGGGATCCTCTCGGAACTGTCAGTTGCTGGCGCTCAAGTCCGGCCCGACTC 
50 GGGCTCCCAG AGAGCAACAG ATGCGGGGCTTAG CTCGGTGGCGGTCGCCACTCGGCGCAAAGTG CTGG AGGCAGGCGGCACCCAAG 

GCACTCTGCTCCAGCTGCTTTTGGTGGCCCX3CCTCTGGCAAAGCCACCGACCGACCGA 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNGGTGAATTGGGGAGACC 

5 5 CCTAAGTGCCACCCTTTAAGAGGTTTCGCCTTTCGCTrTTGGGACCT 

GAAACCTGCAGCAGGAGGAGGGAGTTAGGAAGGGGTGCTCCCAGGAGGTGCGGCCCAAGCT 
GTGGCCGCGCAGGTTCCCTGGGGCTCGCAGK5TAGCCCGGAGCGGAGGCGGCAGGGAGGGGCGGGAGCCTTGA^ 
GAAGAGAAACCCXK3CTGCGGGCTCGGAGCCGTTTCTTGCAGGGCTAGCGTCAACTCGGCAGGGAT 
GACTCTGAACCCCTTTTCTAGAATCACGTAGCGGCCACAGGCACTTCCTCCGGTCCCCCTCCCTCTTCCCAACC 

6 0 TTTTCGTTTCTTTGCACGGTTTGTTTCCrTGATAGAATGTAACATTCATAGTTGTACT 

GAGCAGTCGAGAGCATCT AGCTTGAAAT CAGCCITTGAAAGTCCTTTGACTGTTGGGT CTGAG ATCTTGTCGG AGCAGATAAAGGA 

AAAGTTATTTCGGGTCCTTTGTCGAAATTAATCATGCTACCGAG 

CAGAGGCAAGCCCCTTGTCAAAGCTGACAGAATCCTGGACTCGCCCAGCTCCCT^ 

CGCCACATTTCACC CCyuVGACTTTGGCTTTGCACAT CTG CCCAGGGCTTG CTTTTTTAAGAGGCACAG AAGACAGTAGAGGCATGCC 

6 5 TCTGGAAAGTACCTTAAACATAGAATCCCCTCCCTAGTCTGTA 

GACTAGGTGCGTGGAa^CTAGGGCTGTTAGAACTTTGCTCCT 

TATTGACTGCCCCCTGTGGGCGACTGAGGGTGATGCTTGTTTTTAGATTAAGTTTTGACCAG 
CACAATGCTGAAAGTGACACCrTCCAGATCTGGACAAGCCTAACAGGCGAAT 

TCCCGTTGG cnTTCTTGGTATCCCTG ATACTTCACAAAGGGATGTTTC1ATCCT CAAGTTCCTT AACAAAAGGG AAT 

7 0 TGTCACCCAAAGAGGCTGGTTTGCAGTTCCGGCCTCATAACTCTTTTTCCCTTCACAAC^ 

ACTGCACAGAAGTTATTTGCCGCCTCCTGTTTACTGTTGGAAA 
TATATTGCAGATCATGAATAAAGTTTCCTGGGATTTTTTTTTTCTT^ 
C^TGAAGCCTGAGCCATCAGTTTGAGGCTCAGAATAATATTTTAGATGATGAAAAT 
7TTTAGGGGGCCAGAAATATACATTTA£TGAGTTC!GT 
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TTTTCATTATGATGTTGTCTTATATCATCTTGCTGGTTCGATAGGTTT 
TAAGTGTAAAATTCTTCACATTTTAAGTCTTATAGAGTGATAACAAAA^ 
TGAGTTTGGAGTTCCTTCAGCTGCCATCAAAATTCACACrrGTTACTCA^ 
TTTTTTTTTTAACCTGTAGATTTCATCTGGGX3GAACCCCT 
5 TCAGAGACAGAGAAAACAAGAGTAAGTAGGAGTCTGAATAGCC^ 

AAACACAAGCACAGCTT AGGT AGAGTAGACACGGG CGGGCAG CGTTTG AATG ATTTG CAATTT ATGTTAATCACAAAACAGTTTGA 
GGGATGTCTCCTTTACCAGGTCTTGAAAATCTTCCTGAT^ 
ACTTTGGAATGTATTCTGTCrTGGTTTTTTTATTTTATCAGA 
GAAACTATTTAAATGATTTGCTATTTGCTATATG^CAGAGTTGGAGTC 
1 0 TTGTCCTT AACATTG ACTTTTAAACTC AAG CATCAATTGGTTGGG AGTTTG C CAGATGTG T AACTTGTCACTGATCT AACAGTAAC 
AGGTTCAAACATGCAGAGCAATAGAAAAATGCCAGGGCTCTTGGTGTCTCTG 
ACTACAGCACCAGAATGCAGAAAACCTCATTGCGCAACTGTGCAA 
TTCTGTCTGAGTTTCATAATCCAATAGTTTAGATCCTCTGCCTTTGGC^ 

AGCGATGAAGATGATGAAGACATTGAGATGTGTGACCATGACTACGATGGGCTGCTGCCCAAATCT 
1 5 AACTAGGTGGACAAGGGAAGAGGTAACTAGTCATTTTATCAAAACGTAGGGA 

CCAGTGTTTATTCTGCTTTCTGTTCCCAACCTGTTTATTGTTCC^ 
TTCTCATTCTGGAACCTCACCATTCCCTACACATTGCCTTTTCTGTGA 

CTCCCGTGGTCCCAACCCCCCTGGGC^CATGCTTGGCTTTATTGGTGTTGTTACATATTTGTCTTTCCC 

TCTCTGGCTAAAGTGATGCTTTGCTATTTTCTAAGCTAAAGTAGCCGTAG 
TTACACATGATAGTCACATGACCTATGTATGTGAAAGATTATTCTGAAGAGATCCAGTA^ 
TAGAGCAAAAATCATCAAAGTATGAAGAAAGGTTTCTTCCTTCTCTACTGAACTG^ 
CTGAGAACAGGTGGAAGCACTCCCAGGGAGGCACTTAGGGAATGCACGTTCTGTCTGTCTG 

2 5 CTTGGAGTGTGAACTTCTGTTGTTAACCATGGATTCC^ 

ACTGTGTTTACAAAGGTGTGCATGAGCTAGCATGTAG^ 
TTCTGTGTTCTCATCCATAAACTCTGAACATATTGATGCAGATCCTTAATGG 
TGTCAAATATCTTCTTCTTTGCATATGGAGAAGGTTGAACCAATGGGAAGCTGTC 
ACGGTGACTAAGTTTAACGTTGGAATTCATTTTCCCCCATTTAGTAAGAGTGTTAACAGTTACCAT 

3 0 CCCGGCTTCACTGCTTTACATAATGGTATTGTGTAA 

AGAAATCTGTTTATATAACTACTCAACAGCTGTGTCAAAGTAAAAGCTTTAGTGTG 
TATTACATTAATATTGGCTTTATTTGAGTGGTTTTGAAAGCTCC^ 
CTTAGACAGAGCTGTTCAGTAAACAAGTGATGCCTAGGATTGTCTTrACAC^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

3 5 OTCTTTCGAATGGAAAAATCTCATCTGGATTTAAGAAATTATTGGATAGTT^ 

AGAGGAAGAGAGCCATCCGTAATTTTTATATCTGACT^ 
TTTTTTTTTCAACAATTGCTTCTTGTTTTGTTTTAAAATAC 
ATTTTAAACTTGGTATTCCCTCCAAGTTAGCTTCTAAATTCTCCTATTCTCATCA 
ATCTAGAGCGTCTTTTGTCTTATAGCCTCCCAGGTCCATCTCAATGATGAGTC^ 

4 0 CCACTTCTTTCCTATGGTCTGGTCACAGGG 

TTCACCACGTGTATTCTTTGAGCTTACCACCAGTT 
CTCTGCTTGCCTAGGGTGTCTGAGTCCCTAGGTTTGGTCTCCTC 
GGTGACATAGGTTCTCACATGGCAAGTGGGCTACCTTGACCT 
TGCCCTCTCCCTCCTCTACCCGCTCTGTTTCCTGCT^ 

4 5 GGGCTTTTCTTCCTGCTGATTTCACCCCCACCCCCCACCCCCAGCATGTTCTCTOT 

TCCAAAGGCCGGATCATGAGCTCTCGAATGCCTCTGAAGGAAGCAGAACCTCTTCTTTAAGAGTCCTTTTCTC 
CACGTGGCTTAGGTCTCATCTTACAGATAAAGTCAGCCTTTGGTGTAATCAGT^ 
CTAAGAAAGTGAGAAAGTTTTATTGTTTTCATTCAGTCTAATACAAGT^ 
GCACTTCAGTTCCTTAGGATGGATTTTTATTGAGGCGTGGCC^ 
50 CAGACTGAGGTTAACGATAAGTCACGTTGGCATCTATGTGTGTACTGTGATGTTCCTTCTATGCACATGGC^ 
GGCTCCAATCTTAGATGCAACTTGGATACATTCTCTAGTCCG^ 

AAATGTT CTT AG ATGTTTT CACT ATGT AT CAT C CACATG TG AT C C CT ATTG AAC AAAG CCTT CATG TG TTCTTCACGC CTT ATCAT 

CCCAACGCTGTGTACTTTGTCTAGAACCGGACAGATGTGGAGTGCCAACACCGGTGGCAGAAAGTGCTGAACCCTGAA 

AGGTCCCTGGACCAAAGAAGAAGATCAGAGAGTAAGTTCTGTCCCCATAGATGTGTGACTTGCAGTTAAGAA 

5 5 GTTTCTCACCCCAAGGAAGCAATTTAAATCTTCLAAAC 

TTTCAAGATCTGAAGATCTTGTAACATTGAGGCTTGATGACGCTGACTCTTTCCG 

ATCTG AAGGTCATAGAGCTTGT CCAGAAATATGGTCCG AAG CGTTGGTCTGTTATTGCC^^GCACTTAAAAGGGAGAATTGGAAAG 

CAGTGTCGGGAGAGGTGGCACAACCATTTGAATCCAGAAGTTAAGAAAACC^ 

GGCACACAAGCGTCTGGGGAACAGATGGGCAGAGATCGCAAAGCTGCrrGCCCGGAC 

6 0 GGGAAGAACCAAGCGATGGTCGCTAAACATTCTCTGT^ 

CAGAAACACCAACTTTTAGGAGCGGGCAAGAGCTTGGAGTTTGTTCCGTTGGACAAGGCTAGTGACACCTGGG 
CCAAACCTTTCGGCTTTCCTCCACTGTTGCTCCTCCATTTGTGAAATAGCT 
GAATGTCAGGTATTCTGTAAGCTAAAGCACCTACCTGCCTAAGAAGGGAGTGCAGACAC^GAT 
GCCAGTTCAGCCATTACTGGTGA.CTCCTCCACTTTAGGAATTTCCGTTTT 

6 5 ACTGGGTGTTTTTTTGGTTTTGTTTTTCTGAGACTAAAAAAAGCCTC 

TAATCTCAGTGTTATGTCTTATTTTTACCAAACAAGGAGGGGAAATGACGCCTTTGTATAGGATAATT 

CCCACTGTCTACACATATATACATTGATTTTTTTTTTTTAGCT 

TTAAAAGTATTCTTATTCTTGAGCTTTTTGACCCCAAAGTGGGGAAAGGA 

TGCATCAATGGGGTAGCTCCCTCATGCCTTGAGAACTTTAAGAAGCCATGGATCTGGTCAAAGGAG 

7 0 TCAGAGGATATACTGTTGACTTTTCTAAAGACTAAAGGGGCAACT 

TGGAAATAGTAAAAATGTTTTTACrTAATTTrrCTTTCTTTTCCTTT^ 
AAAATGGGAAGAGTTTAGTAAAAGCACCTTGCAGTTTGAAACT 
CANNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
7 5 NnnnNnNnnnwwnNNNn^ 
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OTJNNNNNIWNNNNNNKNNNNNNNNNNNNNNNNNNNNNNm 
NNNNNNNNNNNN ^^ 

GGGGACTCTAAGTTCTATTAAAGTCCAGTGGCACGTGAATTCT^ 

TTTTGCAGGAAACTTGATACATCCCAAATCAGGTGAGCAAG^ 
5 ACAATTTAAAAGAGGGAAAGAAAAGTTAGTGGAACCACAGCAATC 

TTITTGTTTGl'ri^lTAAGAAACAG'n 1TCTGTTGATTGACTGTAATCT 

TCTACACAGGACTGATAATGCTATCAAGAACCACTGGAATTCCACCATGC^ 

CTTCCAAAGCCAGCCAGACGCCAGTGGCGATOAGCTTCCA 

CAGCTCTCTCCAAGTGGCCAGTCCTCCGTCAACAGCGAA 
1 0 AGAATGTAGAGGGAGAAGAGAGGTGGTTATATCACCTGGGGTCATAAAAATACTTTACAGTTGTGTC 

AAGCATGGTCACAGATGGAGATGTGTCGATGGCCCTCAGCTAGACCCACTGTG 

GTGTGAGGTAGGAGATGGCCCTCAGCTAGACCCACTGTGTGAGGTAGGAGATGGCCCTCAGCT 

ATGGCCCTCAGCTAGACCCACTGTGTGAGGTAGGAGATGGCCCTCAGCTAGACCCACTG 

GACC CACTG TGTGAGGT AGGAGAAG AT ACACTTGGCTT CCtHTTCT 
1 5 TAGTGCTGAATAGAACCAAAGTCX3TCTGATTCTAGTTTTCTGCTCCAA 

GTATATTGCTAAATCrTAAGAGTAGGTGAACATAGATAGATGCATAGTCTGTTA 

T ATTACATTTAAGGTCATAAAGAGGAG AT AG CT AACCTCTG AAACATGAAAGG AG ATT AT AAGTTCAATTCTGG CAATGATCTGAA 
GAAAATGAATTGATTTATATATCCAGGAAATGTTTACTTTCTTACTCAGAGTCAAGGGCT 

G CTCCTTCATC CT CX3CTTGGG TTGGT ATCATTTCTT CCT CATTTCT A^ CT AG CGAAAGAAATCG CTTACAGT ACAT 

2 0 GGGAACAAAGTCTAAAGAGGAGAATATC!AAGAAATGCTTATCATCC 
AAAGACCATATGCAAATGTTATGTAAAACATATGCATGTGTAGTTCT 
TTCCAGAAAGGTCTCCAGTGAATGAATTATCATAAAAGTTTCrc 
TAGTGTACTTTTTTCAOSC^TCCCACGAACACTTTCCC^TTGT^ 
CTCTTTGGGCCTCTCrCCCTGTTTCTTGAACTAGTTAGCCA^ 

2 5 TTGTTTTATTTTAATCATTCCATGAATTrTATCJ^CCAAGT^ 

TCCTGGCGCACTTGTTAAGTAGGAATGACTTAGTCTT^ 

CATTGCATGTT AAT AT AGTCAACGTC C CT CAGCCGG CTG CGGCAG C CATCC AGGTGAGAT CATCTGACGGG CAAACTCCTTAG CTG 
AG CAAGGCACTGCTTTT AAAG AAT ACTri ' CT^lTCT ACACACAC ACA CATTG C CAACCT ACT 

TTGGGCTAGCCTTTATTTACCCTTGGTGGCTTCTGGAACT CATCCATGAGCAGGTGCTCTAGCCTCTG CCTTACCCCT 

3 0 TGTTGCCTTAAGTCCCGTATCACTTTGAGCAGATAGTGGTGAATTC^ 

GGTACCTCTCCCTCAGACCTTGTCTCTCCCTGCTTCACTTGAAACTCT 
CGAATAAAGGAGCrTGGAGTTGCTCCTGATGTCAACAGAGAACGAGCT 
GGATGGGTGGCTTCGCrOVGTGCTCTTCTCT 
TAACAATGCTCAAACAACAGAAACTGCTACTGATGACTGATA 

3 5 CrGTTTTAGAATAGGACCACAGTACCCACTTCTGACTGTCAGGATCT 

TT CTTTTACTGTTATC CTTTT AACCTGG ATTT ACATTTCT^ CAAATTCCATTTG AAGTG T ATTT ATT ATAT ACATGTTT 
TTTAAAAG^KSATCTATACATAAGCCACTCTTTAAATGTTTGCTATCTT^ 
AAGGTGTTTGTGTAAACACATATATGTTCCCATAGATATGTGTATATATGCGTC 
AGGTTGACACCAAGTGTCTTTTTTGATAGGTTTCCAC^TTAAAAAATC 

4 0 TATGTTGAAATAAACTCTTTCCTCTCAAAGAAAAAAAAAA 

TCTTGGAGGTAAGAGGACACCGTTCCAGAATCAGTTCrTTCCITCTACCGTGGGCT 
GTCATCTTGCIWCCATCCAGCTTATATTTTGAAATAGGGTCT 
NCCAGTGAGCTCGAC^TGTACCTGTCTCTCCCTCTCCAGTGCTGGGGTC^ 
TCCAGGTTCTTACGATTGCATGGCAGTCTCCCITCAGAGTGAGACJ^ 

4 5 AGTCTCAGACATGCTAC^TTGCACAC^C^CAC^^ 

TTTCTGAATCCTTGAATGGGCAGCTCACGGAAAAAGCAAACTTTC 

GATACCCACTAGCGATTCTTCAGGTGAAATGTTCTTTGTTGTTGTTATTA 

TTTTGAACTTTAGTTACTATTTTGCAATTTCITCTCTTC 

AACCACATATACTCTGCAGCTCTAGCTTGC^TCAGTGCAACCTATAGCCAAAAAAAAAAAAAAAAAAAAAAA 

5 0 GAATAGAAGCAAGAGGATATGTACAGATAGAAGCAATTCTAACCCTTGAATGCT 

TTAAGTCATAGCAATGGGGAACrrGTTTTTAAATTAAACTCCAATGGAATTCT 
AAGGTTTAACATTTGACTAAAATCTTAGCAACTGAAGC^ATCCTCTAGGTAT 
AAACATCTTGACAACTGTCTGGCAGGCTTGAGTTTTGGATGAA^ 

5 5 TGCAGCTACCCCGGGTGGCACAGCACCTCCATTGTGGACCAGACCAGAC 

, ACACCATGCCACCCCATCTCTGCCTGCAGATCCCGGCTCCCTAC^ 
AGGGCACC^TTCTGGACAATGTTAAGAACCTCTTAGAATTTGCAGAAACACT 
GCACGTTGATGTTTCTGGAAGAGGAATTGAGAATTGGTATAGCATAAATG 
CATTCATGTAAAGTGAAAGACTTAGCTTTTACX^TTCCACTTCrCTCAGTGGTCTTGATTTTC 

6 0 TGCCTTCCAGGTGTGAATTAATGAAGTTCTGACTTACCACAAAGGGCT 

CTTATATATTTTGCTTACTTCrrTTGTGATTCTCTAGAAAATTTCT^ 
GTTTTGGGACAGCACATTTGGACTTACGTTTCCTCGTAACC^ 

ATACTGTAAATTGATTGACAATCATAGTAACACCAAAATAAACCATCTTGAACCATCTTG 

6 5 ATCIATCTAGAAATGTTTTCrGTTTGCTTTAAATTGC^ 

ATGCACCGCTGTAATCCCTGAGCTAGAAAGCAGGCAGGCAGGCAGGCAGGCCCAGCGTT 
TTTGGGTGATGGTGGCGCTGTCACITGCATACC^TCT 
CTCAGCAGTTTTGAATTCTCTGAAGAAGCGGCAGCTTTTTCACCTA^ 
AGAGGGCCATGGGACTAGATCTGCAGGAGAGCCTAGCCTCAGGGTGACCAGGCGAGTG 

7 0 CACCCCAAGCGAGGCACAGCAAGGTTCCGCTGGTC^ 

AGCCCCTCCCTCTTTGCTGACGTCATCAGCTCAACTCTCAAGCGTTCCCCTGTCAAAAGCCTACCCTT 
GCACAATTTCTGCAC^AGCTGTTACTAATTTTCAGCA 
TAATGACTGCCGTCCTGGCAGATTTAATTCCGCGTTCAATCTCACT 
ACTGGTGATTTTGTCGCATCTCTTTCTCTCGAAGTCTTG^ 
75 CCC7TTTCACTACATTTCCCTTTTTCCGTCCCTCCCC7\^^ 
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CTCTTTGTTCATTGACTACTAAGTGATTC^ 
AGAAAATGAACCACTATGCCCCTrTTCTAAGTCAGG^ 
TTITCAACACTGGGTCTTTTAAATTCTGCTATGGGATTTTGCTAAGCT 
CTTAAGGTATATAACACTATAATGCTTAGCAGAGCCACGTGCTTTCCAAATTATAAT^ 
5 TTCCCAATTATATCCGTTGAAGATTTGGTACTTTGTTTTACTTT 

TAGTAGTAGTAGTAGTATTGTGGTTCTTAGAATCrAACCTGGAGCCT 
TATTTCTGGACrTGTTTTTAAAW 
CACCCCCATATTTTCCTAAACCGTGTTTTC 
CGTTTGGGAAATGCTAACTCGGGCTTTAAACGGCTTCTTTACC^ 
1 0 TTAATTTTTCAGAATGCAGACAGTCTTTTATTCTGCTCAG 
ACTGACTGCAGGCAGCAGCCACTGCTCCTGGCTTCCCTGA^ 
GCCrTTCCCAAACCTGTGTTACCCTGGGAGCGTCAGGATGCTGAGAGAC^ 

CCITACACAACAACAGTCTGAGTTATT AGACCATTAAAC CGAGAAAAC CAAAT ATG AAAG AAAAAG AAGAT AACCTTTTT AATGTT 
GATCAAGAGCACATATC A ' rrcr C" l l AAGTGACAAATCTCATTTACAGAGAAGAriTTCTTIXSAA^^^ 
15 " ACCCTrTAGATGTTCTTTCTTATAAATCTCTG 

GTTAACGTTGTCGTAGTTGGGGTTGTG AGTCAGT AGAGG ACACTTGCCT CATCTGGGTTTGGCTTCAAACTG CATGATCACAGA 

CACTTCCAGCAACCATGAAAGCrCGGGCTTAGATGCACCT 
2 0 GAGACCAGACTGTGAAAACCCAG AAGGAAAATTCCATGTAAGTCTTAGTTCAATAAGAAAAATG 

ATTAAAATGTCTGACCCCCCCTTGTATGTATATAATGCATTCACTGTATGTATCCTATGCATTCTC^ 

AAGTT ACAAC CAAC CAGGGAAGAAAGT AGAG AAT AATT AC CTTCTGT AGT ACATTCTG ATGTCTT ATGTG CATG CACAGGGAGTTT 

CTGTACCATTCCCCZACCCCCCACCGGCCAGGTTTTTCTTTCCACrc 

AGCATTCAGAATAAAATTGAGAATGGTTCATAGTTTATATTC 

2 5 TTTGCATACTCACAGGAATGAAAATGTATGTATGTATGCATGTATGT^ 

ATTGAAAAGAAATACATTCTTATTACTTCCAAATAAAAACAAAGTTTTAGAC^ 

GAGACTGGTTTACAAATGATCCCAGATATCACGGTGCTAGTCACCATTATACAGATCCAAGGCCA 

TCTCAAAAGGTGGGAACTACTTCAGAGCTCTTAGTCT 

3 0 GGCGTCATACGGGATACATGTCTTTTTATTCrcrrcCTTCT 

GAAAGCTCTCCTCGAACTCCCACACCATTCAAACATGCCCT^ 
TTCAAGAAAGGTCCITAGATTTAGTGGGCITAGGAGTGTAGACM 
AGACCCCCTCCCATGCAGTCGAGGACCTACAAGATGTGATTA^ 
AGTGGACCACCGTTACTGAAAAAAATCAAGCAGGAGGTAGGTCGCCCTCGCTG 

3 5 TAAGGAACAGTAACACTTGTCATGCAAGACTTTTGTGCAAGCAAGTTGAAAGGCAC 

GGAAAGTGACTTCCATCAACTTCTTTAGGCAGAAAAGATGCTAGAGAGGGG 

TTT CAAAATTTATTG ACAACAT AGT ATT AAT CG CACATATGCAAAGAG AAAG ACTCC CAACCCAAAT CATGAC CACATT AATT AG A 

GCAAGCTTTTTTGTGTGCACAGGCTACTTCTCTGTAAGTCAAG 

AGGAGTAGAGGGTTTCCAAATGGGGGTTTTGGTAGGCAAAAAGCC^ 

4 0 AAC^CCTCGTAGAACAAAAGCATAGTTAACAGGTGGTCATGACAAGGTGATCAT^ 

TGTCCTGGAACAGGCAGTACAGAGCCAriTGTAATTGAGGTTTCAGA 
ATAAACAGX3AATGGATCTAATTGGTCTTTACTTTAAGATGGCTTTCAAG 

TAGACAAGCAGTCAACAGAGAAACAGATTCCAGATTCATTCCTGTTGTCTTTCTAAGGACTAAGAAATA^ 
CCATGCCCAGAGAATGTAGATTCAAGTTACCCrGAGTGG^ 

4 5 TTTAGCCATCAAGAATCAGTGTTAACTCACTAGGGAGAG 

CAAGAATAGGTAGCAATTTACACCTGAGAGTAAAGGTCTCCAGAAAGTCTT^ 
TGGAGTCGCCAACTGAGAAATCGGGAAACTTCTTCTGCTCAAACCA^ 

GCGTCTCCTGTGGCAGATGCCCCAGTAAGTATCAGGCGGATCTCGGACTCTTCTGTGATCAC^ 
CATGTGGGACCCCATGCCTTCAGCACTCACTCTTAGAGCACCTGCTTCCTAAGTATTCACAGCCA 

5 0 ATTATTTGTCTTCTAAGTGCTGTGTTTCCTCCTAGTTAAGCAGTATGTTGTGTGGTTCCATGTGGTATCTAG 

TGGTGTGTACCAAATATTrGGCATGCTAGGAGCCTAGTCAAGGGGTCCTTATGGAGCCACAGGAGAATGGC 
TCACGTTCCACAATTATAATGGCATGGTATTTGCATATAACTCATTGGOT 

AACTCTCACATTGTATAAATGTTGTGTTAACAGCACTTCACCGTCTTAGTGTTTTGACCGTGAATCC^ 
TGTCACCTTGCTCTGGTTGTCACAGTCTAACAGTCTGTGGATAGCGTCTCCTACATCTTATGGAA 

5 5 TGAATACTTACTCAGTGTGTTAACTGTTAATTATCATTTC^ 

TAACATTTTC^CTTTCATTAAAGCAACTGCAAG 

CTGCATTAGAACTTTGAAAGTCIAGCTCATAAATACCACGCTTTCACCATTTGTAGTTGATGTTGCCGT 
CACTTCTGTGCCGATATATACAACATAAGATGACTCATTAGCACATCTCT^ 

ATAGCTAGGACATGCGACATGCTTAGAATTAAGCGGTGCCCTACAGTGTTTTCACCATTTTGGTGATGTTT 

6 0 GAGGCCTGTGTAGAGTTGGCTCCTTTCAGCTCCCAGAGAAGGTTAATTTCTGGTG^ 

ATCTCTCTAGAGTTAGGTCTGTATTCGCTCTTGTATTATCCTGCCTCACTTAAATGATTCC^ 
CT CTGTTTTAATGACACCTGTATCAGAAG ATG AAGACAATGTCCT CAAAGCCTTTACC 
CCTTGCAGGTAATGACTGTTCTAACATTTGGTGTCITAA 

ATACATAAAGGAATGATACTTTAAGACAATGTCTTGATGTCTAGAAGTGAGCCT 

6 5 GGAAATTCTTGTTC^TCGGGGGCCCCCAAACATGATGAAGTGAAACATTTTTTT 

CCTAGGCTGTGACAGTATCTCCTTCTGCTTrTAAACAATCTTTAAAATATGTATCT 

ATG CTTT ATCAIT AAGCAACCAAGG CCT CAG AGACT AAATTTCT AAT AAAATGGG AAA CTTT 

GACTCTCAGAACAAAAGGGCAGCCCTCAGTGTTGGTGGACCATCTGCCGTCAAGTTT^ 

CTGAGG TAG ATGTG TATTGATGAAATAGAACAAATTCCAAATT AAT CCTG AG CCAAGTGTAAATA^ 

7 0 AAATTAGTTGAAGCAGTTCCCAACATTAAATAGCGTAGCACCTGGCCAGCTGCACTTCTCTTCCCTCTGTTC 

AGGGGGGAAACTCTACCAAGCAGAGAATAGGCTGGCTAGAAGTAAAAAGTACTATTTGATGCT 

GAT AGTAGCCTTTGAGAGCTGGCCTCTCTGTTTAT AGGGC CGTCTTG CCACTG ATATTCTGTCTACCCCAAGTTGTGAGAATCTTA 
AAGTCTAACCTTGTATTATTTGCTATCATTTAAAATGTCTTTGTTAATTC^ 
ATCTCCAAGTCCTTTCCTTAACTCTTCGGACACCCCTGGTCTCCTAC^ 
7 5 TGACCTAATTTGTGCTACCCATAATACTCGAAGCAACTCACTGTAGTAAGCTCAGCCT 
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CTGACTCACCCTCCCCCAGAAGCCACCAACTGTCC^TAC^ 
TGATTGGTCTTGTGTAGGCTCTGTGAGTTTGTGAATGCCGTGGTCCTGTC^ 
AGTCTCTGACCCTTAGAACCTTCTATCCCTCTTCTAAGTGTTGGAGAGAGCTG 
TCAGTCACGTGTTCTCTATGTTTAATAACCTATTGACTAC^ 
5 TATCTGATGAGGTCTAAGAGCTTCACTAATATACAGGTGAAGAGATAGGA^ 

AAGTAACAGTAGTAG ATTT ACAT ATGGGGC CTATGAACTGCCT AGC CATGGGTTCTTGGC CAAATTTG CAGGCAT ATG CT AACTCC 
TATGAAGCAAGTCTTAAGTTTAAGAAGAAAACGGTTGGTTGCCTCTCTGACGTTTGTG 
TCACGGGGGTCATTATTGAGGTTCACAGTGTTCACAACTGGGTAAAACTGATTGG 
TGTCTTCTGGTATTGTGAAAACTAGCAAGGATGGAGGACT 
10 TTTATTGCTGAATCTTTGTATTTGTGTGTTTAATTTGGATGCT 

GGGAAGAAAAGATCTTAAGGAGGGGAATGGTTAAACAAAGAGGAGAAAATGGGGAATAA 

GAGAGG AT AGACT AG AGGATGGGTT AGGAG ACAGACAACT AACACTGAG AAT AT^ CACTTGG AATC CCACT ATTTTA 

TAAACTGTATAATAATGTATCCITTAATTCATCCTACAAAGGGGAGTATGCTTATTT^ 

TAGGTTCAGGGTACCTCTCCACTAGTTGTTGGTCA^ 
1 5 T ATT AAGACT CTGTTG CIX1AAGACACCACATGCTTCAGT CACAAG ACATGG AG AAGT CAACTTGGT ATTTG CTGTCACCTTTT AGG 

ACAACTGG ATG AACTGTAGG CATGGTTATTT CTT AACT CAGTTGG ATTG GGTC CCTGT AG CTTTCCTCAGGT ACATG AACTG CTCT 

GATGAAAGGTGGGTGGGCAGGAGTACGGAAGAGGAAGACCAATACACCAAGTCAGCTCT^ 

TGTAGAGAGAACTCAGAATACCTCAAAGATATGGAAGGTAAAGAAGAAAGCACACTTCACCCAGGTCT 

GATTCAAAGGATGTCAGAGCAAAGGTAGCTATTTAATACAGTACTGATCCTTGGTGG^ 
2 0 CTTTGGTTTCCAAAGCATAGAATTTGACTAATGCTGTTTGTATGTTA^ 

ATATTCAGCCTATGTTTATTGAAGTATAAAGTGTAGCCTGACATGAGGTTCCATGT 

AGGCAGATTT CTGGGTTTGAGG C CAG CCTG GTG T ACAT AGAGTT CT ATATACCAGCACAGG T ACAT AGTGAGAACATGT CTTT AAA 

AATAAGACGAGAAAGGAAGGAAGAGAGGTAAAGAAGAAAGAAAGAACATTCAG 

GCAGTAATCATTTGATATATTGAAAACTCATGACT 

2 5 TAGGCCTAACTATTCAGAGCTGAACATGGGCTAGCACrrATTTTGCTATATTCT^ 

CCCCACAGTGGTACCTGAATTGTGGCTGATATCAAACATTAC^ 

ACTAGGCAGT AATGAGAT ACAG AAGT ATGAT AAT AAACTGTAAT ATAAAGCATCCT AATCCT CTGCATCTT 

CTGACTGCATTTGCCCCAGGTAACTGCACTACAAAA 

CTTTG'riTGTTGAAGCATAATGCCATTTCTAGTGTTTCCACCAATACTT 

3 0 ATTGGTGAGCATACTGTTCATTTATTAAATAGTCTAACTTC 

CAAATTAAGGAAGGAAGGGCTTGTGTTGGCTTGTACGTCGTTGGAGGOT 

CGGTCACATCTCCCCCAGGCACKIAAGCACAGAGCAATCGACACTACTG 

CAGTGCAGAGAACAGCACTGCGGTGTTTAGAGTATGTAGTCCCAGTTCAATTAACTAGT 

AAGGCTTGCCCCTGGAAGTGACTCTATTTTGrrCAAGTTGCCGGTCACT 
3 5 CT AAAACTTTT CT C ATTTGT ACTTTT AAG ATAAATT AAATTTT CAATGGATCTTTAGC CAAAC CATTTGG ATATTCTTGACACTGC 

ATAGTTTTTCGTTATTTTGGTTTTATTTTCTTGAAATGAGCTCTTACT 

TCCTTCTGCCTTGGCCTTCTGAGCTTTGGGATTGCAAAT^ 

CCCCAAATATTTTACAAGTTGACTGTTTTATAA^ 

ACCAGTTTTAATANNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNIWNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN>WNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 
45 NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNN^mNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNIWJNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNWJNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

CATTTCTACTGCCAGATTCAAACCCATGCTCTTGCTCTTAGCCGCCrGAGCTTCATTT^ 
CTGTTTGAAACATGAGTGCAAGCGCTGTGTTTATATGCGTTCTTTACTCATGCACTCTCCT 

5 5 GACA.GCCTTAAATTTTAATTTCCTGACITGGTTTGTCAAAGCCTTACTCTT 

GTTTGAAAACATTTCTTCAGTCTTTAATACTTCAGCCCATCCTCAGAAGTTCCAGAATTT^ 
CTCl'CTTCCCTCCTCTGTTCTGTGTAATTCAGCCGGTAATTGTCT 
ACCCTTTCCTTGTGACTGATGTTGCTCCGAAGTGACATGAGTTCTCACATGG^ 
TCATCCTATTTGGAGGACTTGATTAAATGGAGCAAATGAAATGAATTAAGTTACTCTTC 

TATTTATTATCAGAGAAAAACTGCATGCAGCCTCTGTCCTGTCAAAGAAATGCAG 

TGAAGCCCTGGGTTTAATCCAGCACCAGGTGTAGTAATGTAAATCTATAATCCCAGTCCTCTGGAAGATC 

CT AC AT AGT AACTTT AAACAT AGT CTTGG TT AT ATAATG AAAAAAT CT AAAT AT AG CTTAAAAAT CTT AAT AC AGTAGTT AAAGAG 
CAAGTAAGTAGGTCTCCCTATTGCCAGGACTTCTTGTATTATC 

6 5 CAGTGGGTCACTAAAAGAAATAGCTGAGC!AGAT(IAGGTAGTCAGGGCTCCTA 

GGGTTTCTTTTCTTTCCTGCTGTTGTGTGGGTAGCAGGA 
TACTGTGCTGCCGGGTTTAACTCCAGGGTGTCTTCGCAATTGTTGCTATTCT 

AG AAGTGTTT CAGTG AC CAGCATTCAATT AG TGAT AT AGATT AAGT AAGCTTTCTGTGTG T AT CT CTGGG ATGGAATTTCTC TGAT 
GAGCTGAGTCGGTCTGCCCTGCCCAAGTTGTTCCAAGACAGTAACTCTTCATAC^ 

7 0 AGATAGAAGrTGCCTTGGGTAGGTCAGAAAAGGTAAATTGCTTCAG 

AGGGG ATGT C CGAGG AAGG AG CTGTT CTTC CT AAAACTTGGGT AC C CCAAGCT ATAG AATG TAAGTTAT TG AGGTGT CT AT AAACA 
TGTTAGAATGAACACTTCT AGAGACATCTGG AT AAAAGCCCATATCTG CGAAAGATC CCTGTGCTCTACCCATTGTTAATTCAGCC 
TGGACACAGAGGCACTAATGGGTAGGGGGCATCCTGCAGTAATG^ 

CT AATGAGT AGTGGGG CATCCT AT ACT AATGGG TAGGGGGG CAT C CTGT ACT AATGGGTAAGGGG CAT CCTG CACTAGTGGGT AGG 
7 5 (KiGCATCCTGCACTAATGGGTAGGGGGCATCCrGCACTAATGGGTAGA 
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TTAGGAACCCTGAAGTTGTAAGTATTTCATTGT 
AGCAACCTCTTGATACAATGAAGATTTTCTGTGGTATCCTT^ 
ACTTTCAGAATCT CATT AAGGATGAGATG AGT AGTATTT^ 
CTTAGACACTATCAATCCATATATATTAGTTTTTTTAAAAGTAATATAAATAT 
5 AATATTAAATAAATAATAAAATATATATTAGATTCATCTCGCTGTCTTATTACATCTTCACAT^ 
CCTTGACCTCAGCAGATACTGTGTGGCTTTATGAATGTAATAAT^ 
TAATCTAGTTCTGTCAGAAAACCACCTGTGGGCGTGGTCTCGGCT 
TTTTTCAAAGTGCTTTAACfcGGATATCCCAGTTAA^ 

G CATT ACAGGAT AAG CAAGGTG ACTCTC CTGGC CGCCACAGGAGG ATG T AGGG T AGG ATTCT ATT AAGATTT AAATGAATT CAGAC 
1 0 AGGCTGACATCTACAGAACACTGGGTTTAGAAGCAGCAAGA^ 

ACAGAAATGCAGGAGTTGAAAACAGAAACACACAGAAATTGAGGTGACAGCATTCA 

CCTGGTTTGTCACAGCTATCATGTTCTTGAGAAGTTGCTGTGGGGTCCCCAGTTC 

ACAGTCTrTTTTTCTCTTCTCTT'CrTCT 

TCCCTCCCTCTTTCTGTGTATGTGTTTGGGAGAAGCACACAGATGGTATG 
1 5 GAGTCTGTCCATCCCACCCTACCATCTCTTTTGAGACACTCCCTGAAACTCATCAAATACCTAAGTAGGCTAGGCT^ 
TGAGCCCTAGGGATCCACCTGCCCCTGCCTTCTTAGTGCTAGGATTCCAAGTGCACACTACCGTGCCAATCC 
AAAACACAGTTTCTGGAAATCCAGCCCAGGTTCTTCCTCACTGAAGAGT 
CCATAGTGCCCGCCTACCTGACCTTGCTCTCCATTCTCGCCATTGGTATGTa 

CT ACTGTGTGCC CT CCT AGCTCCCAAAG CCG CTT AG CT CATGCG AG CTGCCTTTCTG CC CACAG TG ATTC CACACACGCT AGCTT A 

TCCGGCTTAGCACACGTGCTGAGGAGTCTGGGTATTITCATTATTGTGAATT^ 
GATTATTTGAAAAGTCCCCACAAAGTACrTCATGAGAACTATATAGTACATTCTTTATATATTGA 

TGGAGG AAAGACAAAAT AT CACTCCAGGGATG ATCAAC CTGGAGATTT ATCAAAGT CACAATTCTTTTCAAATTX3 AT AAGTTC 
TG ATT CTCTG AAC AGGCATGGACAGTT AAG AGT CATCT AACAT AC CAT ACAG CAAAGGACGGCTCAGTGTGTGGCCATCAGCAGGA 

2 5 GCAAACATAACTTATATTGTGGTGGAAGCTTAAGAAAC^ 

AAAA rrirrrriTlTTA GTAGGAAAAGCTCTTAGAAAAAAATAGAA^ 

T ATAT AT ATACATAT ATGTGT AT AT AT ATGGTG T AATTTT AAAT CT ACATG CACAGT AT CATT AGG TT AAGTT AT ACAGT AGC CT C 
CTCACTTAGTCAGTGriTCTAAGCACTACATATGCTTCTCTGTAGAGGATACACC 

3 0 TTGX5TTTGGTTTTTTTGAGACAGGGTTTCTCrc 

TGCTCATTATTATATGTCATATTTTGCTGCAAGGATTAC^ 

TGTCTGTTTGATTTCTGTGCTGCACTGGAAAACCTTCGCTCCTGCAAGTTAGCAGG 
ACTGAAACCCTGCGCAGAGCCATGTGTGTTCGGTCTGTGCATTTGGATCCAGGGAAATA 

3 5 CT AATCTGCAACCGTTTAATACTACTACTGTTTTGAATTGTAAACT 

ACCCTCCCCCACACCCTCAAAGGAGTTGCAATCC^CAA.GTTGAGAACAGCT 
CITATTGTTAGGAAAAATATCAAATGTGACTTTGTGTTCAGAAGTCAACCTGGGATGG 

ACAT G CTG TCCCTTGGAATGCT GT AGG ACGATTCAGTGG CT ATGG AGT ATGT CAAG CTG CACCTGATTGTTT CG AGCACACTTGTG 
CACTTGGCTGTGACTGAGAGGCAGCCAGGCAAGCTAAGAGGCTGTGACTCA^ 

GCCATAGTTACTCACCAACAAGCATGCCTCTTCTCACTC(^ 

GTGAT AAACCCCTGAATTT CCT ATG CACATC CCTG CT ACGTTCCTCAT AAG AGG T CTTAG AG GT ATGGG CTGCTG AATC CCAC CT C 
ATTCAGATTACTGTCCCCTCGTTCCTTAGTGATCTAGTCTTGAGTGTGTTGCT 

GTGACCTATGTGGGGTGCTACAGAGGTGGCCACACCATGGGTGTTCTGTGTGAGAGTTTAAGTGTGACAG 

4 5 TGTAAGGGCCGTTCCATGATGCCTTGCAACCACTACATCCTTTGTTTCTCCATAGTC^ 

TCCCCACCCTCACTATCCTTTCCCTCCGGCCGTTCTTAAACAGCGTTCTCAGCGTCCTGTGTCCTGTTCT 
CAAACCGCCArcCGCCGGGGTTTTTTTTTTGCAGGGTATAAAA 

AGAGCAAAAG CTTCAATGTTGGAGACAAAGCTATGTAGCCCATC CACTAAGTCTCAACATTCCCTTCCTAGTG 
TTCTTGCTTTTTGTTCTTTAAAAAATAATTTATTT^^ 

5 0 GTGTCAGATCTCCTGGAACTGGAGTTAC^GACAGTTTTGAGTTGCCATGTGGGTGCTGGGAA 

CAGCCAATCAGCGCTCITAAGCACTGAGCCATCTCTCCAGTCCTACTTCT 
GCTGTAAACCATCCTGAGAATTAAGTGGGACATAAACTGTACATATTCCAGTTTGAGTA 
GATACAGTGGGCAGAACACCGTTCAAGCTATTGTGTGTAGACCATAGCCT 
TTTACCTGTCCAGCTACGGTGTGTCrrGAACCTTTCCTCGTGGTGTGTCTGCACT 

5 5 ACTGTCCTC^TATCTGTATGTTGCTCTTCATTACTTTTATCTTGTGATTTCCAATGT 

TGGAGACATAGCTCAGTCCGCAGACTTCTTCTCTGCCATACATTAAACCCTGACCCA^ 
GAGTTGAGGCAGGGGAGTCGAAGTCAGCTCAACTACACAGTGAGTTCAAGACCAGTCTGAGATACATAA 
GAGGAAAAGGTTAAGCTAATCTAGGAGTTTGGGTTGTAGTTTAATGGTAGAGCATTTGCCT 
AACCITCAACGCCGCAAAAAGAAGCTGTGCTGATAGACATACTAATAAAG^ 

6 0 TGTGTTTTAGATATTTGTATGAATTGTGTGGGTTTTATCAGGTAGTGGTACATTAC^ 

TCAAGTCCATGGAGTGAGTAAAGGCCTAGGATCTATCTGCCTCAGGTATCTGTAGAGTGATGCCCAGATTCTC 
AAAACTGCCTGCGAGGCAG AAGGCATT CCAAG AACAAGATGTTACATCTTACG AAAGTTCTGAG AGACAAAGG CTTGCTAGCATGG 
ACTGCTTCCCGCCTGGCTGCCCTGCrAGGGTGGGGAAGCCTGGGCCTCCrGGCCTGACCCTAATGTCrCCT 

GCCIATGCAGTGGTG CCTGGGAGCCAGC ATCCTGTGGGAAG ACAGAGG ACCAGATGACGG CCTCCGGTCCGGCTCGGAAATACGTG A 

6 5 ACGCGTTCTCAGCTCGAACTCn"GGTCATGTGAGACATTTCC!AG^ 

CACATGGCTCCTCAGCGTGGAGCGCTCCATGGCTGAGAGAAGAGCCTGATTTTGTTGT 
GCATTTTTAGTTGCTTGAGATCTCACTTGATTTCACAGAACT 

TTACCTAAATTATTAGGTAATGAATTGTGACCATTTGTTAATATCATAATCAGATTTTTTAAAAAAA^ 
TTTTAGAGGATACAACAGATCAGTATTTTTGACTGTGGTGAATTTAAAAAA 

7 0 TATCTCAGTCACTAAACATACACAGAGAGATTTTTAAAAACCAC^AGAAGCATTATTTTGAATC 

TTAATGCIAACCCTCTAGGAGCTCATTTGTGGCTAATAATCTTGGAAATATCT^ 

GTTGT CAG CTTG CTTGAAAAGTTATT ACTGT ATG AAAT AG TTTT ATTGAAAAAATT AT ATTTTT ATTCAGTAATTT A^ TAA 

ATGCCAAATGGAGAAATGTGTTCX3CTGCTATGGTTTTAGCCTGTAGTCATG CTGCTAGCTAGTGTCAGGGGGCAATAGAG CTT AG A ( 
TGGAAAAAAGAGAAAGAGACTCGGTGTTAGATAACGGACTATGCACTAGTAT^ 

75 •i~reCTTTTGTAATTGGAAAACTTATTT 
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TTCTCGGTTTATAAAAGCATTGCACTTCTTTTTC^ 
CCTGAATGTTCATCCGTTTGGGCGTTTTTCTGACTTGGAAGAACAT^ 
GTGGTTCCTGGGCTTCAGGGAAGACAAATATGGAAGTCAACAGCCAGTTTCTGC 
ATTCTGAAATGACAGTGTATCTACTGCCTTGTAGCAAAATA^^ 
5 GTGATTTGGGGTGGTGGTT ACAT ATTCGAGG AG ATG AGACAAACT CAAAAT AAT CTT CACTTT AAAAAAAAAATGTGCTTGC CATC 
T AACTTTTT ACATGG AGAGTCAT CG CCACTT CAGGG AACGAAAAG ACAAACCAAGGT AAGTCCACAG AAC CGTCACAACGAC CGTT 
GG AGGGCTcnTC CAGCCTGTGTCTlAACAAGGG CACTGAAG CTCTTT AATTG AAACAATG CACTCT AGAAGT AAATGCTGGT AT CTG 
AGTTAGGGAGATGGCTCAGTCTGGAATGTTCTTCTGT^ 
CACTCGGGAGGCAGAGGCAGGTGGATTTCTGAGT^ 
1 0 GAGAAACCCTGTCTCGAAAAAAC!AAACAACAAAAAAGAAT AT AAA CTGAG TCCCCATCTCCAG AAG C CCCACAAAG CCAGAC AAG 

GCATCCJVTGTCTATAGTCCCACCAGACAAGGCGTCCACGTC^^ 
AAGATTCTCXSGATGCTGGCATGCACCCAGCAAACCCACAC^ 
AGG43GAGAACItiATAGCCAAGGTTGTTCTCCA^ 

AGAATTTAATCGCATATAGGTAGGCTGTGAGAACTGAAAGGTCATGCT 
1 5 TGGTGATGCAAAGGCTTTCTCTTTCTGCTTCCC^GCCA 
TGAATCTGTCTrTCCACAAGCrGCTTCTATATTTTAT 
GTGTTCAGTATCCCTTGCACAAATACTGATTGATTCTC^ 

GTTGGTGTTCATCAAGCCAAGGATATGATAGATCTAATGACTAATCTTCAGAGCT 
CATTTTCTGAGTGTTGAGTAGTATAACTTGATGAGTAGCITTTGTCAGAAAATGGAG 
2 0 TCAAAGTATTAGAGATGATGAATGTGACTAGCAACGAGGAGCCTAAGTGTC 

TCCCTCTTATAGAGGCTTCGACCCTCTATAAACrCITACCTATATGTAAATCA 

TT ACTACC CAGATTTTT CT AG TGACTTT CAGGAAGG CAGTGGT CAT AGGTTT AT AACAC^ 

TTCTGGGTGCTAGAAAGTCCAAGTTCAGTAGGTGGATTTAAAGTC^ 

GTTGTGTTTTCTGTTTTCACATGACAGAAGGCCCTGGTT^ 

2 5 GGTGGGTGGGTGGTTAATTGGGTGGGTGGGTGAATGGTTGTTTGGTTGGTTG 

TGGTTGGGTGGGCGGATGGTGGTTATTTGGTTGXSGTGGGTGGTTGG^ 

TTGGTTTGGTTATTTGAAAC^GGGTC^CCTGTAACCCAGGCTGGCCTTC 

AACTTCTTACTTCCAATTTCCAGGCGCTGGGATTATAGGCATGGGCCAGTATACCCAACT 

TTTGTTT C^TGTTTT AGTG AC CCT ACT CTCTGTG ACACATG TACAAATG CATGG ATGTACT AT ATTT ACTAC CT AAATGAGAAGCT 

3 0 ATTCCAATGTCACTATAAAGTGGCATGTGAGGAACAGGAGAGACACTGAGAGGACCA^ 

GTAGATGCGAGTGTTGTCANNNNNNNNNNNNNNNNNNNNTT 
CTGATCTGAAGTCTGACTVAAATTTC^TTGTAATTCAGACTT^ 

GTTCATTTTACTTATATTGGATATCAAATATAGGTATTATGAAGATGTTGTGAAACAGG 
TrCrGTGGCACCTGTCCGTAAGATGCTTTGCTAGTTCIATCT 

3 5 GATTTCTGAGGCTCATGTCCCTGCTTGCAAGTTACATGTACTCAA^ 

AGATAAATTCTTCTTATGTAACATATATGCAAATGACAGCACAGCTTCCCTTC^ 
CACACTCGTTCTCATCCATTGCCGTTCTCATCCATTGC^ 
TCTACTGTCAAGGAGGGAGGTGACCTGGTTCCCAGCAGTGAGAAGTGAGAG^ 
TGGACTITGTCCTITGCGATATGGGK3GGTTATTAAAGGCTITTAGG 
40 ATCCTGTTCAAACTCTTTATTCACATATAGTATAAAATTAATCCTCAGATTAGTAGGC 
CAGTTAACTTATTTTTCATAGCATGTTTCCTGCTGCCTAATTTC^ 
AAAAAAAAACCAACTCAACTITGATCTGTGCTTCATGAAACATGRTTTO 
AGAAACGATTGCCCAGAACACTTTTCTCCTTTAGAAGTGGGACTTTTCCAACCA 

CACTGAAT CATT ATGTGGTCT AAT CCTTGGTTCTTT ATGG AT ATTTT G ATGG ATTTTTGT ATTCACTG ATT AAGG AAT CTTT CCAG 

4 5 CATTTGTTCCTGAAGAAATTTCAATGTAGTGTTATATTGTAGTCTACCCTTCACACT 

TTTATTGGACAC^AGGCTAAGAACACTGGTAGATTGAAGTTAAGTCAGGAATC 
CAGCCACCAAGTGACTCCAACACAGGAGGACTGAGGGAATTGAACT 
TTCTATCTGACCATTCCGCAAGCCTTACCCGTTTCTATTGAAAATGTCCAT^ 
AGCTACAGGGTTTTCTTAACTCTAAGGATTTAATTTGTTACTAGTGTTGGTAATTAGGT 

ATGAATTCAACATTAAAGCAAAGCAAAGCTGTTGTTCAGAGAGAT^ 
TTTGTTTGCCTAAACCTCTGAGCITTAGGTTTCCGTTCAAATG^^ 
TTTCTCAGATCAGAGATAAACAAAACTCAAAGAATGCITCIAGTAGT 
TAGGTCAAGAACACCCTCGAAGATGAACCTACTATGCTTGGA^ 

5 5 CTCTTT AAAAACATAAGTGTTTTCCCACTTTCTAATAGTACTAAATATATCTGACCATGGATGGAG 

GTTAGAACCTACCACTCATTTTTTTCCAAAAATTCTTTTTTTATT 

AG CTCACAAACAT AAT AAG T ACACACAAGT AT AG TGTGT ACAACATTCAC AATT AT AGG CAG G ATCATTTT CTTGGATACCCTCG A 
ACAT ATGAAGGAAAAAAATAGGCTTATACTTTGGTAACTATGTTT CTTT AAGTCATACCT 
GAGGTTAGATGGACCAGAGACCTACAGTAAGAAAGCATAAAAGATAGTGATAAAACGACCCCTAATC 

6 0 GATCAGTGGCTCTGTTCAGCCATCAGAGAAGCTTCCrCCCA 

ACAGAAAGACCTTAGAACACTCAGCTCCAAGTGGGATGTCTCCATCCATCAAATCTCTCCCCTCAGAGCTC 
ACACACACACAC^CACACAGAAGAGGAAGGAGGGTAAGAGACAGAGGAGGTGGGCCACCAGG^ 

TG AACAAAGCACAT ATGAACTCACAGAG ACTTAG CAG CAAGCACAGTGCCTATACACGCCTGCACCIACGTCCTCTGTGTGT ATATA 

ACGGCTTTCAGTTTTGTACTTTTATC-GGACTCCTGTCTGTGTTAA 
6 5 TCCTTCTGTTATTTTATCTTGTCCAACATCAATGCGATGGTGTTCTTTCTTATTG 

ATCACTTAGAAGCCTATTCTTTTCTCCTGAGAGACAGGAGGGGAGTGGATGCAGACGG^ 

ATGGGGGATGGGAAATGATCATCACrrcrATTCTGTATGAGAGAAGAATCTACTTTCAA 

TTITAANNNNNNNNNNNNNNNKNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 KNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNl^^ 

75 ATAAGCCAGTGAGCAGTGTCTCTCCATAGCCA7TGC i 1 i ACTTC CTGCTTT CAGG CT CCT AT"T CT AT AT CC CTG T C CTtCj C (JTT C C 
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TGGGTTCCCGCTCCGATGAACTACAGTCTTTAAGGGTCCTACTTCCGTTTGGACCTACT 

CCCTGGGCTTGAACTGCCTGCTCTTGCTATCCCTATACAGTGCACCT 

CAGGGCTGGTTTCTGTCATCCTACATCTGGTAAGTCCTTGATTCCTCTCC^ 

AGCCCCAGAAGTTTCTTn 1 rCTTC 1 l' L ' 1 ' 1 AATAAAAAGAGGGCGGGGCTCAAACCATGACTCAGAGTTGGAACATTAATTAATAA 
5 AGGAACCTTTTCTAGGCCAGCAGACAGGAGTTGCATTCTTG^ 
CCTAAGTCATCTTGAAACTGAGGOTAGCTGATGCrTTTATTG^ 
GTCTGCAGACCCTGGTGGTGAGOTGAAGGGGCAACGCTTTGCTCACAC^ 
TGAGCAGCCTGAGCAGAGATGAATACAGGACAGACTGATGAATGGTT^ 
CAATGTTAATTGTCAACTAGACAGGGAGTATGCGTGTATCTG^^ 
1 0 G T AG AGTTTCAGGGTTGG ATGACTACAAATCTAAAGGCTT CATAAAATCCCCT CTGACTTGAACG CC CTGAAGC 

TCATCATGCTTCTAAGTTGTTTTGTTTGTTGGTTGGCTGT^ 

GGAAGCTTTAAAAAATGGGT ATTTTTTCTT 1 1 CAAAAAGAG AGCTGGGAGT AGTGGCACACAC CTTTAATC CCAGCACTTGGGAGG 
CAGAGGCAGGCGAATTTCTGAGTTCGAGGCCAGCCTTGTCTACAGAGTGAG^ 
TCT CAAAAACAAAACAAAGAAAAG AAAAGAAAAGAAAAGAAAAGAA 
1 5 AAGG AAAG AAAAGAAAG AG AGCTTTAGGCTGGCCTGG AACTCACT ACGT AG C CCTGGCTGG CCTTGAACCT AT AGCCAT CTTCCAG 
CATCATCCTCCTGAGTATGTTTGATTACAGATGTGAGCTACCAGACTCAATT 
TGTGAACCCGTAGTCCGGGATTTCTTCAGGTCTCTGTATC 

CGGGGGCAGGAAGAGAAGT ACAGAGCAGTT AGTTCATCAT CACAAGATCAG CT AAGCATCTGAGAGAGGG C C CTGGGTTC CATCTC 
CTATTGGCAAGCAATAGACTCACATTGCAGAGAGTATAGAAGGGCAT 
2 0 AGTCACAAGACTTATGAAATGTCTCTATATATTAAAGGAAATTGTTG^ 
GACAACTGTGAATAGGAAGCGAAAGAATCTAGCTCAGCCCCACAAG 
ACAGATGTGCTGGCAAGTAAGAGCAAGCAGGTGAAGAATGGACCCTTCCTTC^ 
ATGGCCAAGAGTAAAGGTGTGTACTACCATCCCTGGACCTAAGT^ 
ACTCAGAGATCTGCTTGCCTCTGTCTCCTGGGAATAAAGGTGTGTACTGTC 

2 5 TCAAGATCTGGATCCCAGGTCTGCCCTCCATTT 

ATT ACAGACTCT AT CAAT ATAAT CACC CCCTT AG CAGTTC AT CTTTT AAAAT AT CATT AAAGG ATGTGGAG AAATGG CT CAG CTGA 
TATGGGGCATTAGTTATGGAGGAAGTGTCTGTGCTGGGCA 
AATACGGTTAGCAAATGTAATTTCTCCTTCCCTCCCCTCCTTCCTTTGGATGA 
TCTCCCTTCTTACCTACACTTTGCTTTCTGAGCTTAAGTT^ 

3 0 GGAATAAACAATTCATGCATTTGAAACTGC Tl '1 ' lATTATAGTATAGTATAACAATTA 

TAATTTATCACrTAAACCTTACCTAGGTTTATAGGTACTGCAAACACAC^ 

CACACACACACACACACGTTGTGT ATGTTGGTT AGGGCCAAG ACCGCCA CAGG AGAAC CCG AACTC CAGCTGCTCGG CCTGATT 
CTGATGGAAACCCTGACTTCCGGCCAGCTCACTGCCTT^ 

AGACAAGTGTGCTCGGGGCTCCTCTTCACACCCTTGAGCCCACACTCCCACGGGCG 

35 

MOUSE SEQUENCE - mRNA 

GGGAGTGTCCAAACCTCTTTGTTTGATGGCATCTGTTTACAGAGTTACACTTTAATATC^^ 
CCTTCTCCrCCTCCrCCTCGGTGACCTCCTTCTCCTCCCCTTTCT 

GCCGGGGGAGGACGCAGGCAAGGCGGAGGGCAGCGGGAGGCGGCAACCGGTGCGGTCCCCGG 
40 CTCGCCATGGCCCGGAGACCCa3ACAC^CATCTACAGTAGCGATGAAGATGATGAAGACATTGAGATGTC 
TGGGCTGCTGCCCAAATCTGGAAAGCGTCACTTGGGGAAAACTAGGTGGACAAGGG 

AAC AG AAC GG AACAGACGACTGG AAAG TCATTGCCAATT AT CTG CCCAACCGGACAGATGTGCAGTGC CAACACCGGTGGCAGAAA 

GTGCTGAACCCTGAACTCATCAAAGGTCCCTGGACCAAAGAAGAAGATCAGAGAGTCATAGAGCTTGTC 

GCGTTGGTCTGTTATTGCCAAGCACTTAAAAGGGAGAATTGGAAAGCAGTGTCGG 

4 5 TTAAGAAAACCTCCTGGACAGAAGAGGAGGACAGAATC^ 

AAGCTGCTGCCCG^ACGGACTGATAATGCTATCAAGAACCACrGGAATTCCACCATGCCT 
GCAGGAGCCITCCAAAGCCAGCCAGACGCCAGTGGCCAC^ 

CTCCATCTCAGCTCTCTCCAAGTGGCCAGTCCTCCGTCAACAGCGAATATCCCTATTACCACATCGCCGAA 
AGTCACGTTCCCTATCCTGTCGCATTGCATGCTAATATAGTCAACGTCCCTCAGCCGG 

5 0 CGACGAAGACCCTGAGAAGGAAAAGCGAATAAAGGAGCTGGAGTTGCrCCT 

C ATTACCAACACAG AAC CACACTTG C AG CT ACCCCGGGTGGCAC AG CAC CT CCATTGTGGAC CAG ACC AGAC CTCATGGGG ATAGT 
GCACCTGTTTCCTG1TTGGGAGAACACCATGCCACCCCATCTCT 

AGCAAG^TGCATGATCGTCCACCTVGGGCACCATTCTGGACAATGTTAAGAACCTCTTAGAATT^ 
ATTCTTTCTTGAACACTTCCAGCAACCATGAAAGCTCGC^CTTAGAT 

5 5 CTGACACCATGTCGAGACC^GACTGTGAAAACCCAGAAGGAAAATTCCATCIT^ 

AAGCTCTCCTCGAACTCCCACACCATTCAAACATGCCCTTGCAGCTCAAG 
CCCCCTCCCATGCAGTGGAGGACCTACAAGATGTGATTAAGCAG^ 
GGACCACCGTTACTGAAAAAAATCAAGCIAGGAGGTGGAGTCX3CCAACTGAGAA 
AGAGAACAGCCTGAGCACCGAGCTTGTTCTCGCAGGCGTCTCCT 

6 0 CACCTGTATCAGAAGATGAAGACAATGTCCTCAAAGCCTTTAC^ 

AGTGGTGCCTGGGAGCCAGCATCCTGTGGGAAGACAGAGGACCAGATGACC^ 
CTCAGCTCGAACTCTGGTC^TGTGAGACATTTCCAGAAAAGCATTATGGTTIT 
CTCCTClAGCGTGGAGCGCTCCATGGCTGAGAGAAGAGCCrGATTTTGTTGTGGTAC 
TAGTTGCITGAGATCTCACTTGATTTCACACAACTAAAAAGGATTTT^ 

6 5 AAATTATTAGGTAATGAATTGTGACCATTTGTTAATATCATAATCAGATTTTTT 

AGGATACAAC^ATCAGTATTTTTGACTGTGGTGAATTTAAAAAAA 
AGTCACTAAACATACACAGAGAGATTTTTAAAAACCAGGAGAAGCA 

CAACCCTCTAGGAGCTCATTTGTGGCTAATAATCTTGGAAATATCTTTATTATACTAAACCATTTCATG^ 
AGCTTGCTTGAAAAGTTATTACTGTATGAAATAGTTTTATTGAAAAAA 

7 0 AATGGAGAAATGTGTT CG CTG CT ATGGTTTT AGC CTGT AGTC ATG CTGCTAG CTTGTGTCAGGGGG C AAT AG AGCTT AG ATGG AAA 

AAAGAGAAAGAGACTCC^TGTTAGATAACGGACTATGCACTAGTATTCCAGACTT 
TTGTAATTGGAAAACTTATTTGGGAGAATTTTGCATTTGTTG 
CGATTTATAAAAGCATTGCACTTCTTTTTCTTTTTT^ 
AATGTTCATCCGTTTGGGCGTTTTTCTGACTTGGAAGAACATTCTCT 
7 5 TTCCTGGGCITCAGGGAAGACAAATATGGAAGTCAA 
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TGAAATGACAGTGTATCTACTGCCTTGTAG CAAAAT AAAGCT ATC CT CTTATTTT ACAT 
MOUSE SEQUENCE - CODING 

encodi ng7 lkdxnybp ro t e i n , comp 1 e t e c ds ATGGCCCGGAGAC CCCGACACAGCATCTACAGTAGCGATGAAGATGATGAAG 
5 ACATTGAGATGTGTGACCATGACTACGATGGGCTGCTGCCO 
GAGGATGAGAAGCTGAAGAAGCTGGTGGAACAGAACGGAACA^ 
TGTGCAGTGCCAACACCGGTXX3CAGAAAGTGCTGAACCCT 
TAGAGCTTGTCCAGAAATATGGTCOTVAGCGTTGGTCTGTTATTC 
AGGTGGCACAACCATTTGAATCCAGAAGTTAAGAAAACCTCCTGGACAG 
X 0 TCTGGGGAACAGATGGGCAGAGATCGCAAAGCTGCTGCCCGGACGGACTGATA^^ 
GTCGCAAGGTGGAACAGGAAGGCTACCTGCAGGAGCCTTCCAAAGC 

CATTTGATGGGGTTTGGG CATGC CTCACCTC CATCTCAGCT CT CT CCAAGTGGCCAGT CCT CCGTCAACAGCGAATATCCCT ATT A 
• CCACATCGCCX1AAGCACAAAACIATCTCCAGTC1ACGTTCCCT 
CTCCGGCAGCCATCGAGAGACACTATAACGACGAAGACCCT 
1 5 ACAGAGAACG AG CTG AAGGGACAG CAGGCATT AC CAACACAG AAC CACACTTGCAGCT AC C CCGGGTGG CACAG CACCTCC ATTG T 

GGACCAGACCAGACCTCATGGGGATAGTGCACCTGTTTCCTGTTTGGGAGAACAC 

GCTC CCT ACCTG AAG AAAG TGCCT CAC C^^CAAGGTGCATG AT CGT C CACC AGGG CACCATT CTGG ACAATG TT AAGAACCT CTT A 
GAATTTCCAGAAACACTCCAGTTTATAGATTCTTTC^ 
ACCCTCCACTCCTCTCATTGGTCACAAACTGACACCATGTCGAGACCA 
2 0 CTCCAGCTATCAAAAGGTCAATCCTCGAAAGCTCTCCTCGAACTCCCACACC^ 
TACGGTCCCCTtxAAGATGCTACCTCAGACCCCCTCCGATGC^ 
TGGAATTGTGGCTGAGTTTCAAGAGAGTGGACCACCGTTACTC^ 

G AAACTTCT'rCTGCTCAAAC CACTGGGCAGAGAACAGCCrGAGCACCCAGCrGTTCTCGCAC^ CGTCTCCTGTGGCAGATGCCCCA 
AATATTCTTACAAGCTCTGTTTTAATGACACCTGTATCAGAAGATGAAGACAA 

2 5 GCCCCTGGTGGGTCCCTTGCAGCCATGCAGTGGTGCCTGGGAGCCAGCATCCTGTGGGAAGAC^ 

GTCCC^CTCG<3AAATACGTGAACGCGTTCTCAGCTCGAACTCTGGTCATGTGA 

HUMAN SEQUENCE - GENOMIC 

TTGAGAAAAACATCTAACAGTCrrGAAGGTAAAAGGTATCTTCTXTi'AAAAAGTAi i -liGCATAGATATTG'lTdTlAAATGTTCCA 

3 0 TGAG G ATAT CCTTAACTACC AGTT ATTTAG AAGAAAG AAAG AAG CAGCAA 

AAAG CCAGCT AAGGTGCTATGCIACACCCTCT AAGTCCCZAGCTGCITG CTGAGGCAGG AGG ATCACTTX3AGCCCIAGG AATTC 

AAAACCAGCHTGGGCAACATAGCAACATCTC^TCT 

ATAATTAGAAATTTTATACATATATCAAAATGTCATATTGTTCCCCATAAATATA 
AAATAAGAGCTTTTATATCACGAITTTGGTGTACCTGACATTTTT 

3 5 AGCTCCATACTAAAGGATCAAATGAGCCTCCCCAAGCTCCCAGCAGAACT 

GATTATAAACACTTTCATATAGTTTCCTTAGAGGCT^ 

ACTG AT ATGTTTCAT AAACTG ATTX3TTTTT ATTTTT CT CAG AACCG TTACCTTAACTGAAG AG T AAATTGG AAT AAG CAATTT CT A 
TAGATGTCAAAACTTTGTAAAATTGATTTTTTACATCATCCACAA^ 

TT ATGG T ACACTTCAT CCT AACAGGGTTCACATTT AAAATC ATGT AGTCCACTTCTGTGA^ C C CAAGAAAAAT AACTCCCTT A 

4 0 AATCCGTTCCTTTCITLTI'GTTCTGTTCGAGTT'AAAAACTTTA 

GGAAAGACATAGTGGTGATTAAAAAACATGCCATTTCTTAC 
TAAGCACTCATTC^CTTATTGATTTTATGACTAATT^ 

TAAAGTCAACAATCAGATAAATATACAATGATAAAGTAGGAGAGACAGAAATAAGATTCAGAAA 
TTTAAAAGAAAAACAGTTCCAAAGTCATATTGCCAAACAGACyiGAAATCTA 

4 5 TAAAATTATGATTTTGCTATCATTTGTAAGGTTATCAACCTTTATCTGAGGACAGAAT^ 

GGAAGAATTACAAGAACATAAAGGCATAGGAGTTAAAAGTTGAAGGCTO 

AAAAACCTTCATTTAAACAGTCCCAGCTCTGGTAAATGTTATCTGTGTGACCTTGTGCAAATTACTTCACCTG 

TACTTGACCTGGAATTGAGGAAAATAATACCTGCGGGGGAGAGATAGGCAGAAAGG 

ATATATGGCACAGCCTGTTGTAATCTCTAAAGCCAAATATAACGTC 

5 0 AACTAGTTCTCATCAGTTGTCATCAGCACAACAAAGATTTATGTACGTACTTATG 

AGAAAGGTAAAAGATACTGTCTCTCCTCTCTGACCTATAATGAAGGCTCCAGATC 

ATGATGCTTAGATAGATAGATAGGTAATAGATGACAGATAGATAGATAGATAGATAGATAGATAGATACATAGATAGATAGATAC^ 

TAGATAGATACATAGATAGATAGATGATAGCAAAGAAAGGTAACATAGGCCATACACTCAA 

ACAC^CTCACAGGTGGGAGAAATGACTGTGGATTAGCAAGGTCCAAGAAGGCTTCAGA^ 

5 5 GAAGAATGGGTAGCACTT ACAT AAG C AG AG AGGAGAAAGGAAGG CATTG CAGG CAGGAGGCATG CTTG T AAC CAAAGGC CTGAGAT 

TGCCAATGGTTTTTTGGGG^AGGAAACAGAGTGAACAGAGTATGATAAGAGATAAGGCT 

ATGGCATGCAGTCAGTGTGCCCGTCAAGCCCTGCAGGTTGT CT ATAGC CAGGCCC ACTCTGATTGG CCAATACATGAGC CACTTCA 

GGTATTAAATACTATCACCCCTGGGGGTAAGGTTGTCnXSGGCACAGAGGCCAGAGCAAGCT 

AGTTCAGATCTCTATCTTGTCAACATTAGGGAGCCAGTTAAGGGTATTTAAACAGATGAC 

6 0 G ATTATTGAGCAGTGGG ATGC^GAATGCATGGGGGAAAAGTGG CCACAGGGTGG AAAATT AATT AGGGGG CTGTTATGGTAGTTCA 

GGGTGAAGTAATAATAGCCTAGAAGAAGGTGGTCACAATAGAAAGAGAAAACAG^ 

TGGAAACATTGTCAGGGATACAAAGGTAAATTGAGAGACCTGAGTATTTT^ 
TAAAGATCTCTACTGCAGCCTCCATC^TTGAAGGTTTGAAATATG^ 

6 5 C CTG C CTT AAACAC CCTGTT ATTTG TT ATT C CAATAAGCCC CT AAATG TT ATT AAT CTTTTG TTGTTGTTGTTT CT ATTAG CATTT 

CTACTAATAAAAGCTTACTGTTTTGTTGACTAACTTTGAAAGCTGCTATTTTACA 

GAGGGGAATTGACTAAGAACAGTTGTCAGAAGAGATAAGAGAGAAATTACATCTGTAGGCCGGGTC 

CACAACACTTTGGGAGGCCCAGGCAGGCG^ATCACGAGGTCAGGAGTTCAAG 

ACTAAAAATACAAAAATT* AACTGGGCGTGGTGGCAGG CGCATGTAATCC CAGCT ACTCGGGAGG CTGAGGCAGGAGAATTGCTTGA 

7 0 ACCAGGAAGTCAGAGGTTGCAGTGAGCCGAGATCGTGCCATTGCACTCCAGCCCAGGTGACAGAGCTAGACT 

AAAAAAGAAAGAAAGAAAGAAAGAAAGAAATTACATCTGTAAACTGGTCTTGAAGAATTTC 
TGTCACTTGGAGAAAAGTTAACGATTCCATAAAACAGATATTCTAAAGCAGAA^ 

TTTCAAACTG CGATTC ACT ACCT AT AAACAG AGCACGAAAACAACTTTGGGATC CHtLAC CAG AATG AAG AG AAG AGAAGAG AAGAG 
GTCTCAGATTGCATGCTTC^TAGTAAGGGGTAAGTATTGTTTTCT^ 
7 5 CTATCTTC^CCAGGCrGGTCrCAAACTCCTGGTCTCAAGCAATCCTCCCACCTC 
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TGAGCCACCACACATGGCCAGAACTTTTCTTAGCT 

TCTTATCGTGTCACATATAAAAATATTTGAGGCCGGGTGCAGTGGCTCACGCCTGTAATCC 

TGGACCACCTGAGGTCAGGAGTTCGAGACCAGCCT^ 

GCGTGGTGGAAGGTGCCTGTAATCCCAGCTrACTCGGG 
5 AGCCAAGATCGCACCATTGCACTCCAGCCTGAGCAATAAGAGCGAGACTCT 

TGTATTTGAAAG C CAC (_"!"!' 1' 1' CCT CAAAT AAAT CAAAG C CTTT CAAAAT CAGTTGAAAAAAT ACAG AATTTT ATT AAAAGAAGAAA 

ACATTTGTGTGTtyTGTCTATATATATATAT 

CCCTGTCTTTTAACCTCTTAAATATTATCXAAAGACTTTCCCT^ 

GTTGCCTTCATGTAAATAAACTCTTAAATAAAGCATT^ 
1 0 AAAAGCCCrr ATTTGTAGCACTTGCOVTT^ 

T ACAT ATG AAATTGGAAAG AGATG CACAACAG CACAT ATT ACATTG TG TGAT AG ACAT AAATAAC CTCAAG T ATGT AGAT AAT AGG 

AAAATGT CATAAAAT AGGAAGTX3 ATGAGTTTTGAAT ATTT ATTG TCTTTT AAAAT AAAAT AT ATTT AACTG TAAGTTT AT ATAATT 

TACTTTTTAATAATGGCTGTGTCTAACAATTAACTCACAAAATTTAG 

GCTCTGATCCCACACTCTCTACCACCTGCTGAACAACTCTACCCGGATGTOT 
1 5 CTAAACATGCTTTCCTCTTTGAATCTCCCAGTGCATACTTCTCTI^ 

ATCCTTAGCAACCCAATCAGTTGCTAAGTCr j I l GAAGGv_iuitjCCTATGi i u"i j rClX3ATCTGCCTCCTCCTTTTCATTCCACTTACG 

GTTACTGTCCTACCCCAGACAATTACCTAGCCTCTTCCCCTTGTCTCTTCTCACT 

CTTCCACACACACAGCTCTGATAATATAGCTTTGCTATTCAACGCACATACATGAATCCGGCCT^ 

CAATCTGGCTAGCTCCTTCTTTCCTCTGCTGGGAACATGCCTTGTGGAT^ 

2 0 CTGGACTGTCCTTCCCTCTITATGGAAAGACTCAT 

TCATAGGAATATGTAAGTTGATGAGAATCTGGAGAGCGGAAACAAACAAGAAGTGCOUUV 
TTATTTGGTCACTATCTGTTATGGAGGTTCTATGAAACATACTACACTGT^ 
CACTTAGTCCACTGTCATTACTAAAAGATTTTCrGGGGGAGTCAACATC 
CCACAGCTTTTCTTCATTAACrGAAATAATTAAGAATTGTCT 
25 AGCCAAACTGGGTTGAGTGGGTGAGGAAAGGTGGGTGGTAGAGACATGGAGCCCCTAAT 
TGATTCATCTTTGGGATGTGGATCACGAGGATTAAAGATCCACCACGT^ 
AATTTCTATCTTTTATATATATTTTGTCCTTCAGCCC^ 

CAG CAGGAGAATCCT AATG ATG AGCAGCATGTG ATCAAG CGATT CCTTTTC CTTCAAG CTG AGGTTG AAGTTCAGTAG CGAAGG AA 
GAAGGAACAGGCAGCTACAGCCCTCTGGTCAGCAGCACCAACTTTCT 

3 0 CTTTAGC C CAGG AG AGGTG AAAGTG AATGAGGG AGAGG AAGGCGGTG AG AGACAAATGG CAAAGG T AAAAGTGGGCTGGGTGCAG T 

GGCTCACAACTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGAGGA 
AATATGGTGAGACCTCTTCTCAACAAAATATTTAAAAATTAACTGGGCA 
CTGAGATGGAAAAATCACTTAAGCCTGGGAGGTGGAGACTACAATGAACTGTGATGGTC 
GGGAGACCCTGTCTCAAAAATATACATATTTTTITATTTTTAAAAAAGGTA 

3 5 CTGAGATTATCAACCAGTGCTCTGCAGACCTTCCATGAAAACTACTGTTAA^ 

GAATCATGGTCTCTAGGGGGACAGCCTGGGACTGTGCATCTTTCATGGGTACACCCA^ 

GT CT AACCTGTG AC CCG CAGGCTG CAT AT AG CCAAGG AGAGCTTTGAATGC AGCCCAACG C^ 

TATGAGl rrrriTGTGG T ATTT CtitAAG CT CATC^GCT ATCG TT AGTGTT AGT ATATTT CATGTGTGG C CCTT CTT C CAATGTGG 
CCCAGGGAAG CCAAAAG ATTGGACACTTGGG CT CT AAAGATTG AGGAT CACTGCT CHAG TGTTGG CCAACT ACT CT AG TGTTGG C C 

4 0 TGCAGACTCAATACAGCCTTTGAGCTAAAAACGGTTTTTACATTTTTAA 

G AAAG ACACAGAAG AAT AT ACAACAT AGC CT ACAAAGCATTT ACT AT CTGG TCTTTT ACAAAAGAG TTTT CTGACC CCTTTCT AG T 
GCACTTGCAAATG AAAGGCTGT ACCCAAATTT AG CTC CAGT GC^ 
AAAAAAAAAGTGATGAAATGTGTAAAGTTGTCATCTAAAAATACTTTTG 
AAACATTCTGTAATGAGTTTAAAAATATAGTTTCATCCATACAAGAATTTTGATAACTCC 

4 5 ATCAGATAGCAATATATCCAACATTAGCCTAATAAAATTACTGCATGTGAATTCTT^ 

ATTGTCATACAACTGAGGGAAAAATGTAAAACTATATCTGGGAAATGAAAAATTATAAAT^ 
TTTACAAAGAAAAAAAGGAAAACTGAACTTAGTATCTTTCTACTTTAAAGG^ 

AAAT CTTGAAAAAGAGG CT ATG AACTCACITCACAAAGGG TTC AG AT ACT CAAGAG ACT CAG CAATTGGGAATCT 
TGCCAGTTAGATTCATAAAGCAATCCGAAAGCCAGAATGCAATTTCTT^ 

5 0 AAAGATTTTGTTTATTTTTGATGTGAAGAAAGAAAATTATTATAGAATCTAGA 

TTGTCCTTTGTAATTGTGATCCGAAATATCTCACAATAATTTTGTTTCA 

CCTTACTAGCAACTAACACTTGATAAAGATTTAAACTATATAACAATTTTA 

CGCATGCTTCACCGTTTAATAAAATTATGACTCATTTTAGTAAGAACATC 

TTG AACTG CTGGGGCACATGGG ACACC CACAAAC CC AG TCAGCAGAAGTCT CAAACT CT AAAAG ATT AAAGG AAAAAT ACATT AG 

5 5 TGTGTACAGAGAATGAAAAAAAAATCCCCAATGGTTCTAATCTATAAACGGAAGTAAAACT 

CATCAATCCTTATAAATTCAGCACTTATC'1'rcri'CCTTTACTTCT 
TATCTCGTAAAATAGAGCCOTCAAATGACACTAAGCAGGGATACCGACCTCCAAAATCA 
ATGCAAACGTGGGGTTTCCTGAGTTCTGCTAGGGGCGGGCTGTCTGGACCCCCGTGT^ 
TTTTTGCAGGTGTCCAATCCAGGGTGCGCGCrCTGTGCAGGGAGGGGAGCAG 

6 0 GCGGGGTGGGTGGGG CC CGGGTGTG TG CTGGGGTG AG AGTG AAGGACATG TG ACAAT AC G ACGG TGGCGG CTG CAGGAT CCGGG AG 

GGGAGTGAAAGCTCCXjCTGGGGCGCTCCATTAGTGAGCGGTGATGGTTGCCGCCC^ 

AGTTAAATTAGAGATGTTATTTATTTAAGAAGAAGGAAAAAAAACCCTAGCCAAACAGCCTATGAATACATATG 
ACTCCTCCAACTCCTAATT^CCCCGTCTCCAGAGGGCACAGTTGT 

CCCACCGCTTGCCGCCCCCG CGACAGTGAGTGGG AGCTGGAGG AG CT CTGGTCCCGCTGCCCGGG AGCACGCGGAGC CGGGCGACC 
GAAAAGGCGCCCTCGCGGCCCCCGGCCACCCCTCCCTGGCCCCGGGCT 

GCGCACCGaKXTGGCGCGAGCGCCGAATGGGAGCGGCGACCCGGCC^GCCCGGCAGCCCCGCGGGC 
AGGCGGCGGG CAGGG CG CGTGCGCACTGCAGGGGCGCCAGATTTGGCGGG AGGGGGAGTCTCCAAAGCT 
TCTGTTTACAGAGTTTACACTTTAATATCAACCTGTTTCCTCCTCCT 

7 0 CCTGAGAAACTTCGCCCCAGCGGTCCGGAGCGCCCTGCGCAGCCGGG^ 

CCCGGTCGGTCCCTOCGGCTCTCGGCGGAGCCCCGCCGCC^^ 

GGCGGG(X3GCCGAGGGCGGGGGCGGGGGCGCGCGGGGCGCCAGGCTCCCGGGAGCAGGT^ 

GGCTCTCAGACCCTCCGAGGACCTGGAGCCCCTGCCrCGGCAGCAGAAGCCGCTCC^GAGACT 

GAAATGTATCCG^ACGTTGGGAAGCACGCCCTGCGGCTCATTTTGCAAGTTGCATGGGGA 

GGTGCCCACCAGCCCCGGGGAG CC CACCAGGTGCCTGGCTTGATGCCGCGGGTGTCGGCG 
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AGGG ATCCG CAGCGT AATTG CTGGATGCATTGAGAT ATGTTTGGACTTGGAAGTGCAG AG CAT ATGGCAG AT ACAGGG AAAATGCT 
GGATTGTTTAGTAGCTGCAGGTAGTTCATTTAAATTTC^ 

AAAAGCAGATTTTAGGGC^IATAAAGGGG TCTCT CAAATTGTCTT AAG ACCTGG T1TTGGAGGTTCGAATCATTTTTCT C 

TGAAAGGCTTTTACTTCCACTGCAAACATTGCATCCGCACC 
5 CCGGGAAGATGGGTGCAGTCCCTTCACATTTTTGGCCGGACAGATGGGAAGAG 
AACTGTCAGTTGCTC^TTCCCCAGTCTCCACCGGCCTCTGGA 

ccagggctagctcggcagccgtcgcagtcxsccactc^ 

gggacgcgatccccgcgggctctgcccccaggcgaaaggtccctgcgct^ 

tctcgcaccctccttcagcosccttaggtcgccccggcc^^ 
1 0 gtccatggaggcaaag caaatccattcgcctttcctt agattgtttgattttctc 

ggtcccc^cgcggcgctcggaagttccaaagagcgtgggtggagaccctgc^ 

cgccggggcttggttgcgctctggggacgagagggcgacttgggg^ 

caggacggtgtgctgcccctcgagggctccacttttcgcctttaggac^ 

ttirttttttttosagagctgaaactggca 
1 5 ccccgaggtctcggccgcccccgcggggcggggccgcgcaggttccctgaggctccaggtagcccaga 

ggcgggagccttgaacctgcctcaggggaagagaaacccgactgcgggct 

cagggattatattaaccaggtcagcgaaatgagttt^ 

ctccctcttcctaacccccagcacct ittitct^ ^ 

gtactcttgtaaccacaccgggtctagcaggtggggagtcgagtgcatcaaccctgaaatcagcct^ 
2 0 gggtctgagatcttgtcagagcagataaaggaaaagttatttcaggttcct^ 
agcaagcaaatcgcatgcaaagttcaaaggcgagcctgggtggccagcc^ 

CCGGACTCGCCCTGCTGCGTCCTGGCCCCITCAGGTCCTGCAGAAGTCACT 

CTTTGC CTCTGTG CTTAGCTTTT AAG AGG CACGG AAGGACATGGCATGG TCATGTATTAG CGCTGG AAAG TACCTT AG AAAT AGGA 
TCCAAGCAAGCCCTTATTGTAAAGGCGAGGAACTGAAGTCTTGAGAGAGAGGA 

2 5 GCTAGACTAGAACCTTTGCCTCCAGAACCCTAGTCCTC 

GAATAGGTTGCTTGTTCCT^GGTTGTTAAACCG<rrGCGGAGAGAGAGGTGCTCA^ 
AAATGTGACCTCTAGATCCAGCCAAACCTGAAAAGTTAATTCGACrrTGAATCTATGACTGCAG 
GGTCTCTCTCACATTTCACAAGGGACTGCATTTGGGGTCTTTCCAW 
CTCTGGGCTTTATAGCTTTTTTTTTTTTTT^ 

3 0 TGTTTAGTGCCTCCTGTTGACAGTGGAAACAGAACTAGGCTATT^ 

TATAAATATATATTTCCTGGGATTTTATTTG^ 
TCAGATTGTTGCTTAAAATAATAGCTTTAAATTATGAGAACC^ 
GGGCTGGTAATATGCATACCAGTGAGTTAGTTTGGGAAGACAAAACCT 
ACATTATGATATGGCTTTATATCATTGTTTTTATTGGTCTTAGTT^ 

3 5 TTTCTAAGATAGTATTTTTCCTATTTTAAGTTTTATAAGAAAAACTATAAAATAAA 

AGTTGATTAGATTAACAAACATTATGATTATGATTAACAAACAGTAAGCATATAGT^ 
GTTAGGTTAACTAAATAAAAGAAAAAGCITTCGAAAATAGTAATGG^ 

CTGGATTAGG TT ACATTTTAT AAT ATTTCAAACACC ATT CC AT C AG AG ACAG AAAAAACAAGAG T AAGT AAGACT CTG ACT AACAA 
GTGGCCTAATTATTCACTTAGTTACTCTAGAAACTAAGTATTCTA 

4 0 GCAATTTGGTATTAATTTTATTTAQVAAACATTAAAGCT 

TCTTAGATTCTCCTAATCCTCTAGACTTTATGGGATCACTATAATTCrGTTrTGCGCT 
TAATAAAAC!AAAAACCCCATTGGAATAGCATAGTTGAATTt3T^ 
AAAGATTTTGAGTGTGCACITATATAAAGGACATGGGTTCTTGTTCCTTTTCT 
CACTGGTTGGAAGGTGGCCAAATGTGTAACTTGTCCCTGGTCTAATAGTAACAGC^ 

4 5 CCAGGTCCTTGGCOSTGTCTGTGGATACCCATAACAGCAGAACCAGTTTA^ 

TTGTGCAAGTTTTCAAAGTTTTGTCTTCATAACCTTTGAAAAG 
CTAAATCCTCTTGTTTCAGCC(^CGTCTACCC^TT , CTTATTTCTGCAGCATATATAG 
TGTCTGACCATGACTATGATGGGCTXSCTTCCCAAGTCTGGAAAGC^ 
AATCATTTTATCAAGACGCAGAAAATAGATAGAGTGTATAATTTATAAAAAACA 
50 ATTAG<^GGCTCCTTGAGATCTTATCCTAGCCCGCATGTGCAGCGTGCCCTATGCCCCC^ 
CTATTTGCAGTTCTTTGAATGTCCTCTTCTCATT^ 

CACCTCTTCTTC CTACTTTTACATT ACCCATGAAGG CATCACCTCTT , CTGTGAAGCCATCCCTGGCTTCGCCGGAC AGGATTGGAT 
GCATCCTCCTGTGATCCCATGCATGGGTGCATGTTCTACTTTATTGTTGCTG^ 

5 5 TCTCTCAATAAGTCAATGACTTGCTACTCCCAAAAACAAAACAAACAAAAGAATA^ 

CAGTGTATTGATTTAATATCCITGATATACrrA^ 
ACACATCTGCTTCTCCTGTTTCTTTTTCGGCTGGCTTATCAAA^ 
TATTAATATGTGGAGAAAAGTTTCTCTAATGTTCTTAAAATTTTTGCA^ 
GGCTCTTCTTCAGTTTCACTCTTCTCTGCTC 

6 0 GG AAATTGATATTTTACCTATTAAGATTATCC!ACTTGGATAATCCT^ 

TCTTCTGTCTGTATTGTACCAGGGACCTCCACTGTCT^ 

ATGAACTGGTAGGATTGTGAAC^AGCATATAAACTAAAAGAACTGATTAGCTGGAAATGTTTAATGT 

TGGCCTATCTAAATCTAACTGATTTTCTTATATAGCT^ 

CTAACTGGAAATATTCrATTTCITGCTCAAGATCCTATTTATTA 

6 5 TTTCCTAACAGAGTAAAGCAGTTATTTATAATAAACACTCAATATATCCCAGGCAT^ 

GTAATCCCCAAATAACCCTGTTAATATCAGCCTCATGTTAATAAATT^ 
TTATAGATOTCAATACTAAGGCCCAGAAAGGTAAAATGGTATACCCGCTTT^ 

GTCTGATTCTAAATAAATCTATACTCCCAAGTCCGTATATTACATTAATATCTCCTCTATTCGGATA 
TCTAGATTCTGTTCCCTCATTCACTTTAACTTGAACAGAGTTATATAGT 

7 0 GGATGAAAAACTGAAGJ^GCTGGTGGAACAGAATGGAACAGATGACTGGAAAG 

TTTTTCTTATAACGAAAGGAAAGCTTCTCCCAAAGATTTCTATAAAATAT^ 
CACATTTAAGAAGTG<3GAGAAATGCCCTACCCATAACTTATATTTTA 
ATTAGAATCGATTATTATAATTTAGATTATCAAATTTTAAAAACAGCT 
ATTTTGATGATTTGTTACATCX3TAGTTG 
7 5 GCAATACTTTTGGCACCAAGGTTTAAAATTCTGAAGACCT^ 



430 



WO 03/008583 



PCTAJS01/51291 



TAAGGACCCTGGAATTCCTTTATAGTCAGTCCOT 
GOACTATCATATAACTTCCTAACTGGCTTC^^ 
GTTTTTTGAGATGGAATCTAGCTTTGTTGCCCAGGCTGGAGTG 
TCAAGCGATTCTCGTGCCTCAGCCCCCTTAGTAGATGGGATCACAG^ 
5 AGAGACAAGGTTTCACCATATTGATCAGGCTGGTCCCAAACTCCTGACCTCAAG 
GGATTGCAGCCATGAGCCACCGTGCCCAGCCAGTCITCrACTTTT 
ATGCAGCTGTACCTTATCTTTTCAAAACCAACCACAGT 
GCCCTCCTCTATTGCCCCAACTTACAGCTCCAGTTCT 
TTACCTCCTCTGGGCTTTTGCrCCTGGTCTTCCCACCTCC^ 
1 0 TCAAAGTCCTTATGAAGAGCTCTTCAAGACTCCTCCCAAGGAAACCATTTCT 
ATCATTTTGCITTGGTCTTTTCTrCCAAGTGAAT 

AGTTTACAAACTCCTGT AACATAGG ACCT ATCTTTT ATG CATCTT" ri'TT (J'i'T T CTG CAAT AT AAAT ACTTTGGTCCTGGATTGAAG 

CACATCTCrTGTCTGAATGGGAAGCAAACTTTAGTCTTTTAGG^ 

CTCAGGCTATGTAGATACTCIGCAAAAGTATCTGAGGTTAAGGTTA^ 
1 5 ATGTGGCAGGAATGTTTCCCTCAGGAGGCAGAGTGTCTAATTTT 

AAGTAATGAGGGATAGATGGGCTCTAAACATGCTCTAAAATGTTCTTAAA 

TACAACAAAAAATTCACGTGCTTATCACATCATATCTTTATGGTGGTGC^ 

CCAGCACCGATGGCAGAAAGTACTAAACCCTGAGCTCATCAAGGGTCCTTG^ 

TCATTGGTGTGTGACTCATAATTAAGAATATTCCCAAAATGCTAATATTTT 
2 0 AAGTAGAGGACTTCTATTCCCATATTTCCAGTGAATGAAAGCAAATTTT 

TATACTGACTCATTACATAACTITAAAACATAGGTTATTTTr 

ttaAAACGTTGGTCTGTTATTGCCAAGCACTTAAAGGG 

GAAGTTAAGAAAACCTCCTGGACAGAAGAGGAAGACAGAATTATTTACCAGGCACACAAGAGACTGG 
CGCAAAGCTACTGCCTGGACGGTAATAATATGTCAAAAAATATATTGATCGGGAAGAATC 

2 5 TTAAATGTATGGTGAGTAAATTATATTTCATCAGTCGTAAGTGTTTTATTAGATAAACCAGC^ 

CATCTTAGAGTTTATTTGGTTGAGAACTTCCAGGATCCACCTGACAGTC 
ATTTCCCCATTTCCAAAATGAATCATACAGGGCCAGTGTTAGGATCA^^ 

TCT AAATGCATAG AAAGAAACTGAAGACAGTTTGTCTCTGG CAGTT CACGT AGCTGGTTAAGTTACTCAAAGGAACTCAGCTACCA 
TTTTCCTGAT AAC CATCATTCATTl' L'l'U"ITATT'ri"ri"l"l'AATTATTATTTTTTAGAGACAGAGGCTTGTTCTGT CAC ACATGAG TG 

3 0 ACAGAGTGCAGTGGCATGATCATAGCTCAATACATCCTTGAACTCCTGGGCTCAAGG^ 

TAGGACTACAGGTGCACCCCACTACACCCAGCTGATTTTA 
TATCAAACTCCTGGCCTCAAACAATCCITrCACCTCCCAAAGTGCTGGGATT 
TCATTTCATTATACCACTTTGCTTAATTTTTTACTTCATTGA 
TTTTTAAATAATGTCAGGGTAATAATGCTTTCTTTTTA^ 

3 5 CTCATTTTGTCCCATTTTTCCCACTTAAGTAGTAT 

ATTAAAATGATGTAGATCAAAAATATTAAAAATAGTTTTATTCTTGAGCTTCATAATC 
GTGATGCTCCAAAGCATTTGCACCCTTGTGGTTCAGTATAAACTTAGAAATTTTAAGAGGT^ 
TATGGCCTGAGAGGATACACTGCTGATTCTTCTCAAAATTAACAGGGCTATTGAAC 
CCCCATTGAAATAGAAAAAGGGAGACTTAAACGGTTTTTTTGAATTGAGA 

4 0 ATGCGTTGCCTTTTGAAAATATACTGGGGGTTAGGATCTTTCCCTTTGGGATCC 

AAACAATTGGGGTGGCTTTCAAAGAGTAGGAGGTTTTTTGGTCTCAGAAGTT 

TTCTTCATCAGCAGCATCTTATCATCCACCAGACTTCACCATGTCCCTCTCGTGGTGTCTAGGCCCT 

AGTAATTTCCCCAAGGCAGCTGGAACTTTACATTGTGTTGGAGTCTTGTACCGCGAC 

ATTGAAAATACTCCTTTACCAAGGAAATGGAATGGTTAAAAGATACTGATATTTCT 

4 5 CTAAGAGAAATCTAACTAGGCAATAGAGAAAGCCTGATCTCAGAGG 

TAGGCCCAAGGGAAAGGTCTCTGTTAGAAACAGTTTTCTGTCAATTCACTTTAATCTGAATTG 

GTGATATATTTCTGTGCAGAACTGATAATGCTATCAAGAACCACTGGAATTCTACAATGCGTCGGAAGGTC 

CTGCAGGAGTCTTCAAAAGCCAGCCAGCCAGCAGTGGCCACAAGCTTCCAGAAGAAC^ 

GC CT ACAG CTCAACTCC CTG C CACTGGC CAG C CCACTGTT AACAACG ACT ATT CCT ATT ACCACATT TCTG AAG CACAAAATGTAA 
50 GCCATTCCTGTGAATCTAGTTTAATGAGAGAGACGGTTAGTTTAGCTCCAGGTAATAATCATACTTTG 
AGAACTTTCCTGAGCATGATCATGGCCAGTCTTCACCTATGCCCTT^ 

TT ATGTCTC CAGAAACGAGG ATGGT AAG CXXTrTTGCCCAGGGTG ACACAGCTCATAGAGATA 

CGATTCTAAATTGCTGCTTCAAATCrAATTTGAAGGCAACCTTTGAATGT 

GGAGTCGCATTTCTAGATGTAATGTTTTTCTCTTCCAAAACGAAAAGGGA^ 

5 5 CTGAAATTCAGTGGTTATGGTATGTTTGATTCTGGC^ 

TCTTACTCTGAGTCAGGAGCAGAACTrrGTGACATTCATTTTCrGATCTCT 
CTTATTTCTAGATATTTATATTTAGGGAAAGTGATAATCCIATAATAGGTGGAAAA 
TTTAATCTTAAGAATAACAATATTAAATGATAAAACTTACCCAAGAATAGAGAGCTCATATG 
TAAATAGTTTTTATTCTTCTCTGTCCATGGATTTGACTACAAACTTTACAGA^ 

6 0 CTGTGCTTT TT ATT AAAACT CAAAATT AT TC CAAG AAAAAT ATTTTT AT TT ACAAACAGCTGGTG CCAT CAATTTTTCCCAT AG AA 

ACAATTTTGAAACATTGATTAATGTATTTTATAATATATTT^^ 
CCCAAAATTTAAAGTGCTCATCAACACCTCCCAAGGATTTTTC^ 
AATTTGACATTAAAAGGTTTTAAAAATTTGTTTA 
CAGGAAGCCAAGTCCCCTGAAGCATATGTAGCCCTGAATGACGTG^ 

6 5 CCATACCCTGTAGCGTTACATGTAAATATAGTCAATGTCCCTCAGCCAG^ 

CTGTrCAAAGCACCACTTTTAAGAAATGCTTAT/ITCTTTC^ 
GGAACATGGAGAGGAGAGCAGAGTCTTGGCCAGCAATCTATTAATCTTGTCT^ 
TTTGATGTTGCAAGGTGAAATAACTCATCAGTCAGCATCCACTCTAGCCT 
TTCACAAAGATTTCTGCAGGAGCCAAATTTGGTTGGTGGTGTTGTCAC^ 

7 0 ATTGGCTACACCCATTGTATTTGCAGCTTCCAATC^ 

ATGAAGACCCTGAGAAGGAAAAGCGAATAAAGGAATTAGAATTGCTCCTAATGTCA^ 
CTACCAGTAAG ACTGTCATCATGTGCITQ AATGAGGGGATAGCAGCTTTG CCT 
TACACTTAGCAAGGCTCCATATATCCATTCAGAATGTCTCAACACAAGA^ 
ATGCTGATTAAATTGGAAGCAGTCTTTTTGTAATTCCAATAAAAATGCAATATC 
7 5 t^GAAGGACT'ATAATCAGTTTAAAGA 
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ACCATTTTGGTTTTTTTCTATTATTTTGTTAACCT 

TACATGTAAAAGGTTATATATAAAACCTCTTCATAAATAACATTAGCTGCTCTCITTTATTTGC^ 
ATTACTGAATTCTGACATCTTTAGCGACTGGGCAGCCAACTGGGAT^ 

GACAATAAAAGAGAAGTGATG CC CAGT AGGT ATTCCT AGGGGT AAATTT ATTI'T T T G T T AAATAAT AAG ATGTTT ATGTGCACATG 
5 CTAGTTCGACCACTTTTTAACTTTAGTTAATGTTTTGCAATTT 

GCTCTTCTGCTTTCCTAACCACGTTGATTTGGCAGTGCTTCACTGC^ 

AAGAATATGTGTAGATCAAAGCAACTCTAACTCCTGAGTCCTCTAGCTTTTTTGT^ 

TCTTCACTCACAGCAAGCATTTATGGACCTTTGG 

TATAGACCTATGATTTGACATTTTTACAAAATGATAACAAGGTAATCATATAGGTACAAC 
1 0 TGCATTGATAAAAGGAAACATCTTGACAACTGTTTCATAGGCGTAAGTTTGGGATGATGCAACTACTC^ 

CATCTGATACCTTGTGCAACriTCATTGCrAAGTTCCTTCTCCCTTTCTTCTGTCCTCT 

TAGACAO^GAACCACACATGCAGCTACCCCGGGTGGCACAGCACCACCA 

TGTTTCCTGTTTGGGAGAACACCACTCCACTCCATCTCTGCCAGCGGATCCTCGCTCCCT 

GGTGCATGATCGTCCACCAGGGCACCATTCTGGATAATGTTAAGAACCTCTTAGAATTTGCAGAA^ 
15 GTAAGT AGAATTGTG AAGGGAAGTTAACGATTCTGGAAGATAAATTAGAAAACCCATTTCTT AAAT CATTGCCATTTT CT AT AG CC 

AAGCTG TT ATT AGCATAT ATT AATGTAAAGGT AG AAG T ATGATTTT CATCTTCATG AATATGTTTTTTCAAAG AT CATA 

GCCTACCGAGAGAGACAAAAGTATTTATAAATACTTACAAAATAGGACT^ 

TCCTTCAAAAGTCATATTTTTAAAAATAGTCAGATTGATGACTTATAAAGCAGAGCTTGG 

TTACTTTTAA CTTTTTTTTr r C I T I G CTT ACTT CATGGTG CTT CT CT AG AATTTTGT CTTGCTTTTG AACAG CACATT AATG ATTT 
2 0 GGCTTACTCAGAGCTGAAAGC!AATCTTTTAGTGTATTTGAAGT^ 

GTGATTATTGATTACTGGTCCCATTGCTTCACTGAGATGAATCATCTTTCGTGAAACAA^ 

CACHTGGTT CCTTTCCCAGTTG ATTAAAAATCATTTATTGCACACT CATTGGTTAGG AGATGACCATTGC CG TAATATG CCATCT A 

GAAAAGGTCTTCATTTTGCTTTCCATTGCATGCA 

CAGCCCACTAGACCCTGCTCTACTGCAGTATAATAGAGCAACTGCTC^ 

2 5 TAGTGTTTGCTTTGCGTTGAGCATCTCTCTCTCAGTGCTGTTTO^ 

ACATTGTTTCAGGATTCrrTCATCATGGTGTGATCTCAGCAGTTTTGAATTCTT^ 
CACAGGCAAAGCCCTACAGCTTCAGCAAAGAGAGGGCAAT^ 
TGTTGAGTGAGAGTTCACTTGACCCACCCAAGGTCTTACCTCCTGCAAGGCAC^ 
CGGGGCCAGGCCAGCCCCTTAGCCACTGGAGACTGTAGCTCCTTCATATCT 

3 0 TGTCAAAAGCCTACCCTTCTCTCCCrCGCAGGTAGAACACAATT^ 

AATAATCTAGAAACTAATACTTCGGAACAAATGACTAATTACTGCT 
CTAAGTTATACTGCCTCCCAAAGTTTAGGCTGTCTGCAGGCTAACTGTTGATTTTG^ 
GTTTTGAAAGATAGTTGCATGAAACGTACTTTTATTTCCTTCAACACTAG^ 
TGATCCCTCCCTTTAAAATTGTTAACATCATTTCTTCCTCGTTGTTCA 

3 5 AACCAAATTGTGTTTATATAACATTTGGGTTGCAACTAGAAAATTTAAAGCT 

GAAGAGTTTGGGAAGAGGATGAGAAACAAATGCTTAAGGAAAA^ 

GTTGTGCCATACTATTTTACTAGTGCAGTTTTTGCCAAGGAATATTTTTATTTCATAC^ 

TCTTATCAGAGATTTGTGTTCCTAAATTATTTAAATAATTAAATTAATAG 

CTGTTGAATTTGTTTTAAGAACAGATTTAGGTATGATATAGGTATATACATTTGGGA 

4 0 GTTTCTTAGCTGGGCATAGTGGCATGGGCCTGTAGTCCTAG CT ACTGGGG AAGCTAAGG CAAGAGGATTGCTTAAGCACAGGAGTT 

CAAGTTCAGCCTGGTGAACATAGCGAGATCCCATTTCTTTAAAAACTTTCI 
ATGCCGTTGTGCCTTGTGAATTTCCAGGAGGAGATTATGATATGTTGTGTTTCCCAAATTTA 

ACACCT AG TTTTTT CCCAG AACTTT AG TG TT CTG AG AG ACACAGTTT ATT ATT CAAG AATTTT CC AT AT AATGT AAACCTTCTGT A 
ATACTAATTTTGGTTCATAGATTATTAAAGAGAAAAAACAAAGTATGAAAGGAAACTATTAATTATTAACTA 

4 5 ATCACTAAAGCCTTTTAAATGTTAATAAAGAACACATCTCATTCTCTTTAATGACA^ 

GTTTTGCAAGTTTTTCAGCAATGACCCTAAATTTTATTTCTTGGAAACCACACCCAAATCCT 
TACCCTAGTCAATATAAC^TGCTTGTTAGCATCTTTATTATTTAA 

TTTTAAATTGGGGAGAATAAAGAATTACCATCTAATCTAAGTATTTTTTCTTTCTCTCCATATTTAGT^ 
ACCATGAAAACTCAGACTTGGAAATGCCTTCTTTAACTTCCACCCCCCTCATTGGTCACAAATTC 

5 0 AGAGACCAGACTGTGAAAACTCAAAAGGAAAATACTGTGTAAGTCTTTGGTTCAGGAATA 

TTAATTAATGGAATATAGTTCCCAGATGTCTGATCCACTTGTATGTGCAT 

ACTCCAAAATTTCCTATATATGCACAAGTAGTTTTCATAAAACTTTTC^ 
GAAAGCCAGATTTTCAACCTCCTTTCCTCTCTAATTGTAAGCTTC^ 

5 5 T ATTT AATTTT ATTTGTTTATTATTTATT^ 

GGACCTCTTCTGACATC CCCAGGAATATTATATG ATTAGAAGCCAAGGGATGAGTAAG CAGTCATTTTTATTCAATAAAGCCTTAA 
TCAATTCAGAGAAATGGTAGAGAAAGTGTGAATTATCAGAAACCTTGAAAAGAAATGCACTGTTATTTCGAAGT 
TTTTAAAAGTTTAACATGTTTTGTAGTATGGGCTCTGTTTTCATGAATAGGGAAACCTGATTTG^ 
ATGCTCATAGTCTTATACAGATCATAGGCACTACATGTGTCTTCAGAGCCATTAAGGAAAGATGTTAAATCATT^ 

6 0 TCTTCGAGGTATAATATTTTGAAATATTTGAAAATATGTGATATGCATATTCTCTTTTTCTTAACGA^ 

CAC CCCAATTC C AT AT C CGG AAC AG AGTTT ATTGT ATTCAGTGG AAAAATGTT AAATGT AACAGGT AAAG CC^ 
ATTCTTTTGTATTGAGCCACTGCTTGTTTTCTTTTTAAAGTTTTA 

AGAACTCCTACACCATTCAAACATGCACTTGCAGCTCAAGAAATTAAATACGGTCCCCTG 
TTTTGGATTTCATTGGTTTATTAGTCCCATTAGGAGTTCTCTCTGATGCAAAAATACCCACTC^ 

6 5 CTCATCT AG T AGAAG ATCTG CAGG ATGTG ATCAAACAGGAAT CTG ATGAAT CTGGAATTGTTG CTG AGTTTCAAG AAAATGGAC CA 

C CCTTACTG AAG AAAAT CAAAC AAGAGG TAAAC ACGCAACCCTTTGG AAGCAACAAGCTG AG AATC CTGC CC CCTTT ATT C CTTG T 

GTGCAGCTTGATGTGTCTGCCATTGGCACTGTGCAACACCA^ 

CTGCCAGACTGTAGGCATC71TGAACCTTTCCTTTGGGAAGCCAAG 

AGAGTATCAAGAAAAGGGAGGGAAAGAGGTTTTTAATGTACTTTTAAACATTACTGATTATATAAAGTTG 

7 0 TGTGAAGAATTGGAATTTAAATT AAAAT AATAGTATCTTTAATTAATTGTATTAAAAAAAGGG 

CTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCAGATCACAAGGTCAGGAGTTCGAGATCATCCTGGCTA^ 
CGTATCCACTAAAAATACAAAAAAATTAGCTGGGTTTGGTGGCGGGCGCCT 

ATGGTGTGAACCCGGGAGGCAGATCTTG CAGTGAG CCAAG ATCGTACCACTGCACTCCAGCCTG^G AGACAGCAAGA CTCC GTCTC 
AAAAAAAATAAAAATTAAAATTAAAAAAGAGAATCTAACCTGCAAGGATO^ 
7 5 AGGTTAGATACAGTGCTTCTTGAATTTATTTTCATGTAAAGATAAGTATTTTTTTTTATA 
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T AATGGAGAATTTT ACCAGAATGTG ATT AAAAAC CAGTCTTTTGTCACCAAAAGATCTTTTATGTCACCAAAAT AAAAT C CTTGAC 
GGATACAACACTACTAGGTGTATCATTAAAATATATTCTTAATAGAATAAACA 
GCTTTACAGGGGCAGTAAATTTGGAGCTAGTC^^ 
TT-GAGCTGATAGTGTAAGTTAGCAACATAGTTTTATA^ 
5 CATAGGAAGTTCTAGAAACAACAAAGCACTTTCTATATGCTTTT 

CACACCACTGGGAAGGGGACAGTCTGAATACCCAACTGTTCACGCAGACCT 
CCAGCCCCCAAGTTGTTATGTGACACTGGTGCCTCATGAAATCCTGTG 
TAACACATCTTATTTCTAAATGTTCTCCTGCACATGATTTTT 
TGTATTAGAAACATTGGAGACTTTTTCATTATTTC 
1 0 AAAAACTATGTTCATTTATATATTTTCCTTTATTTGTCTTCTA 

GTAGCATCTCATATGGTATCAACCACGTCGTCTATAGTAGATGCCCAGT^ 

TAAAG TTT AT AATTTGTGAAGGGTTG TCCTTATGTCTTAAGG AAAAT CAGAG CT AAT AACAGCTG CT AAGATTT ATTTAG T AGGT A 
CTAAGTGGTAAGTACTGCTCTAGGGGCTTTCGATGAATTGATTCAGT^ 
ATTAATTTTTATTTTTTTTGAGACGGAGTTTCGCTCTTGTTGCCCAGACT 
1 5 CTCCCTCCCGGGTTCAAGCGATTCTCCTGCCrrC^GTCTCCCGAGAAG 
TX3AATTTTTAGTAGAGACGGGGTTTCTCCATGTTGGTCAGGCT 

CCCAAAGTG CT AGG ATT ACAGGCG TG AG CCACCG CGCCCGGCCAGG TGGGT ATT ATT ATCATTT CT ATT ATGTCAACTGGAAAACA 
G AGG CACAAAGAGG CTT AGTCATTTG CTCAAGATTGT AT ACCT AAT AAGGG ATGGTG CTGGG TTGGAACT CAGAAGG TCTTGTTCT 
AAATTCCCTACTTTTCACCATTTTTTGCCAATAATTT^ 
2 0 TCTCTCCCCAGAACTTAGTAAATCTTCTATTTTGAAAGGCATGTATT^ 
CATGCCTAGAACTCTGGGATCCCAAAGGAGCTTTGTGTTGTGGTGA 
TCTCCTTTTAGCTCATAGTGGGGAGAATTTCTGGCAGATGACTC 
CTCCACAGTGTTAGAAAAATGTGGCTCTCATATTATCCAACCTCA 
CAAGCTCCGTTTTAATGGCACCAGCATCAGAAGATGAAGACA^ 

2 5 AGCCCCTTGCAGGTAATTACTATTCCAACATTGGTATTTT^ 

GGTGGTGATGGTAGTGCTGGTGGTCTGTTTTTCGTTTTCAAC^ 
CTAGAAAAATCCAATAATTCCrTTTTC^rrACTCATATGAGACA 
AATGAAACATTTCTTACTCTGACTGAGACTAAGATGTATTACA 
TAT AT CAAT AT AGTTT ACTTT ATGTT CCAAATTTTTT ATTTTTT ATGCTG 

3 0 TTTTAAATGAAGTCTGAAATTAGTCAGACAGAAAATAATTGCAATGTA 

TGAGGCCGATCTGAAGTTGACCATCTGCTCTCAGGTTTTA^ 

ATGAAATATAACCAATTCCAAATTAATCCTGAGCCAAATGTAAATAAGAATGCA^ 

TCTGACATTAAATAGCCCAATACCCAGCCACCTGCAGAGCCCTCTCCrrC 

CTAGAGGAAGTGAAGAGCCTGGATTCAGAAGCTGAAGGTCATATTr^ 

3 5 TTTGAGTCTGCCCATTACCTTAAAGGGATTTCCCATC 

GTAAGTATTTGTGATGCnTAGTTTTTCAGGGTTTTTTG 

CTGG AGTGT AGTGGGGTG ATCT CGGCTCACTGCAAC CTC CGCCTCC CAG ATTCAAG TG ATT CTT AT G CCTC AGC CT C CCAATT AGC 
TGGAATTACAGGTGTGAGCCACCXX5CAACCCAGCCGTTTTTTTGTATTTTTGGATAGAG 

4 0 AGTG CG T AG TATTATTGT AAAAT AAGGATTG CAAGCACATTTTGTTT AATT AAATC AG ATTG CTT CCTT C CTGG C CTCT CT CTG AC 

AGCAATTGTTTTGGTAAAATGGCTGTGGACTGGAATGTGAGAG^^ 

AGTGAGGCTGGAGTTGGCCCCGTGTGGAGAAGACTCAGAATACCACAAAGAGGCCAGACCCAGTC^ 

ACTTTGGGAGGCCAAGGCGGGTGGATAACTTGAGGTCAGGAGTTTGAGACCAGCCT 

ATCCCAGCTACTCGGGAGGCTGAGACAGGAGGATCGCITGAACCCAGGAGGCAGAGGCTC 

4 5 CTCCAGCCTGGGTGACAGAATGAGACrCCATCrrCAAAAAAAAGAATATCACAAATAATTGGGAGATC 

CCCAGACATAAGAGGAACCCAGAAAGGGAGTTACAAAGATTTCAAG^CAAAGGGC^ 
TTAGGTCCAGAAGAAAAGCTTTTGCTTTTTTTTTTTCT 
GAATTCAACTACTGCTATTCGTGTTTTAATTAAGTTAGATTAAGTTTTG 
TTGTAGTAGTACAGAAAAAGTCTGTTCTAAATTTCAAAAAGCC 
50 AAACCCATAGTTCAGTATTGAGCITTATCAGTATATTTGTATTTTATTTAATGGAAAT 

GTTATAGCTCTATTTTATTAATCTGTATTCTCCTGTCTATCAGTGAGTCATATTGGCACTT^ 
AGATTAAGTTGTGTATGGTGAAGGTAATATTTGTCAATTCTATCAAACT 

ACTCCTGTCTAAACCATATAGTTTACTGGTTTACACTGACCAGATATATTTTAATAATCTGCTATAAAGC 
TGCCTCTAATC^CAATACTTTGGGAAGCCAAGA 

5 5 GACTCAGTCTCT ATT AAAAAAT AATT AGCTGGG CTGGG CGTGGTGGCTCACGCC^ 

GCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCG 

GCGTAGTGGCGGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCATGAACCCAGGAGGCGGAGC^ 

AGCTGAGATAGCGCCACTACACTCCGGCCTGGGTGACTGAGCGAGACTCCATCTCAAAAAAAAAAAAAAAAAAA 

ATAATAATAATAATTAGCTGGGTGTAGTX^TACATACCTGTAATCCTAGCTACITGGGA 

6 0 CAGGAGTTCAAGGTTACAGTGAGCTATGATTATACCCCTGCACTCTAGCCTGGGAGACAAGACCAAG 

TTTTTTAAGGCCAGCCGTGGTGGCTCACACCTC 

GAGACCAACCTGGCCAAC^TGGTGAAACTCTGTCTTTACTAAAAATACAAAAAATAGCCAGGC 

CAGCTACITGGGAGGCTGCGGCAGGAGAATTGCTTGAACCCAGGAGGTGAAAGTTGCAGTGAGCCC^ 

AGCCTAGGCAATAGAGTGAGACTCTATCTCAAACAAACAAACAAAAATTT 

6 5 TACTTAGTGAATGTTGTTCTACATACTTGCCAATTGGGAGGAAAAAAGAAATATTCACT 

TCTCCTTCATTTTATGTCATGGAGTTCTAGCTTCTGATATCACAGAAATATGT^ 
TATTGGAAAATGCATCAAGTGACTCTAAATATTTAAACATACATC^^ 

G ATTTITI ATGCAATTTGGTAA rrrTGTTTGTTTGAAACACAG TG CT ATTTC^T AT ATTTCT ATTGAT AATCCAATTTG AC CTAAT 
TAGGCATATTTCTCACACATGGTATTATTGAACATATTGCACATTTA 

7 0 TCTATAATTTTTGTAATTTGTACTTATAATAAATCAAATATTAAGT^ 

TTTTTATTAAAAC^GCAAAATAGGAGTTTCAAAATATTTTAAAATTTG 
TGCAGTTGATTGTCTATCCCTCTAAACTGTCTAAATAACTTAGGTTCAAGAACAGTTTATTG 
TGTGTTTTATCATTTCAGTTTTAATCACCCCGGCTTGATTATGGCAACAACATTT^ 
CATATACTGGGCCATTATGGTTTTGCTTCTAACGTACATATAAATGCACATO 
75 TAGaAaTGCCGaGAAGC^ATGTGT^GACATTT-T 
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GTCAACAATTGTTATCATTTTGCCATATTTACCTCAT 

GGGTGCAGTGACGCAATCTCAGCTCACTGTAACCTCTGCCTCCCAGGTTCAA^ 
GACTACAGATATCTGCCTGGCTAATTTTTGTATTTTTAGTAGAGATA 
ACCTCAAGTGATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTATAG^ 
5 TTATOAA glTTTTATTTTTA TTTTTTAATTTTTTT 

GACTACAGGCGCTTCTCACCATGCTAGCTAATTTTTGCT 
ATCTCCTGACCTCATGATCXXSCCCGCCrTGGCCTACCAAAGTGCT 
TTTTrTATTTTTTTG AGACAAGGCCTTGCTCTCT 
1 0 TGGACTCAAjGCAATCCTCCTCCCTCAGCCTCCTGAGTAGCTGGG 

TTTTTTAGAGATGGGGTCTCCCTGTGTTGCCCAGCCTGGCCTCAAACT 
GTGCTGGGATTACAGGCAGGAGCCACCCTGCCCAGCCTGAAGTATTTTAAAGTA 

TACTTCCAGATGTGTCTCT AAAAATAGACA I ill 1 ATATATAACCTAATACCATCATTACATATTTAACAATAATTCTTTAATCTT 
CTCATTCTCAGGTATATTTAAATTTTCCCATTAATCCAA 
1 5 CTCTTAAATCTCTTTTACTCTCTAGTCCAGTGCTGTCTAATA^ 

CCACT ATG AT AG C CAGTGGCTTTTCAGCACTTGAAAT ATGGCAAGTGCTG CTGAGGAG CAG AATTTTCAATTTGAATT AATTT AAG 
TTGCTTT AATTT AATAGCCACATTTCCTTACTCT 

ATGGTACATAGACACCAAAATCAGGGTGCTAAGTCTGTCCATTGTTATGTGGTTTGTC^ 
TGTTTTGAGACAGAGTCTCACTCTGTCACCCAGGCTGCAGTGCAG 
2 0 CAAGCTATTCTCCTGCCTCAGTTTCCCAAGTAGTTGGGATTA 

GAGACATGGTTTCACCATGTTGX3CCAGGCCAGGCTCAAACTCCCAACC^ 

GATTACAGGCGTGAGCTGCCACACCTAGCCAA.GACTCAGTGAATTTT 

CTACAGTGCTTAAATAGAGGCTTTTATAAGCCATTATGTACACTCATGTTATGT^ 

AGGAATTTGCAACAG CTACATAATCTTATAATTG TTATGTTT CT/T rr AT ATTTTC ACAGTTGTTTTAATriT AAATTG T AAAAG CA 

2 5 AACAGGATATTCTTATAATAAAAATGTTCTGAGATGTTTT^ 

GTCATATCATTGTCAGTATTCAGCCTTTTCTCCACATTGTGTGGGGTAGAGGATAGAGTCTAGA^ 
GAGGAAGTAGAGAAAACATTCAGCCTAGAAAGCGTGCATTTGTGGTAra 

GTAGTTACCCGAGAGAGACCGAGGCAGACGCATAGCTCCCTCCTCAGCAAGGAGAAAAGAGTTGCCTGCTTTCCT 
CCATTTCCTTTGTTTTGAAAAAAG(IACATTATGCT 

3 0 TTTAAGTTATAAGCCTTCTGAACTTCTGTAT 

ATGTGTTC^TAGGCTTTCTACTCACTCACTTGCCTCTTTTTCGTAACCACA 
CTTGCCCATGGCCTGACTTTTTTAAAACAAGGAGTTTAAGTTAAAGAAATTTCTCCAGCCAT^ 

TATCAAAG AATTT AAAAATGTTG CT AGCAG TTG AT ACTC CCCTCAGCTTT CGGTTCAG CGTGTTTT AG ATTG CTCCT ATTTTG TT A 
TTTGTCTATGAACTTTTATATTTTTCAAAAAAATA^ 

3 5 CATGGGGGCAACAGAACCTGGGTTCCAGGCTAGACTCCACTGTTTTCAAACGGCAT^ 

G CAAAGT AT AAGTTTGGG AC CATGG AAT ATCCTTCACT CATCACACAGGTT ACTGTGAAG ATCAAATG G AT AT AAT ATTG CTTT CA 

AGATGGCGGCA.CACTTGCCACTATACACAGGTGATATTATTAGGCAGAGTCGCCACA 

AGCCCTCACCAAAGGCCAGGGTGGTTTTTTGTTTTTCGGGTT^ 

CT ATTCCCTG T CC CACT CTT ACAG AACTGTG TCAG TGGGTCACCAAAAG AG AG AG CTG ATT CAACTG ATTC CTTG AAATAAAGGT A 

4 0 GTCAC^CCTCGCGGTATGATGGAGATGAAGCAACTGATTTACTGGTTTTCTCTATT^ 

ACATGGAAACCCTCTGCAC^CGACAGAGATCTTTCCGAGCCTTTTTCATCTAGTGTGCTC 

CACACTCATTTGGTTCTCCCTTTTGGTCACTTTTTTCC^ CAGAAATTCACGTGG CTGTCAGTAGTAATTCAGTGAC 

CK3CTTTCAAAAAATGTTATAAATTAAGCAGCTTTTCTGTGT^ 

CTATGTTGAGCTCAGAGTCAGAATGCCCTGCCCCAGACATTTTGAGATGTGAATTTTTCATT 

4 5 AGAAT AAACATTCTGAGATAAAAATCTTCCTTGGG CAGGCCAG CAAAGATAGAATTGCTTT AG AG CAATTTGGCACTTGGTCTTAA 

TCTTAATAGTTTGGAACTCATATGTGACTAAGGAAAAGGATAT^^ 
AAAGTCTATCAACX5TGAAAGGTGAAGAGACCTCTAGAGACATCGGGGACAAACT 

ATTG TTAGTT AGGCTTGGGAAG TGG AGT CATTGGTGG ATGAGG CAACTT C CAAT C ACC AT CT CG ATTTTTATTTCC CT AACTTTT A 
AGATTATTTTCTATAGAAAATCAGTAGTGCn'GAAGTTGGAGTCA 

5 0 G AAAAGTTT GG T AAC CATT AAT CTT ACCTT AGCAAC ATTTCAGTTT AAG CTT ATCTCTG ACTTTCC AAAG ACAACT AAAG ATTT CT 

ACAGTCTCTCTTATAACTGATCCATTTTriCAAAAGATGTCCCTTTGGGA^ 
GTATTTTTTTT'TCAGATAGGTGAAATAGAATAAGCAGAAAATTCTTATTTTTTACT 

TAAAACAAAACACAATAAAAACATGATACCATCTGATGAGTTGCTTTGGTT^ 1111 rTTT TTTTTT T 

TTAGTTTATACAAGTTTCTTATTAGGTCAGCAAAT^ 

5 5 AGTTGCTGTATGAATATAATAATAAGAACCGTTTTTGTTCAGCTTGATGAAAAAACTATTCATGTTCT 

CATCTAATTCTGACAGAAAACCACCrGTGGACATGGTCTTGTTT 

TTT AAAGTG CTTT AACAG G AT ATCC CAGTT AAAG CTTTGACAGACTG TTT AT AGTTCACTT CTG CC CAGGGG CAATTAAAGG CACA 

TTACAGGATAAGCGAGATGACTCCCCGGCCACAGGAGGAGGATGTGTGGGTAGGATTATTAATGTTGAAATC 

GACATGTCTGCAAGAGCACCATGGGTT/AGTAACTTAACAAGAATTC^ 

6 0 AATGAAACTAAAGTTTAGGGAGAGAGAACAAGGAGAATTTGCAAAGATATGTATGACAGTATTTAAAAA 

GACATTAATCTTCCTTTGTCAAAATAATTCTTTCTGCATGAAAAGTTGGCATGC^G 
GGAGCGTTTCTTTTTTTTTGTACCAAAGTACTCAATCTTCTGA 
AGTTTGTTTCATTTATTGTGGTAAGATGTGCTTGTCTTGTGCTTTCCTGGTGCAG 
CTTITTGTTCCTACACACAGTGGTTCTCGTACATGGTGGCTACCACATCCT 

6 5 AGTC^TCirGGCTGCCCTAGCTCTGAGCTTCAGGGGATATATGACTCAGAAA 

TATTTGTTTCTTGGTAAATTCTCTTTTGACTTAGGAGAAGCTAACTATT^ 
GTACCTTCITTATATTGATCTCATTTTTAAATACCTAGTCCT 

ATCTCACAC^AACACAGTCAGATTTCAAGTTGAAAATAGATTCAGTTGATAATACTTAGTACTAGATT CTCCT 
GAATTCTGAGTCAACCTAAC^ACTATATTACAAAGGATCAGACCTTGACTTTTAGGCCA 

7 0 AGCACTTTAGGAAGCCAAGGCAAGTGGATCACTTC 

C CAAAAAT ACAAAAATT AGCCAGGTGTAGTGTAGCACCTGCCTGTAGTCTG AG CTACTGAGGAGGCTGAGGTTGGAGAATCAACTT 
TAACCCCX^ATATGGAGGCTTGCAGTGAGCTGTGATCATGCCAGCCTGGGCA^ 
AAGACACAGAACTTGACTTTTAACAC^AGGCATAGTAATAGCAAGC^ 
TTTAGAAAATTAATTCTGGCTGGTTGTAGTAGCTTATCCTTAATCCTAACCCTATC 
7 5 CAGGAGTTTGAGACCAGCCTAGGCAACATAGCGAGACCCCCGCT 
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GACACACTCCTGT AGT CC CAGCT ACTCAGGAGGCTGAGG CGGGAGGCTTGCTTG AGGC CAGGAGTTTG AGGG AG CTGTGATCACGC 
CCCTGCACTCCAGCCTGGGCAACAGAATGAGGCCCTATATCTAAAA 
CCAGCTAAATAGAAAAAAAAAAATTCTAATAGGCCATTTTAAAGACTTTGGAAAAAT^ 
l w rrCTTTCCTGGTTCTGA.CATCTCCATTCTGACCTTCATTCCCCTTCCCCTCCA 
5 TCTTTATGTAAATTTAAATTTACCTACAAGATATTATGATATTAAGATATTTCATTCTC 

G ATCCAAGTCCTTGGCTC CATGTGCAT CT AAT CCTTGCCAT AAGCC CTGCATCGT ACTTCAT ATTCAC CATATTG AACTATTCACC 
CTCAAGTAGTCACTATTAAATGTCACACTGTTGTAGCGACTACAAATGCAAGCTCAGTAACT 
TTTTAATTGGTTTCXrrGTGTCAATTTCATGTTT 
ATGCTTC^GGCTAGAGAAACATGATTTATTCAGAACTATTT 
1 0 ATTCGGAATAGAAGGCAATTCAGAATACCTGTCTGTTATCTTTC 

ATAATCCATCTTGCTATGTAGTATTTCAAGTTATATGTTGAGCCTCTGAGCT^ 
GTTGACATCAGTGGGGAGATGGCACAAGCTTTGTGGCCTAGGAAAGTTTAAA 

AAACTTTGTTGCAGCAACTAGGCAAGGAAACCCTACCACTGTACCCATACCTTGTTTTAGTAGAATT^ 
TCTGGTGACTCTCACACCCCATCTCAGCACTTTCCTGGATTCO^ 
1 5 ATTGTTTCTGATAAGAACTCGAATTAGAGGCAGCGTCTGGTACTCACCAGGGGTGTGCT 
TTTCTTACCCAGGTAACCTTGAGTATGGCGCCTAAATCCTCTCTTCCTTTCAAATT^ 
GTAGCTATGCCACAGGTGTTTTGTGTGAGAATTAAATGAGAGGAGTTATATCAATAA^ 

AGCTTCCTGCCTCTTTCCTGCTATCTCGCTCCCTGGATGACATCCCTGCGTTTCC^ 
20 ACCCTTTCCCCACAGCATTGCCTTGACAGTTCCTCAGTGTCCCATCCCCTCTTCCCATGGAC^^ 
CCTGTTATTATTCCTCCTGTTTCACCTGTGTCCATTCTGTCTCATAAAGGACACTGTAGGCT 
TCTTTTGTCCCATGGAAAATAATCCCTCCAGATAGCAATACCAGGAATAGTGCCTTTTCTAACCCACTG^ 
CAGCCACCAAATCTTATATCTGTTTGTTTCTGTTCCCCTCTCTGC^ 
CTAGCTGGGGCACTGTGCTTTATCCTGGTGTTTCGTGTGCAAATCT 

2 5 GGGGATTTGTTTCAGAAC C CGCAAGG ACACCAAAGT CCAGGGATGCTCAAGTCC CTGAT AT AAAAT AGCATTGTGTTTG CAT AT AA 

TCTACACACATCTTACCATATAATTTTTTTTATTATTTGTATA 

CATAGTGGTCAAAGTCGAGGCTGTGAGTGTATTTATCGTTGGAATAATGTACATTGTACCCGTTCATTCCTG 

TCTCTAGATTACTTATAATAGCTAATACAATGTAAATGCTATGGAAATAGTTGTTATATTGTTAGGG 

CCTATACATGTTCAATACACAGCAATTATTTTTTTCTGAATATTTTTGATCT 

3 0 GATATGGGACCCACAGATGCAGAGGGCCAACTGTGTTTATTTTTTCCAAATTTGTT^ 

TTTAAACTCrGTTGTCTTGTATTTCrTTTCTAATCATCT^ 

TTTGAGGCAACACAGAGGCATCGCAGGAGCTGGAGACAGCAAAGTTCAAAGCCAGTCC^ 

AAGTTACCTCCATTTTCTGTCTCCATTCTCGAGCCTGTCTAGAGAGAGAATGAACCTGCCTCCCGAGG 

GAAATAATGCATGTGAAATGTGAAGTGCCACTGTCCCTGGCATGGAGATGCTGTCCTTAGATATTAGGTCC^ 

3 5 GTGTATCTTTAAAAATTAAACTATACTGGCTTCATGCCTGTAATCTCAGCACTTTGGGAGG 

CAGGAGTTCAAGACCAACCTGGGCAACATGGTGAAACCCTGTCTCT 

CrGTAGTCCCAGCTGCTTGGGTAGCTGAGGTAGGAGGATCACTTGAGCCCAGGAGGCTGAGGCTACA^ 
ATACCGATCAGCCTGGTATGACAGAGCATGACTCTGTCTCAAAAAAAAAAGGAAAAAAATTAAAC 

AGT AATTGATGAAAACATATATATGACACTTAGC AGGTATTGATCAATTCCAGTGCGTG CATGAGCTGCTTTGCACTGTGG CTACG 

4 0 GCACTCAGCAGTTTGCTTAGTGAGTAGATGGCCATAGCTCACTTTAGGTATCTGGAGAATAATGCTCATTCCT 

TAAAATTGCCTAATAGACGGAAGGCATTAAAATATGAAGATGGGATTATATCTTATGAAGCTGCTTGGAATGGGTT^ 

ACAAATAATTTCTGATCATCCATATCCCTTAAGAACACCGGATTTCTGGGGGCC^ 

CTGGTGTCAACCACTTGCCATCTGTTGGTCAGTGCTGGCCCTGCTGGGTCT 

CATCCCTTTCTCCATCAGCCTTGTAGCAGTACCTGGGAACCTGCATCCTGTGGAAAGATGGAGGAGCAGATC 

4 5 AGCTCGTAAATACGTGAATGCATTCTCAGCCCGGACGCTGGTCATGTGAGACATTTCCAGAAAAGCA 

TCAAGTTGACITGGGATATATCATTCCTCAACATGAAACTTTTCATGAATGGGAGAAG 
GAGAGCAGCACCAAGTGCATTTAGTTGAATGAAGTCTTCTTGGATTTCACCCAACT 
TACCTAAATTATTAGGTAATGAATTGTAGCCAGTTGTTAATATCTTAATGCAGATTTT^ 
TGTATTTTAAAGGATCCAACAGATCAGTATTTTTTCCTGTGATGGGTTTTT^ 

5 0 CACTTTTCTCGATCACTAAACATATGCATATATTTTTA 

ATTTAATCCAGATTGTAAATGCTCATTTATGGTTAATGACATTGAAGGTACAT^ 
TAGCTTGCTTTAAAAATTATTACTGTAAGAAATAGTTTTATAAAAAATTATATTTTTA 

ATG AAAAACGTTTTTTGCTG CTATGGT CIT AG CCTGTAGACATG CTGCT AGT AT C AG AGGGG CAGT AG AGCTTGGACAGAAAG AAA 
AGAAACTTGGTGTTAGGTAATTGACTATGCACTAGTATTTCAG 

5 5 ATACATTTGAAAACTTGTTTGGGAGACTCTGCATTTTTTATTGTGGTTTTTTTGTTATT 

TTCTTTTTTGGGAGATGTGTGTTGTTGATGTTCT 

TTGATCCGCATCCCCTGTGGTTTCTAAGTGTATGGTCTCAGAACTGTTGCATGGATCCT 

GTGGTTGATAGCCAGTCACTGCCTTAAGAACATTTGATGCAAGATGGCCAGCACTGAACTTT^ 

CCTTGTAGCAAAATAAAGATGTGCCCTTATTTTACCTACTTTTGTTTTCTCT 

6 0 AAGAAAATGGAGCAGACTCAAAGCACTTTCCTCTAATAATTGCACACCATATGCACTTCGTGTGA^ 

TGGAGAAGAAAGAACAAGCAAACTGAGTTAAATAGTTCACAAACAATCACATGGACAATGGG 

TCAGGGTCTAGCGGCATTTGAGTGCGGATGCTGTAAGGGGAATTCGTT CACAAATGTACTGAT ACCACAGATGTGGG CAGGGGCTA 
GGAGAGCAACAGGGTGCAGGCCTGAAGGC^CCTGGAAGGCCGCCTGGAGGGGGACAGTCAGGAAAGCAGGAG 

6 5 TCCTATCTCTGCCTAGGGGTCCTCACTGGCAGAACCAACCAGAAGCTGGAGGACAAGGGAGCTCCC^ 

GCCTTCAAGGGTATGCGGAGTCGTTTGTTCAAAATAGCCCCATTTCACAGATC 
CTCTTCACTATGCTGTAATATCCCTTTTAACATAAA 

GCATGATACACTTGGTGTTCATAGAGCCAAGGATTTGTTACGCCTAATTACTAGTTTT^ 

CTG CTGAT AG ATG ACGTGACACGT CTT CC CAGTG TTGGGAAGCT CTTT AT ATTCATG CCCT CTC CG AGT ATTTTTC C AAGCG CTGA 

7 0 CTCTG CCACTTACT AG GAGTTTG AACTTAG CTT AT CTGGG CAATG AGTTCTTTC C 

CACAGTGAAACrTTTATTAAGTAATGCACTATACAAATGTTTGTTATTATGT^ 

GTGTGTTG AATGC CC AT CTTG CTGAT AATGTG CACTG C CAATT ACACAGTT AT ATTGAG AGG T CACTT AATT ATTAATG ACAAG AG 
TGAACTCAACATCTTACAGAACTGAAATAGATATTTATATT^ 
TCATGAAAAGGAGTGTAAATTTAGCrcTTATGGTCAAATACrCACACA 
7 5 TTC7GATCAATTTTGAAAATCATCCTGCTCTCTACAGCAGG 
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GAJUlGGCTGCCCTTCn'GAGTTACrGTGTG 

TGTGTCCCCGACCTCTAGATGGGACTGAAAATTa^AGATAAGAGAGATGC 
CAATATTAGAACTGAAAGGGACATTAGATATCATCAGGGCT 

TTTGAAGCCAG ACCAAT CTTTTTTTTT LI"l"l"ri'AAATATAGGTGGTGTCTTGCCATCTTGC CCAGACTGGTCTCAAATTCCTGGGT 
5 TCAACCAGTCCTCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGC^ 
TCAGACCCTACCCTTTACCAGCTGTGTGACTTTGTATGTTTT 

TGGAGAT AAAAATGCATTTCAAGTCIAAGGTTGTT ATTG TG AAGATTCTGCAAG T AAAT ACATT CTTGCATCT CTGT AAAATGTTG C 

CGTGCCTGTTGATTAGATAAGTCACCATGATGACTTATGATCAATTAGAGAAAAA 

CCTCCGAGACCCATGTCCCCAATTGTGTGTTAAAACTATTCTATGTTAAAAACTT 
1 0 ATTACTAATCATTCTAACATATTTTAGTTAA 

AAGCTTGCTGCTTTCTAATCCAAAAAAAATGTGATGTGCAGGT 

TTTCTCCTTTGAACCATTACTATTATAAAAATCAATTCTTCCIT 

ATACAAATCTCTTTGTAAAAGTAAAGTTTAGCGACTTGGGAGGCTGAGGTGGG 

CTGGACAACACAGAGAGACCCCATCTCTGAAAATAAATAAATTTCA^ 
1 5 TGCTCCCTATAGCAAAAGGAAGCTAAACTATAAATCTTATGGAACTTGTTAGTGTTC^ 

ATTAATATTCAGTGAAAATAATCTTGGCCAGGTGCGGTGGCTCATGCCTG 

CACCTGAGGTCGGGAGTTCGAGACCAGCCTGGCTGGCCAACATGGCAAAACCCCAT^ 

GTGATGGTGCGTGCCTGTAATCCCAGCTACTTGGGAGGATGAGGAACAAGAATC^ 

CTAAGATTGCACCACTGCACTCCAGCATGGGCGACAGAGGAATCCT 
2 0 CT AATATGGAAATGAAAG AGGTGGTTTGATATTTGTCTTTAGACT AT ATTT CTTT AATTCAACT AGT ACATG TCG AGG CT CT ATTT 

TGTGCCAGGTGCTGTGTCAGAAGTTGGGGTTAGAGCTCAGAACAACATAGA^ 

TGGGAGATACAGATGAGTAAAGGTTACTACAATATAATCAG 

AGGAAGAAGACAGGTTTCCTACAGAACTGACACCTAGGCAGGACTGACAGATAAAGTCAGGCAA 
AAGATGTGACATCTTAAACTTGCTGAATAAAAACACCCAAAT^ 

2 5 CATTCAGGCAGTTCTACTACCTGAGTATATCTAGTTAGTAAATTTTTTCC 

CAAACTCTCCATCCCGTTTCTACCCAGTCCCCGCAGATAACTTTG^ 
ACCTGCATGCCTACTCTGTCTCTTGTTAGCATGGGTGAGCATGCCCAGCTGCAGTCAT 

ATTCCCTCACATTAACATGGTCTACAACTTCC^GCAGGAAAGACAATGTTCCCT 

3 0 TTTT CTTTTTTTTT CK5GGGAGTGGGGACAGAG 

GCCTCAGCCTCCTGGGCTCATGTGATCCTCCTACCTCAGCTTCCCAAGTAGCTGGGACT 
ATTTTCGTACrTTCTGTAGAGAGAAGGTTTCACCATGTTGCCCAGGCTGGTCTCAAACT 
TATATACCCCATCATACCACCCCATTTATCTGCTCCTCAGCAAAACTGCTTAAGGGAAT^ 
GGTTGCCTCCCCTCTGCCCACTCCACATGAACITC 

3 5 ACTTCX3CCAAATGCTGTAGTCACTTCTTGGGCCTCATTTTACTTTACTT 

TCTAACCATTTTTCT CTCTTGGCTTCAGGCATGACTCTCrTGG TTTTCCTG CITCTCACTGGTTCCCTCTGTTTGCTGCTCCTCCT 

CACTTTGGGCCTCAAGTGGCCCTTTCCTCCTTCTCTCT 

ATCTCTTCACCCCTGAGCCCACTCTGAAATTCTAGACT^ 

TTAAGTATGAACAAAACAGAACTCATAATCATTTAAATGCTTT aT TCTCAT 

4 0 TTTCTCAAGCCAAAAATTTGTCATCCTTGGTTTCTCTCTTTACCTTATTA^ 

ATTCTATCTGCAAAATGTATCTTAAATGTGTCrGCTTTTTCTATTACCT 
GTGCTCTCAATATGCTAGGGAAGAAGAAAGAGGGAGAGAGAAAA 

GAAAGAGAAACCTGGAGGAAAAAAAAAGATACGAGGCCGGAGAGAAAAGCGGTGTAAATAGAGCTTGAACTT 
TGGGGAGTTATGGAATGGTTTCTGCAGTGATATAAAATTACTTATTTATAGCTGATTCATATA 

4 5 TTCATAGACAGTATAAAATTAAACCTTAGGTTAGOVGGGCCAAATG 

ACATAGTGCACTTCCTACTGCTTCTTAATTTCAAAAGAATGTGCTTTAATCCCAGAAA 
CGAAGTTCATAAGGGCTGTTTTCACGTTTTATAGTCCATA^ 
TTTAAGAGCAGAATATTATCAACAAATAATTTCTCTAAAAGTTTCCT 
TATGGAATAATCTGATTATATTTTGATATTTTGGTCATTAAGAAATTCT^ 

5 0 TTATATrri'CrfTTACTCTTTAATCTAGAATGCATTTGTAGAGTCAATGATCCT 

GATTCTTGACGTCACCCTCGGATTCTGATTTATGAGTCAGGTGTAGACCTGGGAAGGTTTAACAAGCTT 

AG GTG GT CTG AG AGGACCACACCAAAACAC CCTG C CTT ACT CCC CCAAC C CTGTGTTTTCTGT CT AAACACT CCC CAAT AG CTTTC 
AATT C CTTTGGGGC CCATAATTTTT AATT AAAATG CCCATTTGT CTAT AG AGT ATT AGAAAAATAT^ T A 
GATTTTCTTAAATCCAGGGAATTATTATGTTCCAGATGTTGGTACTAATAATGG 

5 5 ATTTTTATAATATATGGGGAAGCATAATTTTGTATATATCTAAACCTCTTTAGAAATATGGTTCAATTGC^ 

GGGAATTTATATCTCACXAAGTTAAGCXSCATAAAGAAA^ 

ATTTTCATTCAGAAATCCTTGGTTCTAAACCAGTCTCTAATCAACATGCCAAAAAC 

CTTTGGGATTTCTATTTAAATGGGGTAAGTATTTAATCCTGTATATAGTTACAAAGCCCAGGGAG 

TTGAAAGAAAGCTTTAGTAGGCCAGAGACATTGGAGTACCAAGGATATAATTGTG 

TTTCTAAAGAATACAAATCCATCCACCrATTCTTTAATGGTCCTAAATAGCTA^ 
CCAGTTTCCGAGTCAAGGCX5CGCAGTGTTCAGTGTATGACCTGAAGCCTTCTGGA 
TGGCCTTACTTTGAACATCCCAAATTTATTTCCTTCCCCATTTCAAATATTCCGAAATG 
TAAATACGGTGATTGAAAATACGAATGAATGCCACTCCATCATGAAATTTCACT 

6 5 AATATACAAGTTAAAGAGAACAACIAGGTAAGAGATTATTTGTCAAGAGCCTGTAACrC^ 

GCACCTCCTCTGCAAGCAACACAGAATTCAAAACAATTAAATATC 
TAACATATATAAAGTTACTGGCAGAGTAATATTTTCTTGGATAGTCAGAG^ 
TTTAATAACCATATTTCTTTAACATATTCTTTTTCATCl'1'riACCATT 
TCTTTTGTTGTAAACnTATTAAACCCATAGAGTCTTATAATTGATGATACC^ 

7 0 GAAATTrCAGAAAGGACCCCTrAGGTTTC'rTTTT T TT ^ 

GTGCAGTGGTGAGATCTCAGCTAACTTTCATCAGTAATGAAACCAGCAGGTGTTCTGCT 

GAOVCTCAGAGCCACATGCTTAACTGTGGCTCATGGGTTAATGCATGTCT 

GTGTCTAACCCAATGCCCAACATGACATAAGTGCTCAGTAAATATTC^ 

7 5 ATC R CCTCAG CCAT AGATTCCCT CCAACT CTAATTAT CTCTG AG T AAATGAAG T CATCAAAAT ATGGGAAACTG ACCCAGG AG AAT 
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AGGGTCTAGAGG CAGGGAACCT AGCX3CTGATTCACAATTAATTCCT AG AACT AAAT CAAAAGG AAAACC CC CAACTTG C CATGCCC 
AACTATCAAAAGGACCAGAGGCTACTTCCCTTTACAACCCCCC^^ 

TGGTCC CCTT CTGCAACCAAT CAGGTTGGTCATGGGTCAAGT CTTCATTTGCAT AGGAGT AT AACT^ T AT AT ACTCT ATTTT AT 

CCCCTCCCACAAGCAATCAGACTGATCATGGGCCACCACTTCATTTACA 
5 AGGGTATTTAAGTCCCAGAAAATTGTGTAACTGGGCTCCTGAGCCACTTGCT 

TTTrCAATAAATCTCTGCCTGTGTTGCTTCATTCrn-C 

AACCTGGA(^CCCTCTACCGGTAAOtfUU^CTACAGTGATT^ 

TTGATGATAACATTTGCTAAACACCACGTAAAGGAAGGGAGGCATTTCCAC^ 

TAAAATTTCTGCTTAAATGGGAGTCGAAGGAACCCAACAATACTCAGTTGTGTCACTTA 
1 0 TGTATGTCTTACAGCTAGTTCATATCACAAGCCAGCCGTTTCATCATGCAAAAGTCTAAAACAA 

CTTACGGAAAAACCAATACTGTTGAACAGAGGTGGACAAATTATAGCTCCT 

CAGTTTTATTGGAACACAGCCACACCCTTTTATTTCCCATATTATCT 

GGAACAG^GACCATGTGACCTCCAAAACCTGAAATATTTACT 

AGAAAAGTAATCAAAAATCACAATGGCAGTATTAGTGGTGAAGATAATGGATTCT 
1 5 ATTTTTCTTTTCCTTCTTTTTTTT^ 

GGCTTACCGCAACCTCCACCTCCTGGGTTCAAGCGATTCCTC 

GCCCAGCTAATTTTGGATGTTTGTAGAGATGGGGTTTCTCCATC 

CTGCCTCGGCCTCCCAAAGTGCTGGTATTACAGGCGTGAGCCACCGCACCCAGCCGTGATTTTTCT 
GGCTACACACCATGACTGCTTCTGTTTGGGAGTTGAGAACATCTG^ 
2 0 ATGTGGTCAACTTCATGTGACTCAACCTAATACTTTGGCITCCTTCAGTGTGT^ 
CCAGCTACCTCAGTGATTCTTAATCTTAGCTGTGTATTAGA^ 

ACCC^GGACCAAGTGAAGCCGAATCTCAAGGGGGAGAAACCAAGCCCTTGGTAATTTTGGAAAGCACC^ 
CAGTAAGGCTTGAGAACTATGGAACTACCAAGAGCTTTTGGCAATACGCATGTGCTAGGTGCTA 

AAG AATGTGGACAATTG T AAT AAATAATTGGGG CAAG CCGG CAT AG TGG TTCTCAAACATTAGCATG CAACCAAT CACCTGGTGGG 
2 5 CTTCTGAAAACACAGATTTTTGGTGCCAACCCCTCA 



HUMAN SEQUENCE - mRNA 

GGCGGCAGCGCCCTGCCGACGCCXK3GGAGGGACGCAGGCAGGCGGCGGGCAGCGGGAGGCGGC^ 
CGGCGGAGCCCCGCCGCCCGCC<5C6CCATGGCCCX5AAGACCCCGGCyiC3^ 
3 0 GATGTGTGACCATGACTATGATGGGCTGCTTCCCAAGTCTGGAAAGCGTCACTTGGGGAAAACAAGGTGGAC 
AAAAACTGAAGAAGCTGGTGGAACAGAATGGAACAGATGACTGGAAAGTTATTGCC^ 
TGCCAGCACCGATGGCAGAAAGTACTAAACCCTGAGCTCATCAAGGGTCCTTG^ 
TGTACAGAAATACGGTCCGAAACGTTGGTCTGTTATTGCCAAGCZACriTAAAGGGGAGA^ 
ATAACCACTTGAATCCAGAAGTTAAGAAAACCTCCTGGACAGAAGAGGAAGACAGAATT^ 

3 5 AACAGATGGGCAGAAATCGCAAAGCTACTGCCTGGACGAACTGATAATGCT 

GGTCGAACAGGAAGGTTATCTGC^GGAGTCTTCAAAAGCCAGCCAGC 
TGGGTTTTGCTCAGGCTCCGCCTACAGCTCAACTCCCTGCCACTGGCCAGCCCACTGT^ 
TCTCAAGCACAAAATGTCTCCAGTCATGTTCCATACCCTGTAGCGTT^ 
AGCCATTCAGAGACACTATAATGATGAAGACCCTGAGAAGGAAAAGCGAATAAAGGA^ 
40 ATGAGCTAAAAGGACAGCAGGTGCTACCAACACAGAACC ACACATGCAGCTACC CCGGGTGGCACAGCAC CACCATTG CCGACC AC 

ACCAGACCTCATGGAGACAGTGCACCTGTTTCCTGTTTGGGAGAACA 

ACCTGAAGAAAGCGCCTCGCCAGCAAGGTGCATGATCGTCCACCAGGGCACCATTCTGGATAATGTTAAGAACCTCTTAGAATTTC 

CAGAAACACTCCAATTTATAGATTCTTTCTTAAACACTTCCAGTAACCATGAAAA 

ACCCCCCTC^TTGGTCACAAATTGACTGTTACAACACCRTTTCATAGAGACCAGACTGTGAAAACT 

4 5 TAGAACCCCAGCTATCAAAAGGTCAATCTTAGAAAGCTCTCCAAGAACTCCT^^ 

TTAAATACGGTCCCCTGAAGATGCTACCTCAGACACCCTCTCATCTAGTA 
GAATCTGGATTTGTTGCTGAGTTTCAAGAAAATGGACCACCCTTACTGAA 
ATCAGGAAACTTCTTCTGCTCACACCACTGGGAAGGGGACAGTCTGAATAC^ 
CIACCGAATATTCTTACAAGCTCCX3TTTTAATGGCACCAGCATCAGAAGATGAAGACAATG 
50 AACAGGTCCCTGGCGAGCCCCTTGCAGCCTTGTAGCAGTACCTGGGAACCTGC^^ 

TTCCAGTCAAGCTCGTAAATACGTGAATGCATTCTCAGCCCGGACGCTGGTCATGTGAGACATTTCCAGAAAAGC 
CAGAACAGTTOUU5TTGACTTGGGATATATCATTCCTCAACATGA 

ACAACAGTTGAGAGCACGACCAAGTGCATTTAGTTGAATGAAGTCTTCTTGGATTTCACCCAACTAA 
TAACAGTCTTACCrAAATTATTAGGTAATGAATrGTAGCCAGTTGTTAATATCTTAATGCAGA 

5 5 TGATTT ATCTGGT ATTTT AAAGGATCCAACAG AT CAGTATTT 

TCCAGTATTTCACTTTTCTCGATC!ACTAAACATATGCATATATTTTTAAAAATCA.GTA 

ACCATGTGACATTTAATCCAGATTGTAAATGCTCATTTATGGTTAATGACATTGAA 

AGTTTTCTGTTAGCTTGCTTTAAAAATTATTACrGTAAGA 

TAAATGCCAAATGAAAAACGTTTTTTGCTGCTATGGTCTTAGCCTGTAGACATGCT 

6 0 CAGAAAGAAAAGAAACTTGGTGTTAGGTAATTGACTATGCACTAGTATTTCA 

TCCITCTGCAATACATTTGAAAACTTGTTTGGGAGACTCTGCATTTTTTATTGTGGT'r 
TGCGTTGCACTTCTTTTTTGGGAGATGTGTGTTGTTCATGTTCTATGTTTTGTTTTGT^ 

GACAGAAACTGTGGTTGATAGCCAGTCACTGCCTTAAGAACATTTGATGCAAGATGGCCAGCACTGAACTT 
65 GTACTT ACTGCCTTGTAG C AAAATAAAG ATGTG CC CTT ATTTT 



HUMAN SEQUENCE - CODING 

ATGGCCCGAAGACCCCGGCACAGCATATATAGCAGTGACGAGGATGATGAGGACTTTGAGATGTGTGACCATGACTATGATGGGCT 
G CTTCCC!AAGTCTGGAAAGCGTCACTTGGGG AAAACAAGGTGG AC CCGGG AAG AGGATGAAAAACTGAAG AAGCTGGTGGAACAGA 
7 0 ATGGAACAGATGACTGGAAAGTTATTGCCAATTATCTCCCGAATCGAACAGATGTGCAGTGCC^ 
AACCCTGAGCTCATCAAGGGTCCTTGGACCAAAGAAGAAGATCAGAGAGTGATAGAGCT 
GTCTGTTATTGCCAAGCACTTAAAGGGGAGAATTGGAAAACAATGTAGGGAGAGGTGGCATAACCACTT 

AAACCT CCTGG ACAGAAGAGGAAGACAG AATTATTT AC CAGGCACACAAGAGACTG^GG AACAG ATGGGCAG AAATCGCT^AAGCT A 
CTGCCTGGACGAACTGATAATGCTATCAAGAACCACTGGAATTCTACAATGCG 
7 5 GTCTTCAAAAGCCAGCCAGCCAQCAGTGGCGACAAGCT^ 



437 



WO 03/008583 



PCT/US01/51291 



CTCAACTCCCTGCCACTGGCCAGCCCACTGTTAACAACGACTATTCCTATTACCA 
GTTCCATACCCTGTAGCGTTACATGTAAATATAGTCAATGT^ 
AGACCCTGAGAAGGAAAAGCGAATAAAGGAATTAGAATTGCTCCT 
CAACACAGAACCACACATGCAGCTACCCCGGGTGGCACAGCACCACC^ 
5 GTTTCCTGTTTGGGAGAACACCACTCCACTCCATCT 

GTGCATGATCGTCOICCAGGGCACCATTCTGGATAATG^ 
TCTTAAACACTTCCAGTAACCATGAAAACTCAGACTTGGAAAT^ 
GTTACAACACCATTTCATAGAGACCAGACTGTGAAAACTCAAAAGGAA 
CTTAGAAAGCTCTCCAAGAACTCCTACACCATT^ 

10 CTCAGACACCCTCTCATCTAGTAGAAGATCTGCAGGATGTGA 

GAAAATGGACC^CCCITACTGAAGAAAATCAAACAAGAGGTGGAATCTCCAACTG 
CTGGGAAGGGGACAGTCTGAATACCCAACTGTTCACGCAGACCTCGCCTGTGCG 
TAATGGCACCAGCATCAGAAGATGAAGACAATGTTCTCAA 
CCTTGTAGCAGTACCTG«3AACCTGCATCCTGTGGAAAGATGGAGGAGC^ 

15 TGCATTCTCAGCCCGGACGCTGGTCATGTGA 
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Tabte 19 

MOUSE NOMENCLATURE 
ICSGNM Sox4 
Celera mCG11673 

5 

HUMAN NOMENCLATURE 
HGNC SOX4 
Celera hCG36747 

10 MOUSE SEQUENCE - GENOMIC 

ACTATTCX3TAACTGTGATG<jTTAACrTT 

TATGAGCATATTTTGGGTTGAGTTTAATATTTCAGTCT 

TCCAACATCCAGCCAGAGTTCCTGTGCTCTGAGCAGGCAGACGCAGGC^ 

ATCTAA.GCCTCTGACCCCACTCGGCGGGGGGTGGACAAGGGGCAGCCCCCAACT 
1 5 GGTTATAGGAGAGAGGGAATAGGGAAGAGGTTCCCAACACTGACCAGAGT 

TTATTATAGAAATGCAGGGAGAAAGAGAGAAGGTAGAAAAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGG 

AGGAAGAGGAAGAGGAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGA 

GAGGGAGAGGAGGAGGGAGAGGAGGAGGGAGAGG7VGGAGAGAAGAAGACAAGGAAAAGAAAG 

GAGAGGGGGGAGAGAGTGAAGGGTAAGAAGTAAGAGCATAAGAGTAAGAGAGAGAGGTGGGGGCTGAACA 
2 0 TTACTGCTGCTAGGTAACTGGGGAGGAGTTTAACCTGAAG 

CTTGTGGAGGCTGTGGGAGGTGGTAACTTAGGCAGGGGCGAGAGCTCGAGGAGC^ 

ATTATGACCTVTCAGGGTTCAGACCTCAGCTCGACCGGAGACCAGCCT^ 

AGAAAAGAATTAAATATCTGATCCTCTCCC!ATATCTACAAGACGT^ 

TTCCCTTGCTTTTTAATTAGAACTATGTCATTGGTCCTCCTGTGTGATAAGT 

2 5 CAAC CTGACAG ACAACT AGAATT ATCTGAAAGG AGGGAAC CTT AATT G AG AAAATG CCTCCATAAG ATCCGG CTGCAAAG C ATTTT 

CTTAAATTGTGATTGATGTGGCCGGGCCCAGCCCATTGTGGGTGGTGCCAT^ 
AGCAATCATTGAGGAGCAAGCCAGTAAGGAGCACCCCTCCACGGCCTCTGCATCAGCTCCTGC 
AGTTCCTTCCTGTCCTGACITTCTTTGATGATGAACAGGAATGTGGAAGTAT 
TTTTTTGGTCATGGTGTTTCATCACAGCAAGAGACCCTAACT 

3 0 AAAATTCCTTATAGCAACTCACTAATATGCCAATGGTCAAAGGAAGT^ 

ACACGGGAGGTACC^TTGCCAGGAAGGTTGGATTATCAAGACTCCCAAGACACT 

GAATTCTCACTGATTGGCACTTGCCTGGTAACra^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNTGTGTGTGTGTGTGTGTGTGTGTGCGCGC 

3 5 CTGGTGTAGGCATGACAGTCAGAAGAAAACATTAGGGAGACAGCTCTCACAAAG 

TCACAGGTTAGATAGCTCACAACCACTCAGGAGTCTCCTGTCTCTACCACC^ 
ACAGCCTTT<^GGAGTTGCCTGTCCCTGCCTCCTGTCTAACITTCGAGTC 
TAGGTCCTGGGGAGTAAACTCAAGTCATTAACCTTGTTGTAGCA 
GTCTGTGTGGTTGGTTGGCTGGACTTTTTGTTTCGAGATAGGATCrAAGCTAGA 
40 CC CAGGCTGGCCTCCAACTCAGTGCTTGC CAT ACTTCAAC^ CCATCAAGCCT AGCACC 

GTGCATATTTAAATTGTATGGTTTGTATTATACACTATATAGATTC 
ACATATATAGTAGGTAATGTCACACACAAATTTTCTTTCATTTAAGGTTACATT 

ATGCCAGCTCITAGAAGCAGGAGGT AAAAGAATCCAGAGTTCTAAGTCATT CTAACTGCTCAGATAG TT CAAAGCCAGCCTGGGTT 
GCATTCAAACTTGTCTTTAAAAAAACAAGTGACTCGACATTTGTACAACACAG 

4 5 AAATTTGAGTGTTGTAAGGATGAGCCTTCATTTATTTCT 

TGCITCTGTAAGTCCAGGCTGTCCTTCAACTTGTGTTTCCACAAGCTCTTGCCTCA^ 
ATTTGAAATTATTTTTCACTTGGGCTGTGTGGGACAAAC^GCC 

GCAGTTCAGTATGAATTTATTTTACAGCATA^ 

5 0 GCTGAGCGGGGGACAGACAGTAGGGCACTCACTACGTTGATTGGCAGTTGAAGT^ 

TACTCITATCTTTCCTGGCCACACCCCTGGGCCCAGGAGGCAGCTCTGAACAGATGGTGAGTCTCCGGTGCCAGGTG 
CCCTGCGCTGTGTGAGGCACAGGGCTCTTTGGTCCTTGCCGCACCCCACC 
GTCTCGGTGCTGTCCGCACGCGCGGTTAGCCGAAGGTGAAAGGCTGGGAAA^ 
GGCCAGCTGCTCGTGGCGAGCACGTTCCTCCCCTCCTCTGACTCCCATCACTTC 

5 5 CAGCAGCAC^CACGCCTTCGCACTTGCTCCCCTCCCT^ 

ATGCCCTTGGTCCCCGCCCGTCTCTTAGCCTTCAAGGTGGCTCTCCTCGTTCCCTAATATTTGGGG 
TTTTCCATAATCGTAATCATTTGCrrTGCCrrCCTGGGACTCCAAACTGTAATTGC^^ 

TCCACTGAGAG ATGGT^TGGGGGTGCTGGGCTGGGAAGAGGATCCTCTATTC CG ATTCCG AGTACCGTTTGCGGGGTGGGGGCTG C 
AAGGAGGAAAGTGCGCTCAGCCTTTCCTTTATCCACCCTCCCCCCGTCGCCCT 

6 0 TGTGGACACACAAACACCTCAATCTTCCTGATCAGGCTAGAGCCTG^SAGGTGG 

AAGJUVGCCOTGGCCXjCTGTGCGTGCACTGCCCCTCCCGCAGTCAC^GTGAGAGCTGAAGCGTC 
ACGCTATTTAGTCGAGGAGAGGCTCACAGAGGGTAAGGGGCTGCT 
CCGGTCAGTGGCGGAACTGGGCCCCAGCGGCTAGGTCCAGGCTGGAATCGG^ 
TCCCCCCTCCTGGAGTGGCTGCTCCAGCrGCCACTTGCAAGGANNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGCGG^ 

TGGTGAAACGCCGCTGCGAGCGTCGCCCGAGCC^CGTTTACAGTAGGAAAGACTTTCCTCCGAA 

7 0 AATTTAGCTCCTCCCAGTAAGAGTCACAGAGAGGCAAGGCGAGGCCGGT 

GTTGCTGAGCTCCACGGGGCGGGCGTGCCGGGCTACGGAAACCAr 1 T C1T1TCAGGAGCAGGGTTGGCCTTTGGGACTCCTGTGGT 
C ATGCACCGTGTGTTTCTGCCGCCGCCGAAGCAGAGCGCGCAG CTOTCGG AGAAACCGCATTCTCGCT CCTACCCGCCCGAGGGAG 
ATGACCCGTGACCTCCCGAGTGAGGTTACCAGATCCGAAAGGGGCGTCTCGGCTTCCCCAGCCATGGGGGTT^ 
TGGAAGGTGGCCCTGAAAAAAGTGGACGAGCAAACAGGGAGCAGGAATCft^ 
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TGCCCAGAATTCACCAGCACACCCACCGGTCC 

GGTTGGAT CTTCCCCCACCnTTTCCCACGTCGCTGCATGCTCAGGCAGGCTCAGCGCTG CCACCTGGCGGCTGCCTCTCCCGGACT 
GGGAGGAGCTGTCTGCACGCACCAGTTCCACAGATCTTGAAGGGTC 
AATGCTTCCTATCTATGTGGCCCGTGCAGACACACTTGTTTTCTGCTTGTTCT 
5 GGCATGGGTATAAAGAGCTGACTTGAGCCTGCTGGTTATGAG 

TCTGTGT AGTCTAACAGTGAAT ATGAACGACAG CTTTCATTT ATCCTG CTTAAT A 1^ 
GGAGAGATAACTCAGGCCTTGATAAAGGCTAAGGCTCACAACCAAAACCT 
TCCAGTTTC^UVTACAACTTCCTCAGCACCAAGAATGGAGTAGTG 
GGTTGACCTGAAAACAGGGCTGATGGCATTCAGTTTCCCAGAGCAC^ 
1 0 TGTTTATTTGAACTGAATGTTCAAAGCCAGAGGGGCTCAC^ 

GTGCCAAGCCAGACTCGTGAAACTAATTTCCATGCAGGTTTTC 

CTT CATC CAT ACC C CATC CAT ACCATTCTGT AG ATGTGAGGTTTT ATGTTGTGCAGGTG TGT AGGGG CAGGAG AAGCACATT AGCA 
GTTATTACCCTCTCATGTCGAATGGATTGCR.TTTAATGGAATAA 
CATTTCCAATAAAGACCATTAGTTGATGATTCTAAGGCA^ 
1 5 GGGGAGCGGAGGGAAAGAGCACTGAGCACTGACTTACTAC^GGCCACCAAATAGCT 
TGGAGAACTGATTACAAAGCTTAGGGCATTAGGTAATGA 
CTGGTGTATATGTACITTAATCAAAAGCrATTTCTCACATCT^ 
TCAATAACTCCAAACTGTCAGGTATATACTCTACAG^ATTACATTGTACAATG^ 
ATTTTACCAGGTCTAAGAGAAACATAGACACAGGGGGTGCATCTr 

2 0 TGGCTCAGCCGGTACAC<X:CTCTCTAGGCCAAACCTGGCAATG 

GCCGCACAAGATGTCCTCTGACCTCCATCTATGCACAGTGCC^GCAGTGGACT 
CCCTTTTCCTTAAAGGCCITTTGTCCTTTGATGGCA^ 
CTAACAAGAAAAATCAGGGCAGGAATTACTTTTTTGATATGCT 
GGGTXJATTTACACCACTGGAAATACCACCAATTACACCAACACTCAT 
25 ' CAAGGAATACATCTCCTTATTTTTGTCATAATAAAAATACTC^TTTT 

TGATGGCATCTTAAGAAGGTACATCrGTGGGTATCAGAAAAAAAAACAAAAAAAAA 
CACTTTTCTTGGTGGAGGTTTTTTTCTT^ 

GGGTAGCAAATAGTCAAAAAAAAAAAAAATTACTTGCTTTTTACCCACAA 
CAGCTGTCTCCATAGGGTACTCTTTCCAGTAAATGTCTAATTCTTGACACCCT 

3 0 TGACAAGGTAAATGGAAATATACCACCCAGCACTACTGTGCTCTCAATCAA 

CTGAGGGAAATC^TTTATCATCGTATTAAAGGTGAGTTTrCACTAGA 

TAACTAAGTTGTTCTATATTCTAAGTATTATCTGCATGTGTTCGCT 

TTCTTTTTTTT T TrTTTTTT CCCTITOTTTT^ 

CAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCCAAGTGCTGG 

3 5 CCTGGAATTCTTAAGCATCAGCGGATGATCTTGAACTCCTGATCTTCCCTGAAGGCACCT 

ACC^CCAAAGGGGCTTTCGATTACGTTTCACC^CCGGTGACCACCTT 

AAGTGGCATTrcCATTAGGGAGAACTCGGAATGTGTGTTAGAATCCAGAGGGCT 

CTCACCGGGCACTAACACAAGCCCTCTATAAAGCTCACAAGGTTCCCA 

AG ATCAGGAGAGGGACTTG AGATAGGTCCAG CAGCCCACT CCCCTGTGCCACCCCTTTCT CrTCTTGGTCTTTTAGGCCCAATTGC 

4 0 TTCAAGTAGCACAGACACACGGGAGACAATGGGTAAGAAAGAGAG 

ATTGTGGAAAACAAACATTTAGAAAGTGAAGTCAATCCCATCCTGTATTGACC^ 
CTTATTGAGGTAATTATGGGCCACTGCCTTAGCGTTCCACA^ 

TCATTAAATAACTAGTTAACCTCTGTTrGGCAAAAAGAGTAAGTAATTACTGAAAAGGGGAG^ 
CCTTTCACAAACAGCACAGATAGCATCCTGAAGGGATTTACACAAT^ 

4 5 TGTGATGACAGGTGTCTAGGTACCACTTTCTTAATGGTTTTCAATATCCATTGATTTATTATTGTGGCT 

TAGCTCTAAGAAGGGGCTAAGAAAAGAGATTTTGCACCAAAGGCTGATTCTTTATTCCTGGGCTGA 
CACTCGGCTTTTAAGCAGAAAGCTTCCATTTACAGTATTTCACCCT 
AGAATACATTTCCGTCTCTCCCCTTAATAACTTCATGCATTAACATCCr(TTTCACA 
CACACACACAGRCACACACZACACAGAGGC^IAACGCGCGCGCA 

5 0 TCAGCTGTACTTCTAGCCCTTTTAAAAAGTTTGCTCTGTAAATTGGAATG 

TATTATTGCATCGGGTTCGAAGCCAATGGGAAGGACGGGGGAGGGGCTTGGCACGAGGAA 
fHWNNNNNNNNNNNNNNNNNNNl^^ 
NNN>WNNNNNNNNNNNNNNNNNNNCAGTGGGGGAACT 
TTTATTCATGCCGGTCTGTTGCATGCAAGCTTCTTGGCTTCCT 

5 5 CGCCCACAGAGAGTCCCGGAGCCAGAGTCGCTTTGGCTTTGCACTGCAGGAAAGGGACTTA^ 

TGAGCTACCGCGAGCTCTCGTGAACTGCAATCGACTGCITCAGGGAAAG^ 

CTTGCGTTTGGAAGATACrCCGGCTACCAACGTTTGGAGAAACTCCTCGCOT 

CTTCIAGCGGTGAGGGAGGACGGAGGGCCTCGGGGACTCTAGGTTGGCGGCGGGAGGCGGC 

GGACCCHGGTCTCCGCCCCGGGGAGCCCGCAGGGCCCGGCGACCGCGCCGCGAGCGTGTGAGCGCGCGTGGGCGCCCGGC^ 

6 0 GGGCCATGGTAGAACAG ACCAACAACGCGGAGAACACTGAGGCTCTGCTGGCCGGGGA CCTGG AG 

CTGGGCATCGCGTCCTCCCCGACGCCTGGCTCCACCGCGTCGACGGGCG^ 
TGGCCACATCAAGCGGCCCATGAACGCCTTTATGGTGTGGTCGCAGATC^ 

ACAACG C CGAGATCTCCAAGCGG CTAGGCAAACGCTGG AAGCTGCT CAAGG ACAGCG ACAAG ATTCCGTTCATCCAGG AGGCGGAG 
CGGCTGCGCCTCAAGCACATGGCTGACTACCCTGACTACAAGTACCGGCCGCGAAAGAAGGTGAAGTCC^ 
6 5 ATCGGCGGCCACAGCCAAGCCAGGGGAGAAGGGCGACA^ 

GCGCGGGCGGCAGCTC CAAGCCCGCGCCCAAGAAGAGCTGTGG CCCCAAGGTGGCGGGCAGCTCGGTCGG CAAGC CC CACGCTAAG 

CTGGTCCCGGCGGGCGGCAGCAAGGCGGCTGGATCGTTCTCTCCAGAGCAAGCTGCCCT 

CTACAAGGTGCGGACTCCCAGTGCGGCCACTCCGGCCGCCT 

CCGACAAGAAAGTGAAGCGCGTCTACCTGTTTGGAAGCCTGGGCGCTTCGGCGTCTCCCGTCGGGGGCCTGGGA^ 
70 CC CAGTG ATCCACTGGGGTTGT ACGAAGATGGAGGCCCGGGATGCTCGCCCGATGGCCGG AGTCTG AG CGG CCGCAGCAG CGCAGC 

ATCATa3CCAGCCGC<^CCGATCGCCCGCTGACCACCGOK5CT^ 
CCTCGCAOSCGTCCTCCTCGCTCTCCTCGTCCTCTTCCTCCrCCTCG 
CTCGACCTCLAACCCCAGCTCAAACrTTGAG^ 

TAACTTCGAACCCGGCTCAGGCTCCCACTTCGAATTCCCGGACTATTGCACGCCCGAGGTC 
75 TGG AGTCCAGCATCTCT AACCTGGTCTTCACCT ACTGAACCCAG CG CCCC CCXsGGGAC AACCTCGG CCAAG AGGCAGGAGAGG AGA 
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GAGGGAAAAAAGAAAACAAAAAACAAACAAACAAAAAAACAAA 

AAATGAGTTAAAAAGAATAGATGGAGAGTAGAAGGAGAAAGGGGAAAAGAAAAGGAAGAAAAA^ 
CX3TCCCGCCGTTGGAGAAGGAGCGCGGGGGCGTTAGTGGACCCGCGCTCCCATCCCCCACT 
CGGAGAGTAGACGGGGGCGACCTTTTTTTGTTGTTGTTATTGATGTT 
5 TTTTGCTTGAAGAGACTCCCCCCCACCCCACCCTTCCAACGAGCT^ 

AAAGCTTCTAAAGACCGAAGGAATCTTTCCCTCACCCCCACCCCCTCGCCTT 

CnTTTCTGCAATGAAGACAGAAAGAAGGCTCTGGGGTGATC 
AACTCCACGCTGGCXIAAGTCCCGGGCTGGGCTTTCTCCT 
1 0 GGGAGTAGCAGGCCAGCCTCGGAAATGGACATGGGGACCTCGTGGAAGCCACAGC^ 

CACAGAGGGCGTTGGGGTTCCCGGGCCTCCGCCTGGGGTCTGTGCAAAAATAAAGAATTAA 

AGGCTACCCAGAGCCGG CGGGAGGACAGGAGACTGTGGGATGTGGTCCAGGGGCCG CGGAGTGGTTTGGGGGGGAAAAGAGATTTT 
TTTCTTCTCTTAATGGGAATCGTGATGGTGTTGAGTTATTTCACT 
GATTTCTCTATAAGACTGTTGAGCAGTTTTTAAAATAGTCT 
1 5 CAACGAACCGATCACCAGCTCCTTTTCATTTTTAACTCTTTAAA^ 
ACCACAATTCAGGACCAAATTTTTTCTCCGTGTGTGTTTATTCCT^ 

ACGG ACTCT CCAT CTTT GT ATTTTTTTTT C CCTTGTAAATGTAATCAG ATG CCATTTT AT ATGTGG ACGT ATTT AT ACTGGCCAAA 
CAGTTTTCTTATTTTGTCCCCCCITTTITICCCT^ 

TTCTTTCTTTTI LI 1 CGTTTTTTTTTTI 1 T CTAGTGTTGTTACCCACGCCArrrr ACGTCTCCCTTCA 
2 0 AACTTTTAATTTTTTATATTTAAATGTAGACTTTTGAC^ 

TTTCTCAGTGTATTTTTGTAAAAAATATATAAAGGGGGTGTTAATCGGTGTAAATC^ 

GTGG CTGGTT AAT AT CT C ACACAGTTTGAAAAAT CAGCCCCTGGTTTCTCCATGTTTACACTTCAATCTG CAGGCTTCTTAAAG TG 

ACAGTATCCCTTAACCTGCCACCGGTTTCCACCTre 

CTTTAAAAAAAAAAATTTTTTTTTTTTAACGAATTGCTG 

2 5 TT CTGT AG CTTT AACTTG T AAACC ACAT CTTTTTTG CACTTTTTTTT AT AAAGCAAAAACGTGCCGTTT AAACCACTGG AT CT AT C 

TAAATGCCGATTTGAGTTCGCGACACAATGTACTGCGTTTTCATTCTTGTATTTGACTATT^ 

ATAATTGTITCAGTCCTATGGCATGGTGATAGCATATGTC 

CATCTTTGTACATTTAAGTCTGTATTATAATAAGCAAAAAGATTG 

3 0 CATCCATCCTGTGCAATATGCCGTGTAGAATATTTGTCTTAAA 

AAAACAAAAAACAATATTTGGGGGAAGAGAAAAAGAAAAAAAATCATGCCAGCT 
CTATATACCTTCTGTAAATAACTTTTTTTGAGAAC^AAATAAAATC^ 
AT^CCTGGGATCTCrCCTTTCACTGCAGGCTGCTGCTTTGCT 
TAAGCCATCTTAATGGCAAACCCTAAGTTCCTCTTGGCCAGATT^ 

3 5 TCAACCCTCCCTCCGCACXSCCAGCCTTCGAGGAATTCT 

GAGGGGGGAGGGGCAGGCTTCTCTGCTCTCAATCCAGCCG 

GCCACCATCCCCTTTACACACCAAGATTTGTTTTGCACAGACCAAGAAGAACA 

GAAAAGCCTTCGATAGTTACAAGATGGGAAAACGAAAATCAACre 

TGCAAAGG T CAG T AGGGTG TGGTGT AAT C (2AAG T ACTTG AG AGG CAGGC CAGG CTCAAGG ATCCTGAGTTT AAGG^ CTG AG TGG AC 

4 0 TACTTGATTGAAACTCTGTTTCCAGAACTACCAAAGTTACTATGGCCCTAAGCrrGT 

ACTGTGAGTAAAATATAGCCTACCTGCTCACCACTGGAATGACAGGCCAAGTAGGAGAGTGTAGATTT^ 
GCATTAGTGAAGCCTGCATGCAGCTGGACAGTAGTCGGTGTTTATTTTAGAGATCAACG^ 
CCCITCCCAAGGCTCATCITCACAAATGTGTGCriTTCCAT^ 
CTTGGAAACAATCCTTTCTGAGGATCGGAGGCAAGGTGCTTGCATCCTGGGAT 

4 5 CAAGCATTTTTGCAC^GTAAAAAAGCCATTTGCTAGGGGTTGGCTGT 

GGGGGAAAAACTGAAAAAGCAATTCC CTCTCTAAAT ACTGTT CT AATATCATTTTTCCCTGAGACCTTGAAACAGTATGGTGT AAA 

rmT CTri Tr r AAAAGAGCTTTGGGAAAGAGTTGATATGCT^^ 

GGGTCTTGGAGAAGAAAGCCTTTC^TAGTTACAAGACG 

GGGTTTCCCCTCAAATCCAAAGGTCAGTAGGGTGTGGTGTAATCGAAACACTTAG71GACATGCCA 

5 0 AGGCCATAGTGGACTACTTGAGACCCTGTTTCCAGAACAACAAAAGACCT^ 

ATCGAAATCCTACTGTAAATGAATCCTCTATGAATCATTTTTAAGTGATGCATATTTTCAAATAAGTTTC 
TCTGCTACAGCCTTGrTTATTITCCCCTGACTTTGCATGGGA 

GCTCAGGGTGAAAGTGGAGCAGTGCACTTAGCTAGCGTGCTAGAGGCCCTGGGTTCTGTTCCCAGGCCC 
TTCTGGCAC^TCACCCTTTTCTCTTTAAAGTGTTGATACCGCGTGCTTCCT 

5 5 TGTATTCAGGAAGGAGACTTACATGCCAGACTGTGCAAGGGAAA^ 

TGGCTCAGCAGAAAAC^GTGAGCTCCAGCTAGCTTTTCGCTCTTGTTCT 

G TGG ATT AGTGACAC TTGT CACCATTGTCAGTT CTT AGCTGAAT C C CCTGAAG AG C C ATG AAAAT AAAT AAAT AAATAAAT AGG CA 

GCCTCCAGGCTCCTCTGGTGGGGTGAGTGCCTGCAGGGGCCTCAGGTTACTGCTGACTCTAG 

AACAGAGTTGGAAGGCCAGCAACCACTCGGTTTATCTGCTGTATAGATGGGATCAGAGTC 

6 0 GGTTGGTTTTGGGTTTTCCGAGGCAGGGTTTCTCTGTAAGGCCCT 

AAG<5ATCTTCCCGTCTCTGGAAGTGAGTGCTGAGCCTGAAGATGCGCACCCAACACCACAC 
TGACCTAGAAGGTTTTGGATAGCAGGGCATGGCGGTGCTGTCCATTTAAAA 
TCTGTTAGATTTCTCAAGGTGGAAGTTTTCTGTCATTTTTTTTTTTAA 
TTTTTTTTGTTTTTTTTTTTTTTAAAGATTTATTTATTTATTATA^ 

6 5 GAGTOVGATCTTGTTAGGGATGGTTGTGAGCCACCATGTGGTTGCTGG^ 

TCTTACCTGCTCAGCCATCTCACCAGCCCTTGAGTTGTTTTTAAGATTG^ 

ACTAAGTTCAGGTTCTCTGGAAGAACGGGAAGTAGTATCAACTGCCAGAGCCT 
CTGGCCCCAG<3ACTATAAATACACCTCTC^CTTCTGATCTCTTGTG 

7 0 CTCTGATTCATCCTAGAAAACAACCTGAAAATCCATGTCGGTTTTTTTTTTTC^ 

GCTTGTCTGCATTTTCACCCCACCCACCATTTGGAAGCTGAGTTGGGCIAGATGGG 
TCTGt^TGGTGTTTCAGACCACACTAACCATCATTA 
CTCTGCACAAAATGGTGGTGGCTCTCCTCCTCCAATCTCCACCACCCACCA 
GATC^TTGGAGCCCAGGAGGCAGAGGCAGGAGGGTCTCTGTGAGTTCACAACCAGTCT 
7 5 ACATAGAACAGGATTAACCCTrTAATGCCTGTTTCTCCACACTTACrTTG^ 
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CCAAAATATATTCATTTCCCAAAATATTTGCATTTTC 
GAATACTGGCTCTGAGCTAGGCTGGCTGGAGCACGGTGGTCAAGTT^ 

AAGGCC AC CTGACTGAGAAAG CTGTTGTGAG AATTGAGTGTGACCC CAT ATGTGGAACACATGCAG C AATG C CTGGCAGCAATCTG 
TGCTC^TTAGGTACCAGCCCTTCACCTTGTtSCGGCTGATTATAA 
5 AGAAATCAGACAGCACTAAATAACAATAAATATGCTGTCTGCTTCTC^^ 
GCTOJAGCCCAGTOTCTAAGTGCTCGTGAA^ 

CAGACCTGGTGAAGCCATGCCAGTACTGGGGAAGATAGCrCAAGGCrrATCT^ 

TTGTGAGCTCTTGTCTGAAAACAAAAAGAATGGTTTGGGAGCGTGTGTGTGTC 

TATTGGGAATTGAACCTTGAGTCTGGGACATGCTAGGT^ 
1 0 ATATATTTTTAAATTATTGCTAGTGAAT1TCATGCAAGTATACAAT 

CCCCGCCCCCTAGCCAGGACTCTTTAAAAGAGCAGAACCAGACCACAGTCTTA^ 

GAAAATGACTACCCTCAC^GTGTGTTAGCATGATCTCAATAAGTATACAGGCAAAAGAT^ 

TGGCGATGATTCAGTAGTAAACCTGAGATCATAGAATCAGAAGGGGAAGAAGGGTG 

CTAACATGTATGAGACCCTGGGTTGGAGCCCCAGCACTCAACTC 
1 5 TTCCTGAGCCCAAGAATGAACTTAGATCCAGTGTCTCTCCAAGTGTGGTGGCACATCATTC^ 

TGG CGACCAAGTTT AAGG C CAGG CTGGCCT ACAAAGTG T ATTCCGGGAT AG TT AG AACTG TT ACAT AATGAGACTCT CTGTCTCTA 

GAAAACAAAAGCAAGCCAGGAGCTGGTGGGCCGTGCATGCCTTTCATCCCTGCAC^ 

TCCAAGCCAGCCTGGTTTCCAGAGTGAGTGCTAGACCAGCCAAGGCT 

AACAAAAAAGAAAGAAAAAAAAAGAGAGAGAGAAAAAGGAAGGAAAG 

2 0 ACAACAAAAACAATCAGAATAGGTGTCCCCGTTTGAGAACT 

CATCACCTCAACCCTCCACATCCTTCCTTCAGCTCTGATCCATTTCTC 

TAAGTAATTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTGTGTGAATACATATTAGTTGAA 

TGAATGCACGGTCTTCATTTCAGGAGTCAGCAAGGCAGAAGAAATGATCT 

AAGGTGGGGTGGGTGACCACGCTGCTGGTTATTCAGAATGCCTTCACACAGAGGCrC 

AAGCTCACGTCACrAAACTGAATTrCCCAGGGGGCTTTTGTAT 

TATGTGTGGATATCTGTAG AT ACAGGTGATAACAAAGTTAGCC CA T Tl 'T C ' i ' f TGCTT CTG ACAGTTTCCATTCAAG ACAGTTGGTT 

TAAATGGAGTCCTGCCAACATGCCAGGCACAAGTCAAGTCGTGGCATGAGCTTT^^ 

TTGTGGTAAAGGCCAAAGTTACAACTGATTTATAC^CCACCrTGCT 

3 0 ATCTGATTGGAAGGTTTTGGATTTTATTTTT 

TTTAGTTATTTTCAATACAATTATTTCTCTATTTCCTGCTCCAGGGT^ 

GAAGAGAGGGAAATGGAAGCCCTGTCTGTCAACTTTAGCAGGCTCATTCTGTAGTGAATGCACAG 

TTAAGGAGC^\GGCAATTCCTTTTCTAAGAGCCAAGC 

ATTGTCACCTTCTTACCAGTGGCTGTCCTAACTCCCrCAGCTCTGGC^ 

3 5 CCTCCCTCCCACGCCTCTCACTTTCTCACCCCCCTTTCCCACACT^ 

GAGAAGGCZAGACTTTGAATAATTTATCTAAAGTTCCTGGGAAGTTGT^ 

AAGTCCTCTGTGTTTCTGAAATAGTTTAGCCAGAGTAT^ 

CAATCCGAAGACTCAGGATATGGAGGTAGAAGGGTTGAGTTCAGGGCCAGCCTT^ 

4 0 TACAAGAAGACACTTTGTCTTTTTTTTTTTTTTTTTTT^ 

GCT<rTGGCTGTCCTGGACCTCACTTTTGTAGACC!AGGCTGGCCTCGAACTCAGAAATCCGCC 
ATTAAAGGCGTGCGCCACCACGCCCGGCAACACTTTGTCTTAAAATGATGA 



MOUSE SEQUENCE - mRNA 

4 5 AGAGCAGCAGAGTGAGGGGAAGAGGGCCGTCTCCCTCCCGGTTTCCAGTTCTTGCA 

AACTCTGCCGGTAACCAGCTCCCCTTCTTGCAGGAGGGAGGGAGAAACATA 
TTGGCTTCCTACCTTGCAACAAAATAATTGCACCAACT 

TGGCTTTGCACTGCAGG AAAGGG ACTTAGG CGCrAGAGACGATGTCGCTTTCCTGAG CTACCGCG AGCTCTCGTGAACTGCAATCG 
ACTGCTTCAGGGAAAGGGGTGGGGGAAAGACTTGCCCCGGAGGCGGTOAGAAACTTC 

5 0 TTGGAGAAACTCCTCGCCGCGGCCGACTCCAGCCTCGGTGCCCGCAGGA^ 

ACTCTAGGTTGGCGGCGGGAGGCGGCCGCCCTGGCCCGCGCGCCGCTCAGGGGACCCTGGTCTCCGCCCCGGGGAGCCCGCAGGCC 
CGGCG ACCGCGCCGCG AG CGTGTGAGCGCG CGTGGGCGCCGGCAAGCCGGGG CCATGGTACAACAGACCAACAACGCGGAG AACAC 
TGAGGCTCTGCTGGCCGGGGAGAGCTCGGACTCGGGCGCCGGCCTGGAGCT 

CGTCGACGGGCGGCAAGGCGGACGACCCCIAGCTGGTGCAAGACGCCCAGTGGCCACATCAAGCGGCCCATGAACGC 
5 5 TGGTCX3CAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGAGATGCACAACGCCGAGATCT 

GAAG CTGCTCAAGG ACAGCGACAAGATTCCGTTCATCCAGGAGGCGGAGCGCCTGCG CCTCAAGCACATGGCTGACT ACCCTGACT 
ACAAGTACCGG CCGCGAAAG AAGGTGAAGTCGGGCAACGCGGG CGCGGGATCGGCGG CCACAGCCAAGCCAGGGGAGAAGGGCGAC 
AAGGTCGCGGGCAGCAGCGGCCACGCGGGAAGC^GCC^CGCGGGGGGTGGCGCGGGCGGCACGTC 
CTGTGGCCCCAAGGTGGCGGGCAGCTCGGTCGGCAAGCCCGACGCT^ 
60 TCTCTCCAGAGCAAGCTGCCCTGCTGCCCCTGGGGGAGCCCACGGCCGTCrTA 
GCCTCCTCCTCGCCGTCCAGTGCGCTGGCCACCCCAGCCAAACACCCTGCrc^ 



CGGGATGCTCGCCCGATGGCCGGAGTCTGAGCGGCCGCAGCAGCGCAGCATCATCGCCAGCCGCCAGCCGAT 



65 



70 



75 



CTCCTCCTCGGGCTCITCGTCGTCCGACGACGAGTTCGAA 
CCCTGGGCAGTTTCAG CTCCTCIATCGGCGCrCGATCGGGACCTGGATTT^ 

CCGGACTATTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGATTGGCTGGAGTCCAGCATCTCTAA 

AAGGGAGCGCGGGCCXXK3GAGAAGGTGGGCCAAGAGGGAGGAGAG 

AAAGA 

MOUSE SEQUENCE - CODING 

ATGGTACAACAGACCAACAACGCGGAGAACACTGAGGCTCTGCTGGCCGGGGAGAGCTCGGACTC 
C^TCGCGTCCTCCCCGACGCCTCGCTCCACCX2CGTCGACGGGCGGC^ 

ACAT CAAGCGGCC CATGAACGCCTTT ATGGTGTGGT CGCAGATCGAGCGGCGCAAGATCATGG AGCAGTCG CCCGACATGCACAAC 
GCCGAGAT CTCCAAGCGG CTAGG CAAACGCTCCAAC CTG CTCAAGG ACAG CCACAAG ATTC CGTTCAT CCAGG AGG CGGAG CGCCT 
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GCGCCTCAAGCACATGGCTGACTACCCTGACTACAAGTACCXSGCC^ 
CGGCCACAGCCAAGCCAC^XXJAGAAGGGCGACAAGGTrc^ 

GGCGGCACGTCCAAGCCCACTCCCAAGAAGAGCTGTGGCCCCAAGGTGGC13GGCAGCT 
CCCGGCGGGCGGCAGCAAGGCGGCTGCATCGTTCTCTCC^ 
5 AGGTGCGGACTCCCCAGCTGGCCACTCCGGCCGCCTCCTCCTCGCCGTCCAGTCCGCrc 
AAGAAAGTGAAGGACGTCTACCTGTTTGGAAGCCTC<^^ 
TGATCCACTGGGGTTGTAOIAAGATGGAGGCCCGGGATGCTC^ 
CGCCAGCDKrCAGCCGATOSCCCGCTGACCACCGCC^CTACGCC^ 
CAtt^GTCCTCCTCGCTCTCCTCGTCCTCTTCCrCCT 
1 0 CCTGAACCCCAGCTCAAACTTTGAGAGCATGTCCCTGGGCAGTTTCAGCTCCTCATCGGCGCT 
TCGAACCCC^CTC^GGCTCCCACTTCGAATTCCCGGACTATTGC^ 
TCCAGCATCTCTAACCTGGTCTTCACCTACTGA 

HUMAN SEQUENCE - GENOMIC 
1 5 TATGAAGTTTCCrCTTAGCACTGCTTTTGCTGTTATCTCAGAGGTTTTA^ 

TTTTTAATTTCCATCTTG ATTTT ATTTTTGACCCAGTG ATCATTCAG CAGCAGGTTACTTAATTTCCATGTATTTGCATGG CTTTG 
AAGGTTCCTTTCGGAGTTGATTTCTAGTTTTATTCCACTC 

TGAGGCTTGTTTTGTGGCCTATCATATGATCTATCTTAGAGAAAGTTCCATGCACTGATGAA 
GGGTAGAATGTTCTGTAAATATCTGTTAAGTCCGTTTGTTTCAC^ 

2 0 TGATGACCTGTCTAGTGCTGTCAGTGGAGTACTGAAGTTCCCCAGTATTCTCGTGTTACTTTCTA 

GTAATTATTTTATAATTTGGGAGTTGAAGTTAGGTGGATATATAT^ 
ATTATATAATGTCCCTCATTGTCTTTTTAAACTGCTGT^ 

CCAGCACTTTGGGAGGCCAACACAGGTAAATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCC^ 
TACTAAAAATACAAAAATTAGCTAGGTGTGGTGGCAGATGTCTGTAATGCC^ 
25 AACTTGGGAGGCGGAGGTTCCAGTGAGCTGAGTTCACACCATTGTACT 
ATAATAATAAAGAAAAAAATAAACTTTGTTTTACCTC 
TCCCAGCACTTTGGGAGGTCAAC^CAGGTGAATCACCT 

CTCTACTAAAAATACAAAAATTAGTCGGGCGTGGTGGTGGGTGCCrGTAACCCCCGCTACTC 
TTGAACCCGCGAGGTGGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACT 

3 0 AAAAATAAAATAAAAAAAAAAAGAATAGGTACTCCTCXrTCGCTTTTGG 

TCAATTTTAGCTAACTCCGATATACACTATGGGCATTGGTGAAAATACT 
AAAGTTACCTTCCTGCATGAACTTATATTTTGGCATCTATCAAATGGCATTGTAATTAAT^ 

ACAAACTAC CCAAAG CAT AGTG ATTT AACG CAACTT ATT AG TT CT CAT AATT CTGTGGGG TGTCAATCTGAGTGATTCTTCTGTTG 
TACCCATCTGGAATCGCTCCTGTGGCTGCAGTCATCTGACAAACCAACCAAAA 

3 5 TGGGATCCTCGCCTTGGAGGGGCACCTTGGTTTTCCCTAT 

AACATGATGTTTTCATGGTCTCACTGGAGTAGGTTCA 
GTCTGTTTGTCTCATGCTTGCTAATGCCCC^TTTCCCAAAGCAACTC^^ 
AAAAGCACACGTACTAGGAGGACGATTTATTACAAGAATCTACCACA 
CTCATTATTTAGCGTTTGCCTGCCACCCATAGTGCATAATTCACACCTAGGCAGTAGTACT 

4 0 GACAACATTTTGGGTGTTCTAGTGAAATACAGCCCTACCATGCATATTTGAAATGTATATTATCT^ 

TATTTCTCCriTAACATTGTAGTTTGAACATTATGGTTTCTTTTTTAATTTO 
TTCAGGAGAGTAAAGTTATATTACAAAATATTTGTTACAATAAC^ 

TTTGAGTGTTTTAAGTACTAGCCTTCAGTTATTTTTGAATCGTGATAGCCAGATAAAGTGTCACT 
ATGTGGGA<^GGAAGGATGGAGTTTCATGCTTCAGGAAGCAGTGAAAAATC^ 

4 5 CAG ATCCCCAAACTATCT CT AT CACCCTCCTTTCATGGCAAATTTGCATATGCATTATTGTCAGAAT^ 

CATATACACAGAATTCTTTTCTAGCTTGCTGCTTAACTGAACA 
TGCGCTCGGCACACAGTAC^AGCACCATAAACGTTGATTGACT 
GGTACTCTTATCTTTCTCCAGGCCCACACCCCTGGCCCGAGAGGCGGTA^ 
CTGCACTCCAGGCGCGGTCTG^SGGCACTGGGTTCTACGGTCCCTGCCCCACCCCT 

5 0 GCGCCAAGTGTGACGTCTGACTACGGCGGGAACATGGGAGGGGAAGGGCTG 

GTGCGTCCTACAGTCGCTACCCGCCACXSCACGCGTCCCTCTCCTCCTGACCCGCGTCACT^ 
TCCCACTTCCCCXJCAATCAAGCGCCTTTCTTCCTC^ 
CnrCCCAGOSGGATTCCCTCCCTGGCAGACGaSCGCGCCCCM 
ACGCTGGCTTAGGGTTGCTCTTGAAGTGTTTTTCCAAAAGCGAAATCAC 

5 5 AACGGGCTGTTGCGTTGGGGGTGGGAAGCCCGTCGGGCGTGCCCCACCTAGACCGC^ 

CCCCTCAACCCCTGCCACCCCGGGTCGAGTAGGCAGCTCCACCCGGACGC 
AAGGCAGAGACCAGCTGATCCAAGGAGGACAAAGCGAGCCXSAGACCTTCCGCAGAGCTTGGCT^ 
CAGCCTCTCCCTGTTAACTACCTGCCATATGTTCCCrTCTCTCAGACCCGAAACCGCCTCCTA 
AGCnTCGGGCGCGGCCCTTGACAAGGGCGCCTGGCACCCTGGGGCGCCTCCGTGGCCCT 

6 0 AGTCTTTCCTGCACGAGCGAGGGATCGAGGGAAGGCGCCAATGCACCT 

ACTGGGGTGCAATGAATTGTCGACrrCCCTC!AGCrCTCAGCCCTTTT 
CTTCTGAAAGAGACXjCTGCAGCTGCGCTCTCAATGGGGCCT^ 

CCGGTGTACGAAGGTCGTGTGCCTGCCGAAGAGTGGGTCCCAC^CGCACAGCGCGCGCCGTACCAACCAAGCAAC^ 

GCATTTAGTCGTCCAGAGCAAAGGATGCTGGGCACTCAAGAGAGCAGG^ 
6 5 GGACAGCATCTCCAAGC^CCAGGGGATGGGGTAGGAGGACGGAGGAGCCAAGCTCCT 

CCGGTTGCAAATTCAAAGGAGGGCTGGGAGCCCGCGAGGGTGCC^AGGTGTCCT 

TGTGTGTGCGCGCGCGTGTCTTCAGCTCCTTCCCCGGTCGTAAAACCTAAACGG 

GCCTGAAACATGTTTACAGTAGGAAGGACTTTCCTCC^GAAAGCACAGTCTGAGTCGAT 

AAGCGAAAGGAGAGGTTCGGCTGCCGCGGCGGAGCGAGGCTGAGCACCATTAATCCTTGGCGAGCTTAAGC 
70 G CCTGCGAG ACCGAGTCGCTGCGCTCCGGAAGGCGTTCTGCCTG AAGG AAACLTLTTTTTTTTTTTT TT TT 1 1"1 TTT'l TT'lTl" H'G 

AGTAGGGAGTGGCCTTTGCACTGGAGTCCTTAAGTAAGGCGCGTCGTTTCTCT 

CACTTTAGCAAGAAGACAGGGAAACTGAAATTGATAATCCCCCGAGTTGGGTTA 

CCCCGTCACCCC^AGCTCCTGACAAGTGAGC^GGAGGCGGCGGGTGGTTGCCCTAAACTTTGATTTATC 

ATCGACATACCCTGTTACTCGTTGAGCGTTTAAGCCTCTAACTAAA 
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TGGCGGCTGCTTCTCCCCAGATTGGGAGCCGGCTG 
TAGAAGCAGAAGCCTCTATTCATGCrTGCAATCCAAGTGGCCCT 

GAAAGGAAT AATGAG TTGGCATTGG CATGAACAG CTT AAG C CT ACTGTGTGCT^TCACTGGTTATG AGAGGAAAATGGTACAATT 
GTTAG AAATG CAGGTTTCTT AACT CTT AGCIT ATTCT AGTG AT AAAAATAAATG CTAGG TTTCATTT ACCCCAG AATCCTATTTTG 
5 ATATTTTGCAAATATGAGTTATTTCCATATATTATTTTTCAATGCA 
GCAATGTCATCAGCCCAAAAGGATGAAGTGCAAGTTA 

TCAGGT AAAGT AACAACT ACACAGl'riTr C CT CCTCITT ATTTT ATTTGAACTGAAT ACTTGTG CTGACT AGCG C CT A CAAT 

CATAGTTTCTATAACCAAGGCAAGGCAAAATCTTGCAAATCAGTC 
1 0 CCTCCATGCCAAAGGAATATATTAAGCAGTGGAATAGA 

TGTTGTGTAGATGTGTGAGAGCAGAAGAAATGCATTAGCAGATATTACACTTACATCT 

AAAATAAAATACCTAATTCAAAATTGGAAATGTCTAACATCATTGCTAACAAAGACAGT^ 

TTCTGCAAAATTATCTTGGCTTrCTCTTAATACAGCTAAAAGACTAGCATTAA 

GAACACTGCGTACTGATTTAGTAGGGGCCACCAAATATCTGGTATTTGGGGAAAAAAATGTCAGT 
1 5 CATATGAAGTTAGGGAATGAGAAGGGAGAAATCTTTTCTCATTA 

TAGTTGGTCACAAAGATGACTTTAATAAAAGGGCATTCGTCATATGTT^ 

CT ATTTCAG AAGT CT AGGCACTTGTGT AAT ATAT AAAT ATATATG CAGT AATTATAAAGT CAATGG AAGTT ACTGAT AC CAT ATTT 
CATCAAGTCT AAGAGGAAA 1TT T T CTriT CATTT CAACTCTTC CTG AAATAAGGAGG CATCCT ACATCT AGGGGGAT ATCACAGTT 
TCATTGGCAGCTTrCTTCCTT^TGGAATGTAA^ 

2 0 GTAACAAAAGAATGAGGCCAGTGGCTGTGTTTTAGTAC^ 

TTAACTTCCACTCrrCCCTATGATATTCGATGATGGAAGCATAAATGACTG 
CTGAATTATCAAACTAGCTACACCCTCAAATC^TTAGTTAGTACAAA 
TGATCTTATAGATAATAGCCCrcrrAAAATAATTTTTTTTTGTCAGATTGGT 
AAAATCTAGTATATAACACAGCAACACTGTTATGCAAAAAAGG 
25 TGTTTCTCCXTTTTTAAAAAAATTATGTAATACAAGTAGCTAATCGTCAA 
TGTGGTTGATCCITACAGATCC^TCAGATGGCTACATAGGGTGTTCTTTC 
GCCAGCCTACATACTTTTAAAAGTTTGACAAGATACATAGAAATTTACT 
ATTATTTTGGATTGACTrrACCTAACCACTTATGGTTAGCCT 
TTACrACAATTCTAATTTrCAGACACTGGAAAA 

3 0 GCCACATTCITCTTAGTATTCITCATGGAGTTCCTT^ 

TTCACATC^TGATCTGTGATTA CiUT ' lT ' n 
TTAGCTCACTGCAACCTCCTGGGTTCAAGCAATTCTCCCG 
CCAGCTAATlTriTAATTTTTTAATTTTTAGTAGAGACAG 
TGATCCACCCACCTTAGCCTCCCAAAGTGCTGGGATTACAGGCGTC 

3 5 GGCTATTCTGTTTTAGCCrCTTGAACAACATGCAGCTTTTACTAGG 

TTGATTGAGGGCTTACAATAGATGAGGTGCn^CATAGTC^ 
GTGTTCCCATATAATAGATGAGGAAAACTGAGACTTGGCGACATCCCCATAGT^ 
TACTATGAATCCXK3AGGCCCCATTCTCTTCTGCTACCAGTTATCACCTCCTTCTA 
CTTCGATTCAAGTAACAAGAGGAAATATATAAGAAGGAAAGGAGGAAGATGGC^ 

4 0 TCTTTAATTTTGAAAACAGGGAAAACAAATATTTAGAA^ 

AACrCTGGAAATCTGCCTATTGAGGTAATTACGGAGCACTACCTAATGTGTTTCAAG^ 
AT/ITACAGGGAGTTTAAATCATTAAATAACCATAGTTGACCCTTGCTTGGTTAA 

AATTGTGATGACAGGTGTCTAGGTACCACTTTCTTAATGGTTTTCAATATCCATTG 

4 5 TTACAGCTTTGGCTTTAAGAAAGAGCTGGGGAACAGATTTTGCACCAGAGGCTGAT^ 

AAAATAATCACTTGGCTTTTAAGCAGAAAGCTTCCATTTACAGTATTTC^ 
AACATAATGGTTAGAATACATTTCCGTCTCTCCC(nTAATAATTTCATGCATTAACATCCT 
GCGCGCACACACACGCGCGCGGACACACACAGCAAAAGGAAAAAAGAG^ 
AAGTTTGCTCTGTAAATTCGAATGAGGTCAGATTTGGAGCrTCT 

5 0 TGGGAAGCCCGGGGGAGGGGTTTGGCATGAGGAAGCGTTCOT 

AGGCGCGAGGCXjGAATTGGGGTCTGCTCTAAGCTGCAG CAAG AG AAACTGTGTGTG AGGGGAAG AGG CCTGTTTCGCTGTCGGGTC 



ATACATTTATTCATGC(IAGTCTGTTGCATX5CAGGCTTTTTGGCTTCCT 

GATTCCGCCCACAG AGAGTCCTGGAGCC^C^GT CTTTTTTGCTTTG CATTGT AGGAG AGGGACrAAGTGCTAGAGACTATGTCGCT 

5 5 TTCCTGAGCTACCGAGAGCGCTCGTGAACTGGAATCAACTGCTTCAGGGAAAAAGAAAAAA^ 

CCGCGAGAAACTTGCATTGGAAGCITCAGCAACCAGCAT^ 

AGACAGCAAACTGCAGCX5CGGTGAGAGAGCGAGAGAGAGGGAGAGAGAGACTCTCCAGCCT 
GCGGAGAACrCCTTCCCCAAATCTTTTGGGGACTTTTCTCTCTTTACCCACCT 

CCGCCGCGCG(XTCrTCCCGTTCGGCGTGTGCITGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCOT 

6 0 GTGAGCXjCGCGTGGGCGCCCXjCCGAGCCGAGGCCATGGTGCAGCAAACCAACAATG 

AGAGCTCGGACrrCGGGCGCCGGCCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCG 
GACGACCCGAGCTG^TGOUIGACCCCGAGTGGGCAC^TCAA 

CAAGATCATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAA 

ACAAGATCCCTTTCATTCG AGAGGCGGAGCGGCTGCGC CTCAAG O^CATGGCTG ACTACCCCGACTACAAGTACCGGCCCAGGAAG 
65 AAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGCCGCCGCCTCCT 
TGGCGGGGGCGGCCATGGGGGO^CGGCGGCGGCGGGAGCAGC^ 

ACTCCAAACCGGCGCAGAAAAAGAGCTGCGGCTCCAAAGTGGCGGGCGGCGCGGGCGGTGGGGTTAGCAAACCGCACG 

ATCCTGGCAGGCGGCGGCGGCGGCGGGAAAGCAGCGGCTGCCGCCGCCGCCT^ 

GCTGCCCCTGGGOSCCGCCGCCGACCACCACTCGCrGTACAAGGCGCGGACTCCCAGCGCCrCGGCCT 

7 0 CGGCCTCCGC^GCGCTCGCGGCCCCGGGCAAGCACCTGGCGGAG 

TCGTCGTCGCCCGTGGGCGGCGTGGGOJCGGGAGCCGACCCCAGCGACCCCCTG^ 
GCCCGACGCGCCCAGCCTGAGCGGCCGCMC^CGCCGCCrrCGTCC^ 
CCAGCCTGCGCX3CCGCCTGGCCCGCCCCGTCCAGCGCGCCCTCGCACGCGTCCTCCT 
TCCTCGGGCTCCTCGTCCTCCGACGACGAGTTCGAAGACGACCTGCTCGACCTGAACCCCAGCT 
75 GGGCAGCTT CAGTTCGTCGTCGGCG CT CGACCGGG ACCTGG AT"TTT Cu AG C C CGGCICCGG CTCG CACTT CGAGrrC C CGG 
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ACTACTGCACGCCCXSAGGTGAGCGAGATGATCTCGGGAGACTGGCTCGAGTCCAGCATCrr 

GCGCG CAGGCAGGGAGAAGGG CCGGGGGGGGT AGGAG AGGAGAAAAAAAAAGTGAAAAAAAGAAATOAAAAGGACT^CGAAG AGT 



CGTTTTGGACCCGCGCTCCCATCCCCCACCTTCCCGGGCCGGGGACCCACTCTGCCCAGC 
5 TAGACAGGGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGG 
CrTGAAGAGACCCCCTCCCCCTTCCAACGAGCrrCCGGACTTC 
GAAGGAGTCTCCCCCTTCCTGCATCACCACCTTGGTTTTGTTTTATT^ 
CCCCTCCCCCC C T' rrX T T TAAACGCGTGATGAAGACAGAAGGCrCCGGGGTGACGAATTTG 
CGCCGGGACTGAGAGACTCCACGCAGGCGAATTCCCGTTTGGGGCTTTT^ 
1 0 GGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGACCGGCGCTAGGAAATGACCCGAGAACCCC^ 
GGGAGCTAGGGGCGGGGGCGGAGGAGGACACGAACTGGAAGGGGGTTCACGGTCAAACTC 
GCGGCGGCGGCTGCTGGGCCTCCGCCTTCrrTTTCTACGTGAAATCA^ 

GGAGACTGTTTGATGTGGTACAGGGGCAGTCAGTGGAGGGCGAGTGGTTTCGGAAAAAAAAAAAGAAA^ 
AAAAAAAGATTTTTTTCTTCTCTTAATCGGAATCGTGAT 
1 5 TCTATCTTTTAAAGATTTCTCTATAAGACTGTTGAGCAGTTT^ 
TGTTTGATTCCTAGAACGAAATTATCACCAGCITTC 
GACTTTAAAAAAACAATTCAGGACCAAATTTTTTCTCAGTGTGTGTG 
TCCTTCACCGATGCTCCATCCTCGTATTTCTTTTTCCTTGTAAATGTAATCAGATC 
CCAAACATATTTTTTCrTrTGrrCCCTTT^ 

2 0 cttttitttttcttttttttttt^^ 

ttaacttttaattttttaTatttaaatgtagacttttgacacttttaaaaaaca 

ATTTT CTCAGTG T ATTTTTGT AAAAAAT AT ATAAAGGGGGTGTT AAT CGGTGTAAATCGCTGTTTGG ATTT C CTGATTTT AT AACA 
GGGCGGCTGGTTAATATCTCACACAGTTTAAAAAATCAGCCCCTAATTTCTCCATGT^ 

TG ACAGT AT CCCTT AACCTGC CACCAGTGT CCAC C CTC CGG CC CC CGTCTTGT AAAAAGGGGAGG AG AATT AGC CAAACACTG T AA 

2 5 GCTTTTAAGAAAAACAAAGTTTTAAACGAAATACTGCTCTGTCCAGAGGCTTTAAAACT^ 

GTAGCTTTAACTTGTAAACCACATCTTTTTTGCACTTTTTTT 

CCGATTTGAGTTCGCGACACTATGTACTGCGTTTTTCATTCT^ 

TGTTTTAGTCrrATGGCATGATGATAGCATATGTGTTCAGGTTTATA 

TTGTACATTTAAGTCTGTATTATAATAAGCAAAAAGATTGTGTGTATGTATGTTTAATATAACATGACAGGCA 

3 0 CTTTTTAAGGCAGTTCCGTTAAGGGTTTTTGTTTT^ 

TCTTAAAATTCAAGGCCACAAAAACAATGTTTGGGGGAAAAAAAAGAAAA 
TCAGATTGTTGATATATACCTTCTGTAAATAACITTTTTTGAGAAGGAAATA 

TTGTCGTTATTATGCCCAATGCCTAAGATTGGAAAGGCCCTACAGTATCTGGACACTACACAATCTGCCTTAG 
AGTTCTCAGAGGAGTCTTGTCAAATGCTACCTTATCGAGGAACATTAAATGTCrrCTTGG^ 

3 5 TGGCATCAGCTGAATATGTGTCAACTCAACACATGCTTTCTAGGAATTTTCTGAGTTTACCATATTTAAAA^ 

GAATCTGAAAGAGGGGGAGGGGCAACGTGGACAGAATTCTTCACAGATGAAACTGGCTA^ 
CTGGAGCTGCTTTCTCCTTGATTTGCCCATGCrTACCTCCAGGAGTGGCTGTAATTGGTTT^ 
TTATTATCAGTAGATTATACAGGAGGAGGCTTTTACCGATTATGAAATGCAAAACGT 
GAAAGGTTCATGTGCTACAATCTTCATTTTTTGCCATGCCCAACATG 

4 0 GAGAAGCTGGCTGCATGTGGGCTCAGAGGAAAGCCCCCTCTCCAAATAGCACTTACTT^^ 

AGAATCAGGAAGGATTGTTGAGACTGCAAGGTGAAAAAGTACTTGGATTT 
CTTGCTAAGTGGCACTATTTCCTGTTTATTTTGAGAGTTAACAATTG 

AAATGTGTCITTTACTTGTCTTGAGTTTTTCAGCACTGGAGAGGAGGGGTTAACAAGCAGAAGTGT^ 
ATTTGGATTGCTTGTAGATAAGGGAGTAGTATCCTGGAAGGTACATCTGGAAAAAGAGACCTTTTGAAGACTAGTC^ 

4 5 GGGTTTTGTGGTTGTCCAACGAAAAGCAATAATGCACCCCG 

AT AG CT ACT AGTTTTCAAC ATTTTTTT CATG TG TGGTG CT CAT AT ATTT AGTTTT AATTG CAT TTGCGTTCTG CAATT GG AAT T C C 
ATTTTITAAAATGACCTTAGAACAACATGGCATCTAAGTTT 
CTTTTAATTTTTGTTCATTGCAGTGATAATATAATGGTTTAGCTTT^ 
CITAAATGATGCCITTTTTAAATGATGCATCrCTT^ 
50 CCCTTGACTCTCTATGTAAAAGCAGATGTTTTAAATATTTCCT^ - 

GTGTGTGTCCTTCCTGTCAGGTGGTAAAGTAAGAAGCATTTTTGTACAATGTCTTTTTCT 

tacttacttcttaggtggtacaaggcatcctaaacctgcgaaaccaaacagtttagtgtgatl^ 
caatagaaaagagtaagtttcagattttcttttaagaatatccagcaattgcttctgg 

gtg at acttgctt aag caag ttttgt caatt c ct agttg aattc cttt aagtgc caagaaaaaaaataaaaag catgt aag ctg ag 

5 5 agggtgggcccagctcctgggaaggcttttagggtttgtgtttaccctg 

aaagcaagaaagcaaagggttaaatggacatcaacatttaccaac^tttcctgtgtttc 
ggtctggggaatactatg^tcagaatgcrrtaatcatgaattr 
aatgtttaaaaaaaaattttttttttttaagaatacaaacccatctggcc 
gggaggccaaggcgg<3cggatcacttgaggtcaggagtttgaaac(^gcctg 

6 0 acaaaaattagttgagcgtagtggtgcacgcctgtaatcccagctactcg 

gctgaggttgcagtgagccaaaatcgcat cagtgcactctagc ctgggtg aca^ 
atacaaacccatctacccatttggatttctcagagtgtaaggttggtatcct 
tcctcctggcacccctcctattaaggcctacaaaatacctctctaacccatattc^ 
cctgaaaaatgatatctcc^tacgcatttctcaattttcactatcatttgaaagctaaagt 
65 agacatccgcatggtgtttcaggctctgccaagtgtctaccaccattcccctgtgcccaggataaagtgcagaatt 
tcttcaaggccctgcacaaaagggttttaagcccaactttcca^ 

gactggaattcttgaaggacagaaact aggcattgaatacttcttg cattccccagtttgcrttccctgctt ctttt cacataat a 
aatg ctaaatt ctt atttcttg acttt acatggttt ccatgggttcct aagagctt ccaacaggcatttgcattttatgctgcttg 
tggtaagcgccagcgccggattgcatgaaaacctggattagaatgctgccxctc 
70 tctcag cctttttaagtttc cg tgttttaaatcgtcatatagaaag ag agctatctaatgtagacagctgtgttggg aatagagtg 
cagcggcacgtgtgtgacatgtcgtaatgcctggcctaccccatgggctcg^ 
aaaagctaatgtgtgtaaagcctaacacagtgccagcaacacagaagatgctttajuvttctaa 

ccg^ggtccg agctggggttctgtccccactgggctgt cgc cc cagctggactaccaccatggaactcagggccgaatacctcagg 
gagaagctgcagcgc<;acctggcggcggagcatgtggcggtggaggacaacacgaccctcaaccattgcgcctgtagctt 

7 5 GTuTCGC^CaaGTtCGaGGOGAAGCCGCTGCT CGCTCCTGCGCAT CCA 



445 



WO 03/008583 



PCTYUS01/51291 



TGCCTTTGAACAGAAAACCCTAATCCCAGAGC^^ 

TACAAATCTGGAAAAAAAAATTTCTAAAAATAAATCTCCTGAATCAATGTTTCTC^ 
TACCTGGAGATTCTTGTTAAGGGTAGCGATTATACAGATCGCCAGCCACCGTCAGACT 

AT AG ATTTCCAGTG CT CACTGAAGTTTGGAAAT CAGT ACCCTCACTGTATTTTAGCAT ATT CT CACAAT AAGTATG CAGGCG AGGG 
5 . ATTATGACGCCCTCATTGTTTTAACATGGAGAAATTGAGTTGAGAATGGCCC^ 

CAGAAAGGGTATTGGAACTCTATCTCCTGAGCCCTAGAATCAGCCTAGAA 

CCAAGGCTTGGTCCCTrGGCAAAAATrGGCAGTCCCT^ 

TCCACCATCCTTGTTAAAAGTTTAAGTTTGGGCCGGGCGCCCGTC^ 

GGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGACCAACATGGT^ 
1 0 AGTGTAGTGGCACATGCCTTTAATCCCAGCTACCCAGGAGG 

GAGCCAAGACTGTGCTATTGCACTCCAGCCTGGGCAACAAGAGTGAAAATCCATCT 

TAATTAGCCAAGTTAATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTCCAG 

GGACTGATTTTTGGTAGAAGTATCCAAGGCAGAGAACAGAAATAATATAAATATAGTG 

GGCAGAGCTTCCAGCAAGTCAGAAAATTGGAAATTATTCAGAGCACACT 
15 TCCTAATGAGATTGACCGTTGGCTACAGTTAACTTGAAGGGTTT^ 

AGAAGAAAAAAAAAAACTCTCACATCACTAAACGGTATTTCCCAGGGGGATGTCTGACTAG^ 

AGTTAACAATGGACCTCTATGTGTGTATGTTTGTGGATACAGGTGATAATAAATTTAGCCC^ 

ATTCAGGGCAGTTGGTTTATGGGGTACCGCCAAACAAGCCAGGCCTAAGTCA 

TTACCCTGCAGATTTTTTGTTGAAGGACAAAGTTCTAAGTC^ 
2 0 ATTCTAAGGGTATTCCATTTGAGTGTTTTTTTTAAAGGGCRGGACATTAAGAA^ 

TTCTTTAGATATTTTTCCAAATACATAATTCTTTGGATTAGTTT^ 

GTTGAAAGGCAAGTGAAAACACTATTATCCTTAGTAGGTAGGCT 

GCTAATACATCTGATGCATGTAACrcCTTTTATAAGAGCTAAGCAGAAGG 

GTCGTTAATTGTTACTTTCrTTCCTACCGGCCCCTGCCCTAACTT 

2 5 GGAACCCCTGACCTTCAGTCTCTGCCCCTGCTTTCTCTCITCCCAT^ 

CCAATCCCCCTCAGACTTTGAATAATTTATCTGAAGTTCCTGGGAAGT^ 

GTTCCTTCTCTAGGCCGAGGTAGGTCCAAAAGCCTGTCTTTGAAAGAGGTGGTACTAAGTACAT^ 

AATTTTTTGGATGAGCACCCAATGAATCCAAAAGCTC^AGGAAAGAAA 

AATTCTGATATCTCCCAAAGAATGTCAAGTAAA1TAAGGGAACACTGATTAAGATTTCCA 

3 0 TAT AGATGT CTTTT CAAATACATGAG AGTTTTC CTTG AGTAGTT ACTAG AAGTGG AAA CTGGAT CTT AGTCTTAT AG AC AT AG 

GCTTTTGACAGTTTCCGTACCCTGGAGCTGGTT 

AGGTTAAAAAGTAAACCAAACTTTGAAACAATCATTTTAACCCCCGGAGAAGGTGAATA 

TCTGCAAATCATACTGAGATGATTTTTCTATCCCAAGTGTCTGATAACATATATGAATAGAAAAGCGAAC^ 

TTGACACTGACATGGTGACAAAGTATGGGGATTTTAGTACTTCCGAAATAAACATC^ 

3 5 TCTTCTTCCATTAGAACTCATCAGGCTGATTCCCATTTCT 

GAACAGAATCACTCAAGTAACCTGAAATCATGGGGCTTTATTACAAAGA^ 

AGGGAACTGAAAATGTCATGGTAGTTCTGGGAGAAATAGTGTTCAAGATCCACGGCAGCTCTGGAGA 

TAGCATTAAGATTATGCAGATAATTAATATTGCTCAGGTATTCTCTTAGTATTCCTGTGTG 

CTTGCTTCTGTTCTCCATATTTTGTCCTTCATATTCCCTTTTAAGACAGCTCTGGG 

4 0 AGC^CTTTGGGAGGCCTTGGTGGGCGGATCACTTGAGGTCAGGAGTTCAAGATCAGC 

TT AAAAAT ACAAAAATTG TTGGG AGG C CG AGG CGGGTGGATC ACG AGGT AAGG AG ATTGACACCAT CCTGG CT AACACGG TG AAAC 
CCCGTCTCTACTAAAAATACAAAAAAATTAGCCGGGCGTGGTGGCGGCCGCCTTTAGTCCC^^ 
CAATGGTGGTGAACCCAGGAGGCZAGAGCTTGCAGTGAGCTGAGATCGCGCCACTGC^ 
GTCTCAAAAAAAAAAAAAAAAATTGGCCGGGCACTGTGXSCTCATGCCTATA 

4 5 ACGAGATCAAGAGATCGAGACCATCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAA^ 

CACAAGCCTGTAGTCCCAGCTACTCGGGAGACTG AGG CAGAAGAATCACTTG AACC CGGGAGACGGAGGTTGCAGTG AGCCAAGAT 

GGTGCCACTGCIACTCTAGCCTGGCAACAGAGTGAGACTTCGTCTCAfiAAAAAAAAAAAAA 

CTTGTAATCCCAGCTACTCTGGAGGCTGAGGTAGGAGAA 

ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTACATCTTAAAAAAAAAAAAGACAGCTCT 

5 0 TTACCAAGCAATGAAACGGCTGCTTTGGGGTTGTGTGCCTACTTTTGGACA 

CATCCCTCTTTGTTTTG CrTTTGTTCCTAGGGTCTCATGGCC CTCATACGTC CT CTT AT ATTTTTrCTTTGATCTTCAT ATACTAA 
GGGAGGGTG AACC CCAGGCTTCATG<3AACATCCACCCTTCCTCCTCTACCAT CC^ 

CTTTTATCT ATTT AAAATTCTAACCTTTTCCCT ATAG CTCGAATTTTGGCACTCAGGC CT AGAGATATAAAG AGT AG AGATT AAAT 
ATGCCACATTCTAAAGGGAAGCCAGGTGATGCCCTTACCCCAGTAATCCAGAGCAGAAAGGATCCTACCATAGTCAGTATTC 
55 CCTCAACACGG 

HUMAN SEQUENCE - mRNA 

TTCCCCAGCATTCGAG AAACrCCTCrCTACTTTAG CACGGTCT CCAGACTCAG C CGAGAGACAGCAAACTGCAGCG CGGTGAGAGA 
GCGAGAGAGAGGGAGAGAGAGACTCrCCAGCCTGGGAACTATAACTCCTCTGCGAGAGGCG 

6 0 GGGACTTTTCTCTCTTTACCCACCrCCGCCCCTGCGAG^ 

GTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCX3CGGCGGCCX5CCGCGAGGGTGTGAGCGCGCGTCGGCGCCCGCCX5 
GAGGCCATGGTGCAGCAAACGAACAATGCCGAGAACACGGAAGCGCTG 

GTGGGCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCA 

6 5 CACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTC 

GCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGGA^ 
GCTCCTCX^CCGCCGCCTCCTCCIAAGCCGSGGGGAGAAGGGAGACAAGGTCG^ 

GGCGGCGGGAGCAGCAACGCGGGGGGAGGAGGCGGCGGTGCGAGTGGCGGCGGCGCCAACTCCAAACCGGCGC^ 
CGGCTCCAAAGTGGCGGGCGGCGCGGGCGGTGGGGTTAGCAAACCGC^ 
70 AAGCAGCGGCTGCCX3CCGCCGCCTCCTTCGCCGCCGAA 
CACTCX3CTGTACAAGGCGCGGACTCCCAGCGCCT 

CAAGCACCTGGCGGAGAAGAAC^TGAAGCGCGTCTACCTGTTCGGCGGCCTG^3GCACGTCGTCGTC 

CGGGAGCCGACCCCAGCGACCCCCTGGGCCTGTACGAGGAGGAGGGCGCGGGCTG 

AGCAGCGCCXSCCTCGTCCCCCGCCGCCGGCCGCTCGCCCGCCGACC^CCGCGG 

7 5 GTCCAGCXiCGCCCTCGCACGeGT^ CTCCT CITCCICCTCCTCGGG CTCCTCGTCCTCCGACG ACG 
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AGTTCGAAGACGACCTGCTCGACCTGAACCCCAGCTCAAACT 
GACCGGGACCTGGATTTTAACTTCGAGCCCGGCTCCGGCTCGCACriT 
GATCTCGGGAGACTGGCTCGAGTCCAGCATCTCCAACC^ 
GGGTAGGAGAGGAGAAAAAAAAACTTGAAAAAAAGAAACGAAAAGGA 
5 AAAGTAAGCAGGGCTCGTTCGCCCGCGTTCTCGTCGTCGGATCAAGGAGCGCGGCG^ 

CTTC C CX5GGCCGGGGACCCACTCTG CC CAGC CGG AG<K1ACGCGGAGGAGGAAGAGGGT AGAC^GGGG CGACCTGTG ATTG TTGTT A 
TTGATGTTGTTGTTGATGGCAAAAAAAAAAAGCGACT^ 
GCITCCGGACTTGTCTGCACrcCCAGCAAGAAGGCGAGTTAGTTTTCT 
CITGGTTTTGTrTTATTTTGCTTC^ 
1 0 GAAGACAGAAGGCTCCGGGGTGACGAATTTGGCCGATGGCAGA 
ATTCCCGTTTGGGGCCTTTTTTTCCTCCCTCTTTTCCCCTTGC 

G CCAGTGTG ACCGG CG CT AGGAAATGACCCGAGAACC C CGTTGGAAG CG CAGCAG CGGG AG CT AGGGGCGGGGGCGG AGG AGG ACA 
CGAACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTT^ 
TTTCTACGTGAAATCAGTGAGGTGAGACITCCCAGA 
1 5 CAGTGGAGGGCGAGTGGTTTCGGAAAAAAAAAAAGAAAAAAAGGG 

HUMAN SEQUENCE - CODING 

ATGGTGC!AGCAAACCAACAATGCCG AGAACACGGAAGCGCTGCTGGC CGGCG AG AG CTCGGACTCGGGCGCCGGCCT CGAGCTGGG 

2 0 ACJ^TCAAGCG ACCCATGAACGCCTTCATGGTGTGGTCG CAGATCGAG CG<3CGCAAG ATCATGGAGCAGTCGCCCGACATGCACAAC 

GCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCCT^ 

GCGCCTCAAGCAGATGGCTGACTACCCCGACTACAAGTACCG 

CGGCCGCCGCCTCCTCOUteCCGGGGGAGAAGGGAGACAAGGTCG^ 

GGGAGOVGCAACGCGGGGGGAGGAGGCGGCGGTGCGAGTGGCGGCGGCGC^ 
2 5 CAAAGTTCCGGGCXX3CGCGGGCGGTGGGGTTAGCAAACCGCACG 

CGGCTGCCGCCGCCGCCTCCTTCGCCGCCGAACAGGCGGGGGCCGCCGCCCTGCTGCCCCT 

CCTGGCGGAGAAGAAGGTGAAGCGCGTCTACCTGTTCGGCGGCCTGGGCACGTCGTCGTCG 
CCGACCCCAGCGACCCCCTGGGCCTGTACGAGGAGGAGGGCGCXK3GCTGCTCGCCCGACGCGCCC^ 
30 GCCGCCTCGTCCCCCGCCGCCGGCCGCTCGCCCGCCGACCACCGCGGCT^ 
CGCGCCCTCGCACGCGTCCTCCTCGGCCTCGTCCC^CTCCT^ 

AAG ACG ACCTGCTCG ACCTGAAC CCCAG CTCAAACTTTG AGAG CATGT C CCTGGG CAG CTT CAGTTCGT CGTCGG CG CTCGACCGG 

GACCTGGATTTTAAC!TTCGAGCCCGGCTCCGGCTCGCACrTCGAGTTCCCGGACTA 

GGGAGACTGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGA 

35 
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Table 20 

MOUSE NOMENCLATURE 
ICSGNM Tcofl 
Celera raCG6035 

HUMAN NOMENCLATURE 
HGNC TCOF1 
Celera hCG38609 



10 MOUSE SEQUENCE - GENOMIC 

TTAAGAAAGCTGGGGCCACTGGACAGTGCTGGCCCACGCCT 

TGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGGACAGCTGAGACrAC^ 

AGAGGAAGAAGAAGAGGAGGAGGAGGAGAATTNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNl-INNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNKNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CACGAGACCCTGTCTCTAAAAAGAAACAGCAGCTGGGACCTGAACCTGTAGTCA 



2 5 AGCCTAGGAGAAAGGGACCAACTGGTCTACCTTTACTCCCTACCT^ 

TTCTTTAACCATGAAAGGATGGAGCTGGCCAGGAGCTCT 

CACACCCAAATAAATAAATAAATAAATAAATAAATAATGCTCAGGAGACCAGTGGGCAATTAAATTGGTTCT 

CTCTCCCTGTTGCCCACTGATTCCACCCTACAGTTTTAAGACTTCAGT 

ATGTCCTGCCTCCCTCATCTGTGCCTGTGGTCCAGCCTGGGTCATGTTAGGAGGCCCTG 

3 0 TTGTACCACCGGTGGTAACTCAGTCCTGACCTCTCAAAGTCT 

GTTCTACGGTGCTAAACTGATG AGTG CTT ATTGGCCTGAAGGCT AGCTCCGGCTCGCTGC CTT 

TGAAGGATCCACTCTGAGCCTATGCAGACCTCAGTATCGGGAGAACTGGCTCTAACATGTTACT^ 

AGGGTTTTATGTATACTGTGAACACTCTAAGAAATAAAGGAAGGGAGAAAGGGTGGGAGAAGGGGA 

AAG CCCAT AGGGAGGTTGTGAGTACTTGCXX3 AAACTGGGCAGGG ATC CGGCCTCTGA 

3 5 GGGTGAGGGTACTGAAAGAGAAGTAACTCCCTACAACTTGGAATGTCTTAAGCTGCTACTTTAGT 

AGCTTTCTCTGGATTGAAGAGAAACAGAAATTGCAGGTCAGATCTGGGA^ 

GAAAGTCCGGTGCTGAATGGATGGAACTAGAAAACCTCATCCTGAGTGAAGTAACTGGG 

G TGG ACG TT AGCCAT AAAGT ACAGG AT AAC CATGCTATGATCCAC AG AG C CAAAG AGG CT AAGT ATCAAGGAGGG C CG AAAGGAGG 

4 0 ATGCTTGAATCTCACTGAGAAGAGGGAATAAAATAGACATCTGGGGTGGAGGGAGGGA 

AAAGCAGGAGGGATCTCAAGTGTGGGGAGGATGGTGGGAGACAGAACTGGGATAGAGAACT^ 

GGGAAGCATCTCTGGGACAAGGTAGAAATCTAGGACAATGGAAACTCCCAGGAATCTGTGAGGATGACCCTAGCTAAGACTCCTAG 
GAGTCAGGGATATGGAACCTGAAACAGGCACTTCCTGACACCAAACGAGACTT 
AAAAATCTTTGGCCCTCAATTTGCCCTGCCTATAAGAGGTGGTGGGATAAGGAGGGAGCAGAA 
45 ACTGG CC C AAATTGAG CCTCATGC CATGGGAGGG AGC CCAC CC CTG ACG ACAG T ATT AACAAT ATT CTGC CG TACTTG CAG ACAGG 

AGCCT AG CATAG CTGTCACCTG AGTGGCTCTGCCCAG CAGCTGATGAAAACAG ATGCAGAG ACCC CCCAGTCT 
GCTCGGAGAATCTCACGGCAAAGGGGGAGGAAGGATTGAAAAAAGCCATGTCAAA 
ACCTGGGCACAGGGGGGCTCACAAAGACTGAATCACCAGAGAGCATGCATGGGACA 
TGTACAGCTTGGTCCTCGTGTGGGACTCCCTAACAACAGGAGCAGGAGCTATTTCTGA 

5 0 ACTGGACAGCCTTAACAGAAGATGTGATATGACATGGCATGTTGATATTCAGAGGTCTCCCTTCCTCrGG 

ATAGGAGGAGGGGAGGGGAGAAGGAGGGACTGGAAGGAGAGGGGGTAGGGAAAGCTCTGAT 

AAATGGATGACTAGAATAGAATAGAATAGAATAGAATAGAATATAGTGCTAGAACAGACCTTGTTCACAGATCCATGTAAC^ 
GTCGTTTCAGGGGACTAGAGAGGACAAGAGACTGGCC CACGACCACG TAGC AG AAGTTGGCTATAAAAGCTCTTTACTGGTCCCTG 
GTT CTGCTGCTTCCAACCTGTGTGCAGACCATTGCTG CTGGGAAGAG CATGTAGAACCCTACCTGCTGAGTGTTTTCTTCTTAAAA 

5 5 ATAAAAAAACAAAACAAAAGAATTATTATGCTACACACCAGGGTGCATG 

AGGC^GATCTCTGTGJUATATGAATCCAGCCTGGTATACATAGTGAGTTCCAATATAGTCAGGGACACATAGTG 
TAAJU^CAATAAATAAATAAATAAATAAATAAATAAATAAATACATACATAAATAAATAAATA 
TTTTAAATTATGTGTATATGCGTATGGGTATGTGCCCATGTGTAGAGATAATCTCT^ 
AATTTAAAAAAACTGACCAATGAGCTTAGGGAGAAAATCAGGA 

6 0 GAT AGAGACAG ACAGG AGATTGGCCCCACCCTCAACCCCCAAATGAAGATATAGGCAGCAGCAAC^ 

CGTGACAGAACTTAGAATAAAATAACAGGATGATTAACTTATGAGCTAGTTG 

AAATATATTTGAGTCTCAAGAGTCATTATTCCAGAAGAGGGACCAGGAGGAAAAAACAGGAGCT 

CCTTGAGT0K3AGGTCCCTGAGGAGGTCAGAAGAGGGTATTGGATCCCCT 

TAGGTGCTGTGACTCAGACTCTGGTCTTCCACAAGAGCAGCGAGCGAGTGCTCCTGACTACTGA 
6 5 TCCTCCTTTTCCTGACCrrGCCTGGGAACCTGAGGTGACTAAGGCAGGATGCTTTTCCCAAGGA 
GCTTCCZAGCTTCTTCCATGAACAAGGTCAACTTTTCCAAACT 
GTTTGCTTCTGCCCTGACTGTGGCCTTTTTATATTTTATGGGCTTAATGTTCT 
AAGGATGTGATTTTAGAAAATCCAGGAAATAAAGGAATTGTCCGCCAAAGAGGCATCTGAC^ 
AATC^CCTGTCGCCAGACTGAGGTATTAGCATGTCCTGXSTTTAATGTAGGTTTGTCTTTTTAA 



70 



ACTTGCCAGTAGTGACCAGAGAGCCTACTGTGCACAGCCTCTGAGCTGGGGGCAGGTCCATGTGCT 

CTTTTTAAGGGCCTGTTCATTATCGTTTGGAGAATCCCCCAGAGGAGGA 

GTCTTTGAATGCTTGCCTGCTGATTCCGCACGCTCATCAGCCT 
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AGGCTGTATCCCTAAAACCTGACCCITrGCATCCCTCTCCTACCAATTTAATTCCT 
CACCTTAGACCCCACACCCTTTTGTTTCCTO 
GCCCTGCCTTCAGAGAACCCAGCAGACTGGCCACCGTAGCTCT 
CAAGTTGCTTGACTCTCTGAATGCTGGCTCTCCCTCOT 
5 ACTTGGAAGAGGAAAATAGAAGTTCAAGATGAAATCTCCCTTAACCTCCCACT 
Ca^CGAGCCTGCTTTGGGCrTCTATTGAAGTCCTCCCTCCTCCGTT 
CGTACCCTCAGACATCGCTTCCCAGCTTCCTAAA^ 
CAACITCGGTCTCTTCCATCTCAAAACACTCCTCCCCCAATAA 
AGTTCTTTTTCTCCCCTCCACGCAAACCTG^ 
10 CCTGTTCTGCATGCAGCACTTTCTGAGTCAGTCTTCAGTGATAGCAA^ 
AGAGCA.CTTCCCAGCATTTCACATTACTGAAACTCTGCGCT 
AGGCTTGTCTTTTGAACAGCCCTACCTGTGGGCATGCCTCAAAAACACCTCTG^ 

GGGCATGGTGTCTACATCTAGAGTCCCAAAGAACT'l'Cr'i 1' C CATTG ACAAATTT C T 1 1' LT TTTCT CTG ACTCAGAC CTCT AATTTG 
TGCCCAAACCAGGTTTCAAAGTACCTACTAGGGACCTGGGACTCTC^ 
1 5 TGTCCCTCTAGGGCTCAAATTGCCCCTGCTCTCCAACCAGCCTCAAGCTAAGGAAACCTCTA 
CCTTTGCGCCCTTCAGTCCCCACATCGGAGTCCTCCTGTGCCCTG 
TTCTTGTTTAGCCTTCACAAGGCTCTGTTCTCCAACCCCAGCACCCCAT 
TCTATTTGCCTTTTGCCCTTTCTAGCCACAG 

TTCCCCAAGCCCACTTCCTGCTACCTCATCTAAAAGGTTCCGCCTACCTGTCTGCACAG 

2 0 AGAAGACACACCCTGAACTGCTTCTTGCriTACCATCC 

TAGGTCATTGTTCTATGCCACAGATGTTGCTCCACCCTCTGTTTGCTACCCreCT 

CCTTCACT CT ACT CTCTGG ACACTTGGACAGGCCATC C 

TCCAACCAGCCTCAAGCTAAGGAAACCTCTAAAATGGAGTCT 

CCTCCTGTGCCCTGCCAGCAAGGCCriTCACTCGCCTCTCCCCTTAGTTC 

CTCTGGAGTGCTAGGATTTCAGGCATTGGACACCACC^ 

TCTGACCAGGTGCCTCTGACCAGGTGCCTCTTTTACCAGGTGCCCCTCTGACCAGGA 

GCTCAGGTTTCTTCATAAACACCATTTCCTCTGGGAATCCTTCTACTTCCCCT 

ACAGTAArrGCTAAGGTACTCAGTTCTATTGCTAAGGAGCTTTCTAAACATTT 

3 0 CTTGGAGGTAGAAAATGGTAATTATCTCCACTTCTCAGATAAACCAAGGCACAGAGAATGCCT 

AGAAACAAAGCCAAAATTTAAATCCAGATCGTGTGACITCAGTC 
ACCCTTTGGTGTACGCTACAGCTATTTGTTTAACA^ 

TCTCCAGTAGTAGGTATTCAATAAATATCTGCCTGGTGTCTGAATGTAGATTAGTCTCCTC 
ATTCCTGTCACCTGGTCCTGTGTGCTTCTCCCTGGAATGTGCATTTGTCACTTTTCT^ 

3 5 ACTTATATTAGAAAGGTTTATTATGGGGGCTGGAGGGATGGCTCAGCAGTTAAGAGCACTGACTGCT 

CAATTCCCAGCAACCACATGGTGGCTCACAACCATCTGTAATGGGATCAGATGCCCTCT 

GT ACTCACATATAT AAAATAAATAAATAAATCITT AAAAAAAGAAAGAGGCCAGGTGGTGGTCGCG CACG CCTTTAATCCCAGCAC 

TTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCAAGGCCAGCCTGOT 

AAACCCTGTCTCX1AAAAAAAAAAAAAAAAGAAAGAAAGAAAGAAAGAAAGA 

4 0 AGGCCATGAAGGAAGGAAAGATGGTGGCAGGAGCTTTGGTTACATGTC 

CCCTACCCAGACATATCCACAG^AGTATCTCITAGGTAATT 

GTGGGTCCTAAACTTTCGTAGAAATGTATGTTGACACATATTTCTAGAGTGAAAGCTATCAGCAGA 

AACAGCCGCCTCAAATAT^CCCAGGGTACTCCCATTCTCTCCTGGTCAGTGCCTC 

CTTTTCTGTATTTTGGACTTTCTTCTTCCCAATTCTACAGCCTAGCAGTA 

4 5 GGTGGTTCTCACATTCTGACTCACTCAGTACCTCCAGC 

ATGTCTCrTGGGGGTTGGGTGCTCCTCCTCCATGCTTCCATGAGCTCTGACTTCTCTCGAC^ 
AATTACTAGGTTATTCACTGTGTCTCCCACATGGGTGATTTTTGAGTCAACrrGTTTGCTTTG 
AGCTAGGCGCGATCCAAACATATAGAGCTCAGGTAAGGGTTTGTTTAGAGATTCAACACA 
GGATCCTAAACACTGTGTGAAATGAATACTCGGAAGCTGGGGACAGTGATCAGGAAGTTCAAAACTC 

5 0 TCAAAAAACCTAGACAGTAAATCCCTTCAAAGGGTAAGAATTAAGAT 

GG CGGCCCAGAGC^CGGGTCTAGATCGGGACACGCCTTCGAGCCAATG^ 
TGGGATTGCCCACTCCAGCTGCAAGACTACACrTCCCAGCGTGCCGCGCGGCCCGGTGAC 

GTTCTAGGGCGCGAGGGAAGCGGCTAG CGGGGGCGAGGGGGCGTGC ACGTG AATCCC CTGGCCAG ATCTCTGGCATGGCCGAGGCC 
AGGAAGCGG CGGGAGCTGCTTCC CCTCATTTACCATCATCTGTTGCAAGCAGGCT ACGTCCGCGCGGCGCG^ AAGTAAAGGAG CA 
55 GAGCGGCCAG<jTAAGCGTCAGATGGCCCAGTACAAGATCnAGAGAGCAG 
GTCAGAGATGCGGGCTAGGGTGGAGGAAGAACATTAAGCTCTGCTCTACT 

CCCCATGGGTGCAGTGGGCGGGCCGGAAACTGGTCCAACTAGTGCAGCTTTGTAATTCCCGCCCGGG 
TAAGTTTCCCTAGTATCTGGCCACGTGGCTCTATGCAGTTTCCCTGGGACACCACATGGGAGTCCAGCTGGAG 



60 



TTATGACAAGACTTGAACCCGTGGTGCGGTGAITTCAAAGTTCATTCATTTGGCAAC 



CCAAGTGCTGGGATT AAAGGCGTGCGCCACCACTGCCCGGCTGG CAACTTTTAATTTGGCCCTGT CGTGCGGATGATGAAATGGG A 

CGCGCCAGTTTCAGACTCCTCTGCCCGTCTCACAGCCTAATAGCTGGTAC^ 

GTACAGTTAACATATTAAGCTATAGAACGAGAAATTGGAGATGCTAAA^ 
65 TGCTTCCCTAAGGATGTAGTAGCTGATCTGAGATCTCGAGGGAGATGGGTATTAGAAGCTCTGATGCGAGGCTTCTGAGTGTCCTA 

GCCACCTVGAAAGTTTGCTCAGCCTGAAGTTTGAAATACATGACT 

GGGTAGTCTTCCCAGAACTGTGTGTGATTAAGTCACTGGTGAAATGGATATO 

TTTAATCCCAGCACTCTAGAGGTAGAAGCAGGTAGATCTATTTTT^^ 

TCTGTATAGCTGGCCTTGAACTCAGAAAGCCACCTGC 
70 TTGGTAGATCTCTTAAGTTCAAAGCTAGCCTGGTCTACATAGTGAGTTCAAGACCAGCCAAGG 



GGATGGATTTCTGTGAGTTCAAGGCCGGCCTGATCTACATAGTGAGTTCCAG^ 
AAAAACAAAAAAACAGACTTAAAGCCCAAAGTTGCTGGGCCCTTAAGATGTT 

CCATGTCCATTCATACTCAAACTATAAAACAGAGGAAATGGGTTATAGTGCCCATTGTTGTGGTTCCTGTGCC 

CcT'GCTC^CACAGAAAAGATGCAaATACAA^ 
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ATGTATGTGGCTTAGCATATTGTGGCAGGTC 

AGAAGCTTTAATGGGGAAGGTGACGTTGGAGTAGAGACTAGAAGGAAATGAG 

TCTCAGTAGCAAGAATGGCAAAGCCCTGTTAGGAATTGGTAGCC 

CTGACTGACTGACTGACIGGACTGTGCCCTTCMACCTGA 
5 GACCAGGCTGACCTTGAACTCAGAGATCTGCCTGCCT 

TATGGTGCreTGAAGAAGGATCCTGAAGTTGGCAGAGGTAGAAGATGAG CTGAACGGTCCAATAGGTGCTCTCCCTGACTC CAGGC 

CAATCTGTTTO^TGCTAGCATGTTCTTTCCTTTC 

ACAGAGTGTGTCCTCACCTTAGGCCTGGATTCTAC 

ATCCACTGGGTTCCGTTATGTCACGTGGAAGCTTTCAG^ 
1 0 CTGTCATGTGGCTTCCATCCCTGTACTGTAGT 

CCCTTTCCGATGTATCCTATTGAAAGGTTTGTAGCTGGAACTGTTTGTCC 

AGCCCAGAGGACCTGGAAGAGCAAACGGCATTTTCCAAAGC^ 

ATGCCTATCGTGTTGGGTATTTTTTTTGAAGCrrCTCT 

CTTCTGGACATCTATACACACTGGCAACAGTAAGTCTGGTGGAATGGAAGG 
1 5 ATTGAGCTAATATGTGAATGTGTTCTAATGGCCCACTGTGGGCATACTTGTCACCT^ 

CCATGTGTGTTCTGTTTTAAAGAGAGGATCTCACTATCT 

CTCTTGCATGCTGGGACAATTGGAGTGTGTCTGGGATTAAAGGCGCACGCCTTGG 

TGAGTTCCAGGACAGCCAGGGCTACACACAGAAACCCTGTCTCAAAAAAA 

CACACCTTTAATCCCAGCA.CTCCGGGGATATTGAGGTAGTCAGGTCTCTG 

2 0 AGAGTCTCAAGAAATAAAGGAAAAGAAATGGAACAGCTGGGTTTTTAT 

GTCCCTGGGTAGCCCTGGCTGTCCTGGAACTCCCTCT^ 
GACTAAAAGGCATGAGCCTGCACGGCCTGGCTTGTT^ 
TGGTAGTGAGAACTAGTGAAGGCAGGAGCCTGGCGCCCAACCT 
ACAGTTCTATCATGGCAGCACCG 

CAAAGATCTC^CCCACTGAGCCAGTCCAGAGCAGCGATTA 

GTTTGTGACCTCCTAGAATAACATCATCTGTAAAGTGAACTCTATCATCACGTATTGGGAA 
GATAAv^GTCCTAAAACTGGCCAGTGAGATCTGTCCrGG 

3 0 GAAGGGGGGGAAAAAAAATCCCTGTTCTTGTGAAATTCACATTCTGTTGGGGGATGG 

TAATAAAGCCTCTTCTAACAGCTTCCTGGCAGGTGGCCTTCCTAGGAAGAAG^ 
AGTGTCAAACCCAGCTAGGAAGAATGGGGCTTAATC^ 

TG AAGACTT ACAGCACGCGTG CGCG CACACACACACA.CACACACACZACACACACACT C CC CACTCTGCTT CCTTCCAG ACT CCAGT 
TCGATGGTGGTTCCTTTCTCrAGGTGTGTGCCAGATACTGGGTGGGATGCTGTATGCT 
3 5 AGTCGTCCAGAAATCAACCAGAGAAGGAAACGAAACTGAACGCCAGGGC 

GATGCTTCACGCCAAACTCGAGAGTAACGAGAGTGCTCCTTCCTCTTTGATGACTCTAGCTTC 
GTCAGAGCTGGGCGTGGTGGCGCACGCCTTTAATCCCAGCACTCGGGAGGCAGAGGCAGGCGGATT 



CAGGACAGCACCACAAGMSAAGGACGCAGTTC^ 
4 0 GCCGATAGTAGTGAGTGAGCTGCTGCCGTCCTGTCJVCCTCGTCGGCTGCTTGGGACCCTGAGACT 
GGGTGAAGAACAGATCATTTCreCACTGTGTCGGCGCCACTGTTTAAACTCTGTGTGCC 

AACAGGT ATCTTTCCAAACCTCATGG ATTTGTCC CATCTCCCCTG CCCTGCAGG ACCTCAGAG CTTGGCCAGAAGCAGAAGGCAGA 
GGATGATGAGACCCTTCAGGCTAAGAAGTCTCGAGTGTCGGATCCT 
AGGCAGCAACCGAAAGGG CCAAAGCCAGTAAGAGCCCTCCTTGCTGTGT^ 
4 5 TCTGCTCTACCGGGGTGTCTGCCTCTGAATCATT 

CrGGGAGGCCTTCCTTTGGCAGGCATTTTAACCCCTGAGGGCAGGCATT 
CCAGGAGGCTGGGTAAAGTGGCGCACACTTTTAATCTCAGCACAGAGACAAGTGAATCTG^ 
TAGTAAGTTCCAGGCCAACCAGGGCTACAAAGTGAGAACCT^ 
AGTCAGTTGCTCAGTTCACTTCACAAGGACTGTTTCCACAGCTCCTGGATGC 

TGCTGGTTGTTTTAGTTACTGTTCTGCTGCTATGAGGAGGCACCGT^ 

GGCTGG CTT ACAGTGTCAGGGGAGTTCATCCATG ATCC CCATAGCAGC AGCCAGGC^ 

TTACATCCTGATCTGTAGACAGTGAACAGAAAGCTGGAGACTGGGCT 

T ACACCTCCAT C CACCAACCAGG AAC CAAACATT CAAAT AT AG TCT C ATTCAAACCAC CAT ACTAATTGTCCT CTT ATTTT GGCGG 
55 TATGTGGTGGGTCTGGTAGCTATGTGGCCTCTTCT 

GTGTCCCACAAATATCATATTGCTTATACAGAAGACAGTTTC^ 

TTATTATAAAATGTTCATTACAGAACATTTGG^ 

CCCCCACAATCTTGTCTTTCATTATTCCTCCTGTGTATGTTCT 

TAAT CTTAAATT AT AAATG CAAAG CAGGT CCAGTTTAT AG ACAGAAGCACAG AG AAGG G AAATGT AAG AATC CTTGTT G CCAG CCC 
6 0 CATCACTTCCTACCCATGTCTCTTGGTCGGCAGCCCATAAACCTGCCCTGTTGCT^ 

GTGGACTGACCCACAGCGGGCTTCAAACAGAAATGGCAAATGCTTGTGCTATTAGGTATCTGATA 
ACCTTACTCTTTAACAGTTTCATACAGTATGTGTTGATCATC 
TCCTTAGTACCCTCTTTTTGTTTCCCGAGTTTGTTTTTTTATTGTTGCAATCAA 
TGTTGACTGATCCTGTGTAGGTCACCGGGCTACATCAGGTTCATGAGTATGACCTTGTTO 

6 5 CTTTTGTTTTTGTTTTTGTTTTTTTGTTGTTGTTATTGTTTTG 

TAGCTGTCCTGGAACTCACTCTGTAGACCAGGCCGGCCT 

AGGTGTGCACCACCACTGCCCAGCCAGCTCTGCATTCTTTTTTTTTTTTT 

TTCCAATGCTATCCCAAAAGTCCCCCATA^ 

GCATTCTTAACACCATCTATTCTATAATGCTCCCTGAGCCATGTAGGCTTGATAAATGTC 

7 0 CACTCAGTCTCAACTTTGACAAGTTAGGAGTCTCTGCATTGCTATTGCAGAAAGG^ 

TTCTCATGGCAAACTGACAGTAGTAAGTCCCATCACTTATACC^ 

TTATTTTATATATATATATATATATATATATATANNNNNNNNNNNNNNNNN^ 

NNNNNNNNNOTWNNNNNNNNl^^ 

TGAGCCACCATGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCAGTCAGTGCT 

I CCTATAHi IIjA^ l ill lAUil i 4. lOVTVai ill iv»x ivjx i ji v.*-^v-v-iAui x\.xxt«x i«v,«nun\iwiwi-»w«»w»w..~— — 
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GCCTCAAACTCAGGGACGTCCTG C CT CAGCTTTCTGGGTGT CTGGCTT AATTT AAG AGTCCTCAG AAGT ATTTATGGG T AACATGA 
CAGGGTGTGAAGTTCTTCAAAATAATGTATGATGGGGAAGG 
AAAGTTACAATAACAGCTGAGGCTGCATATCATACATACACACAC^ 
AATATTTK^AAACT^GCTCATGTGCAGAAAAGG 
5 CTGTCTCTTTAGTGATAACTTCTTCCCCTGAATGCCTGAGATCCAA^ 
GTCTCCTACTAGTCCCAGGCTCCTGGGTCGGCACAATTGACGACCAC^ 
TGCCCTGCCTGCCAGTGACTTATCAGGACTGATGATGGTTGG 
GTGGTGGGTGGTAAGATTATAAGCCTGTCCnTTCCCTCTGGGGGCCTCCAGATCT 
GATGCAGGCACAGTAACCACCATGGGTTCTGGGCGAGGCCTTCTCATGG 
1 0 GCAACTT AGTACCTGTAGGACCATTAGGTGGTGTAAAGTCTGCCCATCri' CI 

TGTCTTGCAGCCCCAAGACCGACACCTGTCAATTCT 

GAGTTACTCATTACCTGTGGAAGCTGTATCCAGTGGAGCATAGTCTGCCGTCT 
TAGGTGTCTGTGACrTCATCTAGCTATGCCTCGGGTGCCTTT^ 

1 5 CCATCATAAGGAATGCACTGTACACATTTTCTAGATCAAAAAATACTT 

CAGACACATAAATGATGTGTCACCCGAACCAGGCGATAGCTATCGTATGTAAAAGCAATGATGAAATAATAATAG 
ACTATAGCTTCATGTAAAGTAGGAACTACTCTGACCCAGAAGCAG 

GCTGTAGAGATGAGATGACAAACCAGTGGACTGTGAAAGGCCAGAGCTGGCTGCTCTTAGTCTG 
GAGCTCGGAAGTCGCCTTGTAGGACCCAGTGTCCTCCTGATGCTTTTCACATGTC 

2 0 CTCTGTAGGGAAAGACCAAGACAGCCAACAAGACGGTGAACTCTGTG 

TCTGGGAAGTCACCCAAAAAGTCAGCAGAGCCCTTGGCAAACACT 

CCTCGGACCCACTGCCAAGTCTGGTGAGACCCCCAGAGCCTGGGGCAAGGGGTCAGGACT 
GTGGGTTTGTCTCTGACAGCCAGGGTCTTACAACAGCCATTTCCACC^ 

TTATATGCATCGGTGATTTGCCTGCATGTACATCTGTATGAGATATCACACTTTGTAGTTATAGAC^ 
25 GTGCTGGGAGTTGAACCCAGGTCCTCTGGAAGAGGAGTCAGTGCTCTTAACCTCTAAGCC^ 
TTTAGAGGGTGGCGTTTCCGGCCTTCTCTCTCTGGTGGATTATAGGGAACACTG 
GTCTTGATGCAAAATAGGCCTTTGCCCAGCTTGTCCrc 
TTCTAGCCTTTCCAAGACCTCAGAGAGGCCTGCAAAAATAGGT 
GATAGCAAGCCCHX3TACAGTTGAGAAGCATGTGCTAOT 

ATCTCAAGCGATGAGACAGATGTCGAGGTAATTGCCACCCATCCCCAGGGATTGATCTCTTCCAGCTT 

TGTTCAGATCCCAGTGGCCAGTCATCAGCCCAG CAGTTGTGTC CG ACAG CTGCATGGTTACAGACAGGGCTAGGCAGTTGGAGAAT 
GAGGACCTCAGTACTCCTCTGGATATGCTGGGTATCAGAGTCCAGTGCT 

GT AACTGGGACATGTGGCCAGGGGTAGG ACTG C CTCGAGGATT CTGAGCCCAGTGG TG AATG TG CG TTTGGGG AT AGAATTGGATG 

3 5 GGATCCTTGGTCTGCAGCCTTGTCTCTTTGTAGGATCGTTGGGGGG 

GGCCTGGCTCGCATCTCAAAAAGATTGCTGTAGCGTGGCCTGGAGGAAG^ 
TCAGGTAACAGCCACAGGTGCCAGTGGCAGGAACCAGTATGCCCCATATGTATG^ 
ACCCCAGTACAAAGCCACAAGCTTATAAAAACAAAAGACAAAA 
AACTTTGTAAATCACAAGACIXn'ATCACACTTTCAAAAGGTTGG 

GCCGTCTCTCTGTCGGGCCCTGAGTGTAATAAGGCCCACG^ 

AGCTCCTACGTATACGTATACAGCACGTGTACAGCACTGGCAACAGGAAACCAGCGCGGAGCTCCTG^ 
GTGGTGCCATAGGCTGGTGTGGGGCGTGAGGGGCAGCGTGAAACAGGCGCTGACTG^ 

TTTGGAATGCTAAGGAAGTTCCAGGAAGAAGTTCAAGTCAGTGCAGATCCAGGCTCCTCTGTCCGGAAAGTA 

4 5 TTTTTAGTTGAGTGGCCCCTCAGGATGGACCCTCTTATAAAC^ 

GGTAGCCAGGGGTCCTTATAAGCCTTTTGTGCTGCCCTTAAATGATCTGTGCCTTTTT 
TCTGGGAGATGGAAAGGAACATGCTTTTAAGAAGGGGCTTGTATTCCTTTTCT 
CTGAATGCTTGGTCACCAGGGAGTGGAACTATTTGAAAGGATTAC 
CGCTGGGGGTGGGCTCTGAGGTCTCAAAGGTTTCCAAAGTCCTTCT 

5 0 CTTCTCCAGCAACACXSTTCTTCCrTTTAA 

AAGCAAGTTTTTTAAGAAAAAAGTTTAGTTGAACTTATGGTTTCA^ 
CTGGAACCGCTCGGAGCTCGAACCCCGATTCACAAGCAGGAGGC^ 

CCCTCCAGCGGGGCAAACACTTCTCGTAGTTGGGAGCCTATGGGGCCGTTCCCATTCAGAT^ 
GAAGGCTCTTCATGTGCTGGATCTCTTGCCTGCTTCTAGGTGAAATCrCCIAGCAAAAC 

5 5 TGCCAAGGATCCTCCAGCAAGAACAGCCCCAGGCCCTA^ 

CAGCGGCAGCTGCTGCTGCTGCTGTGGCTGCTGCrGCT^ 

GAGGACGAGGCCCCTGCTGGTCTGCCCAGCCAGGTGAGTCCMGGGCTGCrrATCCGTGACCCGCTGCCTGTAGTTCCCCCCCCCCC 
C ATG CTGGCAAAGCCACTGTGTG ATGGAGAACTCCTGAAGAGGG CCCATTCTAG AGAG AG AAGCAATGGCAACTGGCCAGGGGGCT 
TCGGTG CCTGTGGAAACGTCC ACTGGGCCCACACCG CAGCCAG AGTGGCTGATG CCTTTGGGGAGTTGGGGGGTTGTCCTG CACCT 

6 0 ACTCTGGTGATGATGTTAATTTTCTTCGTTTTCGTGTGTGGTCAGrTGTAGCTGCCCT 

CTTTCTTCATTCTGAGTCGGTTTCCCTATAGCCCAAGCTGGCCT 
CTACAGGCACGTGCTATCTTGCTGTCTTAAGTTTTCTATCG 

CAAATGGG TG CTTG CAGTTC CATGG CCGT C CTG ACTGAG AACATGG TG T CAG C CAGG CATGGCACTGG AACAGT AGCT AAAAG C CT 
TAAAACTGGTCCACAGGCAGAGAGAACGCTACCTTCAAAGCCAT 

6 5 CCACCGATGGAAGGGGTGGGGATGGGGGAGACACAGCAAGTATTCAGATATCACA 

GGCTTGGTTCTCTCITTAGCCAAACCAGACCCTGAGCCG^ 

CAT GG AGTGT AATTCAGAT AGCAGTCTCACAGT CCTGCACACATGCTG GG CGG AG CAGAAAGGG CTGG AAAG CTTTT AGAGGT AGA 

ACAAGAAGAAGTGAAGGGGTCTGTATCCAGGACTGGGGCAGGGGTAGGGGGTGGGGCGGG 

GAAGAGTTGGTTGACAACATGAAGTGGACAAGGAAGGACCCAGAAGGAGATGT 

7 0 TATAGTTGGCTGCTGAAGTTCTGTGGCAATAACTTGG^ 

AGAGG AGC CAAATGCAGCTTGTGTACAATT ACTGCCCCTCC ACACAGAAGGCCTT AG AAGGTC CAAAGCCATTTATTTTCTAGG AA 
GCTTAGGGCAGAGCGGACAAGGCCGTCATGAGAAGACTGTAGAAGCTTGTAGCCACCCTACCAAACAGGGAGGCAG 
TTGGCTGAGAACAGTGAGGAGAATGGATAAAGAGGAGGCCAGTGAAGGGACTGCATCTAGAACAGTAGTTCT 
CTGTGAG C C AAT ACAGTT C CTCATG TTG TGG GG ACCCCCAACCACAAAATGATTTCATGGCT ACTT CAT AACTG T AATTTTGCT AC 
75 TG CT ATG AATC AAAATGT AAATATCTGGTATGC AAG AT AT CTCATC TG C GTCC TC AC CCACAC C ATG AAG AC^GCTG ATCT AGGAG 
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TCCTGTCTACCACTGGAAGCCCTGGGGAGTTTGAGCCTTTTTCCATGCrr 

CTGCCTTGAGCATTGCTGGCTAGTTAATGAGCCGGACAGTCTCAC 

GGATTTAACCATTTAjCGTGCTAGGGCTCTGTTCAGGGCCAG 

catgtcagatc 

cacccaggccaaggcagaaaggccagagaaggactcggaaaccagcagtgaggacgat^ 

ctgtgaatact cctcaggtaaagc ctggggggag aaggg aggcatt atag cc cag acctgaagg ctgct aacctgtctcctcagct 
ggtcccagcccctgtcccttcrcctcagctgggcccaggccctgtcccttctcagctgtgc 
1 0 tcctaaccccgtcccttctcttcagctgggcccagccc 

tggtcctaacccctgtcccttctcctcagctgggcccagcccctgt^ 

ctggtaaggctccctgctcctgtcaggtctctgcccacatctgcctaatgatttactcctcaa 
ctgggaagagccctcgggccagaggtacctcagcccccgccaaggagtcatccct^ 
1 5 ggaaggcctgtggcagcccaggcagggaaaccagaagccaagagcagtgaggagtcagagagtgacagtg^ 



agagcacgtcccctgatcttgccctctgtcactcccaggtga 
accccggggtcatcgggaaaacgtgccaacctgccctgcrc 
agaagatgtctcggagagcagcagtgcagagctggacagtgacgggcctgggagcc 
2 0 ccgggagcctacatggccctacacxtccaggtcictcgtctgotrctct 
tcattccaggcaaaggcctcccttgcgctccctcagaaggtgaggcctc 
ctcagggagcagtgaggagtcatctgacagcgaagaggaggcagcaccagcagcctctgctgct 
ggcagtctggctgtcctggcccgccccagtcagaaaccccacctctcagaattagat^ 

agctgcatctcagaacgaattagtccaggagtctcccccaacctccatctgcactttctccacaggctaagccagctctggaaa^ 

2 5 cagatgaaagcttcctctaggaaaggcacgcctgcatccgcaacaggagcgagcacctcgtcccattgtaag^ 

ctcttcagccagcctgtcatccccagctctggccaagggcacccag 

gagctgctcccagcaccccowsggtacaggtaagggctggatttaccc^ 

cccttgagaaactctttcttctccttgactttctaggggaagtct 

ggtggccccagtgcacactcagaagacagggccttcggtcaaagctatggctcaggaagactcagagagcct 

3 0 gcagtgaggaagaggatgagaccccagcacaggtgaggtagggaggaagcagcctgaaccaagact 

ctg cccttgg ccct aggagc acagaatcctggg actc aggctcaactaagt ctcacgttctcacacgctcttgtttcctgagcccc 
ctggctctaggttcctacacatctgggtgtgccacgcattctgcccactgccactct 
gaagttgccacctggggcaaggtgtgatatagtacacaccgttacagcaggatagggatgtc 
cccgagatattttaagaggagtgtcatgtcggtgaggaactgaaagtcgtctggtgaggtagagggtagact 

3 5 ag aaagttgg ctcgggtcgcaccacaagcaggattaaatggcag agg ccaagagccaggttac ctgggctcaagtccagttct att 

cctggggccttggctacagtcccatctgtcatcaatccacccttgtgggtatgggtaaggcctcgcctg 
tgccttcctggccgtggg ctaggctgctgggggcag agttg caaggcac accacctggcccgagag agcaggaaatgaaaaggagc 
ttgtcacggtctgcgctatgggggaccttccttaagtcctaactctgtcgtacacatgag 
aaactaactagatggtgatatctgagtgagtcggggcctttctcaggtcc^ 

40 TGTCTTACTCTGGCTGAGGCCATAGCTCCTAGCTGGGTGGTATAGTGTTTGCCTACCATGCACCAGGCCCT 
CACAGCACTGACTCCGTAATGTGGGTTTGTACAAGCTGAACACCCTGACAGGCATGGGGG 
GACTCTATCGTGTTCTGTTAACCTGCCCAGTTCCTAGTTCCTGGGCTGAGACGGATGCTTGAGCTCTGCAT^ 
GCACAGATTGACGATTCTATATGGCACTCAGCTTCCATTCTGTAGAGTAGAATTTAATGACAGAC^ 
ACCTAAAAAGCAGTTGTGTTTGGGTAGGAGTATATTTAAAAGGCTTTACAGTGCTGAATT 

4 5 TACTTAAACACGTAGTCTGTTTAATCTTTGCAGAAATCCCATAGATAATGTTTTATTTTTAC 

AGGCCTGGTAAGATTAAGTGGCTTCTTCTTGGTTGTGTTTTGTTTTTGGAATCAGGTCTCACT 

CTTGCTCTGTAGACCAGGCTGGCCTCAGACTCACAGAGCTATGACTATCCCTGCCTACCAAGTGCTGGTTA 

ATACCTGGCATCAGGGACTTCCTl TrmT lTT Tn^ 

ACTCTGTAGCCCAGGCTAGCCTCGAACCCAGAAATCCTCCTGCCTCTGCCTCCCAAGTG 

5 0 GCCCAGCAATCTTTTTTTTTTTTTTTTTTTrTGGGTTTGGTTTTT 

TCACTCTGTAGACCAGGCTGGCCTCGAACTCAAAAATC CGCCTGC CTCTGCCTCCCGAGTGCTGGGATTAAAGGTGTGTGC CACCA 
TGCCTGGCTTGGCATCAGGGACTTTTTTTTTTTAAACATTTATTTATTTTA 
TCAGAAGAGGGCATTGGATCCCATTACAGATGGTTGTGAGCCACCATGT 
AGTCAGTGCTTTTAACCACTGAGCCATCTCTTCAGCCCAAAATATTC 

5 5 TCAGGGTTl'Tl'CTGTATAGCCCTGGGCTGTCCTGGAACTCACTCTGTGGACCAGGCTGGCCTCGAACTCAG 

TGCCTCCCAAGTGCTGGGArTAAAGGCATGCGCCACCACTGCCTGGCTTGG 

TGG AAGGAGG AGCCAG CATCGGCT CAGTGTCTGCAGCATCAGTGTTCTTTG ATG ACTGCTCTGTTG CCCTTTGCCCTTCCT AATTC 
CCTTCCCTTCCTTCCAGGCCACGCCCTTGJGGGAGACTTCCTCAGGCCAAAGCCAACCCACCTCCCACTAAGACACCTCCAGCGTCT 
GCATCTGGAAAAGCTGTGGCTGCTCCAACCAAGGCAAGTAGGACCAGAAACCACAAGATAC 
60 AGTCCC1ACAGTGATCCAGCATGGACCACCCGCTGTCCCACCCCCTGTGATATTACAGCCACACACTAGTAGAAGGATGACTGCCGC 
CCTTTCCTCCTCCAATACCATTGCTACCTCTCGATTCAGGGAAAACCACCTGTTC 
GTCTGTGCCAGCCGCGGGAAAAGCAGGGGCCCCRGCAACCCAAGCCCAGAA 
GCAGCAGTAAAGAGGAGTCTGACAGTGAAGAAGAGACGCCAGCCCAGGTACCCA 
TTTTGGCCCAGGCCTATAGGCACACTTGACIATGCCTCTCCCCTTGCTGTTTCTCCAGAT 

6 5 CAGAGCTGCCTCAGCCCCTGCCAAGGAGTCTCCTAAAAAAGGAGCCCATCCAGGAACCC 

AGGCCCAGCCAGGGAAGACAGAGGACTGAGACAGCAGTAGTGAAGAGTCTGACAGTG 

TCTGCCAGGCCCGAGAGCCACCGTCACCAACTCTGCTCATAAACAGATCGTGCCCCAGAGGGGAGAAAGAACAGATAGAAGTCCTA 
ACTCCAGGATCCCTTGCCTCCAGGTCCTCCCCACCATGCCCCTGGATGAAATGAG 

T AGGACATCCTAGGTACTCCTACAGCTCCGCAGGGGCCTCTGCAG CACTGGGCACGG CAGTCCCATGCAAGCTGAGTCAGGTTCTG 

7 0 CTGAGGAACCTGGAGCCATCCTGCAAAGACACCCCTGTCAGAGTCACCAGGAAGCTTGGGCTACGGCGCAAGTCTCTGTCT 

GGGC CCGTCTCG CTCAGTGATGCCTAGGCCCAAGGAGTGG AAAAGAAAG AG AAGCTGTCTCAT ACTCAAACCATTGTCC^ 
GGCCCGGTTGGTTGCAGCGAGCAGTGGGGCTTGCTCTGGCTGACAGTCTCrAGTCTGAGCT 
TGATGGCCTCTGGTGTCGCACCTTAGGGAGGTGGCAGCAAGATCAGAAAGAAAATGGTTGGTCCTT^ 
GCTTTATAGAAACAAAGGGAATGTCACATATAAGTGGAGATTTTGTA 
7 5 ACAGGCCACCCGAGCAGGGAGACATAGAGCCC~f CCCTGGa^ 
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AAGTTCTGTGGCTTTGGACCACCXr<X5CCCGTG 
CTGGGAACCCTCCCGTGTCACGTGCTTACTCAAGT 

AGGTCCAGCGTCCCC^GAGAAGAGCATAGAAGGGTCCTCAGAGAGCAGTGATGAGGATCTGCCCTCC^^ 
ACTCTAAAGGCTGCCCCTGGCTGGGCCAGGCrcCCGCTCTCTGTGAGA 
5 AGGTAGCACTTCACAGCTCCTAJGCTGTCTTCTGATACACTCCTGGTGTGGTOT 
CTGCCCAGGAGCCTTrrAGGACTCTGATGTTTTATGTAAACTGGA^ 
CGGGCACGTTGTCCTGGCGGCCTGTTCGGGGCCGGGGTTGAGGCCCC^ 
ACTGCCCAAGCCTCTCTGGTCCCCATGGACAGTTGGGGCTCTAGATCCTCCCT^ 
CAGGTCAGGCCTATGCAA1TTATGGCACCTGCTCATTGTCTGCTGGTC 
1 0 GGGACACCTTCTCTGATTTCAGGAGCCCA^ 

ACTGCCCTCTTTCTGAGGGAGTAGGGTATTTCCATTTGAAGGAGAGAG 
TCCTGGCAGTCTGCACTTTCTCCTGCCC^ 

CTGAGGTACCTCCCTCACTCCTAAAGGACAG AOVG CCAGTATGGAAGGTTGCC^ CACAA 
TTAAAATGAGCATTATGCTAAAATTGAACCTTTAAAGTGGTCTTGCAGATCTC 

CGGGATTGCTGTCGCTGTTGGGCATTTATGAAGCACTTCTCTATGGAGAGCT 
GGTTGTTTTTCATTAGCCTGTCCGTGGGAGAAAAGGCCGGGTAGTCCAGCTCTGGGCCAGCT 
CTCTGTGTAATGCCGTGTGGAGTGTAAGCCTCAAGCTCAGGCTGGCTGGAACCCGGGGTGGCT 
GTGAGGTAATGCAGGCACGAGACGGCATGGCTGATCCATTCTGTTGTAGCCATGTATCGTACT^ 
20 AACCTCCCACGTAAATTATTATTAAATGTATCAGCGTTTGCCTGCATGTATGTATGTATGTGCAC 
TGGAGACGAGAAGAGGGTGTCAGATCCCTCTGGAACTAGAGGTACAGACAGTTGTGAGGTGCT 
CAGGTCCTCTAACCTCCGAGCCATCTCTCCAGCCCCCTTAAA 

CATAGGCTATGTCCTTATTGAGTGAGAACAGGGGGCCTG1TAAAAAAAAATCTAAATTTTAGAA 

2 5 GAGAGCCTCAGCCTTAGCTCCGAGTCAGACTGGGGTTTATAAACCTCAGGTCTCCTA 

GTCTCATGGGGAAGGATGAGCCCCGAGGTAATAGTCATCTTCCTCCTCTCTGGGGGGGCGGGG 
GCTGAGGACACACCCTCCACTGCACTGCACCTCTAGCCCCACAAAGCTGCTTTGAGGG 

AAAGGGCCCCAAATCTGCTTCTGTAAGCATCATTCCTGC^ 

3 0 TGGAGGTG AGATG CT CAGGTGGCTGTGGTGGTAGCGTCCTCTTCCACCGCTITGCCTCTCTGATCAAAGCA 

GGGCTTTGTGCTTTCCCCAGGTGACCACACGCAGCACCCACCACAGAATGAGG^ 
GTGCCCAGCCCACCCTCATCAGCC^GGCCGCTGCCrCCTCTATCGCCTGCCC^CTGCCTCCT 

TCC CTGCCTCGAGATGGGGTGGTGTGGGCAGGTGGGG CTGGAGGAGGCTGTGCTGCAGAG CTGCTTTT AAC ATTCCAAGACGCTAT 
TGTTTGCCTCAGTGGGTTTGCTGCCTCTCTTGGTCCCCAACAATCTGTTCTTCTG 

3 5 CCACCGCAAGCCTTCTCCGACTTTCCATC^ 

CGCGGGGGAGGCTGTTTGGCAGGGTAGGGAGGGCCTGAGCGCAGCGGTGCCGCCGCCACACTTGGTGGG 
TTGTTGCTTTCTCTGTGTCTCACAGCCCCTGGGGCGGTGGGAGCTTGGTAAAGCACTG 
CCCAGCTACCCCTCAGGCAGCCCTCTCATTAGCCAGAGCACCAACAGCCTTGAG^ 
ATTCTAGTTCCCTAACTACTGCCTCTGTACCTTCATCGGCTAGGACTCTGAAAGGAAA 

4 0 GACATATTTGGGAATCTGGTCCACTGCCAAATTGACCCTCCCAGTACTAGGATCTGCTGCTGCT 

AGCAGACCCACAGGTGAAGTGCATCAGGGCATGTTAGAACATATCTGTAATCCCAGCATCCAGG 
GCCAATGTGGTAGGCTATACAATGAGACCCTGTCTCAGGATTAAAAAAGTAAAACTGAACCCTACGCAT^ 

GTCCCGGTCCTGAG CAGGTTGGCTTTGTGCACATTCTACTTGAGTCCTTGACTTTGAAGCTGGTCTCAGGCCCGT CACGTCATCCA 
TCAGAATTTCAATATGCAACTCAAAATGATAAGAGTCTTCCTAGTCCTTGTAGACACAACCAC^ 

4 5 CTGACCGGCAAG CAGG CCTTGTGCCTGTTTCTCCAAAGTATCTGTCTGATG AGTGGCATTTTTCATGCTTGTCCT 

AGTCTAATAAATAAGGTTAGTATGTGGAGATCTGTTCCTCTAATTCCAGCAGTGGJ^ 
GAT AG ACTGGT CTACAG AGCAAAACCTGTCTCAAAGCAAATAAGGGCCACATAT^ 
CTGGCCTACCCATCCCACCCCCCCTGTTACTCCAGGCAGTTTGTGTACTAAAGAAACT 
GAGCCACATCTCAGGGAGGATGCTGTCAGGGTCTCTGGGGGAAGGGACAATGAGACTCCAGGGAAATCATA 

5 0 AGGAAATGCTTTTTAGAATTCCCCACTCAGCAGGTGGCCATCTGATGTCACTAGAA 

CCTTCOTCCTTCCCATOrGGCTGTC 

TGGGTGG ATACTGCTT AGCTTGGGGTCACTGTG AAGGTCAGCTGGGCTCTAAGAGCT CTG AAGTGCTGTG CT 
GCTGTTATGGGGTGGGAGTGGGCCTAGAAGACCCCAACTCCTTGAAGTTTGCATTACAAAGCCCCTCCCTC^ 
CCTTCCCCTGACCAAGCCTTTAATCAGAGGCGGGATGTTTCTGTTTTTGGTTTTCAAGGC 

5 5 ACCGTAACAGTAGTCCAGCTGTCCCAGCTCCTACCCCAGAAGGAGTCCAGGCTGTGAACACCACAAAGAAGGCCTCAGGCACCACT 

GCCCAG AG CT C CTCCTCTG AG AGTG AGG ACGGGGACG AGG ACTTG ATTC CT G C CACACAACC CTCCAC CT ATGGT AAG CGCTGCTG 
CAGTGGGTGCTGTTCGGCCJ^GACGGGC CCCAGCCACGCATGCCCTC CC CACTG AGG CCAGAAGGCACTAACCAGGTGGCCTAGCA 
GAGGCATTC^CGAGGACTGGGGACTCAGCCTTAGGCACTGTATTGTTCCCTTAATGAGGAAAGCTCG 
ATTACATTCGAGGCTTCTGTGTGCTCGCTCCAAGCCT 
60 ACCAGAGCTCTGCTATAGCCACCTGGCCGTGGGGTACACTATTAGCCTGGCAGTGAGCTTCC^TC 

ACAGG TAACATTTT AT ACTTATTCTCTTCCCAGAACCTAAGGTGACTGAGTGT CTTTGT ATGTGGCG AGGCCCTCTGGTAGCGCAA 

AGGCGGATTTCCG AGTTT AAGGCCAG CCTG ATCTACAAAGAGAGTTCCAGG ACAGCC^ CTGTCTCGAA 
AAACO^AAAAAAAAAAAAAAAAAAAAAAAAGGACACTGTGTCGTTAAGTATGGACAGCAAGAT^ 

6 5 TGTTTATTTGG CGAGTACATAGTGGTGACATG AGCTCACATTAGG CTGTGGT AAACATCTCAG 

CCT AACTG AGCAGTTGCTCACTGTGACCTG CACAAGAGAAAAAGATG AGGGT AAGG CTTGGAGTCACCGCCACAGCAG CCATTTTC 

AAATGACATGGTGACAGCTGCTGTCTGGAAGGCTTGGGAAGCCAGCX5TCCC 

AAG AGTG ATCTGGTAATGCCTTGCTGGGGAGTCCTACGGCTTTGTATCCT 

AACCAGTGTGACGACGCCCGCAGCCCTCTCACGAGCAG CTTCCCAACCCAGC AAG AGTG AGCAGTCT AG CCGGATGCCAAAAGGCA 

7 0 AGAAAG CAAAGGCGGCGGCGTCCGCT C3\GGTACTGTCGGGTAAGAG AG CCTTGCCATGTG 

TCCTCAGGCCACACGCCAGTCACTAGCTTGCTCCCAGCCCCCTGGCACTGT 

GTGTTGTCCGGAAGAGCGGGCTGGGAGCCCGAAGGCAAGCGGAAGCCCCTGTCCTCCCAGCTTGCCTGAGT 
GGATTTGAGACGCTGAAACCGACTCTTGAGTTGCITGCCTCTCTTAGTAT^^ 
TCACAGTCCAGTGCATTATTAATATCTAGGGCAAGATTAGTGGAC^TATCTAAATAAAACAG 
75 CAGG CCAG AAC CT GGGT C CTCCAC CT CCAAG TT CTTTGCT CTCCAC — - - - 

453 



WO 03/008583 



PCT/US01/51291 



ACACCCAACACCCTAAGGGGCAGGAGAAGAAGGCTGTACTGAAGCCCCTCCC^ 
GGTGCATCACAAACTAGCCCAAAAAGAGCCTCTGGCGGCCGGGTGATCCAGTGGGTAGTGAG 
GGCACCGAGGCTGCAGCAGGCACGAGACCTTCACGGAGTCCTAAACCAGCAGAG^ 
TTAGTTGT CACCAACTGCT AGGTG TTGCAAAGGGAACAGCACAGA^ 
5 GAAGGCrrCTCTGGGAACATGAGCAGAGCCTGCCCTGCAGGAAGGGA^ 

GGGTAAGATAAAGT AGCAAGGGGGTCAGGC GAT ACCTGGACTGCTTTGTGTCTGCAGCT CCAGGAGG CTT A 
TTGTCTGTGTGCTCTTCCTGACTTCTAGCAGGATGGGAA^ 

AGACACAGGCGATGGCCACTGGAGTTGGGGCTGCGCTGTCCTCCCCTGGGCCCAGA 
GTACCCATCTTTCCAGCTGCGATGGCTCCTCTGTTCTCCACCTTGGCCT 
1 0 CCATGCCAGTCATTTCTGGCTCAGGGTAGCACAGATG 

TAACAGAGGGACCCTGGGTCTCTCCCACTGGTGTTGAAAGCTC^TTC 
GTGGTTAAGAGCACCGCCTGCrrCTTCCAGAGGTAATGAATTCAAATCCCAGCAA^ 

ATATGAT ACCCTCTTCCAGTGTGTCTGAAGATAGCTACGGTGTACTT AT ATAT AAT AAAT T T T AAGCCAGT A 

AGCATGGGGGAGGGGCACAGCTTACTTACTGTGCAAGCATGAGGCCCT^ 
1 5 TGGTAGTGGGCGTTTGTAATCCCAATACTGGGGCACGTTAAC 

TTCACCTCCCTGGGCCTCAGTGTG CT AGTG CGAGGTAAGGAGCCACAGTGCCCACCAGAGGGCrGTGGG AAAGTCATA 

CAGCAGTGCCGTGGAAACACTCCCCATGATGCCTCCCCAGAGCGC ACCCATCCAGCCCAAAG CC^ CTCGGG AAATCCA 

2 0 AGCTCCCTGAGAAGCAGCAGCTTGCCCCAGGCTACCCCAAGGCCCCC^ 

G AGGAGGATGCCAAGAGACCC CAGATGCCCAAGT CGG C C CACAGG CTGGGTGAGGATTCC CAG AGGGAGGGTGGG AGATG AGGCAC 
AGCCTGGTAGAACTGTAGG CTCTGGCCTTCTCAGCCCACCCTGGCCTTGTCTCATCC CCTG AT AATACCTTTACAACTTCAAAGAC 
CCAACTGCTGTTCGACAGCTGGCAGCTTCTTGGTATTAAGCTCCGGTTATTGAA 
GGGGAGGGGGGGCAGAAAAGTGGGAGCCTCAGGAACCAGAGGGGAAATGATGCCATTCGTGTGTC 

2 5 CATGGGTGGACAGCAGGAGGCTTCGTGCAGG AGGATGAGGAACAGAG AAAGACCCAGG CCTCGGAGGCCTCCTGGGGAG CT CTGCC 

TGGCCTGAATAACAGGGCCGCCAGGGACTCACACTCAACGGGAAACCTCTGCTTCCAC^ 
ACTGTGGTAGAGGAGACCCCTACAGAATCCAGCGAAGATGAGATGGT 

GGAAGCTGCCTTTAGACTTCAGCCACCCCCTGGATGAGTGTCTAGATCCCCATTCCCAGGGCCCT 

GTG CCTCTAATCAATCGGATAGAACCTCAC CTCATGCTCTGGGGC CCAGGG AAGAG ATTCAG ATGGATT CTG AG ACATG ACCAGAG 

3 0 AAGACGACTTCTTAGGTCTCTCTTAAGGCTCTTCCTATACAGGACAGACTCC^ 

TTGGGGTCAGTCCTCTCCTTCCACTTTCACGAAGGTTACTGGAT^ 

CCTCTGAGCCATCTTGCCAGCCCAAAAAACGTTAACCATCTTCTGTGATGGT^ 

ATGTGCCTTACATATGCGTTAATTCACATAGCTTAGAAGGCACAAGCTGAAGGC 

CATGCTTGGCTGGGTTGGCCTGATCCGGGGTCACCGACTG ATTCC CTTG AGCCTTGCCTCTGGGATACGGCT AGGGG CTCGATGCG 

3 5 TCTTTCACACTGGACTTCTTCAGTCTCTCCTCT 

ACTCCTAGGCCAGACTCCAACTCCTTAGCTTCrrCTGCCCCGGCCACCAAAGACAA 
ACACGCAGCAGACACCGCACTCCCTAAAACCGGTGAGTGGGGACAGCAAGACATGGAGA^ 
TGGGCAGCATTTGTCTGTGTCCCTTCCAAGAGTCTCATATTCGAGCCTGTGGGGT 
' GCTGGCGCCCAGGACCAGGTTGCCAGCTCTGGGCAGTCTGGGAACAGATA 

4 0 GGAGGAGGCCTTGGCTCTCCGTCCGTCCCTACTGACTTCT 

GACTAGGCCAGCAGCCTTGTTTAGAGAATTCAATACCTAGAGGAGACCCTGGATAAGGGACAG 

ACACTGCTGTAGCCTTGACAAGCTCCTC AC CCTGTGTCTTAATCACACAAGGTAAGAGCAA CGTGCAGGGTCAAAAG CAG 

AGCATAAAGAAAACTGTCACATGACATTTGGAGGGACCCAGGGGAAGGGACT 

TGAAGCATCCCGTCCCACTCCTGTCCGCTTCTCCTTAACCGCACT 

TCAGAGGCGGAGCTGTTGCTCACTTTCTTCGCCAGTGTTCTTGTCTATGTGAGAGGCGGGTGCGAGC 

GGGGGCTGTGGAAGG rn T'n'T'i 1 I 'll 1 1 I X TACACGTG T AGTCAG CACACAATT AT CAAAGG CAAG CAG AGGCGT AG AAAATTCT 

AAACAAAGGCTGCACCAGAAGCCACTCTGGCAGGCAGTAAACCTGCnTAGTCAGC 

GCCCCCCAATCTGCCTGTCGCCCGftGCCTTGATGGGCrAAGG 

5 0 GACCCCCTCGTGTGTTATAACCCACAGTTCATGGCAGAAGACATC^ 

TGAGGGACTGTGACXSCCTTGAGGGACTGTGACCnx^CCAAAGCCCTACACTAA 
CATGTCCCACCTACTGTGCCCTCCGTCTCATTAGCAGACTT 

ATAGCTAAGCATAGGAGAGAGATAGAGGAGAGTGACCCCTGGATCTCAGTTTCCCATTTAT^ 
TTGTOVTGCATTGTGCAAGATCTCACCCrCAGCTTTCCAGACrTGTGACCT 

5 5 CCTCGGGCTCCACGCCTCAGAAGCCCAAGAAGCTCAAGAAGAGCAC 

ATCACCCAGCX3CCTCCTGGAGCAGGCCTGGCCCCTGAGCGAGGCGCAGGTGCAGGCCTCTGTGGT 
GGAGCAGGAACGGCTGAAGGCCACAGAGGCCATCAAGGAGAGTGG^AAGAAGAGCCAGAAGCGGAAGT^ 

CTGGGGCC CCAAAGAACAAGAAGAAG AAGGAGCAGCCAGTGCC CAGGGCAAG CGCTGTTTCCCCAG AAAAGGCTC CCATGACTTCC 
AAGGCAAAATCAAAGCTTGACAAAGGGAGTGCTGGTGGCAAGGGGAAGGGGTCT 

6 0 CGGCGAGTTGCTGGGGATAAAGCTTGAGAGTCK3CGAGCAGAGCGA 

AGTGACTGCTTTCCAGTACGCGCTGCCGGTGCAACCTGGGTGGCAGGGC^ 
ACAAGAGCCTGGGAAGTGGTTTGTCCAGAGTTTAACAAGTGTCTGGTGGCAAAGGCAAAAGG^ 



65 




GGCCTCGGGCAGTGTTGTTTTCCTTTTCTAAGGGGAAAGCAGGCTCT 
AGTTTTGTCTTAAGACAGGGCCTCACAATGTAGCCITGACCGGCCT 

CAATCCACCTGCCTCrGCCTCCTGGGTGCTCGGATTAAAAGGGTACTCTACCACGCCCAGC^ 
TGTGGCAGGAGATGTGTCACCTGATCrCrCCCAGGTTCrTGTCCCTTCCCTCACACT 

7 0 CTGAG^CCAGATTGTGTAGTCCACAGTCAGTAAGAGTGGAGCCTGGCACATGGATGGTATCCT 

AGAGCCGCGAGCTGCCGTGGGCCTCCTGACGCTGACCCATCAGTGTCAAAATCTATTGCTCAGG^ 

AAAAAAAGACAAGGAGAAAAAGGAAAAGAAGAAAGGAAAAAAGTCCCTGGCCAAAGACTCTGCCTCGCCGATC 

AG AAGAAGGTAGATTGAGTTCCreAGGGGTCrCAGGG CT AGGAATTCAAACCCTTCCCCTTCTAACACCTAGACCAGG CCCATGCC 

ACCTTTCCAGTCCCTTAATCTAATGGACCTTGTGAGGAGGAGCAGGGGAGG^ 

7 5 T CAG CTTTCAGGT"l\KiCTGG C CAGCGGGG AGC C C ACTG AAG AAACAGC CAG C C AGC CG CTTCCTGGG AT CAGAAGGCAGT TCTG C C 
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CAGGGCTGGTATCAGTGGGTCTCGATCCTTACACCTATGGGGGGCCGTGGATCCTTTO 
CAGCGGTGGTGAGGAGAAGCCACAGAACTGATAGCTTGCTTCNNNNNNOT 



5 TTGGTTGGACAGCAGCTGCTGCAGCATGCAGAGGGTC 

TGG T T TlXiTTCCCITTTGTGGAGAT AAAGGTCGGCAAAGCTG 
GTGACCATCCATCCCCAGGCCTCTGACCTCTGGC 
ATGGCTGGCGCGTGCTGGCCAAACCAGTGAACCCGCAGGGAAGCTC 
GAGTGCTACGrGACATGTGTATAGTATATCTATTTTTTTAAGCGACTTGCCTCTCT 
10 CGGACCTCCCTCCATGTTATCTGCAGAGCCAGCAAGGTCCCGCGTGTGGTTCATCCCAC^ 

C CTTGC C C CATCAATTTTTCCAGCTATTTTGGTTTTGTTTTTTAA CAGAAT AAAAG ACTTT AAAT AT AAAGGTTTG CCTT 

CCCCATGCATCTGGGACTCGGACTGTTACTGGCTAC^ 

GCTCCTTCCrniACTTCAAGCCAAGGCCCTAGGTACTGTGAGTAG 

AGGGAGTTGATGCAGGGACGGCTCCCCGTAGGAAGTGGCCTATAAGCTTAAAAGGAGCTGAAG^ 
1 5 GCTCAAGTGAC CAGGGGG C CTTTGGAAACAAGTT ACG ATGTGGAGGG AGG CAGTGGCG AACAG TGGTGG CTGG CATGAG AAG CGGG 



GTATGTTGGAATAATCACTGAACATTCACCTCAACAAAACAAAAAACCAGTACAGAGAAG 

GTGCT CAGAGCCTTGGG ACGGGCACG CCATACCCAACAGTCAGCCATTCCATCCATTCTCAAGTGTGGATCAGAGTGTGGGATGTA 



2 0 ACGTTCATCTCGGAGTCCCTGGTGGTGAGGGATCCGCTCCCTTTTCCTGTGACAGTGGGCT 

CCAGACAGACAGCCACTGGCAGCCCCCACCTGCATCCATTCCTGA 

CGCTGTCCTTAGTCCAGAAATGGAATTGGGGACGGGAGGAATGGAAGCT 

GTCTGTTTCTT7CTGACTGTTAAGTAGACACCACGTGTGTCTGGGACCCATCTCC 

GGTTGTCCTCGGCCTTGGGGTAAACTAACTGGATCCCAACTGAAGAGGCGX5 
2 5 CTTGAACCATGATCCTACAGGCTGCTGACAAGCTTGGCTGAGCAAGGAACCTGAA^ 

TGTGAGGAGGGGTTACTGGGAGGAGGGGCTTGGGTTCTT 

AGGGCATCCCTTGTCCCTGTCTGCGGGCCTTTATCTCAGAGCCCCACATTCAAGACTACTAA 
CAGTCTCAAGCCCC^GGGTGCAGGGAGACGGGAAGTGGGGTGTGACATCAGGGAACATAAGAAGGGCTGA 

TTGTTGTCCCTGCCTCCTCTGCCCAACATCTTGGAACTGCAGATCAGTCTGTCCCCCTGCTAAGCTC^ 

GCCACCATCCCATTCTAGAAAAACCATCCCAGTGATAGTGATGAATGTCCCAGTGTCACCAGAGTCT 

TGTGATCCCCCCCCTTACCTTGACCCAGTTCCTGCCTGGTCACTCCCAGGCCAGAAGATAGGTCT^ 

AGGGAAGCATCAGGCTCACTGTTATTTTTACCCCAACACACACACACACAC^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNKNNNNNNNNNNNNNNNNNNNN^ 

NNNmWNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNKNN^ 

tflJNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^WINOT 
55 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNIWNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNJ^^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNN1TO 

NNNNNNNNNNNNNNNNNACATTAGCCACACCAAGCCTGGC 

TGGAACTCCAGTCAGGGTCAGGAGCTG AACCAGCACCTCCAGTTTTTTCCCTCTGT CTGGGTCCTCTGCCTGTCCTCACACAACCA 
GTGTTCCCTGCTGTAGGCCAAGAGCCATGTGCCATGCACTGTGCCTGACACGAAGCAAGCATGTGCAATATTAAT^ 
65 AACATGGTATTGTGGCCAAGTCAATCCATTGGACCTTAATTTGATTTTTTTAAAAGATTTATTTATT^ 

TGAACTCAGGACCTCTGGAAGAGTAGGTGGTGCTCTTAACCTCT 
TCAAAAGTGAGGTGGCTGGAACCTGGCCTAGTGATGCACGCCT^ 

CAAAGCCAACCTCTTCTACATAGTCGGTTCCAGGACTACATAGACTCTTATCTCAAGGTGGGTAGGGGATG^ 
7 0 TTAAGGACCACCTTGTATTCCTTGGACTAAG 

GACCCAGGAGACTACCCGCTGGCTCTGATACTCACCGTGAGCAGATGCATCACATGGTCCTGGGTCATGTTG 
GTTCTTCACAGGCTGTGGGGAAGGGAACAGGGTTACCAGCATTGCAGGGAT 
ACTCAACAGCTCAAATCTGCCACCTCCTAACATGGAAGTGCCAGCCCCTCACGCCCT^ 
GGGCAGAAGCTTGTCAGAGCCTTCCCAATAGCAGAGTTrGCTTCCTCCTCCCA 
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TGCCCCGTCTGTATCATGCTAACTTATGTATGTTTT 
GGTACCAATAGTAATCCTATGAGTAACAGCATTTT^ 

AAGCACTTTGCAAGCTGTGGGACTTGCCCTTTCCCCACCAGGGGAGCCCTAAGAG 
TCTCTGAGGTGTGCCCTGCTCCTCGCCAGTAGTTAGTGCCC^^ 

CCATCCGCATCTGGCTCACAGGTTTGGCAGCTGTGGGGCA^ 
GTTAAGATTTGAAACCC!GGTGCGGGGGAGGAAGGGAGATCGGGAGGGGCT^ 

TGGAG CACGCACATTTCGG AAGCTTCATGCGAAGCCTCTCCAGTTG CAGGTTCTGGGAGGTG ATGGTCAGCTTGTCTAGGCGGCCC 
TGTTGCTGGTACAGGAAGTAAGCAGTGGTGGCCTGCCCAGCCAAGAGCAGAGCCACCA 
1 0 ACGGCTGCACCTGTTGGGATGGAGTAAGAGCATGAT^ 

CTCTCAGACTCAAACCCTCCCAATAGTGAGGACATCTGCTCCTGCCGCCXCCCAG 

AAGCTCAATGGGAAGGATGAGCCACGTTCCTTGCCTCCTCTGG^ 

TCCTTCCGAGTCCCCTCTACCCTAGAGATCTCAGGA 

GCAGAAAAAGATAGGTGAGTTTACTATGTAAGAGGAAGAAGGGTTTTCATCTACTGTGCC^ 
1 5 GTTTAAAGAGAAAAGGGAAGGAAGTGCGGTAGGGGACAGCAAAAGCOSGATTTATA 
TTACATGTGGAGCGGGAGGAACATTGTCAGGCTCAAGCAGTGCCAGGCACGTGTGCG 




ATACTCATGTGTGCTGTGCATGTGTATGTGNNNNNNNNNNNNNNNNNN^ 



2 0 GGAAAGTTCAAGAGCAACACAGGTACCTAGGCTTCATGCTCTTGGACAGCCAGACT 
GGGCTTTTTTGGGCTGTGTTGGGGTGCATGCACATGTTACAATATGGAACA 
CCAGTCTTCCCCACTCACCCTTGGGTAGGGAGAACTATCAGTAATGTTATTTGCACACAT 

TGGGG ATTT AGCT CAGTGGT AGAGTGCTTGCCTAGCAAG CG CAAGGCCCTGGGTTCGGTCCTCAGCTCTGGAAAAAAAAAAAAAAA 
AAAAAAAAAAAGACAAAATAACAAAAAGACCAAAAGAAAAACAAACAAAAAAAAACAAGGTAAAAGA 

2 5 TTAGGCACCGTTGTAGGAAGGCTTAACAGGGGAGTGAGTCTCCTGCCTTTTCACTGC 

ATCATTAGGGGCTGGAGAGATGGCTCAGCGGTTAAAAGCGCTGACT 

TGGTGGCTCACAACCATCCATAACGAAATCTGATGCCCTCTTCTGGAGTGTCTGAAGACAG 
AAATAAATAAATATTTTTTTAAAAAAAAGAAACTCATCATTAGATGCCAGCGCCTTGATC^ 

3 0 TTATAGTAACACATACTCAGACTAAGTAAGACGAACTACATTGGAAATCCTCGTGATAAGTTT 

AGGAAGGGTAATTTAGGAAACTCCAGAAAACCTGTCTGCCTGGTAACTTATGACATCACAGACACAGCT 
CAACCCTCTGTATTTCCTAAGCCAAGGGCTGGAGAAAGGAGATCAGACAGGAG 
GGAAGATGGAACCTGGGCTCAAGGTCTTCTGCTTCATCTACTTTGGTGACTCA 
CAGATCTAGTCCCAGCTCTACCTGCTACCACATGAGCTTGGGGTAGCCACGTCCCTCTCCT^ 

3 5 CAGTGAACACAGGTTTGTATTGAAGATGTGAACAGTAGCCATAACTGGCATCCTGGCTTGCCTAACT 

ACTTCTGCGTGTGGACCAGGTTAGGGCTTGGAATGAGCGGTTGTTAAATAACGAAGAGATGAA^ 
CACTTGGCCCCACACCG<5ACACACATTTCCTCTACCA 
GGAGGCTGGATGTCAAGCCCAGCATCATCTCTCCTTCCACTTTTATT 
TAGATGAGTCTGGCCTTGAACCCCTGATCCTCCTGCCTCCA 

40 

MOUSE SEQUENCE - mRNA 

TAGCGXK3GGCGAGGGGGCGTGCACGTGAATCCCCTGGCCAGATCTCTGGCATGGCCGAGGCCAGGAAGC 

CTCATTTACCATCATCTGTTGCAAGCAGGCT ACGTC CGCG CGG CGCGGGAAGTAAAGG AGCAG AGCGGC CAGAAG AGTTTCCTG AC 
TCAGCCCGTCACCCTTCTGGACATCTATACACACTGGCAACAGACCTCAGAGCTTGG^ 

4 5 CCCTTCAGGCTAAGAAGTCTCGAGTGTCGGATCCTGTT AGCAG CTCAGAGAGCTCGGATCAGGAGAAGGAAGAGGAGGCAGCAACC 

GAAAGC^CCAAAGCCACCCCAAGACCGACACCTGTCAATTCTGCAACC^ 
CAAGACAGCCAACAAGAQ^TGAACTCTGTGTCGC!ACCCTGGGTCCGGAAAGACGGTGG 
AGAAGTCAGCAGAGCCCTTGGCAAACACTGTCTTGGCCTCAGAAACTGAGGAGGAG 
AAGTCTGGAACGGTGTCAGCGGGCCAAGGCAGCAGTTCCAGTGAAGATTCCTCCATCT 

5 0 ATCTCCAGCAAAACCAGCCCAGGCCAAAGCTTCAGCAGCCCCT 

TAGGGAATGTGGCGCCCACACCTGCTAAACCAGCCAGGGCGGCAGCG^ 

GCAGCAGAAGAGTCTGAGAGCAGTGAGGAGGACTCIAGACAGTGAGGACGAGGCCCCTC 

TGGAAAAGGTCCCC2ATGTCAGAGCCGACTCGGTGTCTGCC!AAGCGGATCTCTGGGAAAGG^ 

CTGGGCCTG CAGCC ACCCAGG CC AAGGCAG AAAGGCCAG AG AAGG ACTCGGAAACCAGCAGTG AGG ACG ATTCTGAT AGTG^ 

5 5 GAAATGCCAGTCACTGTGAAT ACTCCTCAGGCAAGG ACTTCTGGGAAGAGCCCT CG CCAAGG A 

GTCATCCCAG AAAGGGGCTCCTG CAGTCACCCCTGGAAAGGCAAGGCCTGTGGCAG CCCAGGCAGGGAAACCAG AAGCCAAG AGCA 
GTGAGGAGTCAGAGAGTGACAGTGGGGAGACACCAGCTGCTGCGACTCTGACCACGAGTCCTG 
AGCTCCCAGGTCAGACCTGTTTCCACCGTCACCCCGGGGTCATCX5GGAAAAGGTGCCAACCTGCCCT 
AGCAGCTCTCAGGGTCCAAATGGTAAAGAAAGAAGATGTCTCGGAGAGCAGCAGTGCA^ 

6 0 (^GCCAAGGCAAAGGCCTCCCTTGCGCTCCCTCAGAAGGTGAGGC 

TCAGGGAGCAGTGAGGAGTCIATCTGACAGCGAAGAGGAGGCAGCACC^ 

G CZAGATGAAAGCTTCCTCTAGG AAAGGCACGCCTGCATCCG CAACAGGAGCG AGCACCTCGTCCCATTGTAAGGCAGGAGCGGTGA 
CCTCTTCAGCCAGCCTGTCATCCCCAGCTCTGGCCAAGGGCACCCAGAGGTCAGATGTGC^ 

GGAGCTGCTCCCAGCACCCCCAGGGTACAGGGG AAGT CTGGGGGCAAGGGCCTC CAAGGG AAAGCTGCCTTGGGGCAAGGGGTGGC 
65 CCCIAGTGCAC^CTCAGAAGACAGGGCCTTCGGTCAAAGCT 

AGGAAGAGGATGAGACCCCAGCACAGGCCACGCCCTTGGGGAGACTTCCTCAGGCCAAAGCCAACCCACCTCCCACT 
CCAGCGTCTGCATCTGGAAAAGCTGTGGCTGCTCCAACCAAGGGAAAACCACCTGTT 

G CGGTCTGTGCCAGCCGCGGG AAAAGCAGGGGCCCCAG CAACCCAAGCCC!AG AAGGCTCCCGTGGCTGGCACAGGGGAGGACTCAG 

agagcagcagtaaagaggagtctgac!agtgaagaagagacgccagcccagataa 

7 0 gcctcagcccctgccaaggagtctcctaaaaaaggagcccatccag^ 

gccagggaagacagaggactcagac^gcagtagtgaagagtctgacagtgacacagaga 
crccagtttctgtcaaccgtaacagtagtccagctgtcccagctcctaccccagaaggag 
gcctcagsgcaccactgcccagagcttcctccrctgagagtgaggacggggac 
tgctctcagaaccagtgtgacgacgcccgcagccctctcacgagcagcttccca^ 
7 5 caaaaggcaagaaagcaaaggcggcggcgtccgctcagaccagca^ 



456 



WO 03/008583 



PCT/US01/51291 



CCCATCCAGCCCAAAGCCACCAACAAGCTCGGGAAATCCAAGCTCCCTGAGA 
C^GGAGCTCGGAGGACAGCAGTGACACTTCTTCCGAGGACGAGGAGGATGCC^ 
TGGATCCAGACCCIT'CCCAGAAGGAAACTGTGGTAGAGGAGACCCCTACAGAATCCAGCGA 
TCTCTCCTCTCAGGTTACATGACTCCCGGCTTGACTGTGGCTAATTCCCAGGC^ 
5 CTTAGCTTCTTCTGCCCCGGCCACCAAAGACAACCCGGATGGCAAGC^ 

CTAAAACCGGTAGGAAAGAGGCCTCCTCGGGCTCCACGCCTCAGAAGCCCAAGAAGCTCAAGAAG^ 
CCCACACAGACACTGCCGAACAGCATCACCCAGCGCCTCCTGGAGCAGGCCTGGCC 
GGTGAAGGTCCTGACAGAGCTTCTGGAGCAGGAACGGCTGAAGGCCACAGAG 
GG AAGTTATCAGGGGACCTAGAAGCTGGGGCCCCAAAGAACAAGAAGAAGAAGGAGCA^ 
1 0 CCAGAAAAGGCTCCCATGACTTCCAAGGCAAAATCAAAGCTTGACAAA 

CCAAGGAGCCAAGGAGAAGCCCGACGGCGAGTTGCTGGGGATAAAGCTTGAGAGTGGCGAGCAGAGCGACCCG 

AAAAGAAG AAAT CCCTCAAGAAAAAAAAAG ACAAGGAGAAAAAGGAAAAGAAGAAAGG AAAAAAGTCCCTGG CCAAAGACTCTGCC 

TCGCCGATCCAGAAGAAGAAAAAGAAGAAGAAGAAGTCAGCCGAGCCTGCCGTGTGA 

15 MOUSE SEQUENCE - CODING 

ATGGCCGAGG CCAGGAAGCGGCGGGAGCntKrrrCCCCTCATTTACCATCATCTGTTGCAAG CAGGCT ACGTCCGCGCGGCGCGGG A 
AGTAAAGGAGCAGAGCGGCCAGAAGAGTTTCCTGACTCAGCCCGTCACCCTTCT 

AGCTTGGCCAGAAG CAG AAGGCAGAGG ATGATGAGACCCTTCAGGCT AAGAAGTCTCGAGTC CTCAG AG 

AGCTCGGATC^GGAGAAGGAAGAGGAGGCAGCAACCGAAAGGGCCAAAGCCACCCCAAGACCG^ 
2 0 AGCTTTGCCATCAAAAGTAAAAGAAAAGGGAAAGACCAAGACAGCCAACAAGACGGTGAA 
AGACGGTGGTCCACCTGCTCTCTGGGAAGTCACCCAAGAAGTCAGCAGAGCCCTTGG 

GAGG AGGG CAATGCCCAAG CCCTCGGACCCACTGCCAAGTCTGGAACGGTGTCAG CGGGCCAAGGCAGCAGTTCCAGTG AAG ATTC 

CTCCATCTCAAGCGATGAGACAGATGTCGAGGTGAAATCTCC7VGCAAAACCAGCCCA 

ATCCTCCAGCAAGAACAGCCCCAGGCCCTACCAAGTTAGGGAATGTGGCGCCCAC^CCTGCTAAAC 

2 5 GCTGCre-CrGCTGCTGTGGCTGCTGCTGCTC 

GGCCCCTGCTGGTCTGCCCAGCCAGGTAAAAGCCTCTGGAAAAGGTCCCCATGTCAGAGCC 
CTGGGAAAGGGCCCATCTTAGCAACCCCAGGGAAGACTGGGCCTGCAGCC^ 
GAAACCAGCAGTGAGGACGATTCTGATAGTGAGGATGAAATGCCAGTCACTGTGAATACTCCTCA 
CCCTCGGGCCAGAGGTACCTCAGCCCCCGCCAAGGAGTCATCCCA^ 

3 0 TGG CAGCCCAGG CAGGGAAACCAGAAGCCAAGAGCAGTGAGGAGTCAGAGAGTG ACAGTGGGG AGACACCAG CTGCTGCGACTCTG 

ACCACGAGTCCTGCCAAGGTGAAACCTTTGGGGAAGAGCTCCCAGGTCAGACCTGTT^ 

AGGTGCCAACCTG CCCTG CCCTGGGAAGGTGGGGT CAGCAGCTCTCAGGGTCCAAATGGTAAAG AAAG AAGATGTCTCGGAGAGCA 
G CAGTGCAG AGCTGGACAGTGACGGGCCTG^GAGCCCAG CCAAGGCAAAGG CCTCCCTTGCGCTCCCTCAGAAGGTGAGGCCTGTG 
GCCACCCAGGTCAAG ACTGACAGGGG CAAAGGCCACTCAGGGAGCAGTGAGGAGTCAT CTG ACAG CG AAGAGG AGG C AGCACCAG C 

3 5 AGCCTCTGCTGCT CAGGCT AAGCCAGCTCTGGAAAAGCAGATGAAAGCTTCCTCT AGG AAAG^ 

CGAGCACCTCX3TCCCATTGTAAGGCAGGAGCGGTCACCTCT 
TCAGATGTGGACTCTTCCAGTGAGTCTGAGTCAGAAGGAGCTGCTCCCAG<^ 

CCTCCAAGGGAAAGCTGCCTTGGGGCAAGGGGTGG CCCCAGTGCACACTCAGAAGACAGGGCCTTCGGT CAAAG CTATGGCTCAGG 
AAGACTCAGAGAGCCTCGAGGAGGACTCCAGCAGTGAGGAAGAGGATGAGACCCCAGC^ 

4 0 CAGGCCAAAGCCAACCCACCTCCCACTAAGACACCTCCAGCGTCTGCATCTGGAAAA^ 

ACCTGTTCCGAACAGCACCGTCTCTGCAAGGGGCCA^^ 

AGAAGGGTCCCGTGGCTGGCACAGGGGAGGACTCAGAGAGCAGCAGTAAAGAGGAGTCTGACAGTGAAGAAGA^ 

ATAAAACCTGTGGGGAAGACCTCTCAGGTCAGAGCTGCCTCAGCCCCTGCCAAGGAGTCTCCTAA 

CCCCC^CAAGACGGGATCTTCAGCTACCCAGGCCCAGCCAGGGAAGACAGAGGACTCAGACAGCAG 

4 5 ACACAGAGATG CCATCAGCCCAGGCGATTAAATCCCCTCCAGTTTCTGTCAACCGTAACAGT AGTCCAG CTGTCCCAGCTCCTACC 

CCAGAAGGAGTCCAGGCTGTGAACACCACAAAGAAGGCCTCAGGCACCACTGCCCA 
CGAGGACTTGATTCCTGCC!ACACAACCCTCCACCTATGCrCTCAGAACCAG 

CCCAACCCAGCAAG AGTG AG CAGTCTAGCCGGATGCCAAAAGGCAAG AAAGCAAAGGCGGCGG CGTCCG CTCAG ACCAGCAGTGCC 
GTGGAAACACTCCCCATGATGCCTCCCCAGAGCGCACCCATCCAGCCCAAAGCCACCAACAAGCTCG 
50 GAAGCAGCAGCTTGCCCCAGGCTACCCCAAGGCCCCCAGGAGCTCGGAGGACAGCAGTGA 
CCAAGAGACCCCAGATCCCCAAGTCGGCCCACAGGCTGGATCCAGACCCTTCCCAG 

GAATCCAGCGAAG ATGAG ATGGTGGCCCCCTCACAGTCT CTCCTCTCAGGTTACATGACTCCCGG CTTGACTGTGG CTAATTCC CA 
GGCHTCAAAAGCTACTCCTAGGCC AG ACTCCAACTCCITAGCTTCT CCCGGCCAC CAAAG ACAACCCGGATGGCAAGCAGA 

AGTCAAAATCCCAACACGCAGCAG ACIACCGCACTCCCTAAAACCGGT AGGA CCTCAGAAGCCC 

5 5 AAGAAGCTCAAGAAGAGCACCTCAAGCTCGCCGGCCCCCACACAGA 

AGGCCATCAAGGAGAGTGGGAAGAAGAGCC^AAGCGGAAGTTATCAGGGGACCTAGAAGCTGGGGC 

AAGGAGCAGCC^GTGCCCAGGGCAAGCGCTGTTTCCCCAGAAAAGGCTCCCATGACrrCCAAG 

GAGTGCTGGTGGCAAGGGGAAGGGGTCTCCTGGCCCCCAAGGAGCCAAGGAGAAGCCCGACXK3 

6 0 AGAGTGGCGAGCAGAGCGACCCGAAGAGCAAGTCGAAAAAGAAGAAATCCCTCAAGAAAAAAAAAGACAAG 

AAG AAAGGAAAAAAGTCCCTGGCCAAAG ACTCTG CCTCGCCGATCCAGAAG AAGAAAAAGAAG AAGAAG AAGTCAGCCGAGCCTG C 
CGTGTGA 

HUMAN SEQUENCE - GENOMIC 

6 5 CAGGGAAAGGTAGACAGGTAGTTCCCTCTGCCTCTAACCTGCTGGGGTCCTTGTCGAGTCACTTCCTC 

TATC7VGCAAGAGGAGTGAGCTGGGCTAGATGCTCTCCAAGGACCCTTCTAGCTCTC 

GCTGATGAGGCTATGAATTGCTACAACCCTTTT^ 

CCACTTX3CAATTACATCATACAGAATTAAAGCTCCAGTCT 

GCCAAAAAACTAGAGK5CGAATAACCTGTATCATATCATCTCCCATCTCTGTCTGTTGATGGCCCTACT 
70 CCTCTGGCAGCTC^AGCCTGTCTTGAGCCTCAGTTTCCTCATCTACAACC^T^ 

TTGGTTAGTGTTAGAGCCTGGAAGCACAGCTCCAGCTCACTGCACACACTCAACAAATGA 
GTGGCAGATCCCAGCATCTAGAGAAATAGCTCTGCTGTATTGTTATTAGGTGAAAAAAGAGAGTT^ 
GATTCCCTTTTGTAAGAAAAGAAGCAAAGAAGCCAGGTGCAGTGGCTC^ 
TGGATCATCTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGTC 

7 5 ATGGTGGCACATGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCATT^ 
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CAGAGATCATGCCACTGCACTCCAGCCTGGG 

AAAGAAGTAAAGAGAAGTGAAAAAGGAAAAGGAGAAGGAGGACTAAACCTTATGAAGGATGTCT 
GATGTGAAGGGATTGAATACCAGCCTTCTTTCTGATTGX3GAAATC 
AGGGAGGGAAAAATGATGGACTTTGTCTTAATATATCTCTGAATTGTT^ 
5 AGCTCCTGTGAGATTAAAGATTTCGTGTCATTTCACGAAAGGGAAGGAAGTCTTGG^ 
GGGAAAACCTGTTGTTACAATGGATGAAGGGAGCTACTGGGAAGGGCACCC^ 
CTGGAAGGGTCTTTGTTCAACAGCTTCATTTAACAGAAGGTAATGCTGA 

AGCTGAGCCAGGTG AAAAGGGCCTGACTCCTGACAGTAACACAGGGGAG AGGCC CAGTAACCTTTCCCCTGTATCGCTGCTACCT C 

1 0 TTCCTTCATTCATTTCCTAAGTGACCTCGCCGTGCCCTGCATGGGAATCCAAGGTGACTAAA^ 

T CG CAGTT CAGTGGT CTGGGAGCAT CTGG CTTCT ACCACT AACAAAGG CATTTT CT CTT ACTGGT AG AGAAATG CT CTG AGGCTAA 
TTTTTTTTTTAATGCACTGnTTTGTTTGTTGCTGTCTT 

TTGGGCCCTGAGGCAATTCAGTGAAGGGTATGATTTGGAAAATGAGGGAAATA 
GAGCCGAGGACAAAGCATAAGTGCCGACAATGAATTTTATTAGGAGCAAAACAAAAACAAC 
1 5 CAATCCTGGTGTGGTTTTGTTTTCGTTCTTTAATTTAATTCCAGTTCT 

TTCCTTCCCCACTGCCTTCCAAAGCCTTCCACATTCCTGCCTCCAC^CCT 
TCTTCTTCAGGAGAGGTCTGTAACCATTCCTGCACTTGCCAATACTTATTAAGTGCCT 

GGAAATAATG CAACCAACAAGG CAAAGGTAAGTCTGCCTTCAGGGAGTTT ACATTCTACT AAGGCATGCAT TCTC AAA 
TGATGCCCTCGAGGAAATGAAAACTGGTTCCTGGAGGTGTTGGTGGAAAAAAATCTTAGATCT 
2 0 TGTACATTCATCACCCATCCAGCCACCCAAGCAGAAAC 

ATCCCAGGACCCAGAATTTTAGCATCTCTTTGCCAATACTTTAGACTCCT 

GGCCTGGATAAATCCCACTATGTACATACATCTCCCCAGCCTGCAGCTGGGTAGGTGAGCCCTGATGGAGGAATT 
ACAGGACGAGACAC^CATACACTAAGGACCACCAACCTCAACTGGGGAGCCAAGGCTGTC^ 

2 5 TTGCTTCTCAACACTCTGAATGATCATTCTTTCAAACTCCTCACCACCT 

TGCAGAAAAAATAGAAGCTCAAAATGAAATCTCCCTTAAGATCCCACTATAAACCT 
TTCCTGCrTTTGGGTTCTAGAGAACTCATCCCTCCTCCTCCCCAAGACGAAC 
CACATGCTTGAGAGAACTTTACTCGCTGATTATCTCCCCTATTTCTTACACTCTTGGTTAACCT 
ACTTTTAAACAAATCTCAGGTTTCTTCTACCTTGAAACACACCTCCCCTAACATACAT^ 

3 0 CCTTTTT ATACAAAAACTTCTCGAAAGACCCACTGCCTCCCTC ATCC CTCCT CTG ACC CCCTCC 

ATCATGCCACACATAAATAGCCCTCCCTAAGTCCC^ 

CCTATCTGACCrCTCAGAGCATTCCCTCCATCAAGTCTTCCCAGCTrTCTAAAATG 
CTCCTGGGCTTTCTCCAAGCTGTCCAGCCACTCCTGATCAGCCTTTGC^ 
TACCTCAAAACTTCATCCTGGGCCATCTCTCrTCrcrCTCCAC 
35 TATCATCTGTCTATCGACAACTTCTCATCITCATCTCTAGCCCAGACCTCC^ 
TCAACATCTGCATCTGGTGACTGTAAGGCATTTCAAGCTCAAGATGTCTA^ 
CTCCTCTCCTCAATCAAACCACCCCCCTGTCrrCAGTCCT CTTCT AGTTTTTTCT AT CT 
ACAGAAACCTGAAGGTCCATTCTGACATTGCCTTTTCCT^ 

AACTCTGGGAACCTTCCACTCTCTTCAACTCCACTGCCACCAGCACCCTCCCCGCTGCCCCCGCC^ 
40 CCCCCACAT ACACTGTCTTCTCCCTGG ATGCCTG CCTGCACTCCT AACC CTGACTTGTCTTCCCACATCT ACT CAT ACCCACCAGA 

CACACTCCACTCTCCACTGGGCAGGCACACAGGTCACAGTAAA 

GCTTCCAATGGCTCTTAGGATGGAGGAAAAAAAACCCTGATC^ 

ACTCCCCAGTCCCCCTGCTCTCCCTCCGGACTATGCTACAAAGGCCTTTT 

CCCTTTGCACAGGTTCTTCCCTCTGCTCAGG^ 
45 nnnnnnnnnnnnnn^^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNN^^ 
50 CTCCGTTAGACTGTAAGCTCTATTGGAGTAGAGACAATTCCATCACTAGACT 

CAATGGTAGGTACTCAATAAATATTTGCTCIAAGAAAATGGGAAATGTTGAATGT^ 

AAGACAAAGATAACCACATACCTGGACCTCCTAGGGTTAGGGACTCCTGCGCCCGTAAGAAACATGCTGCCCT 
GGTCGGTGAAGTTAAGAACAGCTCACTCTAACATTCTCCAGCCTACCTCACCTTTT^ 

AGC CTCACATTTAGTATT AATGGT ATGTG CTTTGGTTTT AGC AATCTTTT AAAAACA(IAAAACAGGGTCAACCTCCTCG 

5 5 CAGTCAGAG ACTATCTAATG CCCTCAGAGCAAAGTCCAAACTCTGTTGGGG^ CTTGTCTACTTCTTCAT C 

CTCATGTGCCCTCATTTGTTCCCTCTTCCCCACCTCACCCCACCCCACCT 

TAGGGTTCCCAGGCAATAGCAGGCTCTTTTCTCATGCCTAGACTCATGCAGTGCC 

TTTCGGGTGTCTTTATGCCTCAGCTTAGATATCTCTCTGGC^TTCGGTGCTCCTCCTTGGGG 

GACAAGGCIACACAAGATATTGAACTATAGTTGCrrGGTTACGGACAGTCGT CTCTGTGAGCTTTG AGGGCAACTGC CATGTATTAT 
60 GCATCTTTTG AATCCC CCTAGCCTAGCCGGGTGCTAG CCCAAGTTAGGTGCTT ATT ACAGATTTGTTCAGTTAGCG AATG AAT CAA 

TGAATAAACTGGCAATTCGTGTCTGGTTC^TGGGCACCAAAAACCGTGCGAAATGGA 
TCGGATCTAGAGAAAGATCCAGATTAG<K3AGTTCAGAGATGAG 

ATCTCTTCAAGAAGCCAGCCGGAAGGATTAAAAGAGAGTCGTCCAAGTCCCGGCCGGCCCCGGGGGCGGGTCCCTGTCTAGCCCCG 
CATG^TCCGCGCCAATGGGCGGTATTGTTGATCACGAGTCTCCGCCCCCTTCGCCTTGAGGGG 

6 5 GGGACTACGTTTCCCCMCGCGCCGCGCGGCCGGAGCGAAAGAGGAGCCG^ 

TGGCGGGCGGGGACTAAGGCGGGGCGTGCAGGTAGCCGGCCGGCCX^ 
TACTTCCCCTGATCTACCACCATCTGCTGCGGGCTGGCTATGTGCGTGCGGCGCGGGAAGT^ 

GTTCGTC^GCCGTGTGCGAC^CCGCGTGCAAGATGTGGAGATCAGCGGCCCGOSGCCCGCGCCCCGTCCCCAC^CGACCCC^GCAG 
GCGCCCGGAGCCGGGTCCCGCAGTGCTCGACGGCGCGGCCAGGGCTACCGGA 
70 AGTTCCCCTC^GCCTCACTTTCCTCATCCC^CAGTGGGTTGC^CCAGAAAGTCTGC 

TTCCCTTACCT CCACGCCCTTCCTGGTAGCGGGT ATTTTAAGTTTC CTAAGTCTCCCGCCACGTGG CTAGGCTCTGCGCGGCCCCC 
CCTG<3GGCAAGGAGGTTGCTGCGAGTCTCGGGG^TGGATTGCGACCAGCCCCTCGGGGG^ 
TCTTAGTTGAGCTGGAACT CAGGCCTTTAGCGATTGTCTGTACTGAG ACGGTCAG CTTTGGT ACTCGAACTTO 
Trrr a rrTft ATT AATTCAACTGTTAATCCGGTACTCAGG 

7 5 G AGCTTACACTCTAGTAAGGT AG ACAAAT CAAAG AATCACAAAAGTGACTATATG CTTAGCACCGCAATAAGTGTT AAGAATG AGA 
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GGGATGGGCCGGGCACGGTAGCTCACGCCTGTAATC^ 
GAGACCAGCCTGGCCAACATGGTGAAACACCGTTTCTACTAAAAATACA^^ 

TGGCTACTCATGTGGCTGAGGCACAAGAATTG CTTGAACCCGGGAGGCAGAGGTTGCAATG AGTCAAGATCGTGCCACTG CACTCC 

AG CCTGGG CG AT AAAG CG AGACTCTG CCT CAAAAAAAAAAGAGAGATGGATGG CAGATTG C CCTG GT CAGGAAT AG CTGAGGTGT A 
5 AATGCCITGTGGTGGGAGGGAACTTGTGTACACACTGGCCrGGAGAAAGTCCA 

T G AG CCATGATCATGCCACTGCACT C CAG CCTGGG CAACAGAGCAAGACACTG TCTCAAAAAAAAATTTTT TIT AAGTCCAT AG CA 

GGCCAGGTACGGTAGCTCACGCCTGTAATCCCMCACTTTGGGAGGCCGAGGCAGGCGGATCA 

AGCCTGACCGACATGGTGAAACCCTGTCrCTACTAAAAATG 

CTCGGGAGGCTCAGTCAGGAGAATC^CTreAACCCAGGAGGCAGA^ 
1 0 GGTGACAAGGCGAGACTCCATCTCAAAAAAAAAAAAAACGTCCATAGTGGCTGAAGTGTGAAGT 

GAGATGGAGGCAGGGGCTGGCTC^AGTGGTCTGGTGGGCAGTTTG^ 

GAACTGGTGAGACAGTATCCTAAGTAGTCAGATCCCTITGCCCTGTGGTGTAGGCTCT 

TGGAGGGACTAAAGGGTGATGGGCCCTAAGACGTTATGTATGCTTTTCCCAGTTTCT 

CTATGGTACTTCACAGATAAGGGGAGCAAGAGATAGTACCCATCTTCATTGGAGCAAAATATCCT 
1 5 TAGTAGGAGCAGACAGGTGACAAATAGAAAAAGTTTAATAAGTTAATGATATAGTATATCC 

AGGCAGGCAAGGGGGATAAGGGATGTCAGACTTCGATTATAAGTAGGGTAGTCAAGGAAGTCT 

AGAGACCTGAAGGAAATGAGGGAGTGAGCTGTGTAGTTAACAGGGGAGAGAACATTCCAGAC^ 

GAGGCL?W3GGTGGCCTGACACATTTGAGGACAGCAAGGAGGACA 

TAGTGTTACCGGCTC^CTGCTGGGCACTGACAGATGGTCT^^ 
2 0 CCATGTTAAGAAGAAATCTGAAGTTCGAAGACGTGACGAGAT^ 

TACTTGATATCCTGGCTTATTGACATTTACAGAGACATTGGGCCCT 

TCTGCTTCCACCAAGGGACTGGTACCATTTGGGCCCAGTC^ 

CAGGTGTCTGGGCTGTGATTCGTCCTTTGGGAAGATTGGCTGGCCACGTCTT^ 

CCCTGTCTCATGGCCTCTAGGCATGGATAGTGCCAGAGAGGCT 

2 5 TTTCAAAGGGGTTGTCTAAAAAGGTTTTAGTTGGGATTGGTTGTTCTTC 

AAGCAGACATACTTCCCAGAGCAAAAAGACCCTL."l'TrT i r , l''l' , l''l 1111 riTTTTTTAAAGAAAGGGTCTTGCTCTGTTGCCCAGGCT 
GGAGTGCGGTGGCACAAGCATGGCTCACTGC^CCTCGATATCCTGGGCTCAAGTGATCCTCC 

GG ACCACCAGTGCATG C CAC C ATT CT CAGCT AATTTTT AAATTTTTTT AT AG AG ACAGT CT CACT ATGTTG CCCAGACTGGTCTTG 
AACTCCTGGGCTCAAGTGATCTTCCTGCCTCAGCCTCCCAAAGTGCTGAGATTAC^ 

3 0 CCCTCTTCTGAACCACCTGTCTATACAAGTCATGAGTT^ 

AGTGGTGGGGCCTATAGGGTGGAGTAGGGACGGACACCCCAAGCAACTCAGCTTGGAATAA 

TCTGTCCCCATAGCCCACTGTGGACACCATTGTTGCCTCCAGAAAGCCAGCTCACATCACATGGATGGGTATTTCT^ 
CTGGTAGTGGGAGGGGAC^TTGAGAGCTGTGATCTCTTAAGAACAAGAGCTTCT 

3 5 TTC^GCACCTCCTGAAAGCCAAGTCCTGTTCTTACTTAGTATC-CAAATATT^ 

GCAGCTGACATTGTAGCCAAGGAGGTAGGCATAAGTCAAAGAATGAAACAAATCAATGTTAAAT^ 

AAAGAGAGG AATCTG TTGC CACAGC CACAT AAATGGGT AAGT CT GG CTT CATCTGGGGGTC C AGGGG AAGCTT C CCTGAGGTGAT C 
TTTGACTTGAGAGCTATAGGAATTGGAGGAGTGAACTATGCAGTGGGTAGAG 

GCT AATG CT AAGTCCTGTGGCTGGTGG ACAGTACAGATGGGG CT AGGTCATCGCATGGATAGCC ATG CATGGGCTGGTTATCCATG 

4 0 GTG AGGAATTTTGTCTTTATCCAAGAG CAGTGGGGCCATGGTAGGGTG AAAAAGTGAGGAG ATTG CTCTGCACCATGGTGAG ACTG 

GTGGGGCCATGGTCACCATCGTCCIAGGCAAGAGCrGATGGGG 

ATAGGGATGAAACCAGGGCCTCTGGATGGGTGGGAGAGGAAGTATCTTATTGACTCC^ 
CTTGAAGGGAACTCAGTTTGGAGGGGAAAGATTTCAATGACAT 

CTTCAAGGAAGGGGGAATCATCCAAGGATATATCAACAGAGGAATGGTTGGAAAATGGAGAGAGTAA 

4 5 TGGCCTGACCACAGCTGACTAAGGGAGTGAAGGACATGCAGAC^TATCCCATG^ 

GGAAAGGAGGCAGGAGAGGGGTGTGAGGGACCACTGCACATCTTTGAGGACCAAGAATAGTGCGTTCT 

AAGTGGGCCCCAAATGTACCTGCTTC^GGAAGACAGGTTTCAGCTTCTGAATATAATGACATC 

GCTAGGTGGAAGTGGTGGGCTCCCTGTCATTAGAGGTGTGTAAACAAAGGCAAAAATGGTGTCG 

AGAGCTAAGGTCATTGAAAACCCTGCC AACCCTCAGTCCTCTG CTGCTAAGGAGAGCTT AG AGCAC AGGGCACAGC CCACACTTCC 
50 TCACCCTGACTTGCCCAGGTGAGTGGGTGTTATTCGTAGTACTTCCTTCCCCCAAAAAACT 
CTTCCCTGCCACTGGCCCCTTCTATGGGTta^ 

GAGTTCAAACCCAGCAACTAGCTGTATGTATGACCTTGAGTGGATTTCTTGGCCT 

AT AATGATGGCAATT AT CT CATTG CATTGTGAGGGTT AG ACACAGTG CT CC AT AT AAAG TG CT AAAACT AC CGG TT GG AGGCT CC C 
CC^CTTCCTGCACCCGCCTCAGTATTTATTAAGCATCTACTACAACCGAGC 

5 5 AAAT CC CTG CTCAACATTTTTGTTGGGGG AAAAG AGG CAGAAT AAATT G AT ACATG T AAAAAT G AAAC CTG TTTTT AACAGG AG AA 

TCCTGACAACTGGCTTTCCTGGGAACAAGGGACCCTACT^ 
CCTGGGGCCTAGACAGTGACTCCCTGGGATGGGGGTCAATGGGCGTGAGACT^ 

T CCT ACC CATG CC CT AG ACT CACTTTG CCATG AG CAAG TCACTT AG TTG TT CAGTTTTTGAG ACATGTGTG AGGCACTGAGG AT AC 
TGTAGTGAGCTGC CAG CTG CCTTCCTCCCTGGGCCTTTCTGTGT AGGGG AGG AGTCTAC^ 

6 0 GAAAAGCTATGAAGAAAATGAAACAGC^TCACAGGGCAGAGCATGCCTTGGAAGAGGGGCAATC^ 

CTACATTAAAGTTG AGACGGGAAGGACGAGGACCTGGG CAG AGTCTTCCAAG AAGGCACAGTACACACAG CATGTTCTG CAGGGGG 
AAG AGCCTGGGCGGTTCTGTG AGGTG CCTGCATGGGTCAGCrCCT ATCACTT AAAGACTGGTAGCTGCTG CAGCTGTTATATAACA 
TTATCCAGCATTCCTGATTATTTTAAAGACTAATCAAGAATTGTAAAAGCACTTTATGAAGTGAGTGAAAA 
TGTGCCTATACTGTGTTTTCACCACrGTTTAGATTCTATGCACATTC 

6 5 AGGCAGAGGAAGATGCGGCACTCCJVAGCTAAGAAAA 

GAAGCAGAAGCCGAAACCGCCAAAGCCAGTAAGAGCCTTGC^GCTTT 
TCTACCTCCAGCTTCTTCTCTTATCACrAGAAGACCTAAAAGACCCTGGGGAGG 
ATTTGAGCCCTTCCTCCGTGC CAG GCATT AT ATCT CCATG CTCTGGGGGTGCA^ 

7 0 AGTCTCATTTGGTGTATGGGGCCCAGGAGCTGCCCTATTACAGGCT^ 

AGAAGTACTGCTGGAGGGATTTCAGAAAGCAGCTAGCGGACACATTTC 
TCATCTrCGTTTAGAATTGTACCC^TTAGCACTGGAACGCTGTACTGGGC^ 
AAGCrrTTTATAGCCCCTTCTTCTCCrAAC^TCCCAGGG 
TGTGTCTCCTTTGCATCGTATTATCITCTTTTGCTTACATAGAAGAC^ 
7 5 TCTTGCCATGTTGTCCAGGCTGGTCTrGAACrCCTGAGCT 
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CACATACCACCATGCCCAGCITCCTGAGTCTGCTTTT 
GGGGTTTCACTCTTGTCACCCAGGCTGGAGCACAATGGCGTGATGTCAGCT 

TCCTGCCTCtAGCCTCCCGAGTAGCTGGGACTACAG^ L TTTCTTTTTCTTTCTTT 

TTTTTX TI rTTTT TTTl AAT AGAGACGGGG TTT CACAATGTT AGC CAG ACT AGT CTCAGG TG ATC C G CCCACCTTGG C CTCCAAAA 
GGGCTGGGATTACAGGCATG AGCCACTGCGCCCTGGATAT ATATTAT ATAT CG CTTAATTTT ACATTATAAATG CAAAACGAGTTC 
ACTTTGAACTGTAGAAGAGCATATAGAAGAAAACTTAAAGGTTTTT^ 

C CTGAT AGGT AACCTTT AAAACTT C CT CCTTTGCCTCTAT ATT CATT CAAACATG T ATT CTT ACATGC AT ATAT ACAGACCTG CAT 

GCTATTTGAAACACAAATGGTAAAATTTCTTACTG 

ATTAGAI TT l Tl 'lCnTTGTTTGTTTGGGG ^ 

CAAGTGATCCTCCTGCCTCAGCCTCCGGGTATGTAATTTTATTTTTT^ 

ACCGGTAATCCCAGCACITTGGGAGGTCAAGGTAGAAGGATCACTTGAGCCC^ 

ACCCTGTATCTACAAAAAATTTAAAATTTAGCCAGGCATGGTGGCGTGTGCTTGTAGTT^ 

AGGATTACTTGAGCCCAGGAGGTCIAAGGCT^ 

CAAAAAAAAAAAAAAAAGGAAGAAGAGTCCTCGTTGGCA.TCCT 

TAGCTGGGCATGGTGGCATGCACTGGTGGTCCCAGCTACrC^ 

TGCAGTGAGCTATGAATGTGCCACTGTACTCCAGCCTG^ 

CATTGAGAGAGAAACATGCTGAAGTATTTGTGGGCAACGTGGTAGGATCTATGGCArr 

CTTAAAAATTTTCOVrAATTACAGAAAGAGAGAGATGGAGTGAAGTGC 
ATTCITCTTAACAAAAAGAAATCTGAGATTTAAGGAG^ 

CCTGTAATCCCAGCACTITGGGAGGCCGAGGTGGGC!AGATCACGA^GTCAGGAGATCAAG 

CCGTCTCTATTAAAAATACAAAAAATTAGCCAC^CATGGTGGTGGACGTCTGTAGTCCCAGCT 

ATAGTGTGAACCCA^SGAGGCGGAGCTTGCAGTGAGCCGAGATTCTGCCACT 

CCAGTAAATAAATAAGCACTGAGAATCAGTGAAT ATCGATATCTAAC^ CT CTCCAC T AAG 

AAGCCATTTGAATTCCAGGCAACTTT 

CAGGGCTACAGTCTGGTGGCCTGTCTGAATGGTCCCCAT^ 

TTCCCTCCAGGGGCCTCCACGTCTCATG CTGCGGGGG AGTTGGGAATCCATCAAGGATG CAGACACATAGCTTCATGTTACCATCC 

TCTTCTCTCriTGCrrCCTGGTGGACCTGGCTGATGCATCA.CAGAGCTC^ 

AGCACCAGGCAGCCAATACOATAGAATTGTTAGGTGAGATGAAACOTGTTTGCCA 

TTTAGCAGATGTTTGTTTCTTGCAGCCCCAAGACTAGCATCTACCAACTCCrrCA 

AGAAAAAGCCAAGGTGAGTGGGACTGCCTTCCAAGCTATTGCCT 

GAGAAGGATAGGGTTGTGAGTAATTGCCCCACCTGGATTCAGAGCTCAACCTGTGTCACCAGTTGTGGGATCT^ 
TCrCTGGGCCTCAGTTTACTTTTCTATAAAGATAAGGATGATGCCTGCT 



AGGTGTGGAGGCAC CGT AGTG ATCAAG ACAT CTGTGGTTCTG AAACTCTAAAAGC^ATTTTCCAAACTG^AAACTGCTCAAAAAACA 

AGGTGTCIATTTTCTAATAATCCAGAAAGCAGAGTTAGATACATACATGTATAAAGAATTGTGCCT 

G CAACAATGTCTCAAAGT AATTATGAAGT AATGAT AAG ATCAAOU^CAAACT 

CAAGAACrGGGATTCTGTTTTCTCAACACCrrCrGGTGAG AGGAGGTG CTGTTTTT AACCT CtACTTT AAAGATTGGG AAACAGATTG 
AAGGGGT AGTTTACCCAAACTC ACACAGCT AAGAAGTGGTAAGATTGGGTTCAG ATGCAAGTG^ CCTGTG CTCTT AGTTCACCATG 
CCATACCAGATGTGTCTGTCIACTCTTGTTCTCTGTAGGCAGAGACAGAGAAAG 

GAAACTGAGGAGGAGGGCAGCGTCCCGGCCTTTGGAGCTGCTGCCAAGCCTGGTAAGAAGTCCCCACCTC 

AGAACTTGGGCTGGTGAGGCAGGCCTAGGCATTGCTTTAAGT^ 

GGTTCCTGCATCTTAGGACTTTTCTCATC^CCTAGACGAGGGAGGGTCAGCT 

TGGTGGTTGGAG CATTTTCTGCTTT ATG ACTC C CTGG TG AG TT ACA.GGG ACTUVCTC^ CAG CTC AG AG ACT AAACTG AAAGG C CTG A 
TTTGCCTCTGTTTCCTATGTAGAGTAGACATCTGTCTGGCCTGCATC^ 

CACCTCAGAAGGGGGCTTCACAGCTraGTGCATGTGAGGCACACGAG 

TTGTGGGGAGGTCCTGGGGAGGGGCAGGTGAGGCrGGAAAGGGAGTCCCTCAGTCCCCTCCGTGTCCGATC 

AGCGGGCCAGGCCGACAGCTCCAGCGAGGACACCTCCAGCTCCAGTG^ 

AGTTGCCCrCTCCCAGCCrCTAACCCTTCCAGGAACCrGTCTGGGGCTTCAGA 

GTAGGGTGACCAGGAGCTGGAAAGGCTGCAGCCGGAGAGGGGC^GGGAGAGGAGTCTCGTGCTGCTTGCTGCACT 

ATCAGCCCTGGGGGCAGGAGAGGGACACCCTCCTGCCTCTGAGACT 

CAAAGCIAQ^TGGGCTTCACTGGGAAGGGATGTGGAGGAGGCAGGGCATTCCAGAC^ 



GGCTAGAGTTGGGCAGGACTCTTGGGTTTGTGAACTTTGACCTGGAAAAG^ 

GTGGCAGACCTGGGG AAT ATAGTGATGAGCyU3G ACAGGCAGAGTTCCTGCTGC CATGGGGCAAGGG AG CGTCCCTCAG AT ATTTAA 
TTGGTGATGGCTXSCTATGAAGTGGAAATGTAGAGGGCCACGC^ 

TGG CATGTGCAGAGAAGG CGGCAAAG AGAGTTTGGTGCTTTTAAAGAACATTGAT CTTTTTTTAAGACAGTCTCACTGTGTCACCC 

AAGCTGGAGTGCAGTAGCGCGATCTTGGCTAGCTGCAACTTCCACCTCCCGGGTTCAAGCGATTGTCGTGCCT 

AGCTGGGATTACATOTGC^CGCCACCACGCCTGGCTAATTTTTGTATTTTTTC 

GTCTCGAACTCCTGACCTCAAGTGGTCCATGTGCCTTGGCCTCCCAAAGTGCTGGGATTACAG^ 

AAACCTTGATCTTTAAATGAGAGGGAGAAACCAGG 

GATCATTTGAGGCCIAGAAGTTTGAACCGA.GCCTGAGCAACATAGTGAAACCCT 

ATAAATTATGGGGAAAGTGACACAAGATGAAGTCCTAGGTAGACAGGGGCCAGATCT 

AATTTGGGTTTTATTCTAGGGGCTGTGAGAAGCCATGCCAGCAAGGAGAGATG 

AGCCTGGGTTGGAGGGCAGTGCAGAATAAAAGCIAGTAAGACCATGTAGGAGACTATTGGAG 

TGGGGGCTGGGAGACCAAGGTGTCCACAGTTGGAAA^ 

CCCAGGGAACITCTCACTTTACTGTGCrGCCTGGCCCAGTAACCCTGTGAAGGTT 

C(MGGGCTGCCCTCCACCTTCCGTCCTCACCCTTGCGCCCTA^ 

GGCCTTGCCAAAG^GAAGAAGACCACTCCCAGCCCTCCATAAACACA.CAACCT 

AGACG CCCCAGAAACCCAGGG CCGCAAGGGGTCTGATGG ATCTCAGAGGT AGAAGTA 

TTGTGGAAGTGTAGGAAAGAGCTCACAGAGGGAAGGAATTTGCCCAAGGTGTGGCAAG^ 

rjaarTCCTGGCCCTCAGGAGGGCTGCTGTGATCAGATACTTGAGTT^ 

GCCCWCTCTCAGGCCAAGCCCGGTC 
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10 



15 



20 



GGGTTCCAATGGGGCTCTGTCCCTTCATATTGATC 

TAAAGCTAATAGCCGTCTCIAGGGAGTTGCTTAAAAATCC^ 

GAACCTTAGGGGGAAACAGTAATTATTATT<2ATTTTCTAATTCC^^ 

GTCAAAGCCTCATCAGTTTCnTVCTAAGGAGTCTCCAGCAAGA 

GGTCAAAGGAGGGGCCCIGCCCCC^CCAAGAGGGCCAAGAAGCCAGAAGA 

AGGAGGAGGCCCCTGC^GGGACACGAAGCCAGGTGAGGCCTGGAGGAGG^ 

TGAGCACTCTGCCATGAGCACCTCTGCCA.CTGGAGTTGGGGAGAGGGAGGACTATG 

TTCCCACAGGGGAGTCTTGCACTCCACTTTGTCTGT^ 

TCCCAG CCAGTCCTG CTGTG AGGG ACTTG AGAAGTTTTG ATTG C CT AAT CCTCTG ACTTCTCAGGACAAACCAG ACCCTG AG GAGG 

ATCCCAAGGGCAAGACAGGGGGGCTGGAAAGCAATGGAAGTCAGAGCAGGAAGGAGGAAGAGTGAGGGTCACCCA 

GCCTGGG CAGG ACAAGG CGGGGGCATGCAGTAGATAAG CAGATCCCCTGCGTTAG CTGCTGGTAAATGGTAAAG AGGAAGTGGGGG 

ACATAGG AATGG C AACTTG AAGCAGAGTGCTCTGGG CTGG C CTTGGG AG AAGGCG T CT CAGTAAAG ACCAG AAGGAGG AAGCACTC 

TTAGGAAGGTCATTCCAGGCAGGGGCTGAGCAAGTGT^ 

CCCACCCCATCTACAGGGAACTTCTACGCCTCGGCTGTCTATTTTGCAGTGTCrc 

TTGAGAAGGAACACAAATCTGAATlll'l'l'l'Tll'GAGACAGAGTCTTG CTCTGTTGCCCAAG CTGGAGTGCAGTAGCGG CAATCATA 

GCTCACTGCAGCCTCACTCCTGAGTAGCTAGGAATACAGGCACACACCACCAC^ 

AGACAGGGTTTCACTGTGTTTCACAAGCIX3TGGACTCAAGCAG 

AGCCACCGCACCTGGCCATAAATGTGAACTTTTATGTAAAATCTGCTTTCTCC^ 

AAGCAAGACCTGTGTGGAGCAGCCTGCACTTTCTGGA 

CCAGGGCTGAACACCCAGGGCCTTCCAGGGAAGACAGGGCCTGCAGCCACCCTGCCAAAG^ 

AGGAGTT AGTGAAGAGGGGGCCAGGTGTGCAAACCTTG CCCCATGAGGCTGJIGGGTGGGCTTCCAGGGGTCCTG CTX3CTGTCCATG 



CACCCCCTCCCCACAGCCCAGTCIGATGATGCTCCCGATAAGCCAGTGCCTGTGCCTCCAGAGCTCTCT^ 

2 5 CAGAGGACAGGCTCTAACTACCCAGTGTGCTGGGGCACCAGGGGGAACCATC 

AGAGGAGGCAGTGGCTAAGTGAAATGGAACAGGGGGCATGGAGGCTCCAGGAGTGA 
ACCAGGAGGCCAGTGTGGCCAAAGTATCAGTCAAGGTGGGGAGTGGGGAGGGAAGCAGGG 
TCTGGACTTTATCCTAAAGGCATCACCAGAGAGTTTTCAC^GCAGGAGAGCAAGCT 
CTGAAAAAATrCTCCAGGTCAGAGCTGCCTCAGCCCCTGCCAAGGGGACCCCTGGGAAAGGGGCTAC 

3 0 GCAGGGGCTGT AG CCTCCCAGACCAAGGCAGGG AAG CCAG AGG AG GACT CAG AGAGCAG CAGCG AGGAGTCATCTGACAGTGAGGA 

GGAGACGCCAGCTGCCAAGGCCCTGCITCAGGTGAGGCCTGAGGAGGGAGACTCCATGCAGCCAGGCC 
TCAGGACTTGTTCTCCCACTCTGGGCCAGAGCCCCGGGCGTGCCTCAGACCCCAGCCCCTTACTCCCCT 
TTTGCCCTATCTGGTCTTCTGTACGTGGTGTCCTGTGTCTCCTCACACGTCCATCCTCTGGGCTGTCTCCCCTTG 
CAGGCGAAGGCCTCAGGAAAAACCTCTCAGGTCGGAGCTGCCTCAGCCCCTGCCAAGGAGT 

3 5 GCCCCCTGGGAAGACAGGGCCTGCAGTTGCCAAGGCCCAGGCGGG 

ACAGTG AGG AGGAGGCGCCTGCTCAGGTG AGG CAG AGGGGAGGGGTGG AGAGTAGCCCCATGCCT AAACCCCAG CACCTGCATGGG 

TGTGGCCACCTTTGCCACATCCAGCTCCTGTCTCCACACGTCCACCCTCTGGGCT 

AGCCTTCAGGGAAGGCCCCCCAGGTCAGAGCCGCCTCGGCCCCT^ 

AGGAAAACAGGGCCTGCAG CCGCCCAGGTCCAGGTGGGG AAGCAGGAGGAGG ACTCAAGAAG CAGCAGCGAGGAGTCAGACAGTGA 

4 0 CAGAGAGGCACTGGCAGCCATGAATGCAGCTCAGGTGAGGCTGGAAGCCGCCCTGCATGGCCTC 

TG^TGGTTTTGACTTTTCCTCTCTGAACCTAGAGCCCTGTGCGGCTGGCTTCG 
GGACTCCCTCCCTAATCTTGTCCTTTGTGTCTCCCAGGTGAAGCC 

GCATGGGGCCCTTGGGGAAAGGCGCCGGCCCAGTGCCACCCGGGAAGGTGGGGCCTGCAACCCCCTCAGCCC^ 
GAGGAGGACTCAGAGAGCAGTAGTGAGGAGTCATCAGACAGCAGTGATGGAGAGGTGCCCAC^ 

4 5 GCCCCTTCCTGTAAGGCTCTTTCTTTTTCCCCCCCACTCAGAGTGGAGCTGCCT^ 

TACAATCTTAGTTTCTTAATGTCTGCACACACCTACCCTGGGCTCCCTCTCCCGATCCT^ 
OKSGGAACATCCTCCAGGCCAAACCCACCTCCAGTCCTC^ 

CTGAAAAGCCCATGGACAACTCGGAGAGCAGCGAGGAGTCATCGGACAGTGCGGACAGTGAG 
GCTCAGGTGAGGCCTGXKK3AAGGAGGCTGCTACATGGCCTGATCTGCCACACCACAGTCAGCA 
50 AGGCAGAGCATGGGTTTTGGCTGGTGGGGCAGAACAGATGGGGGACTCTGAGTTCAGG 
CCTCAGGCAAAACOteCTCTGAAAATTCCTCAGACCAA 

TGTGCGAGTGGGCACCCAAGCCCCCCGGAAAGCAGGAACTGCGACTTCTCCAG 

AGAGACCAGCAGAGGATTCTTCAAGCAGTGAGGAATCAGATAGTGAGGAAGAGAAGACAGGTCT^ 

AGGCCTGTGTTTTCTGGGCGGGCCTCAGGGCCGCCCCTACGTGGTCCTTTGGACTCCTGGAGACACCTCT 

5 5 CAGGCAAAGTCTGTGGGGAAA(^CCTCCAGGTGAAAGCAGCCTCAGTG CCTGT CAAGGGGTCCTTGGGG CAAGGGACTGCTCCAGT 

ACTCCCTGGGAAGACGGGGCCTACAGTCACCCAGGTGAAAGCTGAAAAGCAGGAAGACTCTC 

GTGAGGAAGCAGCTGCATCTCCAGCACAGGTGAGGCCTAGAAGGAGCAGGCCCATCCCAC 

AGGATGGGCTTGACTGGGGGCTAGTGTTGCCTGCAGGTGTGCAGAAGCCTCAGAGGTAGCCT 



6 0 GGTTTTCACAAGCTCTGGAGTTCCCACCCTGCCAGGTGCTGCCCITGTCTGGGGTTG 
TCTCTGTGGAGCATGGACATGCTCCTGCAGTGTGGAGCGATGGGG 
GTGAGAGCATCTAGGCCTTTTACAGGAGGrcGCAGCATCTGAGCCATGGGCITG 
TGTTCCAGGCAGAGGAAGCAGTCTGTGCGAGGCCAGGGAATGAGAGAGTGCCTGGCTTGGGTGCCT^ 
GGTGGGATGGCTGGCAGTTGTTTAGAGCCAGACCGCCTGGGTTCAAGTTTCAGCTCCAC^^ 

6 5 AGCCCTTACTCTCCTATAAACAGGAAATCAGCCCTCACAGGTTTGGGTGAGAATTTAGGTGAGATCTCACCT 

GGGAGCTGCACACAGCAAGTTGTCTCTGTCACCCTGACGATGGGCC^ 

TGG AAGAG CTGAG AG AAG ATGG CTCACTCC CTGCACTCAC CCACC CACTGTTT ATT ATTTTT AATTG TCG CT CTGTTTTTTTCTGT 

AACCACTC!AGAAATTAGTCATTTCTTTTTTCTCTGTACCAGAGC^ 

TGTGGGACCTCAGGGCTCrAGAATTCCTCCATTGAGGTTTTTA^ 

7 0 AAGCTGC AGG CACATTCCAG ACC CT^TG CCTT ACAG AGTTT CATT AT G CAG CT CACTT AAAT ATTTG TCTTC CCTTCTG AG CCT CC 

CACCTGTACCATGAGGTGGGTGTTTTATTTTTATTTTTTGAACCATGGGCT 
GGATTACCCAGGTTTCTTTTTGTTGATTTCTAATTrACT 
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TACTGATGAGCACCTCGGCCTGAGTCATACrrGGTCCACACT 
TCATGTX3TTTGAGGAATGCTGGGGGATGGGGTTGTTTTCT 
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GCTGCCAGAGTCCTCATTCTGTTTCACATTAGGCTTG 
TTCTCCTTCTATTTGTTGGAATCCCTGACAGGCCITTTGGGGGTTCT 
TTAAACAACAAGTATCTTGTTCTGCCGTGGCTGCITGAGCTCCAG 
CATGTTTCTGACCTTTTATGGCACTCAACTTACATCCT^ 
5 AAAACAT CTGTTTT AGGT AGT CTTGTGCTT AGCT AATGCTAAGG AAGG CACG CACAATG AG TTTG AGTGTTTATT ATGTTC CAG AC 

ATCATGTTAAACTCTGAATCTCCrATTTAGTCTTCATCATAACCCGTAG^ 
AACTGAGGCCCAGTGAGATCAAGTGATGAGCTCAGGTTCAC 
TCCAGAGTCTGTATTCTTGGCTAGCTGCITTACTCAATCT 
TCAGTAAAGAAAACCCAGGCCAAAGCCAACCCAGCTGCCGCCAGAGCA 
1 0 TGTCACTGCAGCTG CTCAAGCCAAGCAGAGGTCTCCATC CAAGGCAAGTGGGCCCAGAAGCCACAGGAGX3TGTGGAGGGTTGGGGT 

AGAGAGGAGGACCAGTCACTGAGCCCAGCCAGCAGAAGGTGCGTGCATGGGCAGG 
GTGGGACCTGAAAGGAATCACTTTTGCCTCCAATACTATTATCCCCCTG 
CAGTACCGTCTTGGCC^GC^CCCAGCATCTGTGCCATCTC 

AGGAGGACTCAGGGAGCAGTGAGGAGGAGTCAGACAGTGAGGAGGAGGCGGAGACGCTGGCTC^ 
1 5 TCATCCCCTACATGGGATGTAACACCTTTGCCACATCCAGCTCCTGTCTTCT 

CTCTTTCrCCCTCCAGGTGAAGCCTTCAGGGAAG 

GGGCTGCCCCAACACCTCCTGGGAAGACAGCrGCCTTCGGCTGC^ 

TCAGACAGTGATGGGGAGGCACCGGCAGCTGTGACCTCTGCCCA.GGT 

CAGAACGGGGGTATAGGGCTGGGCGTGGTGGCTCACACCnXSTAATCCTAGCACTTTC 
2 0 TCAGGGGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATATAAAAAT^ 

CCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCCCTAGAACCTGGGAGGCGGAGG 

ACTGCACTCCAGCCTGGGTGATGAGAGTGAGACTCTCTCAAAAAAAAAAAAAAAGAAAA 

GGGCCAGGTAAGGAAAAGAAAGGAAGGCCAGGTTAAGGCTTAAGGCTCCAGCrCCACCATGCCCTCCT 

ATCTCCTTAGGTAAAACCTGCTACTAAAACCCCCCAAACGAAGACTTCCGTAGCTAAAA 

2 5 GTCTCCACCAGCTCTATAAGCTAGGCCAGTATTTCCCACCAAGGC^^ 

TTAACGTTATACAAAATGACTCTCCCTGTCAGAGTCCCAGGGAAGCTTGGAGTGTGGCACAGGTCCCCCTGAGG 
GGAGGAAGTCTCATCCTGCACCTTGGCCCGGAGCCTCACATAGGCAGGCGCATCTTCTCAGGC^ 
AGCCTGGACCCTGGAGAGTGGGGAGAAAGAAAATCTGAACAATTCCCTCTGTCTGCTTGCAGAAG 
TACAGAAGTGTAGGATGACAAACTCCAGCACTGCCAAGGCACAGGTGTGTGATGGA 

3 0 AG ATGTCGGCCGGGTGCGGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGG CCAAGG CGGGCGGATCATGAGGTCAGGAGTTCG 

AGAC CAGC CTGGCCAACATGGTG AAAC C CTGT CT CT ACT AAAAAAAT ACAAAAATT AGC CAGGCGTGATGGCAGG CGTCTG T AGTC 
CCAGCTACTCGXMAGGCTGAGGCAGGAGAATCGCTTGA^ 
CACTCCAGTCTG^GCAACAGAGTGAGACTCTGTCACAAAAATAAAAAAAAGA 
TAGCAAGTAGGGCAGACCGGAGAGCCCITGCCCC^TCCAGAGGGGCAGT^ 

3 5 ACCTTCTGTGTGGCCAGGTCTTCCATTTCTTAAGGGAAGCTGGAAGTCCCTGTG 

TTTCITCATGAGTAAAACAAGCACTGAGATGAATGCTCCCTAGCCCTAACCTCTG 

GGTGAAAGACATTGGCCCCACCCTCAAGGAGCTCACAGTCTGTGGGGC^GACAGGTGACCGGATTATCGC^ 
TGCTGACAGCTGAAAGAGCAGGTCTCnX3AGAGGCCAGAAAAGGAGGAGATGATTGAGCT 

GTGGAGAAGGCCCCTCAGTGTGTGCAGAGGGGCTGGG CATGAAGGCACATGGTCACTGAAGGCAGCTGTCCTGCAGGCCGTAGG CC 
40 CTC CCAGG TG CTTGTT ACTGGT AG ATTT ATGGAGG AC CT AAATG CC AG ACT AAGAC ATAGGC C C CGGC CAT ATTTTGGG CTGTG T A 

GTTTTTTTGTTTTTAAGTTTTCAATGAGTTCTTAGTCAGAATGGCCT 
AGGGGATTGACTGTCTTAAAGTGCACAAAGTCCGAGGTTCCAGCT 

AGATTTCTTCATCTCTG C ACTCCACTAG CTTTATGGTCATAG CAGGGCTGCAG CATCTCCTAAAGTG CTGGGGGCAGGCCAGGCAC 
GGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGACAGGCGGATCACTTGAGGTCA 

4 5 ACATGGCGAAACCCCATCTCTACTAAAAATACAAAAACTAGCTGGGCCTC^TGGCACCAGCCTATAATCCC 

ATCACTTGAACCTGTGGAGGAGGAGGTTGCAGCCTGGGCAACAGAGTGAG 
GGGGGTAGACTTAAGGCTACTCAGACCTCAGGCTGGACGTTCTA 

G CTGGG AGGAGGGCCCACCCTCCCTGAAGCATGTGGTAGGGG CCACTGAACAGAGTGAGGGTTCTATTAGCAAGGAACAGATGGGG 
G CATGGCTGTTAGGAGGCAGCCTGCAGTGTGCATTGT AGTTG CCCCATTTAAAAATTGGAAATGACATGTACAAATCTTC CAGCTT 

5 0 CTCTTGATAAATTGGAAGCTCAGGCAGCCTGGGTCCTACTTTTTTG 

GGG ACCACTGAG CTGCCATCACCACC AGCATTTCCACCCAGATCTCTGAGTCCTCCTCTGACTG CAGTGCTTGCTCATCCTTCCAG 
CCCCCATCATCACACCTGCCCCAACAGGAGGGGATGGTAGCCAGTGGAACACrTTCTAGGTGGCT 
AC^CCGGAGGTGAGCTGTGCTGGGACCACTGGCITGTGCCTGCAAC 
GCCAACAGCCCTGCAGCCCTCAGGTTTTAGCTGCATCGGAGTCATAAATCCCT 

5 5 CAGGGTGGAAGCIATGTTTCCAGTGGCTGTGGGG CAGGTGAGGCCAGCAGGTATCCCTGTGCATGTGCAGGAAGGGG CCACG CTGCC 

TCCCCTGGAAACCAGAGTGCCTGAGGGTCATTGCCTACrGGCTGTTTCCCCCT 
TCCAGGTCTTGTCCCCTCAGAAGGACAGTAACTCCAAACCTGCCAGAAGCAAGACCCTG^ 

CTGGG CTGG CAGGC CTTG CAGG AG AGGG CAGAT CTTGCC CAT AGGGAAAAT CTCGC CAT ATTT AAAC AG CACCT CACAGCTTC CAG 

6 0 AGCTCTTTCCTCTATTGTGACATTCATCTGAGCCCTGTGAGGCAGGCA 

GGTGTTGGGGCGAAGCCCAGCTCTTCCACCCAACAGCCCCATGGTCCT^ 
CAGCCATAAAATGGGGACAGTACTGGCACCTCTCTCCTGCCTCTGTGCACTGGTGC 
GCCTAAGAAAGGGTGCCTGTTAACTGTGAGAAGGGCAGGGAGTGTTGCCCCATTTGACAGATTG 
GCAGTTTCCCCAAATTCTCATGATTCATAAGAGGCTGAGGAAAGGTCC^VAACGCT 

6 5 CTGTTTAGGACCAC^GAGTCTTCTTGAATCCCATGAGCTCTGCCCAGGACCCTCT 

CACTGCACCCAGAGCTTAAGCCACAGTCCTGCTCAC^CTACACCTCC^ 

GAGGCCCAAGTC^GCCATGCTGGTGTGGTCCAAGCTTCTGTGTGAACCCTGGCCC CCTTTGT CCTCC 

TCTAGCCCCAAGCTCCCAGAAGCTTCTGCCAGGCACCTCAGAGCACCAGCTCCCCTCT 

TCTGCAACCTC CAGAC CAG AG CGGTCTCATGAACCACCTC CCCTTG AATATCACAT ACCACATTGTT ATCAATT CATCAGT AAGGT 

7 0 CAGATGCTGTTTGCACACCTCCAGCAATAGGGTGCTCACTGCCTCTCA^ 

ACCACCAACATCGGTAACTCCCTGTATTCTCATTTTTCTGTTTAATC^ 

GCATTTTAAGGAACTGGGAGTGAGGCTCAGAGAAGTTGAGGGACCTGTCCCAGCATCACATC 

GTGTTTGAGCAC^GGCCATTCACATTCTGACCCGTTGCCT 

7 5 TGTGGGGGTGGACTAACACAATTAAAATGAACATCCTGCTTAAACrc^ 
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TCTCTATTTAAAAAGCAGAAAAGAAGTCAGCGGTTTGTACATTCCCCCAGGAGGATGCGGGGAAGACGG^ 
TGCAGCCTCTTCCTGATTGAGAGATTTTTAGTTCCTAATCCCCAGCTCACTA^ 
GCACCTCTCTATTGACAAGTTTCAGTCTGATTTTATTAGCCCATCAGTGG 
CAGGCTGGGAGGCAGGGATAGATACCTTGGCTCCATTCCTAGACCCCTCACTGTGCCTCCGC 
5 GCCAGAGTCTGTGCTGGGGTCAGGACAGCTTCC^GATTTAAAAACAAGT^ 

CCACAATGTATATGTGTATCAAAACATCATGTTATATACCACAAATATATGCAGTTTCAC^ 
LTTCrrCTATTAGTTTGTCAAAGACGTAGGCCCTGCTCTCA 

AATTCTGGTGTTACATTGAGGCCTAGCATAATTGGTTCTAGAAGGAGCTCCAGTCT 
CTCTGGCACCAGTTTGCTGGTTGCCCTGGGTGAGGCCACTTGCCTTCT 
1 0 CTGGGTTGGTGGGGTGGACCTTGAGGCTCCAGGCCACAT^ 

ATCTGTTAAGCAGCCTCAGGCAATTCTGAGAGAGGTTCTTGGAGTGTTTG 

G CCTAGGCCATCAG C CCC CTCTCC CCACT CTGTGACTTG CTGG AATG AAGGCACAG TT AGAATGAG CTCAG CTTTGACAGGATGGG 
GTGAGCACTGTCCAGAAAGCCCCTAGGTTTGGTTCTGCAGGCATCAGACCT 
TCCTTACTGCCCAGCACCTATAGGGAGGTACAACCAAATGTTCAGGCCAGTC 
1 5 TCTAACTGACCCCAGAACTACATTCCTGTGTATACTGTGCT 

AGGCTGGGATAACTACTAGCCGTTTCATTAAACGGGAAACTGAGGATCAGAAAGTTGAGGG 
GAGAGGCGTAATGAGAAGCAGGTCTTCTCCTATACCCTGGGCAAGCTTCACT 
GCCCCAACCCCCCACAGC(^U«5CCCCCTGCACCACCTCCCTCCTATGGCCCAGAGG 
AACAGGAGGCTTTTGCTGCTGAGCTGCTTTTAACATCCCAAGACG 

2 0 CGACAATCTGTTCCTGTCTTCACATATATATAGATCCCCCCTCCCTCAACTGTAAGCC^ 

T ATATCGTGGGAAGGAAGGCTCTGGAAGTGGCAGATCGGAG CAGCCGCGGGGGAGG CTATTTGGCAGGGGT AGGAAAGGCCTAAGC 
CCAGAGAAGCTTCCACCCTCAGGCAGGCACCTTTCTGTTTTACT^ 
GCCAGGTGTGCTAAGTCAGGCTCAGGGGGTAAACAGGATGGTGAAACAGAGA 
TCAGGCAGCTCCCTCTTCCTGCCCTCCAGCTCTTCACTGGCCAGAGAGGGAGCAGAGCT 

CCCTCX3TGTGCGGTGAACATGTTCAGGGATGGTGCAGTTCAAAACCAGACTGGCCTC 
TTGCTTTTTCTTTTATTGTTTCCTTGGAAAATTTAAA^ 

CCTCAACAGTTAGGAACTCCTGAGCAGGCTCGTTTCACGCAAACCCTCCACTCAGT^ 
AGTCTCAGACATGTCAGCTCATCCACCAGGATTTCAGTATGCAGCTCTCAGTTATAAG 

3 0 TGTCACACCTAATAAGTTGATCTACCCAGTCAGTGTCCCT^ 

ATTGAGTTATTAATCCTCTAAGTCTTAATCTGTTGGTTCrCACCCACTCACCCAA 

AGAAACTGAATTGGTATAGCTGGGGAGTTTCATCCTGGGCCATGTATCAGGAGGGCrGCAT 

AGACTGTAAATACCTCCCCAGGTGAATTGCATATTCCAACCAGGCAGCCTTGGAAAACT 

GTAACTGTCTCTGAAGCCAGCCGGGGAAAGAAGG CAG AATTGGACATTCTGG CCGTGGCTTTGCTTCCAGTAGGG ACAGCAGCCTT 
35 GGGTCGCTCCCTTTC CCTCTGGGCTTGTGGTTTTCCCACTTTGTAT AAGGGAGCTGATCCTG CTGAATGTAGGG CTTCACTGTGAG 

GGTCAGGTGGGCTCTGGGAGTTGTGAAGTGCTGTGCTGGGTCTTAG CAG ATGTGTGGGGTGGGAGGGGGCACCCTGGG CAGCTGGA 
ATGGCTTCTGTGCCCAGCTGCCCCTGAGCTCAGTGGACCCTTTGCCTTGTAAAACACTC 

CCAAGCC'rrTAATCACTGGGGGGGTGi"! TTGTn iTGTTTTTCAAGGTGATTAAACCCCCTCTGAi^ vi l GTCGACCCTAATCGTA 
GTCCAGCTGGCCCAGCTGCTACACCCGCACAAGCCC 

4 0 AGCTCCTCCTCCGAGAGCGAGGATGAGGACGTGATCCCCGCTACACAGTGCTTGACTCCTGGT 

GGGAGGGGCTGCCAGCACTGCCCCACATTGAGGCCCAGAAGGGACAGGACIACTGGC^ 

GGGCTTGGGGTCAGGTCTCTGGGCCCTG CATTAGCCATT CTGATTAGGGCCTGGCTCAG ACCTGCAGTTCTGGGGG7ATAGTTGCC 
AGAGCTTTTTTGTGG CAGCC CCAGTCATGG AGCCCAG AC CAGTGG CTAG AAGCTGCAGGGACCAGGTTC CT ACATAGTCTAAGAAA 
GAATTCTGTATCAGTTGACrGCCTGTGGAACAAGTGGTCAGGGTGGTAGTGAGCTTCCCGTC 

AATCTGTGATTCTACCAATGACAACAGGAAAAGGGTTAATC 
ATCCTGATAGGACAATTTAAAAATGCAGACGGGTGCAGTGGCT 

TCAC CTG AG AT CAGG AG TT CAAGACCAG C CTGGGT AACATGG CGAAAC CC CGTCT CT ACT AAAAATACAAAAAATT AG C CTGG CGG 
GCGCCTGTAATCCCAGGTACTTGAGAGGGTAAGCCACGAGAATCGCTO 

5 0 CCACAGCACTC CAGC CT GGG CG ACAGTG AGACTGT CT CAAAAAAACAG AAAG CAGG CAAAAT AGAG ATT ACTG CT GGGCATGGTGG 

TTCGCACCTGTAATCCCAGCACTTTAGAAGGCCAAGGTAGGAAGATCGCTTGAGCCCA 
GTGAGAGCCCTATCTCTACC^ATCTACCTACCTACATACATATGTACZATATACATATACTTATAC^ 

TATA'l"l"iT"l 1' ' 1 AAT AAAAAT ACAG ATT AG CACAT AAG AAACT AT AT CAC CTG TAATTCTGCC C CAAATAATT ACTG TT AACATTTG 
ACAAATGCTAACTTTATAAAAACCTAAAATATACGTGTGT' r T TTAAAGCAAGACCATACTGTATCTAATGTTCAGTAACCTTCACT 

5 5 GGATATTTTATGACATCGGGTTTTTTTCATOTTTACATATTTGCAGCACAATTCCT 

ATGCTACAGTGTGTGTAGCCAGCACCATGTTGGAGGAAG CIAG AGGGTTTTGTTTCAGGGTGGAGGGGTGAGGGAGTTGG CTGTTAT 
AAACAGTGCTTTGAAGAACATGCATGTAACCAGGCCTCT 

AGAGTCGTTTGGTGCCGGTGCTCTGGCCCATTGCTCAGGAGTGGCAGCCCGACCAGTTGGGAGGGGA 
GTCACCCCCAGAGCAGCCATTTGAGCTCCCTGAGCCTTGTCAAA 

6 0 GTAAACAAGCTTCTACCTCTGTAAGCCCTGAG CACAG CT CTAGATCACCAGCACAGGCCGGTAAATTGGGTTATTGCCGCTGCTGA 

GGAGGCGATGGGGCTTTTGTGCCCTGATGTGCCCCC^TCTCGCTCCATTTCA 

C CCACC CAAG AAT AG CC CC CAAAG C CAG CATGGCTGGGG CCAGCAG CAG CAAGG AGTC C AG T CGG AT AT CAGATGG CAAG AAACAG 

GAGGGACCAGCCACTCIAGGTACCTGGTGGGCAAGGGAGGGTAATGCAGGCCACT 

TGTGCTGATGGGATGGCCTGCAATTGCTGTCACGCCCACACT 

6 5 OTACAACCCCCCAAATCACCTTTGAGTTATGAGGAAGAGTAATCTGGGGACACAGCCTCCT 

AGTCCCTCATGAGATTTTCCAGCGTAGTGGGGCTTTTGATT^ 

TTTGGTGTTTGTAATAGTGACTTCAGCATTCTTCCAAGCCTTTACCATCTTCAGTGC^ 

AAGCTGGGCATTTTAGGGTTAATC<;CTGGTGCATTGTTTAGATC^ 

ACCAGCATTCAGCAAGGCCGGAACCCGGGTCTTCTGAGACCCAGTTCATTGCT 

7 0 CTGAG^GGATGAAGTCCATGACACACTCTGACCTCTCTTCCTGAGGGGTGGGCT 

GCCACCCACTCCCTGGGCTCATCACAAGCTCCAGGTCAAAAACTTCT 

CTG CATC CATGAG CATCGGTT CCTG CCAGG CC CT AGGGATACTG CAGGAACCAAG ACAGGGACAAGGAATTCCCAGGCCAGTCAGG 
GAGTGGTGAGGGGCAGCGATTAAACAAGGAAACCCACATATAAATAAAAAGT 
CTAGAGAATAGTGGGGTGACCTGCTATATGAAGGCCTCCTGAGACAGCGATC 
7 5 CTGGGCCACCCT CCAGG CAGGGAAACTGCAGGTACCAGGTCTTCGGGGG AAG CG CTTGGGAATGAGGGAAACTGAGAGGTGGGGAC 
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AG CTGG ATGAGGAGCTGGGGG AG AGGAJGATG AGCCAGGCCGTGGCAAGAGCCACG CCACATTTGAAGGCTCTCAGCAAGGGGTCAT 

GCCCAATTTCCATGCCCACCACTCTCCCTGACTGCTAGTGGTTATAAGAAGTAGAGATGGGGG 

CAGTGAAGGTGCAGATTGAGTAGTGGGGCCTAAGCTGCTTCCCTGCCCTGGGCT 

GATTCGCTCATAACCTGAAGCCTGCCACCAGTTTTGCCCCTTTGACTGCCCCT 
5 AGCATCTGACCAGGGTGTGGCACAGCTGGCTVTCCCAAGGACTTG^ 

AGAACCCAGCTTCCCTCCCACTGACCCAGGCTGCCCTGAAGGTCCT 

ACCCAGCCTTCAAGTGGGGTGAGCTTGGGGAGCCAGGGGAAGCAAGCC 

GGCACCAGCACTCATCTGCCCCATGACCTCTGCACTTGGTTTGCCTCCCTCGGCCT 

GTACCTTTCGTAGTTGGAAGGATCAAATGAGGCTGCATGTC 
1 0 GGACCAGGTCTTACTTGCCCTAATTTTTCCTTCCATTCCTTCr 

GCAACAAGTCCCCAGAGCACCTCCGTCCAGGCCAAAGGGACCAACAAGCTCAG 

CAAAGCCCCTGAGAGCTCAGATGACAGTGAGGACAGCAGCG^ 

CCAAGTCAGCCCACACGCTGGGTGAGGGTGCCAGGGGAAAGGCAAGGGTGG^ 

ATAGCTCTGGCCTCAGCTCCATATCTCAGACTCATTCTGCACCTGTGGCTGAGCCTGG^ 
1 5 ATGAGG AAGT CAATCCAAATGATTTCAGAGACCTGACTTTG CTGTTTGACCACTCTCAG CTTTTTGGTATCAGACTCCCTTCACTG 

GCTCCCAAAAACTCCAGGGCCATGTTTCTGGAACAGTC^AAA^ 

TGGCATTCAAATGTCGCACCTAGCATGCTGTGACTAGCGATAAGTGTGCAAGGAGTGT^ 
CGGGGCAGGGGATGGGGGTGAGGGACCTGCAGAGAGACCAGGGCCTTCCT 
TGGGGCTACCAACAGGATACTGTGCTTCTCCAGTAGGTCCCACCCCCTCCAGGAC^ 
2 0 TCCAGCGAGGATGATGTGGTGGCGCCATCCCAGGTAACTGCAAGGGAGAGGACTGG 
GCCAGGCTCCTGTCTACCCGATCCCTCAGGTCAGGGGTC^ 
GGCACCACGTGTGGCCGAGCCAGCAGACCACAGCOVGGCTGC^ 

GTAGAAGGGATTGACCTTGATTCTGGGTGATGTCCGGACAAAAACATTCTTTAGCAGATTTATT^ 
GCAAAAGCTAGTAATTTTCCATTTGTATTAATAAGAAATTTCCTTTTTAAATC 

2 5 TCAGGAGGCACACGCCAAGGGCTGAAGTGCTGCTCTGTCCCTrGTTGTCCACC 

GGCCCCAGGCTGCTTTCCTCACAGCAGGCCATGACTCGGGCTGGGTTATGGCAGTGGGGTGGGGTGGTGG 
GGGCTTCTTCAGTCTCTCCTCTCAGGTTATATGA 

AGACTCCAGCCCCTCAGTTTCCTCTACTCTGGCCGCCAAAGATGACCCAGATGGCAAGCAG^ 
GCATGTTGTCCCCTAAAACAGGTAAGTTAAGGTCTGCAGGAG^ 

3 0 GCCCTCCTGT AGCAACAGT CCTCCCAACATGGTCCTGTCTGCCCCCAGAG CCTCTCCyU^TGAGACCTTGTGGCCTTGCATTGGAC 

CTG AAGATGCTCATCAAAGTGCAGGTGGGG CCAGGTGCAGTGGCTCATGC CCATAATCCCAGCACTTTGGGAGG CCG AGGTGGG CA 
GATTGCTCAAGCCCAGGAGTTCAAGGCCAGCCTGGGCAACACAGAGAGACCCCATCTCTAAAATTTATTT^ 

TGGTGCACGCTTGTAGT CCCAATTACTTGAGAGGCCAAGGCAGAAGAATG ACTTG AAGCCAGGAGGTTG AGG CTATAGTGAGCCAT 
GATCATGCCACTACACTCCAGCCAGGGTGGTACAGTGAGACCCTGTCTCAAAAAAAA^ 

3 5 TGGCTCCCATGGCCCTGGACAGCCTGGAAGTGGGGGTGTATC 

TCAG AGG AAGCTGCG TCTTTTGG CTTGTGGAGCAAGTGTTGGCTCTCTCCAACCAT CACCATCAACTTTTCTTCCCTGT AAAT AGG 

CTCACGGTGTGGGCCTTGGGCCTCAGAGGATAGCTTAGACAATAGGCTGACAGTCACGTTT 

ACCCCAGGCAGTAGACTGGCCCCAGGCTGTTCTGACTCATTGTAAAGCCTTGGAAATCCCCTGGCCACT 

AAC CAT ACAAAAC AG CAGTTGATGT AT C CAG TCTGTGT CAAGG CAT GG CACATG T AG AAAAT AGT CACATTGT ATTTGG AGGGCAG 

4 0 ACTGGATTATTCAATTCACCTAACCTGGGGCATCCACACTCCCTG 

CACTTGGGACAGCTCCAGGGCTTCCCCAGCCTTGGTGTTCTGGGTGTTTGCAGATAAGCAGC^ 
CTCGTGCCTATAATCCCAGCACTTTGGGAGGCCAGGGCGAGTGGCTCACTTGAGGTCAGGA^ 
GTGAAACCTTGTCTCrACTAAAAATACAAAAAACTAGCCAGCATGATGGCAGGCGCCT 
CAGGAGAATCX3CTTGAACCTGGGAGGTAGAGGTTGCAGTGAGCCGAGATTGCACCACCG 

CTCCCCMCTTCCCTCTCCATTGGCT^ 

CTCTGCCTAACCTGGCCTGCCCTGCCCAGTCCACCTCCCCCA^ 

GGGGCTGAGGGGCGAAGCTCACTGCATGCTGCAGGGGGCAGCCCT 

AGTATCCACCTGGGCCTTCCTCCAGGCAAACACCrCCAGGACCAAACCACCAAGCCACCT 

5 0 GTGCCCATGAGACCCAGAATGGGGCAGAGACCTCCAAGTCACTTGGAAGTTAGAGCCGCT 

GCTXSCAGCTTGCCCCAGGCTGGGTCGGAAGAGGAATTGCTGCTCACTTTCrrCCCC 
GAGACATTTTAGAGGGGGCTGTXXSAAGGTTTTTTTTTT^ 

GAAAATTCTAAACAAAGGCTGCACCAGAAGTCTCTCTGGCAGGCAGTAAACCTGC^ 
CCCAGAGCTCCCCACCTTGCCTGGACCCCCTGACCAGGAAGGGCCGAGTGTGGGT 

5 5 CTTTTATTATAAATCTACAATGTATCATAGGAAGCCTTACTGT^ 

ACTCCATCTCCCTCCTTTCTGAATTT^CATTCCATTGATAGATTTCAGGCCC^ 
CAC C CT AACTGG AGTTTTT AGT ATAAAG AG AGAGGG AAAAGCTGT AAGTTAG AA^ 
GTGAGCTTAGGAGACCAGCTGGGGCAGAGTGACCGCCAAGCCTTGCTCTCCCCATCTGTGC 
60 CCCTGCACCCTCTT CG CTCTT AGGTACCATAAG ATCTGT CC CCCAACTCTCCCAGATCTGTGACCCCACATTCT CATAGGT 

GGAAAAGAGGCTGCTTCAGGCACCACACCTCAGAAGTCCCGGAAGCCCAAGAAAGG 
GCTGCAAAGCAACATCACCCAGTGCCTCCTGGGCCAACCCTGGCCCCTGAATGA^ 

TG ACTG AGCTG CTGGAACAGGAAAG AAAGAAGGTGGTGGACACCACCAAGGAGAG CAG CAGGAAGGGCTGGGAG AG CCGCAAGCGG 
AAGCTATCGGGAGACCAGCCAGCTGCCAGGACCCCCAGGAGCAAGAAGAAGAAGAAGC 

6 5 TGTTTCCCCAGAAAAGACCTCC!ACGACTTCCAAGGGGAAAGCAAAGAGAGACAAAG 

AGGGGTCTCTTGGCTCCCAAGGGGCCAAGGACGAGCCAGAAGAGGAGCTTCAGAAGGGGATC 
AGCAACCCAAAGAGCAAGAAGGAGAAGAAGAAATCCGACAAGAGTGAGTGACCGCTTCTCCC^ 

ACCCCACGGGGGCGGGAGGG ACCCTCAGCCAGCAC CTGGTCTCATTCCTCCCATGTAGAAAAGAGAGG CTGAGACCAGCCTGGCCA 
ACATGGCAAAACC CCGT CT CT ACT AAAAAT ACAAAAAATT AGCTGGATG TGGTGGCGGG CAC CTGT AATCTCAG CT ACATGGGAGG 

7 0 CTGAGGCAGGAGAATTGTTTGAACCCGGGAGGCAGA 

CAAGACTGTCAAAAAAAAAAAAAAAAAAAAAAAAGGCTGAGAAGTGGTTTGT 
AACTCGATTCCAGAGCACTTGAGTCAG»CTGCCAGGAATG 
CAGCAGATCTTGGAGGAACCTGCTTCTCCAGCCAGCTCCT 
jtf^CCCA7CTTGTCTCTftGACCTC\GTTT^ 
7 5 AGGACCTCATTTAGGTAGAAATGAGACTCAGCAGGTAGAAAACTGCGGCTCGG^ 



464 



WO 03/008583 



PCT/US01/51291 



AGAGTTTTGCTAGGTCX^CCAGAGTCAAACCTAGGC 

GAGCCAGCTCGTTCTGGAACAAAAGTOSGGCACCCTCAGAGCAGGACT^ 

TTTAOVGTAGGGATAGATAGTGTCCATTTAAAATAAATT^ 

TCCAGGTGGTCCATGGATGGGACAGAAATCATGAGCrTGGCCTCCAGGGGGCAGGTG^ 
5 GCCAGCCTGTITTATTTTTATTTTTTGAGACAGGGTCTTGCT 

CAGGCTCAGCCTCCCAAGCTCAAGCGATCCTCTTGCTTCAGCCCCCCA^ 

TAATTTTTGTATTTriTGTAGAGACAGGTCT 

CAGCCTCCCAAAGTGCTGGGATTGCAGGAATGAACCTCCACGCC 

AAAGCTGGGAGCCCTGCCCTGTGCACCTCCCAACATTGACCCCAGCACTTAGGAT^ 
10 TTCCCTTAGGAAAAAAAGACAAAGAAAAAAAAGAAAAGAAGAAGAAAGCAAAAAAGGCCT 

CAGAAGAAAAAGAAGAAAAAGGTAGAGAGTTCCTGGGGTGTCTCAGGCCAGAAAAC^^ 

CCAGACCTGGTTCCTGCCCCCTTCCCATTTTCXSGGGCCCAGTCGGG^ 

GGCATGTTCCCTGAGCACATGGAAGCCGTAGAAGTGGCCTGCCGGAGGTAGAAGGGTG^ 

GCACAGGGAAACCAGGTAGACTGTGGCAGGTTTCCAGGTGAAAGGAAACGTTGCCTGGGCTGCGTGTGCCC 
1 5 CCCTGAACAACAGCTGTCCTCACC^CCCGGGGCCTTAGCAC 

TGTATTTAGGAGAGCTGAACATCTGTTTGCCTCTGCCCTTGGAGGTCGCTGCAGACCCAGTATCTAT^ 

CCTCAGGAGGTGGGGGCAGCAGTGGGTGGGGAAAAGCTGCAGGTCTGAGAGCCTCTCGTACTTCCCT 

AGAGCAG ACTGT ATGACGAGCACCAG CAC CAGG CACAGGTACGCTTC C CAATCATTC CTG AG CATTCAGGGTGGGAGGACAGCT CT 
GGTGTCCCCTGTGGTCCCAGAGAGCCAGGCAGACCTGATAGGTGGGCTCTA^ 
2 0 CAAGG GT CCATGGGGCCACTG GGGTGACATCAG GAG CTGGCACTGCAG CATC AGAAGGATC CAGAACAGT AATGGCAGTGGTGACA 

CCX1AGTGCACCTGGGTGCAGGGGTTGTGCATGCTTTAG 

TACTCAAGATGAGCTATCACCATGGATCTATGCTTCCCTATGGAGGACAGG^ 
CTGGCTCTGCTGGCCCTGTGGTCACCCAGCCTCCTGTAAATGGG^ 

2 5 GTGAAGCTAAACTAAACACTAAAGGACTCAGCCCIAGTTCrCAGA^ 

CCCCCTCCATTGGAGATGCCAGATTTCATTTTCCTTGC^ 

TAGGGATTTCCTAGCCGAGCAGTGGCCATCCCCATGCCTCTGACCTCCACCGACCTCTGCCC^ 

ACCTTCCCTCGCTCCACAGAAGAAGACAGCCAGCTTCAGGGGTCCCTG 

AAGGAGAAAGTGGAC(2AGCTCCCATGACCTCACCCCACTCCCCCAACACA 

3 0 TTTTAAGTGACCTCCTCTCCTTCCACAGACCCCACATGCCCAAAGGCCn'CGGG 

TTTTTTGTTTTTTTTTTAATAACTCAAAAAA 

TTCTTGGAAGGGACTGTCTCGCCGACACTCGGGATTCCCTCTTGCCTGCATGTTGAGGCT 
GCAGTGAGCAGTGAAGTGCCTGAGTGCCGAATTGGTGGGGAAGGCTTCCTGGATGAGGTG^ 

3 5 AGGATGGTCATGGTTGATTTCCTGTTGCCAGAGTAAGTCATGTAAATACCCA^ 

GTT AGG CCTCGAG CT AGGGG CAGG CT CCCTGG AGGTGC CIGGTGAACAGCTGTGG CAGGCACAGCAGG T CTAGG AAAAT CGGAAGT 
GAGTTGCCCCTCAAATCGAGCTCACCATGGTTCTGTGGTGGTGAGGGCAA 

GGTGTGGACTGAGGGAGGGGAGGGAT AGCAAAGGAGGTGGTCTGCTG CTC CAGCCTGGAAATGCAG AAGC CCACTTCCCTGGGCCC 
AGAGCCATGTGGAACACCAGGGGTCCTGGAGATGGTCTCCAGAGCTT^ 

4 0 AGGAATGCATGGCTGGAGGTGTCGG AGACCCAGGTTGGCATCCAG CTGCACCCTTGACTCGGGGC CCGGGGCCAGACCCTCCACCA 

TCTGGCCTCT(n , CCTCTC!AAGCATGTTGCTCTAGAACAGAACTACAGAGTAGTG^ 
CTTTAAACTTAAAAATGGGGTGATTTCACTGCAGTC 
GCTTCCTCCTGGCAGCAACCAGATAGAGCTGAGTAGCAGTCAGCCCCTTA 
ACTCCCTGTCCATCCTGACACGGAGGCCCTATGTCGGTTGCCATCTCT 

4 5 GGGGACATAGGAAAATGAAAGCTGTACCAAGAAGAGCTTGTGCTTAAACTAGGCAA 

TGTGGGTTACTTGGGCrGOSTTCTCCCACCTCAGTG 

CAACAGAAGAGCCAGGAGCCAGAGTGTTCTAATTACTACCTTTTATTC^ 

GGCTGAGCAGAGGGAACTAGGGGTCGGCAGAAAGGATTATGGGTGGAAAAC^TTC^ 

5 0 GGAAGAGAGTGGCTCAGCCTGCAGGTAAATAGGCTAGAAAAGC(ZAAGG CCAAAGG CTGGAGGGGAGAGGACAGTCAGCATGTCCAG 

CCTGGGGTCTGGGTGTAGGGTTATCCCTTCTCCCTGTGCCTTCCCATCTCGTCCA 

G CTG CTGTG ATGT CAGGAACGGGGATCTGT CTAG CTTTTGG CCACTTC CTGGG AC CT CACX5C CCCTG TTG ACAGATGGAGATTGGG 

CAGCAGGGCCTTGCTGCATTGTTATCTGCTGTTCCGACTTGGTTTGTCT^ 

CATGGGATGAGGTACAGGGTGGGAGATCGGGGAGGGGCTGGGGGCTGAAGG^ 

5 5 ATGTTGAAGACCGCCTCTGCTGCTCTCACATGGGGACTGGAGAGAGAAGCAGCAGGTTGAGGTTGGG^ 

TGC CCCITTCT AACATCCTGGACCTGCAGAGCAGTTAAGGACT C CCCCTGCTCAGGTCCAATAATGACCAT 

CATGGATTTGTGTAACTCCCCACAGCCCCCCAACCCTGTTCCTTCGTCTGTCA 
AGCGTCACACTCCTTCACCTGCCCACAAAGGAGCTGGCCCTGCTGGGGATGAGCTGCT 
TTACCTGGGCCCAGATCCTGCTTGGTCACACCC^CCCAGAAGACGG^ 

6 0 CCXSTTTGTCACTTGAAGTGCAGCCTACCTGACCCAAGATGCCTGAG^ 

AAGCAGGTACCCAGGGAAGGACAGCTGC^^CAGCAACAAAGAGGTGTAATGGG 

TGGGTGGGTCTGTAGGACTCCTGGATGCTAGAGGATGG CGCGTGG ATGG ATG AAATTCACAGATGAAAGGACTGACGGATAAGTGA 

ACAAATATGAGTGCATGTAAGAGGAGAGATGATAAATGGACAAATAGGAATGCAC^ 

TGACTACGTCACTCCCCCCAGGAGCTGCCATCCTGGCTCTCCTGGTCCCATGTCCT 

6 5 CTAGTCCTTCCTGGTCCTCTGACACTGGCACAGTGCCACTGCT^ 

CTCCGTGCCGTTGGGGAAGACACACCAGCAGTAGCCGATGCTCCCATAGCACT 

TGGG CCTG AATGAACCCGGGTGGACAG CAGGG AT GTGG CTG AC CT CTT C CTGG (ZACTTGGTCAGT ACTGAAG CGACAGG CATGATG 

AGGACACAGTGAGTGAGTGAGCTCTGAACCAGGGTCTGAGCAGAGCTAAA 

TCCCAAGTGGCTTTACTCACTCCAGCCATCACACGGATGGGAAAACT 

7 0 GAATGGCCATGTGTCCAATTCACCTCAGAAGTTGATGGTTTCTTCTC^CACTC^ 

GCTCATGCCTGTAATCTTAGCACTTTGGGAGGCCGAGGTTX^CG^ 

GGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCACATGCCT 

GCAGGAGAATTGCTTGAACCCGGGAGGCTGAGGTrGCACTGAGCTGAGAT 

ACT CT ATCT C AAA A A C AAA C AATCAAACAAAAAAACACCATT ACAGTTTGG AATTG CACTT G CGATT CTGATTCT AG AGAAAAG CA 
7 5 ATGGGCACCTTGGTAAGTCTAAGCrTCCTGTTACTGGGAGTTCT 
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GGGGTTTAAGCATCAGCTAGGAGTTCGAGATAGCATCTAAGGTGCCTTCTATC 

TGGCCCTTTGACCTTCTACTTCAGGACCCTGAATCCTAGAGGGAAGGGGACTTG 

CGGGATAGAAACAGCACTCTGTCTTTGGGTCTAAAAACATCCrAGTCTGCATTGTCCATGAG^ 

GGCAGCAGCTGTGACCTTGGCGCCCTCACTGGATTGCGGGCAGCA.CTGGGG 
5 CGGGCAGGTTCTACCCCAGGCACCCTGTGCTGTACCATCCTCTATGAGAGAG^ 

GCCCTTAGGAACCCTTAGACAAGATGGGGGTGGGTCAGGGGAGGGGGACAGCCAAACCAAGA 

AGTGAGAAAGAAGTGAGAGCTGAGAGGATGGGAGATTCCGCGGTGCTGGGACCAGGGAOT 

CAGGAGAGCCCCGAGGGTCCAGAAGCCCTGGGACGCTAAAGCTCCCACTCCCTGTACCTT^ 

TCCAAGGAGTGCCTGCTCATTTCAAACAGGAGCCAATGGTGCATCCAGCTCTC 
1 0 TTCGATGGCCACCACCAAGCCCCTACACTGCCTGGCTTTGCCCCTGGGGACAGAAACATG 

GGGTCCAGCTGCAGCCTGGTTCCCTCTT AAAGACCCCTGGGG CTTGAACCCCACATACrcrcCAGCATCCAGGTTTTGGAAGTG CTG 

GTAAACCCCTTCAGCTTCCIGAGAATGGCrrrCCCCAGCCCACCCTCACCCCACCT 

GAGAGAGTTCCCATGCAGGGAAACCTGCTGACCTTCCAGTCTATGGTCTCCATGGTG i rC 

GCTCCCCTTCAGTGGCGGGTACACCTTCAGGGGGTCAGCATTCTACAGGGTA^ 
1 5 TGGCAGGCAGTGTGAAATGTGTAGCCGACGTTCCCTTGGTCTTCCATTCTC 

GGACITCAGCCATGATGGGAGGGGAGGGGAGGGTACGGGTAAGGGTAGTCCAGCTC 

GAGTTCCAGATTCCAGGCCAGCTCTACCAACAAGCCCCAGTGTCCTCCCTGTGGATGGCT 

CAGCAGGTGAGGACCCAACCTTGACTGCCCCCAGCTAGGTTGAGTCCTCTGCCTLTr 

TCACCTGGTCACTTGTCTGTCTCCCACACTAGCCTCGGATCCCCATGGCTTCA^ 
2 0 CAGAGCCTTGCACAGTGCTGGCATAGGACAGACACGCAGTACAAATT^ 

AACTTTGAGCCTCAGTTTCTTTTTTTATTTTTATTTT 

AGTGGCGTGATGATCTCGGCTCACTGCAACCTCCACTTCCCGGGTTAAAGCGATTCTCCTGCCTCAACCT 

TACAGGTGACTGCCACCACATCCGGCTAATTTTTGTATTTTTAGTAGAGAC^ 

GGCCTCAGGTGATCTGCCrGCCrAGGCCTCCCAAAGTGCTGAGATTACAGGCATC 

2 5 CAAAAATGCAGAGGCCAGACTACATATAACATTTAAGGGTCACCCTTGTATTCAGTAATCCAGAGAGCACA 

ATGAGCCAGGTATACAGGCCTTGGAGCTTGGCCCGGCCAGGGTCCTGGGTGGGCAGG CAGG ACCACCCAGCTAG CTCCCTG CACTC 
ACCTGGAGCAGGTGCATCACATGGTCCTCTGTCATGTTGCCATACTTGGTGGCATTCTGCA 

GGTTACCAGAGCTGGTCCCTGGAGCAGGGGGTATGGAGCAGGAGCCGGTTGCTGGGATCTAGGTAAGAGACTGGAATC 
TTGGTCAATGGTTCCATCTCrroCCACCTCCAAACATGAGAGCTCGAG 

3 0 CAAACTGGCATAAGCTTTCCTGCCCAGGGTTTTGTCCACAGCACAGTATTGTGAATATAAATA 

CTTTGAGAATGGGAGAGGACAGTGAGATGCTATAGGAAATGAGAAGCACTTTGTAAACTC 

ATGGCACCAAGAGACTCTTCACCTGTGCCCTCCACACCTCGGGAGCCCGATTC 

TCAGGACCCCTCCCCAGGTGGGCTCTCTTCCCTGCCCCT^ 

ACCACCCTGGGGCTGTCCTTACCCCCTGGGGCAGGGCTCCCATGGGCAGCGCCTGCATCAGCA^ 

3 5 CTCACAGGCTTGGG7VGCTGTGGGGACAGGAACGAGGGTAAGTGTGGCCCCGGTGCCCAGGGAGGAGGGTAT 

AAGGGCTGGAATTCCAAACCGGAGGGAGAGGACAGTGGGCCCAGCTGGGTGCAGGGATGCGGGTCTCT 

AGGTGGG^SGGT ATCAGGATGTAGG^^TGCACGCACGCTTGGG AAGCTT CATC CGC AGGTTCTCC AG CTGCAGGTTCTGGGAGGTGA 

CTGTCAGTTTCTCCAGCCGGCCCTGCTCCTGGTACAGGAAGTAGGCGCT 

GAAAAGCCTGTGTACAGGGCTCCGCX3GCTGCACTTCCTGTGGGAGGTGGGGAG 

4 0 CCCCAAGGGCTGG CTAGG AATGGGG AAATGT AT ACCCC CATCTCCATTAGACCCCTAAGGCACCCCT AATAAACAATGCATTTGAA 

GCACAAACAGTAAGCTGGCTATGGAAACACAGCrTCAAGACCTC^ 
CCTGGCAGAGACCCTGCAGCTCCCACGGGCCTCAGTTCAACAGAGAAGA^ 
GGTCCTGCCTCTGCGATGAGGGGTTGAGGTGGGGGGGTCTTCCCAGGCTCCCTTTGCCX5TGCGG 
CCTCAAGGGGAACAGGTTTGGTTTTTGTTTTGTTT^ 

4 5 GTGCAGTGGCt3CX3ATCTCXX3CTCACTGCAACCT^ 

ACTACAGGCACCCGCCACCACGCCTGGCTAATTATTTTGTATTTTTGATAGAGACGGG^ 
TCGTCATCCACCCACCTCAGCCTCCCAAAGTCCTGGGATTACAAGC^ 
CTGCTGGCAGGATCAACCAGGTATGTTATTTTTTTTAATTC 
GGGTTTTCACCCCTCTGCCCAACCTCCCACCCTCACAGTGAAATCAGATT^ 

5 0 AGAAAATGTAGCCCAC CTGGGAAG AAGTGGGGC CTGGCCATTTTAGGAGTGGGCTGGGGG AGG CTTGG ACAATGT CAGG CTG AAGC 

AGGGCCAGGCACCACCCACTCTGGGGCCATCCTCTGCCCAGGCTCTCCATC 
GTTCCCTCATTTGTCACATAC^TGCCCCCACCTCTGCGATTCTAGAATTTGGTGGGTT^ 

GCGCCAAAGTATTTCAAATGGTAATATCCATCCTGCAGGGGGTGGGCAGG AGTAT AGGTG CCTG AGCCTGTGTGG CCTGGATGAAT 
TTCTCATCCTCCCTGGACTCACCATCCAGACCCTCGGACCTCAGCAAGCC 

5 5 CATGCATATGTTACAGTACAGGGGATGTAAGGATCCTGCCCCTACACCCCAGGAGTTTCTGATCT 

TGGGACTACTAAGACCCTAGTCCAGGTGTTGCCAATCTGAAG^^ 
GGGATTAGGTGCCCrrcTAAGACGGCTTGATGGAAGGAGTTGGTTCTCTCTTG 
GXjAAGCCCCTCACCAGGCTGGGTGTGGTCACTCACGCCTGTA^ 
CAC^GAGTTCGAGACCAGTCTGGCCAACATCGCGAAACCCCGTCTCTACAAA 

6 0 CACCTCTAGTCCCAGCTACTTGGGAGGCTCAGGTCGGAGAATCACTTGAACCCGGGAGG 

TCACTGCACTCCAGCCTGGGTGACAGAAC^AGACTGTCTCAGAAAAAAAAAAGAAA 
CTGTAATCCCAGCACTTTAGGAAGCCAAGACAGGTAGATCACCTGAGGTCAGGAGTTCGA^ 
CCTCTCTCTACTAAAAATGCAAAAATTAGCCGGACGTGGTGGCGAGCACCTGTAA 
ATCCCTTGAACCTGGAAGGTGGAGGTTGCAGTGAGCTGAGATCATGCCATTGCACTCCAGCCTG 

6 5 TCAAAAAAAAAAGAAAAAATTAGCCGGGCGTAGTGGTGGGCACCT 

TGAACCTGGGAGGCAGAGGTTGCTGTCAGCCAAGATCIACACCACTGCA^ 
AACATAACAAAAAAAAAAAAAAAAAACAAGAAAGCCCCTCCAGAACTGTGAGA 

TGTTTCTTTGTAT ATT ACGCAGT CTGTGGTATTCTGTTAAAG CGGCACAAAACGG ACT AAAACAGACCC&TTGGGAACTGCAAGG A 
TGAGGCTGCACCCTGCCTACTCCAGCAACCAGCAAGGATGGTITAGGACGCTCCAGGAGTCCCTTGTGCCT 

7 0 CACAGACTGGCTCATTCTCATTGGCTTGCTGCGTGCCAGTCTGTGCT 

AAGGAAACAGCAAAAACCCGACGGTGGGOSGGGGGCGC^ 
TGGCCTTCTCAAGGAACACGGGTGXSCCTAGCCTGTGTTC^ 

ACACCCAATTCTAGCCCCAGCTCTACCTCCCACACACGAC^TGGGACAGTACCTTCCCCTCT 
75 HUMAN SEQUENCE - mRNA 
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TAAGGGCGCGAGGGAAGTGGCGGGCGGGGACTAAGGCGGGGCGTGCAGGTAGCCGGCCGGCCGGGGGTCGCGGGTATGGCCGAGGC 

CAGGAAGCGGCGGGAGCTACTTCCCCTGATCTACCACCATCrGCTGOSG 

AGAGCGGCCAGAAGTGTTTCCTGGCTCAGCCCGTAACCCT^ 

AAGCGGAAGGCAGAGGAAGATGCGGCACTGCAAGCTAAGAAAACCCGTGTGTCAGACCCCAT 

GGAGGAAGAAGCAGAAGCCGAAACCGCCAAAGCCACCCCAAGACTAGCAT 

CAAGCATGAAAGAAAAAGCCAAGGCAGAGACAGAGAAAGCTGGCAAG^ 

GTGGCCAACCTTCTTTCTGGGAAGTCTCCCAGGAAGTCAGCAGAGCCCT 

GGGCAGCGTCCCGGCCTTTGGAGCTGCrrGCCAAGCCTGGGATGGTC 

GCTCCAGTGATGAGACAGACGTGGAGGTAAAGGCCTCTGAAAAAATTCTCCA^ 

CCTGGGAAAGGGGCTACCCCAGCACCCCCTGGGAAGGCAGGGGCTC 

AGAGAGCAGCAGCGAGGAGTCATCTGACAGTGAGGAGGAGACGCCAGCTGCCAAG^ 

CCrCTCAGGTCGGAGCrGCCTCAGCCCCTGCCAAGGAGTCCCCCAGGAAAG 

GCAGTTGC CAAGG CCCAGGCGGGGAAG CGGG AGGAGG ACTCGCAGAGCAG CAGCGAGGAATCGGACAGTG AGG AGG AGG CG CCTG C 

TCAGG03AAGCCTTCAGGGAAGGCCCCCCAGGTCAGAGCCGCCTCGGCCCCTGCCAAGGAGTCCCCCAGGA 

CACCTCCTAGGAAAACAGGGCCTGCAGCCGCCCAGGTCCAGGTGGGGAAGCAGGAGGAGGACTCA^ 

GACAGTGACAGAGAAGCACTGGCAGCCATGAATGCAGCTCAGGTGAAGCCCT^ 

CATGGGCATGGGGCCCTTGGGGAAAGGCGCCGGCCCAGTGCCACCTGGGAAGGTGGGGCCTGCAACC 

AGTGGGAGGAGGACTCAGAGAGCAGTAGTGAGGAGTCATCAGA 

GAAAAGTCCTTGGGGAACATCCTCCAGGCCAAACCCACCTCCAGTCCTGCCAAGGGGCCCCCT 
CCAGGT CAAGG CTGAAAAGCCCATGGACAACTCGGAG AG CAGCGAGGAGTCGTCGGAC^ 
CCATGACTGCAGCTCAGGCAAAACCAGCTCTGAAAATTCCTCAGACCAAGGCCTGCCC 
AAGGTCGCCCCTGTGCGAGTGGGCACCCAACCCCCCCGGAAAGCAGGAACTGCGACTTCTCCAGCAGGCT 

TGGGGG CAC C CAG AGACCAG CAG AGG ATTCTT CAAG CAG TG AGGAATCAGAT AG TG AGGAAG AGAAGACAGGTCTTG CAGT AAC CG 

TGGGACAGGCAAAGTCTGTGGGGAAAGGCCTCCAGGTGAAAGCAGCCT 

CCAGTACTCCCTGGGAAGACGGGGCCTACAGTCACCCAGGTGAAAGCTGAAAAGCAGGAA 

AGACAGTGAGGAAGGAGCTGCATCTCCAGCACAGGTGAAAACCT 

GAGCACCTTCAGCAAAAGGGACAATTTCAGCCCCTCGAAAAGT^ 

GTGAAGCCACCAGTGAGAAACCCCCAGAACMTACCGTCTTGGCGA« 

TACAGCAGCTCAGGCCCAGACAGGGCCAGAGGAGGACTCAGGGAGCAGTGAGGAGGAGTC^ 

TGGCTCAGGC^aAAGCCTTCAGGGAAGACCCACCAGATCIAGAGCT 

CCAACACCT C CTGGGAAG ACAGGG CCTTCGGCTGCCCAGGCAGGGAAG CAGG ATG ACTCAGGGAG CAG CAG CG AGGAAT CAG ACAG 
TGATGGGGAGGGACCGGCAGCTGTGACCTCTGCCCAGGTGA 

GCCCAGCTGCTACACCCGCACAAGCCCAGGCTGCAAGCACCCCGAGGAAGGCCCGAGCCT 

TCCGAGAGCGAGGATGAGGACGTGATCCCCGCTACACAATGC^ 

CCACCCAAGAATAGCCCCCAAAGCGAGCATGGCT^ 

AGGG ACCAGCCACTCAGGTGTCAAAGAAGAACC CAG CTTCCCTCCGACTGACCCAGGCTGCCCTGAAGGTCCTCGCCCAGAAAG CC 
AGTG AGGCTCAG CCTCCnXSTTGCCAGGACC CAGCCTTCAAGTGGGGTTGAC AGTGCTGTGGG AACACT 
GAGCACCTCCGTCCAGGCCAAAGGGACC1AACAAGCTCAGAAAACCTAAGCTTCCTG 
GCTCAGATGACAGTGAGGACAGCAGCGACAGTTCTTCAGGGAGTGA 

ACGCTGGGTCCCACCCCCTCCAGGACAGAGACCCTGGTGGAGGAGACCGCAGCAGAGTCCAGCGAGGAT 
CC^GTCTCTCCTCrCAGGTTATATGACCCCTGGA 

GCCCCTCAGTTTCCTCTACTCTGGCCX3CCAAAGATGACCCAGATGGCAAGCAGGA 

TCCCCTAAAACAGGTGGAAAAGAGGCTGCTTCAGGCACCACACCTCAGAAGTCCCGGAA 

AGCCTCAACCCTGGCGCTCCAAAGCAACATCACCCAGTGCCT 

CAGTGGTGAAGGTCCTGACTGAGCTGCTGGAACAGGAAAGAAAGAAGG^ 

GAGAGCCGCAAGCGGAAGCTATCGGGAGACCAGCCAGCTGCCAGGACCCCC^GGAGCAAGAAGAA 

AGGTGGGGAGGCCTCTGTTTCCCCAGAAAAGACCTCCACGACTTCCAAGGGGAAAGCAAAGAG 

AGGAG AAGAAAGGG AAGGGGTCTCTTGGCTC CCAAGGGG CCAAGGACGAGCCAGAAG AGGAGCTTCAGAAGGGGATGGGGACGGTT 

GAAGG TGG AG ATCAAAGCAACCCAAAGAG GAAG AAGG AG AAG AAG AAATC CG AGAAG AG AAAAAAAGAGAAAG AAAAAAAAG A 

GAAGAAGAAAGCAAAAAAGGCCTCAACCAAAGATTCTGAGTCACCGT^ 

AGACTGTATGACGAGCACCAGCACCAGGCACAGGGATTTCCT 

CTGCCCACCATGGGTTGGAACTAAACTGTTACCTTCCCTCXSCrc^ 

AAGCCAGTGAGCCTGCGGGGAGGCTGGTCCAAGGAGAAAGTGGACCAGCTrc^ 

TTCATATAGATGTGTAC^GTATATGTATTTTTTTAAGTGACCTCCTCTCCTTCCACAGACCCCA 

TCCC&CCACCTTGCTCCACAGATCCAGCTAGGCCTGACCTCT 

GTCTTCCTCTCGTCAGTTCTTTTGGTTGTGTTTTITGTTTITTTTTTAATAACT 

AAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGCCGAC^CCAGGAAGCGGCGC^AGCTACTTCCCCTGATCT 

AGCTTGGTCGG AAGCGGAAGGCAGAGG AAG ATGCGG CACTG CAAGCT AAG AAAACC CGTGTGTCAG ACCCCATCAGCACCTCGGAG 

AGCTCGGAAGAGG AGGAAGAAG CAGAAGCCG AAACCGCCAAAGCCACCCCAAGACTAGCATCTACCAACTCCTCAGTC CTGGGGGC 

GGACTTGCCATCAAGCATGAAAGAAAAAGCCAAGGCAGAGAC!AGAGAAAGCTGGCAAGAC^ 

CTGGGAAGACGGTGGCCAACCTTCTTTCTGGGAAGTCTCCCAGGAAGTCAGCA 

ACTGAGGAGGAGGGCAGCGTCCCGGCCTTTGGAGCTGCTGCCAAGCCTGGGATGGTGTCAGC 

GGACACCTCCAGCTCCAGTGATGAGACAGACGTGGAGGTAAAGGCCTCTGAAAAAATTCTCCA^ 

CCAAGGGGACCCCTGGGAAAGGGGCTACCCCAGCACCCCCTGGGAAGGCAGGGGCTGTAGC 

GAGGAGGACTCAGAGAGCAGCAGCGvAGGAGTCATCTGACAGTGAGGAGGAGAC^ 

CTCAGGAAAAACCTCTCAGGTCGGAGCTGCCTCAGCCCCTGCCAAGGAGTCCCCCAG^ 

AGACAGGGCCTGCAGTTGCCAAGGCCCAGGCGGGGAAGCGGGAGGAGGACTCG(^ 

GAGGCGCCTGCTOVGGCGAAGCCTTCAGGGAAGGCCCCCCAGGTCAGAGCCGCCTCGGCCC 

GGCTGCCCCAGCACCTCCTAGGAAAACAGGGCCTGCAGCCGCCCAi3GTCCA^ 

GCGAGGAGTCAGACAGTGACAa^G^JGCACTGGa^GCC 

CCTGCCTCTACCATGG<3CATGG^GCCCTrGGGGAAAGG CG CCGGCCCJIGTGCCACCTGGGAAGGTGGGGCCTG CAACCCCCTCAGC 
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CCAGGTGGGGAAGTGGGAGGAGGACTCAGAGAGCAGTAGTGAGGAGTCAT^ 
CCCCGGCTCAGGAAAAGTCCTTGGGGAACATCCTCCAGGCCA 
CCTGTAGCCGTCCAGGTOIAGGCTGAAAAGCCCATGGACA 
GGCACCAGCAGCCATGACTGCAGCTCAGGCAAAACCAGCTCTGAAAATTCCT 
5 CTGCATCTGCCAAGGTCGCCCCTGTGCGAGTGGGCACCCAAC^ 

CCAGCTGTGGCTGGGGGCACCCAGAGACCAGCAGAGGATTCTTCAAGC^ 

TGCAGTAACCGTGGGACAGGCAAACTCTGTGGGGAAAGGCCTC 

AAGGGACTGCTCCAGTACTCCCTGGGAAGACGXjGGCCTACAGTCACCCAGGTGAA 

GAGGAGGAATCAG ACAGTGAGGAAG CAGCTGCATCTC CAG CACAGGTGAAAAC CT CAGT AAAG AAAACCCAGGC C AAAG CCAACC C 
1 0 AGCTGCCGCCAGAGCACCTTCAGCAAAAGGGACAATTTCAG^ 

CTCCATCCAAGGTGAAGCCACCAGTGAGAAACCCCCAGAACAGTACCGTCTTGGCGAG 

AAGGCCGTGGCTACAGCAGCTCAGGCCCAGACAGGGCCAGAGGAGGACTCA 

GGCGGAGACGCTGGCTCAGGCX3AAGCCTTCAGGGAAGACCCACCAGAT 

AAGGGGCTGCCCCAACACCTCCTGGGAAGACAGGGCCTTCG^ 
1 5 GAATCAGACAGTGATGGGGAGGCACCGGCAGCTGTGACCTCT 

TAGTCCAGCTGGCCCAGCTGCTACACCCGCACAAGCC^ 

GGAGCTCCTCCTCCGAGAGCGAGGATGAGGACGTGATCC^ 

ATGCCCACTGCCCACCCAAG AAT AGCCCCCAAAGCCAGCATGGCTGGGGC CAGCAGCAG CAAGGAGTCCAGTCGGAT ATCAGATGG 
CAAGAAACAGGAGGGACCAGCCACTCAGGTGTCAAAGAAGAACCCAGCT^ 

2 0 CCCAGAAAGCCAGTGAGGCTCAGCCTCCTGTTGCCAGGAC^ 

ACAAGTCCCCAGAGCACCTCCGTCCAGGCCAAAGGGACCAACAAGCTCAGA 
AGCCCCTGAGAGCTCAGATGACAGTGAGGACAGCAGCGACAGTTCTTCAGGG 

AGTCAGCCCACACGCTGGGTCCCACCCCCTCCAGGACAGAGACCCTGGTGGAGGAGACCGCAGCAGAGT 
GTGGCGCCATCCCAGTCTCTCCTCTCAGGTTATAT^ 
25 GCTAGATTCCAGCCCCTCAGTTTCCrCTACTCTG^ 

GGGAACCCCCAAGCCTCAACCCTGGTOCTGCAAAGCAACATCACCCAGTGCCT 
GGTGCAGGCCTCAGTGGTGAAGGTCCTGACTGAGCTGCT^ 

GG AAGGGCTGGG AG AGCCGCAAG CGGAAGCTATCGGGAGACCAG CCAGCTGCCAGGACCCCCAGGAGCAAGAAGAAGAAGAAG CTG 

3 0 GGGGCCGGGGAAGGTGGGGAGGCCTCTGTTTCCCCAGAAA 

TGGTGATGTCAAGGAGAAGAAAGGGAAGGGGTCTCTTGGCTCCCAAGGGGCCAAGGA 

AAAAAAGAAAAGAAGAAGAAAGCAAAAAAGGCCTCAACCAAAGATTCTGAGT 
GACAGCAGAGCAGACTGTATGA 
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Table 21 

MOUSE NOMENCLATURE 
ICSGNM Piml 
Celera mCG21141 

5 

HUMAN NOMENCLATURE 
HGNC PIM1 
Celera hCG33220 

10 MOUSE SEQUENCE - GENOMIC 

T AGGGAACAC CAATTGAGAAAACACCT CCAT AAGATG CAGTTAT AAGACATTITCTITTTC'TT'I 1 CI"!"!"!' LTTT CT C CTGGAAATC 

ACHTTGTAGACCAAGCTGGCCTCAAACTCAGAAATCTGCCTGCCTCT 

CTGGGCAAGACATTTTCTTAATTAGTGATTGGTAGG^ 

TTCTGTAATTAAGCAAACTGAACACACCATGAAGAGCAAGCC^ 
1 5 AGGTTCCrrGCCCIGCTTGAGTTCCTGTCCTGACTTTTTGATAAACA 

AAGTTCTGGCCATGGTGGTTCATC CGGGCAATTGTAATGCTAACTCCGAGTCTTTCTCTGGGTT AAA^ CCTC 

AAACTTGCT ATGT ATCTGGAGACAAC CTTG AACCCCTGATCCTG CATGACAGACATG TGCC C AT AACCAG CTTATGTGGTGCTGGG 

ATTGGAATGAAGGCTTTGATGGTTCCACGCCTGCTAGGCAGTATTCTACCAACT^ 

2 0 CCCTGGAATGTGAGTGTATGCCCCCGCGCTACTGGACTTTCGCTGTITC^ 
GCCCTTTAAACTCCCTA6TCOTAGCTGQ6CGT66TGGCGCACGCCTITAA 
GGAAACGCCCTACATAGACCAGAATGCTCGACCCCTGCCCCGCACCCTGTC 
ACACACACACACACACACCCTTTTAAATTTTGACACAG 
GGGCCTGGACCTCrCAGAGGTCCGCTGCGCCrCrTCCrrc 

2 5 AAGTCCCTACCATTTTTAACATTTATTTTCAAAGGTCACT^ 

GAACATCT^ATTTTTGTAATTTTAGGGAGAGGTGTCrCCTCTAAACCC CTGGCCC^CAGGGCTCTAGAGTGGATG CCTTGAAGTAA 

CACATATGGTGTAATCTCTGCTTCTGAGCCCTCCTGGGGGAAAGGGCAGCCC 

CATGCCTGGGAAGTTGCCAGGTCATGTGGGATGGGCAGCTCTCrG 

AAAT ATCTTCTCCCGT CCAGGG C CTTGGCACTT ACCCT AAATGT ATCT CTTAC CATTGGTTGCAAGGTG ATGTGGGAGG CCAATTG 

3 0 TTTACCAAGTTTGGCCAGAGAGTCCCACTAGCTCTO^GCGATGCTTCTTGCAT 

GACAGTTCCTTTTAATTAGGGACTTGATTCTTCAATGTGAGAAATCTGGGGTGATGGCCACT 
TCTTAAGTTACCTTCTGACTGATCACAAGACCAGTCCAGGAGATGGAAA 

GCCATGATGGCCTGGG AGTCTTTAATCC CAGCACTCAGGAG ACAG AGGCAGGT AGGTCT CTGTCTCTATAAGG TCAA GGCTAGCCT 
ATAGTGAGTTCCAGGGTTGTAGTGAGACTTTATCTCAAAACATACAAACA^ 

3 5 ATTTGGGAGGCaGAGACAGGCGGATTTCTGAGTTCAAGGCCAGCCTGG 

AGAAACCCTGTCTGGAAAAAAGAAAAACAAAACAAAACCGAAACAAACAAACAAAAAACT^ 
AAAATAAGTAAAAAACAAAAACAAAAAACAAACTCACCACAGAA 

TTACTTGAGGTGCCACAGCGAACAGGTCGGTT AATTAC CIAGGCTAAGTCACCATGT CCOVGGTACCTGTGATGTCCTTGGCAATTA 
CAGAATTGTGTCTAGGAGGGAGGGAGCTATTATTAGCCTCAAATTACAGGTAAGGAA^ 

4 0 CAACCTCCAAATGGCTGAGTCAGGACAGAGGTAAAGGCAGCACACATGGGAAACCTG^ 

ACAACTATGACTTACCCCGAGTCIATCrCAAAAGGAAAGGACCTTGTCCTG CCATAG CCTTGATGGGGCCTGGATCAGAGTCAGGTT 

TGGGTTTTGTTTTGTTATTTTTCTCTCTAAGGCCTTGCGTC 

TTAAAAAAAAAAAAAAGTATTTTCAACTGAACAGGCAGGGX3 

ATCTCCCAGTTCAAGACTAGCCTGGTCGACAGAGCGAGTTCTAG^ 
4 5 TTTCTAGCCTGGTCTACaGAGCGAGTTCTAAGACAGCCAGGGC^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNW 
55 HNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN 

NNIONNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNN1*NNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNjjNNNNl^NNNNNNNNNNNNNNNOT 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNHNNNN^^ 
NIOWJNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNMWNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNKACCTTCTTAGACAAAA 
1 0 CCTTGAACTCACAGAGATCTAAGAGTCTATGCCrCTCTCTGCn^ 

TATCAGCCTGCCCTATGCTCTATTAGTGCTTCATCATGGAGGCAGATCTCAGCCCTC 

AACAGACAGCAACACCCCAGGCATCTCCCAGGAATGAATGATCGCTTGCATCTA 

TTTTCTAGGTATGCTAGCAATTAAGAGAATTAGAGAAAAATATGCACATATGGGCCATATC 
1 5 TATGTGCCGTAGAAAAGGATTAATTAACACACAGCTAA 

AGTGACAGGGGAGCCTAAGGAAGGAGGCCTGCATGCAGTAGCrT 

GAATCAGGTGAAACCCTGTTTCAACTGCTCACCCTCCGAAAAGCTAGATTTTATG 

TCTCATGTTGTTCAGGCTGTGGTTGAACTCCCTTTGTAGCCCTAGATAATC1"!" 

AGACAAGGTTTCTCTGTGTAGCCCTrGGCTGTCCTGGAACTCACTT^ 
2 0 CTGTCCCCCAAGTGCTGGGCTTAAAGGCGTGCACCACTGCTGTTCAGCTAGCCAAAGATAA^ 

CACCT CTG AGG C CTGGGGTGACACTCAGGTTC CACC CTGCCTGG CAT AG AT ATTATT CTTCATTGG ATCACACCTCACAC CTT CTT 

CACTGGGCCTCTGCTTTAGGAAACTGGTTATCTTTTTATTCATT 

GACTirGGAAATGTTATGGAAGAGCTCTCTGCCACTGAGATCGGCCCCCAGG 

TCTCTGTCTGTCTCTGTCTCTGTCTCTGTCTCTCTCTCTGTCTCT 

2 5 CTATTGAACTCAGATTCACCTGCCTCTGCCTCCGGAGTGCTGGGATCGAA 

AAGAAGGCAAAATTTTGCTT CAAAAATTTGTTTTTGGTG CCAAGAAGTG AACT AGG CAAG ATCTCTCAC CAAGCT ACACAC CC AG C 
CCCGGAATTTGCTTGTTTTATTTTACTTTTTTGA 

CCTTGTCCTCACTTCCTTCCTITTTCAGTCATCGGAGTCCATGGGGTTAGAATC^ 
TTTATGAGACTGATCAATTACGTTTGTACCAGTGAGGCACGTATGTTGTGGCTGTGTrc 

3 0 TCATT ATAATTT ATCAAGACGGTGACTGGGACAG AGGGAGAG CCAGGGAG CAACCACATCGTCTCX3TGTGTGTGGCCT CAGATCAC 

ATTTGATTCTTTTTTTAGAGTTGGCAACACAGT^CAGC^ 

GACCTGTGTGGTGGAAGGAGAGAGCTGACCTTCTGCAAGCTGCCCTTTGACCTTTATGC^ 

CTTGCTGGCTTTTTCCTATTTACTAATTGGTTATTTATATTTACATA 

3 5 ACACACACACACACACACACTCTTGGTGCTGGAATTATAAGCACCAG^ 

AACTCAGGCTCTGATTCTTGCTTCGACACCACT 
GTTTTCCAGACTGGTCTCGAAATCAAGATCCTCTAGGGTCAATATC 

TT AAAT AATGTGG CGGTTT AAAT AG T AAAT ATGGGCCGGGTGGTTGGG CACG CCTTTG AACTGAAC ACTTG^ 
TGGAATTCTGCG AGTGTGAGAC CAGACTCGTCT AGAGAGTGAGTTCCAGAATAGC CAGGGCTAC ACAGAGAAACCCTG 
40 AAAGGAAAAACAAAACAACAACAACAACAACAACAACAACAAACTAGTGGC 

GTTTGG AAAG AGAC AGG ATG AACTCAGGACT AGCTGTTT GC CAG CTCGG CTTC CATT CATAAG AAGGG TTTC AGG TGCGG CTCTGA 

CCrATAAAAGGATGACCTTCrcACCTAGTTAGTGTGTGAGTGTCTTTTTCTGG 

GCCCCATTTCTCCCGCTTCAGCGTGTCCCAOSGTGCCCCAGCCAAGTGGT^ 

AGAAAGGTGATACCAGTGCTGTGTGTGGGCAGCACAGCTCGCTAAGTTCCCGGGATCAGCCAGTT^ 

4 5 CTCTCACTTCTGAGCACCTGGCACATACTGTTGAGGTCT 

GGTTTGAATCCCATCTCAAAAATGTAGCAAAGCTAAGACTCCAGTCTAA 
GAGAGACAACTCAGCAGTTAAGAGCACTGGCAGCTCTTCGGAGGACCCACGTT^ 
ATATGTAACTCCAGCTCCAGTCTCAGAGGACCTGGCGCCCTCCTGTGGCCTTCACAGGCACT 
TACATACATACATACATACATACATACATAGCACGCAGACACAGAAATTAAATTAAATGGAGCT 

5 0 CAGAAAGATCCCTTTAAGATGGAAGTGTCTATCTCCGAGCTAGAGTGAGTTTTAAGCCATG 

AACAAATAACATCAACCAGAGCAATAACAATAATAACGAGCATAGAGAGAAAACAAA 
TAGGCAGAAGANNNNNNNNNNNNNNNNNNNNCCTGGGCGCG 

TGAGTGTCCCGTGCCTCTTTCCTAGGCAAAGAGAAGGAGCCCCCTGGAGTCGCAGTACCAGGTGGGCC 
GCTTCGGCTCGGTCTACTCTGGCATCCGCGTCGCCGACAACTTGCCGGTO 

5 5 GATGCTTTGAGCAGGAGGGTCTGATGGAGACTCGGGGGTCTTCTTC 

ATTGGGGAGAACTGGTGAGTGAGCCGGAGCCAGAGACCCTCTCATGGGTGGGTGGGTGGCGACGG 
AACCCTCGCCCTCACTCTGCAGCCCAATGGCACCCGAGTGCCCATGGAAGTGGTCCTGTTGAAGAAGGTGAGCT 
GCGTCATTAGACTTCTGGACTGGTTCGAGAGGCCCGATAG 
GACTTTATCACCGAACGAGCAGCCCTACAGGAGGACCTGGCCCGAG 

6 0 CAACTGCGGGGTTCTCCACCGCGACATCAAGGACGAGAACATCTTAATCGACCTGAGCCGC 

GGTCGGGGGCGCTGCTCAAGGACACAGTCTACACGGACT^ 

CGAGCTTCCCGGCATAGTACAGCCGGGGATTTCGAGCTGGTTGAACGAACCTGCAACATTTCT 
GTTGTTGTTGTTCTTTTTGTTGTTGCTGCTGGTGTTTGCTT 

6 5 TGAATAAGAGGTCGTGGGTAGCCGGCTGTAGCCCAGATTTCTAAAGTATGACCCACTTTTCCCT 

GTGGCGGGAGACATTGGTGCTGATTCTTTTCTTTTTC 
GTTCTTAGAACTGCAGGGTTTTGTTTTGTTTTGTTTTGCTTT^ 
TGTTGTGGGGTGAGGAGGCCTGGAATGAATGTTGTGAAATAAGATCTC 
GGATGAAAACCTATCTACCATAGGGAGGGGCTTGGATCGCGTTCTTTCAGCAGTGGAGGGC^ 

7 0 GTGGGTCACGATCCGAGTGGCTCACCCGGACAAACTATGGCCTTTCAAAACTTGTGACT 

GGATGCCGGGCTG<3GCTGCGTTTTTTGTATTTAA(^GTTAATG<3CTGGCTGAG 
TTTGCTGCCCCCTGTTGAGCACCACGAGGCTAGCACCrrTT^ 
ACGTCXSTCTGAAACCAGCCCTGCAACTGTACCTTTCCAGTCT^ 
CCGGAGGAAAGCACACCCTGAGTTAGTTGTGTA^ 
75 GGGTCCTTGTAGTTAAAGGCTAACCCACTGAGTGGGGCCTCTAGACTTGAG 



470 



WO 03/008583 PCT/USO 1/5 1291 



G TAAGGG ATTTGGGTTT AGGGTTTTTGGG CT CCCAGAGAG CTAC CAT AT CATTTCTTCATTTT CTGT CTGG ACCAGAGAGG CAGAG 

ATGGAGAGAAAAAAAAAATATGGTTGTTTTAAAGTCAAATCTG 

AAACCCGAGCCAC^CAGTGTCTTTATTAAAAAATATTTGAAAA 

AGGTAGAGGTAGGCAGATCTCCGAGTTCAAGGCCAGCCTGGCCTACAGAGTCACTTCCAGAG 
5 AAAAAAAAAATTAAAAGAAGCGGCTCAGACTAGCTGCT 

TGGTAGGGACCCGAGTGTACAGTCCTCC^GAGTGGATTCGCTACCATCGCTACCACGGCAGGTCG^ 

ATCCTGCTCTATGACATGGTCTGCGGAGATATTCCGTTTGAGCACGATC 

TGTCTCTTCAGGTAACTTCCCTCAGACACCCCCAATCCTTCCCAAACCCT 

AGTATTCCTATCCACCTTTTTATTTTTATTTATTCATTITC^ 
1 0 TGGGACTCAAGAGATGGTGGGTTTTCACTGGCT 

CAACCCCCAGACCCTGTAAAATCTAAGTACCAGCAGAGACCCAGTCTCTAA 

COX1ATAGTGTCAGTGGAAGGCGTGAGGTAGGTAGGTGAATTG 

CATCCTTCCTATTTCTAGTGAGTGGGTGTCATGGGAAAGGCCCGAGCTATTGAAGTC 

TTTTTTAACTGAAAGCTGATTGCTGGAGGCTTTCGGTATCCGTTCTTT^ 
15 CTTTAAAAACCCCAGACTTGTAGGTCTGGGGAAGAAGACCTTTGCATTACAAAAACA 

CCGCCTTCTGCAGAGTGTCAGCACCTTATTAAATGGTGCCTGTCCCTGAGA^ 

CCATCCGTGGATGCAGGGTGACCTCCTGCCCCAGGCAGCTTCTGAGATCCATCTC 

AGCCTTTCTGCTGCTGTCTGTCAGACCCTCCAGGGAGGGAGAGCT^ 

GACAGCAATGACAACTCATTCCAGACTCCAGGTCCCTGGAGGAGC 

GGGATAAAAACTAGCACCATATTTAAGTCCCTGTCACCTCTTCCAACT 
GAAATACTTGAACTTTACCTCTTACCTGCTGCTTCTCAAACTCTGCCT 

CCACTTCCT ATGAAAGGTGCCATGGGAGAGGCT ACAGGG CCAACTG CTG AGCCACCTGCCCCTTTTCTGTCTCCTTTAGTAAAATT 

2 5 CCCAAGCAAACTGGT CTTC C l'lX rTl GTTTCGACTT AACTGTTT CAAAAC GAAGACCTCACACAGAACGCACAACCAATG CAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAACCAG ACAAG CT GT AAATGTGTGTATAGTTCC GG CATGGT AGC CTG CAAAAGGACTGTAGATG 

ATAACAATTTTAAAGAAGAAAAAAAAATGCCTTTAAGTTATTTTATGTGTTGGG^ 

TTCITTTTTATTCTITTT T GCrrTGTTTTTGAAA 

G TTGCCTT C CTGT CCAAGCTTCCACTGACTGTTGC CC CCGGTTT ATTGTCATGTT CT GC CTTTTT riT IT u m CCTT CCTCCTG A 

3 0 CTTGGGGACCTTTTAGGGGAGGGCTGTGACGCTTGCTGGGGTGTTGGGGCTCCC^ 

GGCCCCCTTGGCrGCTTGCTGGAGTGGGTCTAGGGTCTGCrGACTGTACATAA CGGATCCT 
CTCTGGGGCTGTGCTTTGAGCAGCX5TX3CAGCCCGCrGGTTTTATCT 
TTTTTTTTTTTATACTTGGCATTTTTTGAATAAA^ 
NNNNNNNNNNNNCTGACGTGTCATGTCAC^^ 
35 GTAGCACTTGAGAGTTTGATTTGGTTTTTGTTTTAAAGATTTTATGTATG 

GGTCCCATCACAGATGGCTGTGAGCCACCATGGGGTTGCTGGGAATTGAACTC^ 
CCATTGAGCCIATCTCTCCAGCACCCTGGTTCTTGGTACCTCATAGCAGG^ 
TGAGGCTGGCirCAAGCTCAGACCTACCTGCCCGGCTTGGTGAGTGACCA^ 
TGTAGCCrrcGAACTTCATATGTATAGACCAGGCTGGCCTCAAACrCCT 

4 0 AGGGCATGTATAGTTCCACTC^3AAAAGATTTGGGAGCCACGAGTCCTAGCATCTTCCTAGCAGA 

GCACTTGGGAACTAAGAGGATGAGAGTCAGGAGTTCAAGGTCAGTCTCAGTCTGGGATACTCGTGACT 

GATGCACTCTGGTAGACAGGACC^AAGGAATTCCTATACATTCAAGGCTGAGCCTGGGCTGGAGTTCCA 

TAAAAGGGGATGAGGATGGGCAGGGGAAGTGGAGAGATGGCTC^GCAGTTAAGAATTCTTGCAGAG 

CACCCACATGGTGGCTCACAACCATCTGTAACTT CAGTTCCAG AGGATC CAATGCG CTCTTCTGGCCTCCAAGGGCACTAAG CGTG 

4 5 CATTC^TACAAAT ACACRGATAACACACCTATTAAGAAAACATTTCAGC CGGGCAGTGGTGGCGCATG CCTTT AATCCCAGCACTC 

GGGAGGCAGAGGCAGGCAGATTTCTGAGTTCAAGGCCAGCCT 

ACCCTGTCTCAAAAAACCAAACAAGAAAGAAAGAAAGAAGAAAGGAATAA 

TCTTGCAGGGGACCCAGAATTGGTTCCCAGCACCTGTATGACCT 

TGGTCTGAGTATGTCCTAATAGTTGAACCACTGCTGATCCAGGTCCTAAAAGTAATTCTTTCT 

5 0 TTT Crn 'L M l ' lTCTl T Cri TLTTTCTrrCrTCCTTCCrTC(nTCCITCCT 

AGGCAATTTCTCCTGTAGCCTAGGCTTAAATAGCCCAGGAGAG CCTTGAACTCC CGAT CTGTATCTAGGATCTGGG^CTTGCTGCC 
AC CACAC CCAC^TGTAAACATTG CTTTCTTGTTG CTTC CTG CC CATT AG TAACAGTT CT AACATTCTT CC CT C CTG CAGGGG CATC 
TTGACCTCTTCCTCCCCTCTTGCCATAATCTCnTAGGAACTrGGCCCCATT 
CTCCCTTCATCGAGAGATCTCCTCTACCACCACATCTTGGACTAGGTCCCAAACCCC^ 

5 5 AGGAAGGATTTCTCAGCTCTTGGGGGCTGATTTTCT 

CACTGCCCGGCAGCTGGGTGTCTAAGCACACTGTGGGACCTTTCCAGTCAAGT^ 
TATTTATTTATTTCATGTATGTGAGTACACTGTAGCTGTCTTCAGACACACCAGAAGGGTGC^ 
TGAGCCACCATGTGGTTGCTAGGAATTGAAATCCCCTGCTAGAAATGAAATCCGGCAGAGCCTTGCCCTTCTTGC 
GTTGGGGGGGGCCTTTCTGTGGCTGTTGAGACTGCTGGGGGCCATACAGTGTG 

6 0 AGGGCTCAGTGGTTAAGAGCACCGACTGCTTTTCCAGAGGTCCTGAGTTCAAT^ 

GTAATGGGATCTGATGCCCTTTTCTGGTGTGTCTGAAAATAGCAACAGTGTACTCATATACATGAAGTAAATAAA 
AAAAAAAGAAAGAAAGAAAGTAGGGTCCTCGCTAAGTAGAAGAGGTCAGAGGTTGTAGGAGATGCTT^ 
TTTGTTTTTTGTTCTTTATG^AAACCAGGTTGGTC^ 
AGGTCTGTGCCACCACACCCTGCATTATTTATTTTTTATTTTATTT^ 

6 5 TCCTGCATTCTAAGCTAGACTTGAACCCTGGCATGATAGGTGTGTGTGAGCTGAAACTAGGC^ 

TTGATCTCTCTCTCTCTCTCrCT^ 

CTGGAACTCACTTTGTAGATCAGGCTC^SCCrrCGAACTCAGAAATCT 

G CCACCACCGCCCGG CGTGX3TTCTCACTTCTAAGGAGTAAG AAG AGGGTTGTGAT 

CATTAGTTCCCATCCAAGAGATGGAGACAGATCTGTTTATGTCCCTGTTGATT^ 

7 0 TGGACAGACCTGTGTTTCCAGGAGGCAGAGATGGGGATGGATGTCCTTGGCGGAAGAGATTGGGAAATC^ 

AGAAAACAACTTGNNNNNNNNN^ 
NNNNNNNNJINNNNNNNNNNNNN^^ 

75 NNNNN1 nqNNNNNNNNNNNNNNNN^^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNIWNMNNNNNNN^^ 
5 nnnnnnnnnnnnnnnnNI^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNIWNI^^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
1 0 TTCTTGGAAACCCTCACCACAAGGACCTATCGAGAGCAGTGTT^ 

TCATTTCCGCAATGGCCTGTGCTACTAACCCCGATTGCCAACAGC 

AAACGAAGCZAGAACTCAGGCCAAGATGAGGAGACCCCGTATTATTTTAATTTAC^ 

GTGACCTTAGGTCTGGGATACAGTACAAGACCGGCTGTTATTGAATTGTATCCTGTTGTTO 

AAGTTGGTGGTGTGGCTCAGTGGGTAGAACGCTTACCAGGCATGCACGAAGCC 
1 5 TTTTTT TACTTATTATATGTAAGTACACTGTAGCTGTCT 

AGCCACCATGTGGTTGCTGGGATTTTGAACTCAGGACCTCCX3G 

CCCTGGGTAGGGTTTGTAATCCCTGTGCTGGGGAGGTAGAAGCAGGAGGCCCAGATGTTCAAGGTCATG 
GCCACCCTGGACTACAGGCAGGACAGTAACAGTCACGTCTATGTGACCCGATGGTTTTCT^ 
AGACCCTGCTGTCCTGAATGCCAGGTGTAGCTTTTTTTTTACCT^ 
2 0 ATCCCAGCACTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTTGAGGCCAGCCTGGTCT 
CTGCACAGAGTAACCCTGTTTTGAAAACAACAACAACAAAGAAATA 
AGATTTATTTTCAGTTTGTAATTATTTGTGCACTTGTGTGTGTCrGTGCGG 

AATCAGAAGGGGACACTGGACCCTCTGG AATTGGAGTTG CAGGCAGTTGTG AG CTGTCAGATCTGAATGGTGGGAACCAAACTTGG 
ACCTTCTGTAAGACTGGTATGTATCTTAAGCCCT^ 

2 5 CATCAGCACCACCATCATCATCACCATCTCTGGCTAGCCT 

TG ACTGTCTTTGAACT CACTCTGT AGACCAAGCTGGACTCAAATAGTGCTGGGGACAGAACC C AGGGTTCCTGCAA TG 
ACCCTCTTATCAACTGAGCCACATTGCCGCCAGCCCATGAGTTTTGAATAAACAATTAATT^ 
GTATATAAAATACCCTTTTAATCTAACTCCCTGGAACCATGAGGGCCCACATGAAATCAGGGGCAG 
GACCCTTCCTTGACTTGGAGACATATTGTCACAGTTGAATTCCGCTCCAACTTTGGTCTTCACG 

3 0 TCTGAGTTTTG T TTTTTT 1'CTTCTCATCCACGACAGAGAAATGGTCCGGAGTTTGTGAAGCCTGTGCCACCCAGCCAACGGTCATT 

GGTACAAATGAGGCACAGCTTAAACAGGACACCGTCGAAAGGAGATTTTAGAATGAGCTAGTCTGA^ 
GTGGGGTCCAGAGATAGACTGAAAAGGGCGCCAGGAACTTCAGGGGAGGGAGTGCGGACTTGTTTTTC^ 
TCGGCTGTGAAAGGCTGTCAGAAACACACCCTACTGAGCTAATGCTTAAAATGC^TGCAGTG 
GGTAAAATGACTTAAAAAAAAAAAAGTTAATTTCTTTTCCTTTTCCCC^TCA^ 

3 5 aaGTCTCCTTTCCGGTTAGATTATTCATGTGAGTGCGAACATTCTCCCCGTGGATTGATATAAATAAATC 

CCCA.CCTTACTTTCTGACTGTAGTGCAAAATTAGAGAGATGTGTCATCAGGGGGTGGAGGGGGCCG 
CGGAGATAAACTTAAACGCCGCATTTTTTTTTTTCTTCAAAGTCA^ 

CTGGCGAAGCACTAAAGCAAAGTGATGGTTGCTCGCGGTGGCGGAACATGTCTGGGAACCTCGCACCTTGGGAG 
AGGTCAGGCAG CCAACACTTGGTGAGTTC CACGCCAGCCTGGG CTATGTGAGACCCAGTTGCTAAACAGCAAAGCGAGATGGTGCG 
40 GTGCGGCACAGGCTGACAGCTCTGCAGTAGCCAAAGGAATCACGTGATCCATCTG 

CCCTTTGGGGTGAAATGTACACATGTATCTTTGAAGGAATTTTTTTCTTTAATATCT 
GGCTTTTATTGTCTCCACACCTAGTCAAAGAGAAACTAGACATTTTAATCTATT^ 

ATG AG AACGG ATTTT G CTG T AG ATGT CT ATCCATCTCTGTG TCTTTGGGTTTTTT AAG ATTT ACTTT ATT ATAT AT ACAC^ 
CTGCCTGCATGTATGCCTGCAGGCCAGAAGAGGGCGGCAGATCACCTTATA 

4 5 GAACTCAGGACCACTGX3AAGAGCAGCCAGAGCGCTTAACCTCTGAGACAT 

AGGTTAGGAGTAGTGCAAGGTCAGAGCTCAAGGTCCTAAGCAGGTGGTGCACACTTGTCAGCCTAACTCTAGGGAAAA 
GAGAATCAGGAGTTCAAGGTTAACCTTGGCTATGTAACAAGTCTGCAGCCAGTTT 
AAGCTGATAAGCACACTTGCTGCTCATCCAGAGGACCCAAGTTCAAGTCCCAGCACCCACACAA^ 
AATTATTTACTATCTGTTTGTGCTATTGGGATTrAGAACTGAGCTAGGCAAGTC 

5 0 CI^ACATATCCAGGTGAGCACGTGGAGGTCAAAGGACAACTGGAGAATTGCT 

TCTCAAGTTCAATTCCCTCAC CCATATAGTGGCTCACAACC ATTTGTAATGTGAGCCAGGTCTCTCCTGG CTCAGGGGTGCAGGGT 

GCAGGAGGTCTAACTTAGCCCTCrCGTTTACITTGTGCCACAGGGTCAGTTGTCCT 

TTTTGATCACCTGGGATTCCTTCAGGCTGGGTTCTCTGCTCT 

TCAGGGGAGTGGGAGTCATTTCTTCTAATCATTTTGGTCCCGGGGATGTAATTCAGGTCCT 
CTCTCTCTTTGAGACAAGGTATCCCTGGGTGATGTGAAACTTG 

CTTAGCTTCCCAGTG CTGGGGTTCAAGATGT ACAAGGCCTGG CTGG AG AG ATGGCTCAGTGGTTAAAAGCACTGACTGCTCTCCCA 
GAGGTCCTGAGTTCAAGTC CC AG CAACCACATGGTGGCTCACAACCATCTGTAATGGG CTCTG ATGCCCTCTTCTGGTGTGTCTGA 
AG ACAGCTACAGTGT AC T CAT AT 

60 

MOUSE SEQUENCE - mRNA 

TCTAGAGTCGACCTGCAGCCCAAGCTTCCTTAATGGAATCTTCTGGTTTATTTATTTCTCACOT 
TCACTTAATATCCTGGTTGCTGTTGTTGTTGTTGTTTTGTCTG 

GCATGCTAGATGACTATCTGATGATTGAGCTACATCCTCAGCGATAAGGCTTTTCCTTT^ 

6 5 T cTGGCTTCACTATACTGCTCAGTTTGGCAGTCCTCCTGTTTCAGCOT 

GGTTCATGAATTGCTGTTAAGTGTTCTTTACGTTGGGTCATCTCCCATCAGAAACG 
TGGGCAAGAGGGAAGGGAGAAGCAAACGATCGATGAATTTGAGAACTGTCCTGGCTTTGGCATGC^ 

AAG CAGAAAAAGG AAACAAAG CAGG T AGACAG T AT AAATCAACG AAG AG AT AG CAT ACAG AAAGT CTTCTTGTTTT AAGGCAGGAT 
CTTTTGTGTCTCTGACTGACTTTGAACTTTGACCTGACTCTAC 

7 0 GGATCAAACCTAGGATTTTCTTCTGTATGCCAAGCAA 

TTGTAAGAACCCCACATGACGATCAGATGGGCGGCAGTGAACGAAGGCTTAGA 

GCCTTGTTCAGAACGCAAGGCTTTGAATAAGCGACAGGACTGGAGTGTATCT 

ATTTCCCAACATCAGACACATTCCGAGAAGAGGACAGGGCACTTTGGA 

ctgctcccagattgaaaacaagtggaaat^ 
7 5 gt^aacacatatttXtttat^ 
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CCACTCTAGGCATACCTGTTTTCTGATTGCCTGCT 

CCCTCAGGAGCAGAATTTGGGCCATTCTCTCTGCTGTCGTATGACCCTGTGTCA 
AAAGGAAGTAATTCTCCTTCCCCTGGGTTTTTCTCCTTTCGTCATCAGAAATGAATG 
AATATGTAATGTTTTGTAAGAGGCATCCATAAAAGTTATGGAGTCTCTCGGC^ 
5 AGCATTTAAACGCGTGCGCTAATGACTACATCCACAACAGGCTCGTTCCT 
AAATAACGAACAAGGGGTGTAGCTGGCTGTCCATGACTCTG^ 

TTCCACGTGOlTGTGaSGACTTTCCTTTGAGACAAGGCTGGGG 

GCTGATGAGAAGGTAACTACCX3CAACAACCTCTGATTAAACCCAGCGGCACCCTCCCA 
1 0 CTCCTCCCGCCCTCTCAGAAGGGGCTGAGGCAGCATTTGGCAAGAGAAAGCT 
CGGACATGGACCGAGGGGATCAGCTACCCCGGGTCCCTrCTAATTCTCTCTC 
TCGCATCCCAGGTCTCCCCCACTTTTCCTCGCAGGCCTGTGACTGGCCTCG^ 
CCTTCGCCCCCAAAAACACAATCATCACCCCCAGGATCTAGCCCACATAGCATCGTCA^ 
GGGCTGAGTCACACCAGAGGACCACCCTGGTCGGCATCCACACACACGCGGT 
1 5 GCCCTTCCATTTCTCACAACTTGCTACGCTCCAGCTCCAATCCTCGACCCCCCT 
GGCGGAGCTGGCCTCGGGCGGCCTTTGATGGCTTTGTTATTC 

CCGCGCGCGCCACCCAGCTCCCGCCTCCCTGCACGCCCAGGCCCCTCCCCCTCCGTTTTGGC^ 



20 TCCGCCCTCCGCGCCGAGCCAGCCCGGCTCC^ 

GCGGCGCGTCCGCCCCTTTACrrCCTGGCGGCGGGGCGAGCCGGGCGTCTGCAGC^ 
AGTCGGCAGTGCCGGCGGCGGGATCGGCAGTAGGAGCAGCAGCAGCAGTCACAGTC^ 
CGCCCTGCCGCCTCCCGCACTGCCTGACCCAGCCGGCGAA^ 
CGCCCTCGGCCTTGCGCGCCAGCCGCAGCC^GAGC^^ 

2 5 TCCGCCGCCGCTGCTGCTCGTCCCGGCACCGGCX3CCGCACCGACACCCTGGAGGTGGGGATGCTCCTG 

GCCCACCTGCGCGCCCGCCCCTGCAACGACCTGCACGCC^CCAAGCrGGCGCCGGG 
GGTCGTTGGGACGCCGGGAGACCCCCGCGTGGGTCGAGCGCGCTGAGTG 

TGGAGTCGCAGTACCAGGTGGGCCCGCTGTTGGGCAGCGGTGGCTTCGGCTCGGTCTACTCTGG 
CCGGTGAGTGGGCAACGCCGCCTCGGGGAGGGCGCGCGGAGGGATGCTTTGAGCAGGAGGGTCTGATGGAGA 

3 0 TCCAGGTGGCCATTAAGCACGTGGAGAAGGACCGGATTTCCGATTGGGGAGAACTGGTGAGTGAGCCGGAGCCA 

TGGGTGGGTGGGTGGCGACGGGGGTGCCGGAACCGGCGCGCTAACCCTCGCCCTCACrrCTGCAGC 
TGGAAGTGGTCCTGTTGAAGAAGGTGAGCTCGGACTTCTCGGGCGTCATTAGACTTCTG 

GTGCTGATCCTGGAGAGG C CCGAACCGGTGCAAGACCTCTTCGACTTTATCACCGAACG AGGAGCCCTACAGGAGGACCTGGCCCG 
AGGATTCTTCTGGCAGGTGCTGGAGGCCGTGCGGCATTGCCACAACTGCGGGGTTCTC 

3 5 TAATCGACCTGAGCOSCGGCGAAATCAAACTCATCGACTTCGGGTCGGGGGCGCTGCrrCAA 

GGTGAGCCAGGCCCCGGGGAGGGAGCTGCCCAAGGTGCCGGCCTGGAAGCCT 

ATTGGGCCGCCTGGCTCAGAGCTGGCAGGGGTGGGGCAGAGGCGAGCTTCC CGG CAT AGTACAGCCG^GGATTTCGAGCTGGTTGA 

ACGAACCTGCAACATTTCTGGCGTGATTGGGGTTTTGTTG^ 

CTTTTCAGCAGGAATTTATTAAATTCCATGCTrrTCrc 

4 0 ATGACCCACTTTTCCCTCCAGTAGGAGGTGGAGCCGTTGGTGGCGGGAGACATTGGTGCT 

TlT r'lT TT T l'i 1 1 T T TTGTAGTTTTGTTTTGTGGGGGAGTTTTAAGGTAGTTCTTAGAACTGCAGGGT 

GCTTTTGAGGGTAAAGGAATAAGAGCCGCGGTTTCACCCCACTTGGGGGTGTTGTGGGGTGAGGAGG 

AATAAGATCTGTTGCTGGTTTGAAAATTAGTTGGGAGTCTCCGCTGGGAGGA 

CGCGTTCTTTCAGCAGTGGAGGGCAGGAATAGTAGATCAAAGCGCGAGGGTGGGTCACGATCCGAGTGGCT 

4 5 TGGCCTTTCAAAACTTGTGACrrCTGGCTT 

TTAATGGCTGX3CTGAGTCCTCCCCTGTGTTTTCCCCCCCTGGGCAAGTCTTTGCT 
Tn T Tl ' TTTl T lTTriT rTTTTACT^ 

AGTCTAGGGAGTGGGAGAGTGCAGTGTCTATGCCTTGCTTGCATAGAACCCGGAGGAA^ 
GGCCCTTACCTGGGCCCGGTCTAGATGAGACTGGTGTGGCAGGGCCTGGGGTCCTTGTAGTTA 

5 0 CTAGACTTGAGCTTTTCGGTGGCTTTATTTGTCTrTGGGCCG 

CTACCATATCATTTCTTCATTTTCTGTCTGGACCAGAGAGGCAGAGATGGAGAGAAAAA 
TGCCTGTCTGCCTCCTGAGAGCTGGGATTAAAGGCATGTGCCACCAAACCCGAGCCAGGCAGTGTCTTTA 
AACCTAGTGGAGGTGGTGGTTCTTGCTTTTAATCCAGCCCTTGGGAGGTAGAGGT^ 
CCTACAGAGTCACTTCOVGAGCTACACAGAGAAACTTTGGAGGAGAAAAAAAAAATTAAAAG 

5 5 TTAACTTTTTTTTTGCTAATGCTGGTTT 

TACCATCGCTACCACGGCAGGTCGGCAGCTGTCTGGTCCCnTGGGATCCTGCTCT 
GCACGATGAAGAGATCATCAAGGGCCAAGTGTTTCTTCA 

C CAAACCCTGGTT CAGGGTT GGTTT GGGGATT ACAAG AGTG AG AG AGTATTCCT AT CCAC CTTTTT ATTT TT ATTT ATT CATTTTC 
TTTGGGGGAGAGTAAGGCCTTAGCCTGCCTCTGACCCCCAACTGCTGGGACTCAAGAGATG^ 

6 0 TCCATTTGGGCTCAAATTGGAACAAGTTTTATATACATCCCACCCCAAC 

CAGTCTCTAAACAAAGTTTAAAAAAAAAAAAAAAAAATGAAGGTTCCGGATAGTGTCAGTGGAAGGCGTGAGGTAGGTAGGTGA^ 
TGAATCACCTCATCITGCTCACTGGTGGCTCATCAAAACCCTTGTCATCCT 

C CG AG CT ATTG AAG T CTG AAACCACAGGTTT AAG AGGGG AGG CACTTTTTT AACTG AAAG CTG ATT G CTGGAGG CTTTCGGT AT CC 
GTTCTTTCCCACCGGGCAGTGTTTGGTATAAAAAGTACCGTGCCTCTTTAAAAACCCCAGACTT^ 

6 5 TTTGCATTACAAAAACAAGTTGAGTCATCTTTAACCTCTGTCTG 

TGTCCCTGAGACCGTCAGATCGGCCCTCCTTTGAAGAAATCCGGAACCATCCGTGGATGCAGGGTGACCTCCT 
TCTGAGATCCATCTGCACAGTCTGT(^CCGGGATCCAGCAAGTAGCAGCCT 
GAGCTTGTCTTCTGGCCTCCAACACK3ACCCTGCGTGACGATGCAGGGACAGCAATGACAACT 
GGAGCCTCCCACAAGGGAAAGAGACTACTTCACTGGTCCTC 

7 0 G cTGGGCTCCCCTGAATCCATCCCrGGTTa3GGGTGGGAGTGGATCAGAGCCCT 

GTCACCAAGATGGGCCGGGGGGGAGAGGAAAACAGTCTGGGGGGTGGGATAAAAACTAGCAC 
TCCAACTCCTG AGTG CCTTCTGTGGGGACTCCG CCTGTGCTGGG AGAAATACTTGA^ 
TCTGCCTGGGTTTTGCTTCCTTCCTCTCCTCT 

a & rrnrrn ar^cc ACCTGCCCCTTTTCTGTCTCCTTTAGT AAAATT CCCAAGCJ^ACrGGTCTTCCTCTrTGGTTTCGACTTAACTG 
7 5 mCAAAACCAAGACCTCACACAGAACGCACAACCAATGCAAAACAAA 
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atgtgtgtatagttccggcatggtagcctgcaaaaggactgtagatgataacaattt^ 

ATTTTATGTGTTGGaxrriTGGTTTQG rm^ 1 1 1' 1 ATTC TT IT 1T GCTTGTTTTTGAAAGATG 

GCACGTTCTAGCCCGGAAGTTTTAATGTATTTATTTATTTATTTATTTG^ 

CCGGTTTATTGTCATGTTCTGC LTTTTl ' l 1 1 1 1 CTT i'CCTTCCTCCTGACTTGGGGACCTTTTAGGGGAGGGCTGTGACGCTTGCT 
5 GGGGTGTTGGGGCTCCGGTGGGACAGCTCCTGTAGCCCCCrrCTCCAGGGGCCC 

TGCTGACTGTACATAAGACAGTTCCGGTGTGGTGTGCCTTCCGGATCCTCTCTGGGGCTGTGC^ 

GTTTTATCTGAGTAAAGTATACTGTACAGGGGAATAAAAGAGATCT^ 

CTTTTGTCTTAAACCTTTGAGTTTTCCATT 

1 0 ACTCTACCAAGGAAACTCCCTGACTATGACCAATTCCCTTTAACAAAAGT^ 

AATTACAGCACCAGGAACATGGGGACAAGACTTGGCTAGGACTAAATTTTTCT 

GAGTTCCATCAGACAATGTGGTTGCAGGAATTACTACAGGTAGCATTTC 

ACACTGTAGCTGTCATCAGAACAGGGCATCAGGTCCCATCACAGATGGCTGTGAG 

GACCTCTGGAAGAAGCAGTCAGTGCTCTTAACCATTGAGCCATCTCTCC^ 
1 5 TGAGCTCTAGCTGTACTAGGGCTGTAGATGAGGCTGGCTTCAGCTCAGACCTA 

GTTTTTGACACAAGGTTTGAAGTCTGTGTAGCCTTGGAACTTGA 

CCITTGCCTCCTAAGTGCTGGGATTAAGGGCATGTATAG 

CAGAGTGGTGTAATCCTATAATCCCAGCACTTGGGAACTAAGAGGATGAGAGTCA 
TCGTGACTGGTCTAATCAGCTGGAAAGATGCACTCTGGTAGACAGGACCAAAGGAATTC 

20 



MOUSE SEQUENCE - CODING 




TCGCCGACAACrTTGCCGGTGGCCATTAAGt^CGTGGAGAAGGACrcGATTTCCGATTGGGGAGAA 

2 5 CCCATGGAAGTGGTCCTGTTGAAGAAGGTGAGCTCGGACrrTCTCGGGCGTCATTAGACTT 

TTTCGTGCTGATCCTGGAGAGGCCCGAACCGGTGCAAGACCTCTTCGA 
CCCGAGGATTCTTCTGGCAGGTGCTGGAGGCCGTGCGGCATTGCCACAACrGCGG 
ATCTTAATCX3ACCTGAGCCGCGGCGAAATCAAACTCATCGACTTCXKM 
TGATGGGACCCGAGTGTACAGTCCTCCAGAGTGGATTCGCTACGATCGCT 

3 0 TCCTGCTCTATGACATGGTCTGCGGAGATATTCCGTTTGAGCACGATGAAGAGATCAT 

GTCTCTTCAGAGTGTCAGCACCTTATTAAATGGTGCCTGTCCCT 
TCCGTGGATGCAGGGTGACCTCCTGCCCCAGGCAGCTTCTGAGATCC^ 

HUMAN SEQUENCE - GENOMIC 

3 5 GGCTAATTTTGTATITTTAGTAGAGATGGGGTTTCTCCACATTGGTCAGGCT 

CCTCGGCCTCCCAAAGTGCTGGGATT ACAGGCATGAG CCACCGTGCCTTGCCAACCCGTCTCTTTAAAAAAAATTTTTTTTACGG C 
CAGGCACGGTGGCTCACGCCTGTAATCCCAGCACTITGGAAGGCCGAAGCAAGTGG^ 
CTGGCCAACGTGGTGAAACCCTGTCrCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGTATGCGCCTC 
GGGAGGCTGAGGCAGAAGAATTGCTTGAACCAGGGAGGCGGGGGTTGCAGTGAGCCAA 

4 0 GACAGAGAAGACTCCATCTGAAAAAAAAAATTTTTTTTTTTTACAA 

AAACATGAGTCCTTTCAAAGTGGGTATTAAATTAATGCTTGCTGCTCCriTAA 

CAAAAGAATATGATTGTATCAGATTAGTATTACAAGTAACTGCTGTGAGTTTAAAAACTATTGGCTG 
TAAGAAGTCCAAATGGCCTTCAGTTGGTCTGAAAGGACAGACAGAAT 

TGTGTTGTCTTTAAG CCCTGCATCAGAGGACCAGCTATTTGCCAACATAACT CCCATTTACAAGAGTAACTTCAG ATAAAACCCTG 

4 5 ATGACTG CCAAGTCTG ACTTTCTCAT CT AG TG AGTGGGTGTCTCTCT AAT AAAGT AG CATCACTGAACACAG AAG CAAG CTT AAC C 

TATCCAGGGAAAAGTGACACCATTTCTATAAAGGGAACAGCACAGCTCCCTAGTTCT 
GCAAAGAATCTCTGACAAAGTAATACATTGAAGGCCATAAAAAACAGCTCTGTATCT^ 
GTGAGCTGTCATGTTTGTTITAGC^GCATGCCACCrAAAAAGAGAAGTAAAACAGAG 
GTAAGTTTCTCCAAATTATTGAGTAGATTTCGTCCAATAATGTTACTACATGTC^ 

5 0 GTATTAACCACGATATAAGCAAATATTCCTAAACTATAAGCIT 

GTAGACTGAAATAGCCAGGCACAGTGGCTGACGCCTGTAATCCCAGCACTTTTGGAGGCCGAGCT 

GTTCGAGAC CAGCCTGG CCAACATAGTGAAACCCTGTCTCTACTAAAATCACAAAAAGT AGCTGGG CATGGTGGCGCGTGCCTGTA 

CTCCAGCOTGGGCAACAGAGGGAGACTCTCTCTa^AAAAAAAAAAA 

5 5 ACATCAGGAGCTCCTACTAATTCTCTCTGGCTTCTTGCT 

ATCCCAGCATTTTGGGAGGCTGAGGTGGGTGGATCACAAAGTGAGGAGTTCAA 

CTACCAAAAATACAACAATTAGCTGGGTGTGGTGGTGCACACCTGTAATCCCAGCTACT 

GAACCCATGAGGCAGATGTTGCAGTGAGCCGAGATCGCGCCACTGC^ 

AAAAAAAAAAAAAAAAG AAACTTTT ATTCATT CATTT CATG CGT ATTGGGGAAT ACAAAAAAACAACACAAAG CAT CTTTTGCTTC 

6 0 ACTTCCTGACTCTACTCACCGTGTTCTCTCTCTCCTTCCTACTTACTCGTA 

CTCGATGGAGCCTCCCCGGACCTCTTGAGCCTCCTCTGAACCTTGACT 
TTGTTAAATCTCTTTTTTTGTTTGTATGTTAGAGACAGGGTCT 
CACAGCAACCTCTGCTACCTGGGCTCAAGCAGATCCTCCTCCCGAGCTCAGCCT 
CACATCCAGCTAATTTTTGTATTTTTTGTAGAGACGGGGTTTCACCATATT^ 

6 5 TCCACCAGCCTCAGCCTCCCAAAGTGCTGGGATTAGAGTCAT 

TTCCTTGGAAAGGCAGAATGGCATGGTGGGTAGGAGATTGCTCrGCCACTTAAC^ 
TCCGTTTATTTATCTGTAAACCAATGAAGATAGTAAAAATGCCCACCT 
AGCACATCATTGATGCTCAATATTGGCTTTTAGTACCCTATCTCTTTTGCCCCTC 
GTTCAATTAAAGGATGATGGTTTCTGCCTTCAATATATTTATAGTCT 

7 0 ATACTATTAGATGACAAATTAGATACACAATAAAGAAATGTTTTCATCACTTTCTTTGTAG 

CT T T T TT GCACTT T TT TT ATTTTACCTroATGAACT^ 
GATGGTAGAGAATCCCCTCACCAGCTCCTAGATAGAAGACATTCr CTCT 
ACACACGCACAGACACGATGGCCCTTT T C TTT CT 
GAGGGATACATTTTCrrTAGTCITCTAATAAGGTATTTTATAACAGTCT 
7 5 CITTTTATATTCATCTATCGAGCAATACTCAGGAACT 
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TTGTCAGTATCTAATGTGCAAAATTCCAAAAATAA 
AATTTAAATATACTTTATTGTAAAATAGCCCTTTCTTTTCCCAACCT^ 

GCGCAGTGGCrCATGCCTGTGATCCCAGAACTTTGGGAGGCC^AGGCGAGCGGATCACGAGGTO 
CAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCGCATGCCT 
5 GCTGAGGCAGGAAAATCCCTTGAGCCCAGGGGCCAGAGGTTGCAGTG^ 

GACTCCGTCTTAAAAAAAAAATGAAATAAATATCZATATCCTTATCAAGGACAAGAACTAGCT 
TCCCAGCACTTTGGGAGGCCAAGGCGGATGGATCACGAGGT 

TACTAAAAATACAAAAAATTGGCTGGGCTTGGTGACACACGCCTATAGTCCCAGCTACT 
GAACCCXjGGAGGTGGACGTTGCAGTGAGCCGAGATCGCACCACTACACTCCAGCT^ 

1 o aaaaaaaaaaaaaaaaaaaaaaaggco^gaactagttctagtgactacatagattc 

gatttcctaagtgtctcccactatcacaatgatttcaatttaaaaaggtttt 

gcattttcggccgggcgcgccggttcac^cctgtaatctctgtaatctcagca 

tcaggagtttgagggcagcctgaccaacatggtgaaaacccgtctatactaaaaatac^ 

ccagt aatcccag ctactctggaggctaagg cagggagaatggcttg catccgggaggcagaggttgcagtgagccaagattgcgc 
1 5 cactgcactccagcctggtagacagagcaagactccatctcagacaacaaca^ 
tgatggatcacttgaacccaggagtrtgggaccagccttggcaaaatggta 

ccgggtgcggtggctcaggc ctgtaatcccagcactttgggagg ccgaggcgggcggatcacgaggtcaggagatcgagaccatcc 

tggctaacatggtgaaaccccgtctctactaaaaatacaaaaaattagcixksgot^ 

gggaggccgaggcaggagaatggcgtgaacccgggaggcggagcitgcagtgagccgagatca 

2 0 agcagagcgaaac cccgtcacacacac aaaaaaagaacaagtgaaaatt agcccgg cgtggtggctcaccc ctgtaattccagcta 

ggtgggaggctcaggcagcaggatcgcttgggcgcaggtctaggctgcagtgcgccgtgatcc^ 

CTCGGTGACCAGAGTGAGACTTTGTCTCAAAAATAAATAAATAAAAATTAAAACA 

AGACAGGGTTTCCCTATGTTGCCCAGGCTGCrcrCCTGAGCTCAAGCAATCCGCCT 

GGCGTGAGCCACCATGCCTGGTCTTTTTAAATTTAGAATGTTTCCT 

2 5 CTTTGGGAGGCTGAGGCCGCCGGATCACTTGAGGTCAGGAGTT^ 

AAT ACAAAAATTAGT CGGGTGTTGTGGCGG ACG CCTGT AATC C CAG CT ACTTG-GGAGG CTG AGGCAGG AG AATCGCTTGAACCCGG 
GAAGGGGAGGCTGCAGTGAC CCGAGAT CATG C CA CTG CACT C CAG CCTGGGCG AGAGTGGG ACTCTG TCT CAAAAACAAATAAACT 
AAATAAAATAAATAAAAAACTGAGTAAATTTTTTCT^ 

TAAATAGCGTATAGAGTCTTAGTTATACAATCCTAGTTATATAAATGCATTGACTAGACTT 

3 0 CG AT ATCACTTT CAAACTTTCTTCT AAACTTG TTTTTT AAAAAAAT TT CGTG ACT ACAGAACTTG CTG AAG TCATTGTTTTTTTGT 

TTTTTTTTTTTGCATAAGCATATAGTCCCTGGATGAAGTCA 

AAAACCACCTAAGGATTAATTTXMATATTGTTCAAATTAATATTAATAAATGCTCCT 
GCCITTTATCCTTCCAAAAGCACTTGTCATTTAAT^ 

TAGTGCTAATTTTATTAAGACTATATTTCGATAAATCAAAACATCAAGTTCCCTACT 

3 5 ATCTGAATGTTTACTAATTATCrTTTAAAGCAGTTTrAAGTA 

AGCTGTGAAGCCCCCACTATACAAAGTAGATATATTAAATGTTAATCACTTTAGCTCTAATTA 
AGAAAAAGTGAAAAAGGTTGAAGTTGATACATATTTTTTTTTCTAAAAGGCCTAAAAG 
CAT CTCT CCCCAAATGGTTAAAAATCG AACTATTAATCCAGTAGTTT^ 

GCG CGGTGGCTC ACG CCTGTAATCCC AGCACTCTGGGAGGCCAAGG CAGGTGG ATCACAAGGTCAGGAGATCGGGACCACCATGGC 

4 0 TAACACGGTGAAACCCCGTCTCTACTAAAAATATAAAAAATTAGCCAGG 

GGCTGAGG CAGG AGAATGGCGTGAACCTGGGAGGCAG AGCTTG CAGTGAGCCAAGATTGCGCCACTGT ACTCCAGCCTGGGAGACA 

GAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAGGCCGTTTTAAAAAACAT 

GCCATC^TGATTGCTAATATTTATTAGAAGTTTACTCTG^ 

ACAACCCTATGAGGTAGATGCTGT/TATTATCTCCATTCTGTCTAAAGCTTAATA^ 

4 5 AGCTGCC AGATC CCGGAATTGAAACCAGGCTAACTTAGTTT AAAGCCCTTGCT CCCAACTTTT ATGTTCCCGAAGCCCACTTAATT 

CATTTAAATTTGGTATAAATTGACTTTAATTATGGCCTTTT 

TAAAGTATG CJVTTCAGGG AGGCTGGGCATGGTGGCTCACACCTGTAATCCAAGGAGG CTGGGCATGGTGGTTCACACCTGT AATCC 

CAGCACTTTGGGAGGCCGAGGCAGGCGGATCATAAGGTCAC^AGTTCGAGACCAGCCTGGCCAA 

TAAAAATACAAAAATTAGCTGGGCATGGTGGCGGGCGCTTGTAATCCCAGCT^ 

5 0 GAACCCGGGAGATGG AGGTTGCAGTG AGCCGAG ATAGTG CCATTGCZACTCT AGCCTGGGT AACAGGG CGAGATTCCGTCTCAAAAA 

AAAAAAAAAAAAAGGG CATTCAAGGAATAGG AAG ACAAAGT AT AATGG AAAAG T AAATGAACAAAAAGGATTTAAAAAAACAAAT C 
CCAACTGTCAGATTAGTGGACAGTGAACCAATGCTCTGAGTTTGCT 

GGAAAGGCTTTGAATAAAAGTGGAGAGGTAGAGAGGGCTCTTCCCAGGACAGGTTGATTGCrTCC 
GTGCTAGGCGAGTGGGAACAAGTGGGAATAGTTCTGTTrGACTACATTTGAG 

5 5 GAAGTGGGAATCACATATGTTTAATGATGTCTTTGTTCGGTGCCAAGGACAGT^ 

CAGGAGGTGAATTTGGGTCATTTCATCTCCTCTCTTAGCTGCTGTATGACCCrGTGCCATCCT 
GCAAAAAGGGAGTAATTCTTGTTCCCTTGGGTTTTCTCCTAGTCATAA 
AfcAATATGTAATATTTTGTAGGATAAATTCATAAAAGTTGTGGAGG 
CAAAGGAAACATTTAAATTCACTAATGACAGCTATAGGGATCCTGATTT^ 

6 0 GAATGTAGTTAGCTTTCAATTACTTAGGCCACTGACAATTCAATTTATGGATTAT^ 

GTTATCAACTCTACATGAAGTTTAAAAGGACAAAGGATGCAAGTGGAATTGGTCA^ 
GGTGGGGGGTTTCCTTTGAGACAAGACTTGGGGTTG^ 

GTGACTACAACGTGAAAACTGGTTAAATCAAGCGCACCCTCCCACCCTCGTTTTAG^ 
CTGAGGCAGCATCnXSCATCACAACACTAACATTTG 
6 5 AATC^GGATTOSCTGGTCCTrTrGCATTrCTAAAATGGGAATCCC^ 

TCCCTTTCCCGCAG^lAGCGGGGCrCCCGCCrrCCAGTTCTGTTCCAG<3CCTGACT 
ATCCTTCGCCCCCGACGCGCCCCC<^CA<^CAAACCCCC^GAATCCGCCrc 

ACTTGGAGGTCTCGGGTCTGAGTCACACAGAAAGACCACCCTCGTCX3GC^TCCCCACACACAGTCCG 
CCCCGCCTGACACACTAACGCCCGTCCTCTCCXSCGC^C^ 
70 GGAG AGCCGCAGAGC CGGCCTCGGGCGGCCTTTGATGG CTTTGTTATTGTTTGGGTTTGAATCGATACGCCCCTCCCCATCCTTCC 

TCCCTCGCGGCCCTACACCCttGCTCCCGCCTCCCCTCA^ 
CGTCX3TGGCGGCGTAGAGACCATTCTGACCGCGAGAGCTGGGCGG^ 

TCTGGTTGGCTGGAGTAGCXSCrGGCAGGGGCGGGGCCGGGGCGCGGCCACAGAGCGCGCGG 
GTCCCGCCGCTTCCCCACCCCCTCTCCTCCCTCGGCa^ 
75 cGTCCCCGCCCGCasasCCCTCCCCGCGaSCCCTCCCCGCCGGCGCGCTCCrCCCCTTTACTCCTC 
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TCTGCTGCAGCGGCCGCGGTGGCTGAGGAGGCCC 
AGCAGCAGCAACC^CTAGCCTCCTGCCCCGCGGCGCTCCCG 

GACCCCGCTGGCGCGCCCTCCCGCCGCCAGTCCCGGCAGCGCCCTCAGTTGTCCTCCGACT^ 
C^GCCACAGCCGCAACGCCACCCGCAGCCACAGCCACAGCCAC^^ 
5 G CGGCAGCTCCTCTGGGCACCGTCCCTGCGCCGACATCCTGSGAGGTTGGGATG CTCTTGTCCAAAATCAACrCGCTTGCCCACCTG 

CGCGCCGCGCCCHXIAACGACCTGCIACGCCACCAAGCT 

GCGGCGGGATCTCCTGGGTGGGGAG CTGGCGGCTCGCGGGCCGGCACTGAGTCCCCGTGCTT CCCCCTTTCCTAGGCAAGG AG AAG 

CAACTTGCCGGTGAGTGGGCGCCCCGCGGTGGGGAGGGCGCG CCGGG CGGGGGGCG CACGGG CGTGCTTTAG CCCGGACGAGGGAA 
1 0 CCTGACGGAGACCCTGGGCTTCCAGGTGGCCATCAAACACGTGGAGAAGGACCGGA 

TGCAGGAGCGACCCCCAGGATGAGTGGGTGGGGTGAGGGGCGCCCCCGACTCCCGCCCTAACGCGC 
AATGGCACTCGAGTGCCCATGGAAGTGGTCCTGCTGAAGAAGGTGAGCTCGGGTTTCT 

TGCAAGAGGAGCTGGCCCGCAGCTTCTTCTGGCAGGTGCTGG^ 
1 5 ATCAAGGACGAAAACATCCTTATCGACCTCAATCGCGGCGAGCrCAAGCTC^ 
CGTCTACACGGACTTCGATGGTGAGCCAGGCCCGGGAGGG 

GGCCAGTCrrCCCGCGCCAGCCTTTTGTAAAGGTCATTGGGCCGCCTGGCTCGATGCT 
AGCGTAGTAAAGCCGGGGATTTTCAGCCAGCTGAACCTGTAATGTTTCTGGCAT 

CCAGG CTTT CCCGATGAAT AAGAGG TTG TGGGCAAC CGG CGGT AG CC CAGATTTTTCT AAAGT CTGAC C CAGTTT C CC CGC CAGT A 
2 0 AAAGGATGGGGGCGGGGGAAGAGGTGGAAATTGATGCCGGTTTTGTA 
GGTAGTTTTAGAGCTGCAGTTCTTAACTCCrrTCTGATCTAGGGAAG 

TTTTTCTTTAGTGTTAAGGGGAGAGAGAGTCAGGAATGAATGTTGTGAAATAAGATCTGTCGCT 

CTCCGCAGAGAGGATGAAAACCTATCCTAGGGAGGGGCTTGGAGCGGGTTCTTT 

AGCTGCGGAGGGTGGGTCATCATCTGAGCGGCTTAACCTAACAAACGACAGCCrrTTCAAAACT 

2 5 GTTTATTTGCCCTTGGAGGACGCTGGGTTGGGCTGCATTTTTTGTAT^ 

CITTCAAGTCTTTGCTGCCCCCTGGTGAGCACC^ 

CACGTGGTCTGAAACCAACCCTGCAGCTCTGTCTACCTTTCCAGTCTAGGGAGGAAAGGG 

GGGTGGGGAGGAAGGCCTCCCAAAGGGCACCCTGACTTAGGATCTTGTGCAAGC^ 

CTCACCCGGGCTCCTGGGCAGTGGTGAATGCAATTTAAGGATAGTCTATGAGATACCri'TCTTGGTT 

3 0 CATCCTTCGCAGGCGGTCCTGCCTCACAGGGCCrCAAGTTTTGGGTCTGC 

G AATTT CAG TTT ATGGTTTGGGCT AGCAG AG AGG TGGGT AAT G CTTTGGGTTGG AGAG ATG C CGT AAG GTG CGC CT CCACT CTCCT 
TAGCCCAGAGGGAAAAATGGAGTTCACCTAGCTCCTGAGAGAAGGGATTTT^ 
CAGCTTCTTTGTGCTTGTTTTTGCTAAAAGTGTGTTTTCTCT^ 
AGTGGATCCGCTACCATCGCTACCATGGCAGGTCGGCGGCAGTCT^ 

3 5 ATTCCTTTCGAGCATGACGAAGAGATCATCAGGG 

GGCCATGGGGTTATTGGTCTTAATGGGGCTATTAGTCTTCATGGG^ 
TCTGTCACCCTTGGCTTAGAATTGTAGGTGAGTGATTTACACTTGAGCTGGCCTCATAA^ 
TTGGGAGGTCAGAGGAAGGCATGTGTGAGCATATTAAGAAGAAAAGACAATCTGGCT^ 
AGAAACCTGATAATTCCTGGCTGTTTTGCCAGGGAGTAAAAAGTTAAA^ 

4 0 TATTTTGGTACCAGTGACCTCACTGTGGGTGGTGTTCGTGTITGTAAGTTGGTAGGTGAATTGAA^ 

TGTCTCATCAAAATCTCTTGTCATCATCCTTCCrATTTCrGGTGAGTGGGTC 
ACCAC^^TTTAAGAGGGGAGTCAGTTTTTAGC!TGAAAGCAGACrrGGAGGACCCAGA 

ACCCAG CACAGTGTTCTAGAAAATGCTTGGCCTCCCrGGGACCCCAGACTTGTGGGCCT CTG AGAAG CAAATGGGGAAGACCTTTG 
CAGTGTAAAAACAAGTTGAGTCATTCATAACCTCGTCTATCCT^ 

4 5 CTGAGACCATCAGATAGGCCAACCTTCGAAGAAATCCAGAACCATCCATGGATGCAAGATGTTCT 

GATCCACCTCCACAGCCTGTCGCCGGGGCCCAGCAAATAGCAGCCTTTCTGGCAGGTCC 
AGGGGAAGCTTCTGTCTCCAGCTTCCCGAGTACCAGTGAC^ 

AACTCATTCCAGATCCCAGGCCCCrGGAGGCTGCCTCCCAACAGTGGGGAAGAGTGACTCT 
CCCATAGATACTCTCITCrrCTCATAGGTGTCCAGCATTGCTGGACT 

5 0 ACCCTGCCATGGAACTGTTTCCTTCATCATGAGTTCrrGCTGAATGCCGCGATGGGTCAGGTAGGGGGGAAA 

TAGGACTAGCACCATTTTAAGTCCCTGTCACCTCTTCCGACTCTTTCTGAGT^ 
ATACTTGAACTTGCCTCTTTTACCTGCTGCTTCTCCAAAAATCTG 

CCC CCT CCTT CAT ATG AAAGGTG CCATGGAAC AGGCT ACAGGG C CAAACG CTG AGC CAC CTG C CCTTTTTT CTGCCTCCTTT AGT A 
AAACTCCGAGTGAACTGGTCITCCTTTTTGGTTTTTACTTAACTGTTTCA 

5 5 AATGCAATCAACAGAAAAGCTGTAAATGTGTGTACAGTTGGCATGGTAGTATACAAAAAGATTGTAGTGGATCT 

ATTTTGC CTTTAAGTT ATTTTAC CTGTTTTTGTTTCTTGTTTTGAAAGATGCGCATTCTAACCTGGAGGTCAATGTTATG^ 

TTTATTTATTTATTTGGTTCCCTTCCTATTCCAAGCTTCCATAGCTG 

GGGACCTTTTGGGGGAGGGCTGCGACGCTTGCTCrGTTTGTGGGGTGACGGGACTCAGGC^ 

CTGTGGGGCCCCTCACCT ACTTACCCAGGTGGGT CC CGGCTCTGTGGGTGATGGGGAGGGGCATTGCTGACTGTGT ATAT AGGATA 

6 0 ATTATGAAAAGCAGTTCTGGATGGTGTGCCTTCCAGATCCTCTCTGGGGCTGTGTTTTC 

TGAGTGAAATACTGTACAGGGGAATAAAAGAGATCTTATTTT 
ACTCGTGGCTTCTAATCGTCTGTGCGGAGGCATTGCTAACCTGCATTTATTGAG 

AGG AATT CT CAG C CT AAAT CTGG CTTCCTT CAACATTG GG ATTTCT AGG AT AGTCATCTTCCTG AAAC^ G 
TA0K5ACCAGTATTGTCAAAATTTGGGCAATATCTGGCATGGGAGGCCT 

6 5 TGCAAATAGGATAGATGTGACAATTTTCAAGTGTTCAATAGCCATTTATGGCT 

AGTTCTATTGGATGGGGCTGATCTAGAGATAACAAATACATACCTATTACCAAACTAACCCT 

ACTGATAGTAAAACATTTTC^CATGTCCCCTTGGTACACGTGAAGTTTAT^ 

ATTCAAGTTTCCTTCACCTTTAGTGTGAGGTTTAGTTT 

CACTGCCAGTGGCACGATCTCGGCTCACTGCAACTTTCACTTCCCGGGTTCAAGCAGTTCT 

7 0 TTACAC^CGCCCGCTGCCACGCCCAGCTAATTTTTGTATTTTAGTAGAGACAGGGTTTT 

TCCTTGACCTTATG ATCCGCCTGCTTCAG CCACCCAAAGTGCTAGG ATTACAGGTGTGAGCCATCGTGCTCCGCAG CCTTTTTTTT 
TTTTTTTTTGAGATGGAGTGTCGCTCTGTTGCCCAC^CTGG 
GTTTTAAGTGATTATCCTGTCrCAGCCTCCCGAGAAGTTGGGATTACAGGCAT 
ATACC>GGGTTTCGCCATGTTGGCCAGGCTGGTTTGGAACTCC^ 
7 5 ATTACAGGCGTGGGCCACCGCGCCTGGCCTAGTCTTTAACAATAGTATAATTTATTATGC^ 
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TGGTGACACTAAAGTCGTGACCAC^TAAGACCCAGTCTACAAAAAAT^ 
TCCTACCTATGCAGGAGGCTGAGGCAGGAGGATCGCTTGAACC 

CTCCAGCCTGGGCAACAGGTAACCCCACTTAAAACACACACACACACACACACACACAC^ 

AGTTTGACTTAAGGTTTCCCCCTCTGATGTGTTTGACAAAGGG CTAATATTGGGAGT AGGTGAAATGTTAGCTCTG CCATTTTCTT 
5 ACTGCTTATTCriTGAATAATTAGTATTTTTAGTCCATGGTTTCTTTGC^ 

AGTTTAGTTAAACTGGATGGCAAGACCCATGAATCCACTTTGTACGTGTGGCCT 
AGTGCCATTGTCAATTCATATATTAAGTATTGGTAAATGTTGGTTTCTTGTrrc 
CCTGGGGGTCATCATGCTCCCAACCCTCAATGCATCCAC^^ 

1 0 GCCCCCATGTGGCTATAAAGAGGAAGGAAGGATGTCTTGGTTCTGGGGGGCTGATTTTCT 

CCATCCCCCACACAACATACACACACTACCAACATCTTGGAGCCTGAGGGTTGCATGCTTCGTC 

GGATAATTAAAAGTTGCCGCAAACTATTAACGAAATCAGGCAAAGCCCCACCCTTCT 

CCCATCTAATGAGACTGCTGCGGGGGAATAGAAGAAAGTATCTCTGAATAAAGACAGA 

AGAGTTTTAGGGGGAGGGTGAAAGGTCATATTTAATCTCTTCTCAGGGGCAGTTTGA 
1 5 CAATAAGTCTGCGGGGGAT CAATGTCATGGCCAGAGGAATAT AGAG AGCAATGG1TAAAAGCATTG ACTTGGGAAT CAG CCTTGAG 

CTTACTTATCTAGAAACCTTGGGCAAGTTATTACACCCT 

AGACTGGGTTAAATAACAGATATACTTACATACAGAAAAACTACTATCTCCCGCTTACGGATC 
AAGAAGTACAGGGTGGGGTAGGGTGTGGTAGTTCACACCCATAATCCCAGCACTTTGGGAAGAC^ 

CCAGGT^GTTTG AG ACT AG CCTGG C CAACATGACAAAACCC CAT CT CT ACAAAAAAAT AG AAGCATT AG CCGGACGTGG TGGTGCGT 
2 0 GCCGGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGAG^ 

CACAACATTCTAGCTTGGGTGAAAGAGCGAGACCCTCT'CTCA 
AATAAGTTTCCAGAGTCAGGGAAGTGGTTATTACCITTGGGGAGAAGGGAGGGAG^ 
TTTTGGCAATGCTTTATTCCATAAACTAGATGGTAT^ 
TTTATGATAAACTTTATTTTTAAAAATTTTTCTCTGGCTG 

2 5 GCAGGCGGATCACAAGGTCAGGAGATCGACACCATCCTGGCTAACACAGTGAAACCCCATCT 

CAGGCATGGTGGCATGTGCCCATAGTCCCAGCTACCCAGGAGGCTGAGGCAA 

GTGAGCCAAGATAGCTAGCCACTGCACTCC^CCTGGGCGACAGAGTGAGACTCCGTCT 

GTTC rTT CT TT TTAAACATTTATTTATTTATTTTAGCTG 

CX3GGTGGATCATCTGAGGTCAGCAGTTCAAGACCAGCCTGACCAACATGGTGAAA 

3 0 GGGCATCGTGGTGCATGTCTATAGTCCCAGCTACTC^GGAGGCTGAGGCAGGA 

TGAGCCACGATCACGTATAGCCrGGGCGAOVGAGCGAAACT 

TATATTATATATTATATTATATATTATACATAAATTATATATTATATATATAATATATAATATATAATATATATTATATATTAT 

ATTATATATTATACATAAAATATATATAATATATATTATATATTATATTATATATTATATATAAAATATATATATAATATATATAA 

TATTATTTATTTATTTTGAGACAG<5GTCTCACTCTGTC^ 

3 5 CTTCCTGAGTTCAAGCAATTCTCGTGCCTCAGCCT^ 

TATTGTTAGTAGAGATGGGGTTTCACCATGTTTGCCAGGCTAGTCTTGAACrCCTGATCTCAAGTAATCC^ 
CAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCCGGTCCAl'l"I'riTTTl 
CAGGCTGGAGTGCAGAGGCATGAACTTGGCTCACGGCACCCAGGT^ 
TACAGGGACCTGCCGCCATCCTTGGCTAATTTTTCT 

4 0 TCCTGACCTCAGGTGATCCGTCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCACGAGCCACTGTCCTC^ 

TCITTTAATTGAGACAGGGTCTCACTACGTTGCTCAGGCTAA 

AAAGTGCTGTGATTACAGGTGTGGAATATCATGCCCAGCCTATGATAAGCTTTTAAAAG 

AGGAGG CAG CAG ATGG AGTTATT ACTG CC CTTGGCTG AGGGG ATCTGGGAAGG CTTCAAAG AGGAGGTGG CATTT CAC CTGG ATTT 
TG AAGG ATGG AGC CT T AGC CTG CTTCCCTG TAAAAT AGG CAC CAAACATG CAG ATGGAG CAG AG AAAT CTGGATG TG CT AAATG TT 

4 5 TTTCCCCCACTAGACCAGG AATCAGCAAACTATGG CCTG CTGCCTGAACTGGTCCC CTAA 

GCCTAGATTTTACATTTTTAAGTAGTTGAAAATCAATCAAATGAAGAATAATATGTTGTGATATG 
CTGGGTATAGTGGCTCATGCTTATAATCCCAGCACTTTGGGAGGCC 

CTTGGG CAACCTAGOVAG ACTTCCATCTCT ACAAAAAATAAAT AAAAATT AGTCAGG C ATGATGG C 
GACAGGAGGATCACGTGAGCCCAGGAGTTTGAGGTTACAGTGAGCTATGATCTCACCACTGC^ 

5 0 CAGAGCAAGACTCTATCTCTAAAAGAAAAAGAAAAAAAAAGAGGCC^ 

GGCTTAGGTGGGCGGATC^CCTGAGATCAGGAGTTTGAGACCAGC 

ATTAGC CCAGTGTGTTGGTGCCTGCCTGT AATCCCAGCT ACCAGGG AGGCTGAGG CAGGAG AATTGCTT AAACCCATAAGGTGGAG 
CTTGCAGTGAGCCAAGATTTCTCCATTGCACGCCAGCCTGGGCAA 

ATATATATATATATATATATATACACACACACACACATTTGTACACACATATGCAACTC^ 

5 5 GTCCATTTACATATTGTCTATC^CTGCATTTGTGATACGATGGCAGAGTTGAGTAGTTAAGACACTCAG 

AAAACATTTACTATCTTGTCCTTTACAGAGAAATTTGCTAATCTCTACACTAGATTATGGT 
TTGTTTTTTTGGGGTTTTTTTTTTTGAGACAGGGTCTTACTCCG 
AGCCTTGACTTTCCAGGCTTAAGCAATCCTCCCAC 
AATTTGTTTTTTTGTTTGTTTGTCTGTTTGTCTTGAGACAGGG 
60 ACACACTGCAACCTCAACCTCCTGAGCTCAAGTAATCCTCTTGCCACTGCCTCCCAAAGTGCT 
TGCCTGGTCTGAACTACATCTTTGATAACTCTTTTTTTrrTTCT^ 
TGGGATGATCTCGGCTCCACCTCCTGGTTCAAACATTCT 
CGCCTGGCTAATTTTTGTTTTTGTTTTGTTTTGTTTTG 
CCATCTCGGCTCACTGCAAGTTCIAGCCrCCCGGGTTCATGCCATTCTCCT 

6 5 CACCACCACGCCCGGCTAA' rTT 'ri " ! 1 T T T T T T T TTTT T TGGTATTTTTAGTAG AG ACGGAGTTTCACCATGTTGTCCAGGATGGTC 

TCAATCTTTTGATCTTGTGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGA 
CTCTTAACCTAGTGCCAGGCATGTAGTAGTTCCTTAGTGATAATAGCTAACATAC^ 
ATGCTTTG CATG CATTAACTT ATTT ATCCTTAAAACAACTCTATGACACTATTGG AAA 
TATTTTATTTTATTTTATTTTGTATTTTTATTTATTTATTTTTTT 

7 0 GCGCTGATCTCAGCTCACTGAAATTTCTGCCTCCTGGGTTCAA 

ATGTG CT AT CACG C CCAG CT AATTTTTGT ATriTrTTl' 1 AGT AGAG ACTGGG TTTTGCCATGTTGGC 
TGACCTCC^GTGATCCACCCGCCTCAGCCTCCCAAAGTGCTAGGATTACAGGCATGAGCCACT 
TAGAGATGAGGGTCTTGTCATGTTCCCCAGGCTGGCCTCAAAATCTTGGCCTCAAG 
TGACT^TACI^TACATGCCACCTAGCCCGGCTTGATC 
7 5 AGTAAATTTCATGTCAAAGGCCACATATTAGGTAGTGXSTCAGGTCAGA^ 
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GACTCCGCGATCTTGGCTCACTGCAAACATTTGTGTGGATT^ 
TGGTTCTTCCCCAGACTCCAGAGTTGGGTAAGCCATTCTGTTCro 
TGGGCCTGGCTGGGGAGGAAGAGTTGGAGGCTGAAAAGGGAGA^^ 
TCTTGTAAATGTTGCrCAGCTCCTTCAACTGCTGTGCTGGTTCTTG^ 

AAAAGATATGGCCTCATCCCCCAGTTCATTTCCTGCCCAGCCTTGCACTACTAACCCCGATTGC 

CCTCAGCAGTGCTGTGGGAGGAAGTGCAAAAATAAGCAGAACTAGGGGCAAGA 

CAGG ACATCCGATGT ACAGAAGTGGATTCATCAT AATCCTTGGCT CATCG T AT ATTCT 

TTATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATA1TTTTAAATCGGAGTCTTGTC 



1 0 AAGTAGCTGGGATTACAGGCGCCTGCCACCACGCCCGGCTAATTTTTTG^ 
GCTGATCTCGAACTCCTGACCTCAGGTGATCCACCTGCCTCGGCCTCCCAA 
GCCCGTTATTATTATATTTTATCATACTGGGTA 

TCTATATGATCCTCTCrTAGTCCATTrCTTACCTCCCCCAGATACAACCCACTGTCCTA 
TATATTTTATTTATTTATTTTGAGACAGAGTCTCACTCTTGC 
1 5 ATCTTCCTGGGTTCAAGCGATTCTCCTGCCTCGGTCTCCGAAGTAGCTGAGAT^ 
TAGTAGAGACAGGGTTTCACCATGTTGGTCAGGCTGGTCTTGA^ 
GCTGGGATTACAGGCATAAGCCACCATGCCCGGCTTATTTATTTATTT 
TTGAGGCTCACTGCAGCCTCAAATTCCTGGGCTCAAGTGAT^ 

CACCACACCTTGCTAA' n " l 1"! 1 1 1 1 rTTTGAGATGGAGTCTTGCTCTGTCGCCTCGGCTGGAGTGCAGTGGCACAATCTAGGCTCA 
2 0 CTGCAAGCTCCACCTCCTGGGTTCACGCCATTCTCCTGCCTCAGCCTCC 

GGCT AATTTTTTTGTATTTT CT AGTACAGACAG GGTTT CAC CATGTT AG CCAGG ATGGTCT CGATCTC CTGACCTCGTGAT C CGC C 
CGCCTCAGCCTCCCAAAGTGCIX3GGATTACAGGCGTGAGCCA 
TGGGTGTCTCACTATGTTGACCAAGTTGGTCrTGAATTCCTGGCCT 
TACAGGCGTGACCACTGTGCCCGGCTAACTTTTAAAATTTTTTATAG 

2 5 TTCTGGGCTCAAGTGATTCACCCGCCTCGGCCTCCCAAATTGTTGGGATTA 

CAG CATT CTTTCACTTT AAAAT AACATTTTATTTT AT AGG T ATGT AT CCCTAAATT ATGGAG T ATTT AGT ATTTTT ATTTTGTCTT 

ATTTTATTTCATTTCATTTTTAGATGGAGTCTCGTTCrCT 

CTG CCTCCCGGGTTCAAGTG ATT 

30 HUMAN SEQUENCE - mRNA 

GAGGAGGCCCGAGAGGAGTCGGTGGCAGCGGTOGCGGCGGGACCGGCAGCAGCAGCAGCAGCAGCAGCAGCAAC^ 

GCCCCGCGGCCTTGCGACGAGCCCCACGAGC03CTCACCCCGCCGTTCT 

AGTCCCGGCAGCGCCTCAGTTGTCCTCCGACTCGCCCT 

GCACAGCCCAGGCAT AG CTTCGGCACAGCCCCGGCTCCGG CTCCTGCGGCAGCTCCTCTGGCACGTCCCTGCGCCGACATTCTGGA 

3 5 GGTTGGATGCTCTTGTCCAAAATCAACTCGCTTGCCCACCTGCG CG CCGCGCCCTGCAACG ACCTGCACXSCCACCAAGCTGGCGCC 

CGG CAAGGAGAAGG AGCCC CTGG AGTCG CAGTACCAGGTGGGCCCGCTACTGGGCAG CGGCGGCTTCGGCTCGGTCTACTCAGGCA 
TCCGCGTCTCCGACAACTTGCCGGTGGCCATCAAACACGTGGAGAAGGACCGGATTTCCGACTGGGGAGAG 
CGAGTGCCCATGGAAGTGGTCCTGCTGAAGAAGGTGAGCTCGGGTTTCTCCGG CGTCATTAGG CTCCTGGACTGGTTCGAG AGGCC 
CGACAGTTTCGTCCTGATCCTGGAGAGGCCCGAGCCGGTGCAAGATCTCTTCGACTTCATCACGGA 

4 0 AGCTGGCCCGCAGCTTCTTCTGGCAGGTGCTGGAGGCCGTGCGGCACTGCCACAACTGCGGGGTGCT 

GAAAACATCCTTATCGACCTCAATCGCGGCGAGCTCAAGCTCATCGA 

GGACTTCGATGGGACCCGAGTGTATAGCCCTCCAGAGTGGATCCGCTACCATCGCTACCATGGCAGGTCGGCGGCAGTCT 

TGGGGATCCTGCTGTATGATATGGTGTGTGGAGATATTCCTTTCGAGCA^ 

CAGAGGGTCTCTTCAGAATGTCAGCATCTCATTAGATGGTGCTTGGCCCTGAGACCATCA 

4 5 GAACGATCCATGGATCCAAGATGTT CTCCTGCCCCAGGAAACTGCTG AGATCCACCT CCACAG CCTGTCGCCGGGGCC CAGCAAAT 

AGCAGCCTTTCTGGCAGGTCCTCCCCTCTCTTGTCAGATGCCCGAGGGAGGGGAAGCTTCTG 
ACACGTCTCGCCAAGCAGGACAGTGCITGATACAGGAAC 

CAACAGTGGGGAAGAGTGACTCTCCAGGGGTCCTAGGCCTCAACTCCTCCCATAGAT 

TGCTGG ACTCTGAAATATCCCGGGGGTGGGGGGTGGGGGTGGGC AGAACCCTG CCAATGGAACTCTTT CTTCATCATG AGTTCTGC 

5 0 TGAATGCCGCGATGGGTCAGGTAGGGGGGAAACAGGTTGGGATGGGATAGGACTAGCACATTTTAAGTCCCTGT 

CTCTTTCTGAGTGCCTTCTGTGGGGACTCCGGCTGTGCTGGGAGAAATACTTGAACTTG 
ATCTGCCTGGGTTTTGTTCCCT ATTTTT CTCTCCTGTCCTCCCT 

GGGCCAAACGCTGAGCCACCTGCC C r iT ' m 'CTGCCTCCTTTAGTAAAACTCCGAGTGAACT 
AACTGTTTCAAAGCCAAGACCTCACACACACAAAAAAATGCACAAACCAAGCAATCAAC^ 

5 5 GGCATGGTAGTATACAAAAAGATTGTAGTGGATCTAATTTTTAAGAAATTTTGCCTTTA^ 

CATAGCTGCTGCCCTAGTTTTCTTTCCTCCTTTCCT 

GTGGGGTGACGGGACTCAGGCGGGACAGTGCTGCAGCTCCCTGGCTTCTGTGGGGCCCCTCACCTACT 
CTCTGTGGGTGATGGGAGGGGCCATTGCTGACTGTGTATATAGGATAATTATGAAACACAGTTCTGGATGGTGTGCC 

6 0 CTCTCTGGGGCTGTGTTTTGAGCAGCAGGTAGCCTGCTGGTTTTATCTGAGTGA 

TTITTTTTT A TACTTGCGTTTGGAATAAAAACCCTTTGGCTTT 

HUMAN SEQUENCE - CODING 

ATGCTCTTGTCCAAAATCAACTCGCTTGCCCACCrGCGCGCCGCGCCCrGCAACGACCT 

6 5 GGAG AAGGAGCCCCTGGAGTCGCAGTACCAGGTGGG CCCGCTACTGGGCAGCGGCGGCITCGGCTCGGTCTACTCAGGCATCCG CG 

TCTCCGACAACTTGCCGGTGGCCATCAAACACGTGGAGAAGGACCGGATTTCCGACT 
CCCATGGAAGTXX5TCCTGCTGAAGAAGGTGAGCTCGGGTTTCT 

TTTCGTCCTGATCCTGGAGAGGCCCGAGCCGGTGCAAGATCTCTTCGACTTCAT CACGGAAAGGGG AGCC CTGCAAGAGGAG CTGG 
CCCGCAGCTTCTTCTGGCAGGTGCTGGAGGCCGTGCGGCACTGCCACAACTGCGGGGTGCTAC^ 

7 0 ATCCTTATCGACCTCAATCGCGGCGAGCTCAAGCTCATCGACTTCGGGTCGGGGGCGCrGCT 

CGATGGGACCCGAGTGTATAGCCCTCCAGAGTGGATCCG CTACCATCG CT ACC ATGGCAGGTCGGCGGC!AGTCTGGTCCCTGGGGA 

TCCTGCrGTATGATATGGTGTGTGGAGATATTCCTTTCGAGCATGACGAAG 

GTCTCTTGAGAATGTCAGCATCTCATTAGATGGTGCTT^ 

XCCATGGATGCAAGATGTTCTCCTGCCC 
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Table 22 

MOUSE NOMENCLATURE 
ICSGNM Wnt3a 
Celera mCG11700 

5 

HUMAN NOMENCLATURE 
HGNC WNT3A 
Celera hCG42253 

10 MOUSE SEQUENCE - GENOMIC 

TGAGCTGTTGATCACATGCTGCATTTAGAGAGAGT^ 

TAGCTATGTGGGCCTTGCCACCTCTACCCAGGAGCACACC^ 

AGCAGACCCAGTCAGTACAGCTTCCTTTCTCTCCCCTGGCCTTGCTCT^ 

GATGCCCGAACAGAGTCTTTCGAGTGGACAATAGGrcCTC 
1 5 CTTTCCCCCATCCCACCATGTGACAACCTTCCAGCTCTG^ 

GGCTCCCCAGGTGACACCCTCCAGGTGTCTGATACATTAGTG 

GAGGCTGGCAAACACATGGCCGCCGGCX3CGGGAGCCTCCCGCCTGCCAGGACTCGTTCAAAGTATC 

CGGCGGGGCAGG ACAGGCGCTGGCTGGG AAGG CTTCAGGCAGGGC CCTGTG CTGACTGGAGT AAGG ACTTGGG ACCCCTCCCCAAC 
TCAGAAAGTGC!TGGATCCTCAAGGAGGGCTTCCATTTTCTGGGGATGCTGT 
2 0 CAGGAACACAGGTACCGTXSGGGACAGCTCTCCATCTGAAT 

TGAACAGAGTGAGTCCCATACCCTGGGTCTCAGGTATGJ^TGTGGAAGCCTGGTGA 
AAAGGCCTGAGTGGGACCCAGAGCATTCTGGCCC^^ 

GGGCCTGGGCAGAGCCTTAGGAACCCTACCTGCTAGGATCCAACTCAGTGCT^ 
AGAAGTCAGGCAAGCAAGGCACTTCCTGCACCAGATATAGATGC^ 

2 5 CTCGCIACACACACACAAACACACACACACACACACANN^ 

NKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
. NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNIINNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNIWNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AGCCTTGGCCCAGGGAGGCCAGTGCTGGCCCAGGCTGGGCTGGCCTGCTGC 

3 5 CTGCATGTCACAGCCAGCGTCTCTG CTATATCCCTGGCCAGGTGCCCGG CCTTTCAACAGG AGCAAGGGAAGGAGGAGGG AG AAAA 

ACCATTAGTGTTTGCCGGCTGCAGG CGTGGCTCACGTGGCTGCAG CCACAGGATGAAACTCCTAGTCTAAGGATGGGGCTATGTTC 
CAAAACATAGCGAGCTCTAAGTAGACGGCGATGCCTACCCCTTTAAAGACAAACT 
GTGAAAATGAAAGCAAGAGGAGTCTGAGATCTGATTGGCATAACACGGGCATTTTCACT 
ACAC^CACACTICACTVCACACACACACACACAC^ 

4 0 TTTCTAAGG CAT AGTTT AAACATCT G C ATTTTG GG AT CATTTTG AGGG CGGCCT CAGCAGTT ACCTG AGGTCTG AAGTG CTT AGGG 

TCCACTGCCTAACI^GGCTGCCAGTCCTTCTGCTAGGACCCAAG 
AAGGAATGGCAGGAGAACACTAATTTTATTTGGTGGGGGTTTGTCCAGAGGACCT 

CTCCCACCGTGAATCACAAAGGCCAGGTGCAGAGAAATGGGTGCAGCGAGGGTGTC 

4 5 AGGGCCCTCCTGCCCTAGCTGTTTCCATCACCCCAAGGCAGGGTTCAGACCTTACGG 

GATTGGAAGTGACTGGTTACAGGGTCTTCCATTCTTAATGCTCTTCCTGG^ 
GGTCCTTTGAGTAACCTGAGGGTTCTCTGTCTTTGTCTGCACATGGGTTTACACACGTTAG 
AAGAGTTCATACAACACTCCTGGAGGCrrCAGGGCGGGCCCAGGGGCTCAGGAAAGACTAGGTGCGTTCTGG^ 
CCGTTTGTCTGG CTGGT AGCTGTCTCTTGGGCCTTGTAAGCAGGCAAGGGCGACTGGCT AACTGCCAGGG CT 

5 0 CCCTGTCCCTTCTCGAACTCCTGATATCTCTTGGGGGTTG^ 

GCAGTGGAATTTCAGATGTCTAGGGGAAATTATGTCTATCATTAAGTTCTGGGAGAATCAGGTGGAGAAAGCA^ 
GCAAAGGCCAGGTCAGGGCCCAGGAAGG CACTC!AGGTGTTTCGAGTTTGGACTCAAGCCTTAG CAAAATG CTTGTCTTTGGAGT AT 
CTTTACAGTTATGGAATGTCAGGOTAGAAGAGACCTCAGATCAGCAGCT^ 
GTTGGTCACCTACGGTTCAGCTAGGGACACTGAGGCTCCAAGAGTCT 

5 5 GATGAGATGAGTCCTCACAGTTAAGATCATTGCTCTCATCTCCCTTGGTGACT 

CTCCCTCITTTGTAACACTGTCrCATTTTG 

AATGTTACTTTGCTTAATACATTTTAAGACTATTACAGTGTATCAGGGCCA 

ACACAAGCACAAAGACCTGTGTTTGATCCTTAGGACACACATAAAAAAATTATCTGGTAG 

GATCGAAAGACAGGCAGGCCCCCGAAGTTCACTGACCAGCC!AGCCTAGCCTAAA^ 

6 0 ACAAGACATAGTTCCTAAGGAAGAACATACAGGATTGATCCCTGCCCATCTAGTGCATGTG 

GGGGGTGCCCTTGGGAGCCAGTGTGCCTTGGGGACCATTGTTTTTGCCATATCAACCTGAAOT 
AGCCTTGGTTTCTTGTCCCAGGTGGTCTTGGGTCGCACCTGTCCT 

CTCCACCCAGGGGAACTGGAGCTGAGGGTGGGCAGATATGGGAATGGGACAATGATGAGAGATCT 
GATGATCTGTGGAAAGGTCCTGAGTAGCATCAGGAAGGGGCAGCTGTCC^ 

6 5 TCATGTGCCCCrcGAGCrTGGACACAAGGTGCCTAAAAGGTGGCATCrrCCCAG^ 

GTAGGGGTAGAAGAATCCAGTGCAGTGAGAAGGAC^VGTGCTC 
AGAACTTTTGCAGCTCAAGGTGTCAGCCAGACCAGGAAGCCACTTCT 
TAGGACTGAGTTCTCCAGGACTCGGTGGCTGTGGCTTAGGGTAGCCCCTGAGAGTCCC 
GGCAAACTC CATG CTTTGG ATCCAACTTCTGAAAGCACACATG AGTAGCAT7UVACTT AGAACT 

7 0 GTCCTGACATTCTGGGCATAGTTTTGGACCTCAGAACTCTGGGAATACCTCAGAATT^ 

AGGCrTTCTGGGGATAGGGGATGGAGTCAGGCAGCTGTGTGTGCTAGAGAGCAGCTGT^ 

GCATAGGGACCGCTGAATCATACTCCTGTGCAGTGTCTCAGTGCTAATCTCGTGCCCATGCT 

CTCACCCCCASCTCTGCTCATTGOQlTGGGACCTCa 

AGGGAAGACAGCTATCATC^TAGTGGGTGAGAACTGGAGAAGAAGTTACAGA 
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AGCACACACAATAGCAGGACCTCCCCCCGCCCCTAAAAAATTCCACCTTGCTCGCCTTCCTATAGCTCAGCT 
TAGCTGTGCTGTCTCCACTTTCCTTCTGGGCTGTTTTGTTTAGTTTTTC^ 
ATTTGCACTTGTGCAGAGGCCACAGCATGCTTGGGGTGGGGGGTT 
AGATTCTAGGGAATCAATGATGTTTGTCAGGCTTGGCAATAGGCACCTITTAGTCACT 
5 TTGATGACTGTACAAAGGGCTCCTGGCACTCACCTTGGAGACAGTGTCCCCT 

ACTTGTGGTCCCCGGTGAGGGAGGTTCAGTCTAAAGAATCATGAGTTTGAGGCT 
CTAGGCTGAGCTTGTTGCTTAGTAGTAGTTGCCTC 

AATGGTAAGTTAGCCAGJ^GTTTAGGAGAACACTTGAGAAAGTCCTGGGGAGGGTCAG 
GACTGGGATATATGGGACTCTGTCTCAAAAACAGAAAAATAAATAAATGAATAAATGTTTGACTT^ 
1 0 TCTGTAGGCACAGGCTTAGCCAGCTGAAAATTGAACATGTTTGGAAAAACTGGGCA 
TGGAGGAGGCAGAGGCAGGTGGATCTCTGTGAGTTTGAG^ 

GAAltSCTTGCCTCACACACACTTGGGAAGTAGAGACAGGGGGAGGAGGATTTTAA 
TTAGTCTGCACTACATGAGAGACAGACAGGCAGGCAGGCAGACAGJVCAGACAGAACr 
ATCACAGCCCAGCTGAAGTCCAGAAGTTTATCGAACCTCCTCA 
1 5 GTCTTCAGCATTCCTTCCCACAAGGCCTCTGTGCCTGGGCTTTCCTGATGGTCCAGCT 
CGATTCAAGGCACTTCAGAGGCTCTTGGGTCCAGAGAATC 

TGCTGGTGCTGCCCACACCCTGGCCTCCTAGCTAGGGGAGGTGGCGAGCAGGGACCTGCCCAGA 
ACTCAGCATCTCCAGAGGAAGGGGGCTGGGGCGCTGCCATGGACCTGGGAAGTCCCTTCTCTGCCTTCCTTG 
CAAGGTGGGTGGGGAGCTGAGGTCTGACCACTCCATGCCATTGGAAGGAGCCCCCTCTT 
2 0 CATCATCAGCACCCAGGATGAGTGCTGGTTTCTGGCATCTAACAAAGGAGACCACCTA 
AGTTCCTCCTGGGAAGACAGAAATGGCCAGGTTCTGGCCAGGTTCTCTTTTTCTGCTA 
TCTAAGTCTCCTAGTTAACTGCATTCACAGAGTGAAGATTGTCTTCTTTTTGGAT^ 
GAGCTATGAATGTGGCTCAGTGGGTAGAATGAATGCCTAGCATCTGTGAAGTCCTATGTTTGATCCCTG^ 
GGATCTCTGTGAATTTCAGGCCAGCCTGATCTACATAGTGAGCTCCAAAACAGCCAGGGCT 

2 5 AACCACAAGAGGCTGGGCGATGGTGGCACACACCTTTAATCCCAGCACTTGGGAGG 

CAGTCTGGTCTACAAAGTGAGTTCCAGGACAGCTAGGGCTACACAGAGAAACTCTGTCTCAAAAAACGAAA^ 
AAAATAGAGGAGGAAGAGGAGGAGGAGAAGGAAGACGAGGAGGAAAGGAGTAAGGTAGGGGGAGG 
GGAAAATCTTGCTTTGCCACAGGTAGGAGAGTCTTGGTGAGTGTGGGTGC^ 
GAGCTGAAGTACTTCTGGATGTGGGTAACAGGCAGAGAGCAGATGAGCCT 

3 0 TAGCCAAAAGTGACAAGGTACAGCAGTGGCTTTCCACCTTGGTCTAGTTTCTTCCTCTCTA 

CTCTCTGCCATCCTTGAGGG CAAGGAGCACAGATCCCAGTCCACTCCATATGGACGTGTCTCAG CTCTCAAGCCAGGACCTTGGAC 
AGTGATTTCTTCATTTCCCACCAAGTGTCTGCTCTCCCCATACT 

CAG ACAG C CATGGTGGTCTGT AC CT AGAGAAG AAAGTG CTG ATT C CG C CTG C AGGCAGGG AGGAGGG CATCTCTGAGCCAGACT AA 
CCATGG AGGT CAGGGG CCTGG AGAGGAGAG ACAG CTGCTG AGAGTGG AAG CAAT C CC CAG C CTACACAGGGCTGG ACTT CTT ATGA 

3 5 ACGTGGGGCAAAGAGAGGAAGGATAAAGGAGCAAAGAGCAGGCTCTGTCATGGTGTGCACAGGGATGCA 

GCATGTGATGATGTGTGACGCAGGTCTCTGGAGTATTTCAGGAGATAAGAGAGGCCAGCTCTATGGGAAGTAGACCACA 
ATGTGCTGGGGACAGCCAAATCTCTGCATGTGAGGGTTCTACAGTCACAGACTTAGCTCCAGAAAAGTCCTAAGTG 
CCGTGTACAGCAACACAGACCTGTAATCCCAGGATG CTGAGACTACACAGTG AG AAC CAGGTCAGCCAGTGGTACATAACAAGACC 
TTGTCTCAAAGAAAGGAAAGAGG AGAGATGGGTGTTGGAAGCCAGGCCCAGGGGCTCGTGCCTC 

4 0 AGATCAAGGT CGGCACTGGCTACACAGCAAGTTTGAAGCTAGTCT AGGCTACATG AG ACCCTGT CTC AAAAATAAG AAAAACAAGG 

GCTGGAGAGATGACTCAGCAGTTAAAGATCGCTAACTGCTCTTCCAG 

ACAACCATCTGTAATGGGATCTGATGCCCTCTTCTGGTGTGTCTGAAGAGAGCAAT^ 

AAATCTCTTTAAAAAAAATAAGAAAAACATGAAAATGAACATGGTGGGGGGAGAGGGAGA 

4 5 ATCCTTGTCATCTAGGTTTGGCATGGACAGAAAGGTGTGTGGACTA^ 

TCTGCTGTGAGCGTGTGGGTACGTGTGTGTGTGTGTAGGCCAAAAGTTGATGTTGGATGTCTTCCGTCACTCCCC^ 
CGGTACAGGGTCTGTCACroAACCCAGAACTCCCCTGGCTGCCTTTGCTCCC 

CTGGTTTTT ACATGGGTG CTAAGGATTGTATCTCAGGTCCTCACACTCACCTGGCAGGCATTTG ACCAACACCTTCCTCCCC AT CC 
CCTCCTCACATGTTTTTTTAGAGGACATTTGCTTCT 

5 0 TCAGCAGCGA.TCCTCATCCTATGTTCTCCGTCAGTGTCTGTATCCTCCAATGCCTATAGA 

AGG AATGC AGGT CTGGTTCTT AC ACACCGTG AGT CATGACAGTG TG ACATGTGGTCT ACATC CTGGG T ACATGGTCAACATG CTGT 
CTCTAGGTG CCCTTCCTGTCGGAAGCAGCTCCGGGCCGATCACATGCACAT AG CCC AG AGTCTGTCCGTTTCCAGAGATGGT CTCA 
ATCACCTCCTTGTCTTTCCACCAGGTCCTTGGCTGTGGGACCCCAGTACTC 
TCCCAGGCCTGGTGCCCAAGGAGCTGCGCTTCTGCAGGAACTACGTGGAGATCATGCCCAG^ 
5 5 ATCCWKSAGTGCCAGCIACCAGTTCCGAGGCCCGGCGTTGGAACrrGCIACCACCGT 

TGGAGAAAGGTACCCTGGGGTGGTTAGGGTCATCTCTATATGGACATCATAAGGCCTTCCAAACTCTA^ 



ATGTTGGGGAGGTGGTTCACGAGCCAGCACCCCTGGGCTGGTTTGGAAAGGAGAGACCGGC^ 

CTGGCAAAG AGCAAATT ATCCT AAAGTGTAGTGGTTAAGGC AGTAATTACCATGTGTTGACCTCCTGC^ CCGGGGGTCAGGGG CT 
6 0 GGGTTGGCTCTGCTTCCCATCCACAGGGAGGGAGGTCCATGGGAGGCTGTC^ 
GCAAAGGCTTCTTTCTGTGGTCTCTGGGACCCAGGTCTGAGACAGAGAGGG^ 
AGAAATAGAAGAGAGAO^AAGATGGCAGAGACTGGGTGGGGGGGAGGGGGCOSAAA 
TCATG CCTTGAACAAGAAGGGAAAAT CATATAATTTAAATCACTTTAGAACATACIAATCTCAGCAGACT^ 

TGCG ACACT AC CACT CCT AATTCCAAAATATTTTCATCCT CT CG AAAGG AAAT C CCAAAC CC ATT ATGCAGTG C CCCTGG CAAC CA 

6 5 CTGGTTTGCTTCCTACCCCTGTGGTTTTCTTATTCTGGA 

CTCCTTTCACTGAGTGTGAGGTTTACAAAGTTCATCTATGTTGCCACACATATCA^ 
TTCCATTTCAGTGTGTGTGTGTGTGTGTGTGCACGTACTCGTGTGCACCT^ 
AAAGATGATGCCATCTGTGGATTTC^TTTCrTTTATTGGTATATGTTTATTGTATTGGGT^ 
GAATGTATTTTGACCTGACTTAATTCTGCTGCCCAGGACATCAGTCCCCrc 

7 0 AG CAGTGG ATGTCCX3TGTTTCCTGAGTCACCCAGCCTTTGTGTTTG CTGCAG AGGTCAAAAT ACAGCACCTGTGGCTTTGCAAGCC 

TGGCTAGGAAGCATGGACACTGAACTGTCCCCAAGGAGAACAGCATTGTGGAGA^ 
CCrGTGACCCTGAAGGTGTTTAAATTCTTTCTTCTGGTGAGGTC^ 
GTTCCCTGCAGTCTGACCCrCCCCTGACTTTCACTTCCAGCAGTC 
CACAAGCCAGCTCTTCACTTCATATAATSAC^ 
7 5 ATGACCCACCATTGATTGTCCTTCCTGGTGGCATTCACCAGCrCACT 
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TAATTG CAG CTTG CAACTTG AG AGCTCTGAGTCAGG CT AATT ACAC AGTTG CTCC CCACAGAACAG CACTG CT GGAG 

GATCAAGTCCCTAGGCCGGTACAGTTTACCAAGCATGCCTTATAGTATGTGCACGGGTGGTCCATCCTGTGTCTTCAT 

GATCTGTGTACGTGCAGCAGTAAAAATGCTAGTGTGTACACATGTTTTCCTACA 

TTGCT1TTT Gtt rr IG l"i"ri"l GT CT ATTTGTG'I~ri~i"GTTG CTATT AC AATGT AC CTG AT ACT AATAATTTGAAG AGGGAAG ATCG A 
5 TTTTGGCrCITGGTTTCCGAGGGTTCAGCCC^ 

GTTACCAAAGGTTTCTTCCTTCTVCTGGAACCAGAAAGCACAGAAAGC^ 
ACCCCAAGCACGCTTCTACCTTCriTCCCCCATGAGCTCCCTGTCT 
ATTAGTTCCAAGCTC^GGTTCCAATCACCTCAACA^ 
GGTATGTACATACGTGTGCACACATGGCTTCAGAGTGGGGCTGCACT 
1 0 GAGTTCTA<^TCTTCATCTGAAGGCTGTCAGCAGAATACT 

GACACACCTACTCCAACAAGG CCGG CCATGCCTTCTAATAGTGTCACTGCCCAGG CCAAGCATATT CAAACCACGACAGTTGAGG A 

GCCTTTAGAGATCCTGAGAAGCTGGCCTTTGAAGGTTTC^ 

AGACAGAGCCCCTCCAACCTCCTGCTTCTCTGTGATTGACTG^ 

TT AT AACT AAT ACATC CAC CT CCATGG GGG AG TCTGTGGGAGCATGCACACACAC CACCT CCCT CACAATG CT AAAGTG AGT CTCT 
1 5 CACAGGAGCAAGGAGGGCACCCCATGATTCGCCAACTA 

AGAGCGGGTAGCKSCTCAGTTTCTCCTCTTCACCAGCACTC^^ 

TTCTTGAATTT ATCAGTT ATGTG TGCCCTTT AT AAAAAAAT CTATCTATT CAG TT CTTCT ATTCATG TTl'T AAT AAGGTT AT1TCT 
ATTTAGTTTGAGTTTTGATTATTTGTGTTTTCCrTTAATCA 
GTGTGTGTGTGTGTATGTGAATGCTTATGTCTGCCTTGGT^ 
2 0 TGTGGATCCCAGGGATTGAACTCAGATCGAATTGGGTTCANNNNW 
ACATGCAGAOITTTAAAGTATTTTTTTATTTT^ 

GTACCTGTGTGAAGGTGCCCTTGGAGGCCAAACAGGGTCCAGACCCTCCCT 

TTTAACTTTGATGACAAGTCCCCAACTCTCTCTTTGGCT^ 

2 5 CTCCTCCCrGAGCCCCCCACGCCCGAGACAGGGTTTTTCTGTGTAGCrCTGTCTGTCCT 

GTCGAACTCACAAAGATCCACCTGCCTCTGCCCCTCGAGTGATGGGATTAAAGCG 
CTTTTTCTX^TTTCCTCGAAAGAGTCTTATAGCTTCAAGCCAG 
TGAGCTCTATACATTTGAGAACAACCTGGTCTATATCCTGGTCTGTATGGTGAC^ 
TTTTACATCAAAAAAAAAAAAAAAGAAAAGTCCTATAGCTTTTA^ 

3 0 AATTTATATTTTGTGTGT ATGGG TGTTTG CATACATGT ATG TTT ATGTAC CACATG C CTGGTG CTCACAG AGG TT AG AAG AT CCC C 

TGGGACTGGAGTTATATACAGTTGTTGCCATGTGGCTGCTGAAAATTGAACCC^ 

CCCTGAACCATCTCTCCAGCCCTGCTTGATATATATATATAT 

ATCTTTTTTTCTTTTTTAACTTTGGCTTTGTTTG 

ACCATGTAGACCAGGCTGGACTCCAACTTAAAAGAGATCCATCTGCCrCT 

3 5 CCCACCTCCAACTTCATTCTTTTrTTTATTTTC^ 

GGGGAGCTAAGAGAGTAGTAGAGGCAGAGAAAGGCGGAGAGAAGGAGAGA^ 

GAGGGGGGACAAGGGG ACAAG AGG CAAGAGCATAAGCAAG CAAAG CTCCT^CTTCATTCrTTACTGTGTTGAAATGCAGTTATTCA 

AAGAGCGTTTGTTATAAAGAGTATTGTTCCTTCTTGAATGGCCTTAGCTCACTC^ 

TGATGAACATATAGGCTGTITGCTCCATTCCACTGGTCACTGAGTATTATCATAGC^ 

4 0 TGTCCTGTGTTTTTTTTTTAAACTCAAAGrrGCTTC 

GGTAATGTCCGTTTTTTTAGACTTATTATTTTTAGTGTGTACGAGTATTTItS 
GGTG CTCACAG AGATCAAAAGAGAACATTGGATCTGCTGGAACT^ 
TGAACCCAGGTCCTCTGGAAAAGCAGCCAGTGCTCTCGATTCCTAAACCATCTCT 
CTTAGGGACTGTGTGGGAGAAGGGGGGGGGGAGAAGAAGGGGGAAGAAGGAGAAGGGGAAGGAAGAG^ 

4 5 AGGAGGAAGATGAGGAGGAAGGGAAAGAGGAGGAAGAGAAAGAAAAGGAGGAGGAAGAG 

GAGAGGAGAAGGAGGAAGAGGAAGCAGGAGAAAGAAGAGAGGAGAAGGAGGAAGAGGNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 0 CAGTGACACTTGGCGGCTCCCATCTCTGAGTCTTGTACATTGTTTTCTAAA 

TGACATTCTTTTCTTTTTAAAAATTAATTAATTT^ 

CAGAAGAAGGTGTCGGTTTTCCCGGAACTGGAGTTACAGATAGTTGTGAACT 

CrCCAGGAGCAACTAGTCTCrCTCCTGGGATAACCATCTCTTAGCCCCATCTTGC'r 

AACTCACAGAGATCACCTGCCTCTGTCATCCAAGTGCTGGGATTAAAGGAGC^ 

5 5 CGTTTTGTATTGTGTTTGGATTGTTTGTTGCTAGGAGGCAGAGATTTGTTGAGCTCAATGAGG 

CCGTGACCTGGCTG AAGCTATTGATAGGCTC CAACAG^TTGTTTGTTGTCTACTAT^GGTTCCTTGGG CTGCTCTACGCATAGGGT 
GATCTCACCTCTTCCTCCCATGCTGGATGCTCTTTATTCCTre 

AAGTG ACTG CTTTCTG C CTT CC CCAT AGACAGTG AT AG CTGTGGG TCC ATCCAGG TTTT AATTTTGT AGATTCAG AAG T AAG AAC C 
CTTTACTCGTGTATAAGTAGATAAAGGATTTTTTTAAAAAAATATTTTATT^ 
60 ATATACACCTATGTGCCACATTCATGCAGTGT^CACAGTGTCCAG 

GCCCTTAAAGTTAGCAGTGTTTTAAATTTTATTTGTTTTA 

TGTGCACATGTGTGAACATGTATGCAAGTGTCTATGGAGACCAGAAAGGA^ 

ATGAAGAGCCATTTGGGTGTTCGATCTTCTACAAGAGAAATACACACTCT^ 

6 5 TTTTCAATAATCCATTCTTTTGATCTTTTGGAATTATACCAACATTTTAATC^ 

TAGTGGTATACATCTTTAATCCCAGTACTTGGAAGTCAGAAATAGATGGATCTTTG 
CATGTCTCCAAAATAAAACAAGTTCrCCAAATTCATGAGC^CACIAACT 

AG CATCTTACT ATGTTGTCITGGTTCTTATTG AATGCCCCAG CTA 
CAGGCATGGGCCACCCATTCATTTTCCl'T rL M XU'AAGTTCGRCGGCAAACATAACTG<]AGGTGAGCAGAATTTCA 

7 0 ACTAGCGGCAAGCACCTTCTTTCTTGATTTCTAGGACAAATT 

ACCAGAGCTGGGACACGAGAGGCTCTCCAGTGGATATGTATATGATGGCAAGCGCTCCATCCCACGCT 
TGTGCTTTCAGTCAGCACTCTGCTGGGCAGCCCCGGTGCTCT 

GGGAGCCCAGGGGTGAGAGAAAAAGGAGCAGGGGAAGGCAGTGGGGCTCAGC^ 
7 5 GGAAGAGAGAGGATGGCTGTGCCTGAGGCTAGAGCCCCGTCACATAATTTTGGGCGAGGTTAGGG^ 
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CTCCCAGTATGCTCTTGGGCAAACACAGGCTGTCTAGGGACAAGCAGCTGCCACAGGTCCTGCCCT 

AGCCTTGGGGCTGAGGGCCGCGCCTGCCCACATTCCTCTGCCT 

TAGGCTGTCTGGCCTCAGCATCCCATCCCTCTTTCCC^ 

ACTGTCCTCCCTGCAAACATGACTTAGGCCTCTCTCAGTTTGCTGCTCCCrrCTTCA 
5 ATCCTCCTCCTCCTGCnTATGCTCCCCTCCTCAGTCCTTGTTCTCCTT 

GCCCCCAGAAAGGCTGGCTCCAGAAGTCCTTGTGCCCCCCACCCCCTGCTGCCTGCTGGCCT 
AGGGTCAGGGCAACCCTAATCATGTCAAAGGTTACCCTAGG 
TGGCCTCTGTTGAACAGAGTTTAACATCTACCAGCCAGCTGGGAGGAAG 
AGCTTGTGCAGCCATGTGTTATAGAAAAGCGTGTGTGCAT^ 
1 0 GCACTTGTGAATATGAATGTGTTTGTGCCCATGTTAGTGAATATGTCTGTCCGAATGTGT 
CATTTGTGTGGATGTGCCTTACGTGGGTATGTGTGTAAGTATATGACTGCATGTG 
TGTCAGTGCACATATACATTTGTGCACCTGTGTATGTACTCACAGGTATTGATTATATAT 

CACAAATGCTCGTACATGTGTGCTCAGGTGTCTGTCTCrGTGTGGGTGTGACCATGTACACGGTACAGGCGTGTG 

ATGCATTGAGCATGCATGTGTGAATGGGTGCCTGTGTGTATACAAGGCAAGTGTTTGCT 
1 5 TGTGTC^GCCTAAAACTCAGGATGCTAACCTGAGCGTGAGGAAGCTCTAGCCCTTG 

TCITGCACCCAAAAGTAGCAGGGAGCGACCTACCTACCTACTCGG^ 

TGAACCGTTTATATCCCTTGAGGCAAGCCTCAGACACTTGGAGGCAGGTAC^ 

AAATGGGTGGGGTGGTTGCAGGGCCAGCCTGGGCCAGAGGTAGGGGTCACAGGACGA 

AGGGGACAAGTGGGAGGCTCACAGGATTGGGGGGTGGGGGCAGAGTCCAGGAAGCAGGTT 
2 0 GGACAGCTTTCTTTTGGCTGTGTGTGACATTTGCATGTAAGACA 

TCTCTACCTGCCTGTGTACGGGGGTGGCTGTGCCTGGAGTC 

TACCCACCTTGCAACACTCCACCCCACTTGGGGGTGGTCTCCCCAAGTCCCT 

GGTCCCTTCCTCCAGCCTGTGACATACCTCCAGGCC^GCCATACCCTCAACCCATAGTGGCTG 

2 5 TCTGCCACAACTATTAGCAGAGTTGGTTGCTTTTGAACATCAGACTCTCAGTATCTGCT 

CTGGGCAGGGAACAACAGC^GGCCTCTTCCCCTCCCCCAGGCAAAACCCCTTCCTGATGACTGGCC^ 

GGGGGTGGTCCTC CACCTCTCTGTGTGCTCAGCCT ATGGAAGG ATCGACTC CAATGAGACTAGCCTGCAAGACAGCCGGAGCTT AG 
GGGTTCCGGAGGGACAGAAAGGGCAGCTTTTTAGGGA 

GTGTTACCATTTACGATGCGTTGGGGGTGGGGGGGCTCCCCrGGCTCCGGGGGCCTCT 

3 0 GAGGGGGGACCCCGGCCGATATCCCACTGCGCTCGGCTCGGAGCACAATGTCCCTGCGAGGACGACAAGATTAAGC^ 

GCGGCGGGCIAGCGCATACTTCCAGGTTGCTATGACTGG CCGGCGATCGC CT AATCACCGGCCAGGGGGAAATTCATATAAAATATC 

GCTGGCCATTGTGGGGCTAACGCGGCGTGACAGTGTCGCACAAAGCCGGATTCATCAGCCCCAGCAGGT 

GCGCGGGCCCGGGCGX3TGTCAGGGGCCGCGGCCACGGTG 

3 5 CTCCGTGTGATTACCAAATGCCACCGCGACACCCCCT 

GACGG TGGG CAGCCTTG G AGGGG AGG CG CCAGGG CGGGG CCTGGG CTGCAAGCT ACCAG CTGGGGCTG CT CGGTTCTCT CG TCT G C 

CCTCTGCCTGCTGTGCTGAACACGGGTGGGGGAAGGGATGCCCACCCACTGAATGTGTTATGGAA 

GGAATGGGATAGATAGATGGTTTGTCTTTCCTCAGCGTGACAGGAGAGTGACACTCCC 

GCTCTGGCCAGTCTG AGCAATTATGG CTATGCCATAGTTGGGCCTTCTTCCCACCAGTGGT CGGATAGTGATCTCCCAGTTGGAGT 

4 0 CTGGAAATTGAGAGGACAGAGACCATCTGGAGGTTGTTGCCTGACAAACTCAG 

CCAGGAGGGTGTGGTGGATGTAATGTGCAGACATAGGTAAGGGTGAACCTTCTGTCTGCTGGGATTC 

AGGGAAGCTTCAACAGAGATAGAGGGTCCTAAATAAACGCAra 

ACCQVATTTCAAAAATAAATAAATAAATAAATA 

TAATAAAAATATAAAAATAAAAATAAAAATAAAAAGGTACTACTGGATTTAAAGGCACAGATTAACCTCAACTCAT 

4 5 CAATTTCAATAAGCCCACACACACATGCACACAC 

ATTCATTTCACTTCTAGTTCTGAGCCTAGTGCCTAGCTCATTATGGACT 

GAATGGATGAGTGGGTGGATCGATGATAAATGGGTAGTAGATGGATAATGAGTGCGTGATAGATGGACAGTAGACAGATAATTGAG 

TGGGTGGTATTTTGGTGGAGGATGGGCGTATGATAAATAATGAACTAGATGAAGGGAAAGGAAG 

CCTTAAACCCAGTAAGAGCGCAGCAAAGOTGAGATTGTTTCTGCAGGGGGACTTTG 

5 0 ACAGTGGCCTCAATTCATAATCCTCCCACGTCAGrc 

TGCITCTri T ATAGAACAACTTCTTCAGGTAGlTTCriTTCTTTATCTTTA 
CCAAAAGCTCTTCTTAGCTCTGCCCCCAGATGACTGGCCCAGATGGGCAACA 
CCCCTCCTCATTTGATGATCAATAGCCATTTGAAGGGTCTC 
CTCACCTTTATGCTGGCCATTrCTGCTGAGTCAAC^GAAACTGAAAGGTGCTGTG 

AAACCCTATGGGACTCTGATGGAAGCAAAGGGTTTCCTCTCTGAGA 
CCTGGGTAGACCCAATCTTGTTGGATTAAAATCCCTGCCATCCTA 
TCCTGCAGACATGTTCTCAGTTATTATTAAGTATCTTAAGTATAGCGAAGCA 
TTCCTCTCCACACACACAGCCTTGCACACTGCTGTGGTACATGGTAATTGATCATGAAAGC 

6 0 TTTGTGCCTCCATGTCCCTTGAGTTCCACCATCCTTCCCTTTTGTACT^ 

GGCTTTAGCAGCTGCCTCCAATTCATTTAAGTAGAGACTATACATTTTT^ 

GAGGTAGATGGTATCTATTCACTGCTAGGAATGTGAACTCC^ 

GGGGTGGGGAACTGCTTAAGACTTTTGGCTGCAGCITGCCT 

AAATGTGGGTTTGCAGCCCACAGTTCCTCCCCTGGCTTCTCAGATGGAGTTTCTGGACT 

6 5 ATTTCITCCCCAGCCTGTTGAAAGCCAGCTCCAGAACTTGCCCTGCCTTGACTT 

AG AGGAAACTGAGGTTCAACAGAAAGGG AGGACTTCCCAG CCTG CTGGTGACAG AG CTG AG CAAG AATGAACTCTTG CCT ACTT CA 

AGTTCTATGTTGTTCTCTCCTTCTCCCAAGGACCCTTCCCCTCGAGTGTGCCTACCAGCT 

AGCAGGGAGCCTGAAGGCCTTGTTCTGGGCAGGGCAAGGCCT^ 

GAGGAGCGTTTGGGTGACACTTGAGCTGGATAAGGTAACCAAGGCTGCCTTGAGACTTGGAATGTGA 

7 0 GACTTCTGAGTCGCACCCAAGAGAAGGTAGCTCAAGGCCTCTGGCCCCAGACTCAGGTTCTATGCAGAAAGAA^ 

CTT AGGGG CACGAGGAGGAGTGCCTTGGCTCCT^ 
GGCATGGACAAAATCCCCJ^GGTTTCATTTC 

GAGTTCAAGGTCCTTCTAGGCTACACAGTAAGCTCCAGGCAAGCTTGGGCTGA 
AOajLajL&TO.ACGGeTCGAAG 
7 5 TCTGGACTGGAAGTGGATTCCAGAAGACAAGTCTTCCCGAGGAATAGGCACAGCTGACA 
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GGCTGCCATGCTACAGAGTG AAGCTGGTAACACTGAGGCTGGGGTAG AG CTTG AGC CCAGGGAAGAATG AAGG CTGAGTGTGACCA 

GAGGCCAGTGAGTGCTGAGAAGAGGAGGAAGACAAGGCTGGCGGCAGGAAACCTTTAGAGAAC^ 

TGGCTCTGGGCACXX3AGCTGGGCCTGCAGTGGGTGGCTGTGTC 

GAGAGCTTCTCTGTGGTTCTGGTAGTTCCTTGTCCATCGTAATGTGGGAAGAGTGGGA^ 
5 AGTTGTCCAGGCTCTGACAGTGCCCCAGGTTGCGGGCAGAACAGCTGGAGTCT 
TTCTATGTGTGGACGCTGACTGAACTTTGTGGGTCAGTCTGAGTGTC 
TTGGGACACACAGAGTCCCGAGAGTCTAACCTCrrAGATGCAGCTTGAGG^ 
AGGCCACAACTTTTTATGAAAAGTCTCACAATCCCAAGA^ 

TCTGTCTGGAATGGTTT CT AG CCGGG TGTGTAAAATCT CTGCT AGTCAACAAG T CAGATTTGT ATC CAT CTT ACTCAGAACAC CGT 
1 0 GCTTATCATACAGCAAGCACTTGGCACACCITTATCTGGTGTGAAG 

CACAATTGGTGTGTGTGTGGTGGGGGGTGTGAATCAATGCAACCACTATAGAAATGTT^ 
ACTACCACAAGACTCTATTTCACTTCTGGACCTATATCCAGAGAACTCTATACT 

ATGCTGCTTTATTCACTATAG CAAAGAATTGGAATTAACCTAGTTGTCCATTAGCAGATGAATCGATAGTGAACATTTTG CAT ATA 

TAT AAAAATGGAT ACT ATT CA.TCT CTTGATTT ATTTTTTG CTG C CTTTTGT AGTCAAT TT ATCACAATATT ATCA 
1 5 TTTCAAGATCCGTCCACATGATTTGCCCTCTCAATCTGAGGCCCCTGGTCTAGACCTA 

CAAAGTCTGGACTGTTGTTTTTCCTGTTTATGCAGACCGGTGAT^ 

TCTCCAAACAGAACAGTTACAGTACAGCTGTGGTANNNNNNNNNN^ 

GTTCTGGGACCTTAATGTAGTCCTCCrTGGGACTCTCCAGGA 

TAGTAGAGTCGCTTACAAGCTGTGATCTGGATAGTACAATG^ 
2 0 CTGGATCTCTCAGCAGCTCTTCAGTATACACTGGAATCC 

AGTGAGGGCAAGCAGGCCAAAAGCAGAGGTGCTTCCATGTCC^ 

GGTTTTCTGACCTCAAATGATCCAATCAAGAAAAATCCCTCrrCAGTGTGTC 

GAGACAGGCAGATCTCTGTGTGACCTAAGGTAGCTTGGTCTACATAGTGAGTTGC^ 

TTAAAGCAAAAAAGCCAGACAATGTTGACACACACCTTTAATTCGAGCACTT^ 

2 5 GACAACCAGAGCTACATAGAGAAACCCTGACTCAAAACAAA 

TT AACTCTAGATATACTCAAGTTG ACAACCAAGAAT AGCTGT CCTGGTTGG ACTTGAACT CAAAGCAATCCTCTTGCCTCAGTTTT 
CCACATGCAGGTATTACAGGTGTGTAACATAAGTTCTGGTTTAAA 
TAATTTTATGTCAAAGATCTAATITTCCTTTTGTTGTTCTGAG 
CTAAAGATGAACCTGAAGCTCTGCTCTTCAGCrTCCACCTCTGGAGTC 

3 0 GCAGGGCTGGGGTTTGATCTCTATTCCTTTGGCATCTCTCT 

TGAGACAGCCTCATGTAGCCC!AAGCTGGCCTCTATGACCTTGAATTTTTTATCTTCCCGC^ 
AGAGTGTACAGCTATGCCCCAGTTCATGTGCTGCCAGGGATGTGAACACAGGGCTCCATTC^ 
GAGCTGCACCCAAGGCCCTACTTCTGTTAGCATAATTGTAAGTATTTTTAAGGTGCT 
TTTAAATGACCCAATAAGGGTTCrTAACATGTCCATCAAGAAC^^ 

3 5 GTCCACTGTCCTCACATGACTGTTCAGTGGCACACCAAGACCCCTTAACCATCATCT 

CCACCACCGTCCCAGGCCCTGCCTCCGTAAGTAACCCATTCCACTTCCTGCTAGGAACGGAGTGC^ 
TAGAACATGAAGTTCTTGTCirCCTGTGCCrGGCTCATTCACTTATGTCCTCTGGGTCT 
TCATTCrTTTGTAGTGTAGCCrTGGCTGCCCTGGAACrCACTCT 
GCCTCCCAAGTGCTGGAATTAAAGGCATGCACCACCACTGCCTGGCTAG 

4 0 GTGCACACTACAGAGCTATCGTAACAAAGCACATACATACAGTGCT 

AGTTCGAGAGCTAACTCCACATAGTCCCGCCCAATTGTTTTTTCAGCAAAAGGTA 

ACGAATGGAGCTGGGAAATTGGAT ATCT CTATCT ACGT AAGAACAAATTCAGT CCCTCCTGTTTCCTATG CACAAAACCCAACTCC 
GTGTAAGACCTGAAACTACTGGGACAAAAAGCATAGGCCTGGGCATGGTGATGATGCCTTTGATCCCTTTTGTGAGTTCGA 
GGCTGGT CTGCAT AG AGCATCCCAGGCCAGCTTGAGAGATAGAATGATT CCCT ACCTCAATAACAACAACAACAACAG CAACAACA 

4 5 ACACACACACACACACACACACACACACA^ 

CTGGTTTTGGATTCCTGCCTITTGGACT AC AACCCTCCT CIACCTGGCCTCCAT AACCCCCTATCTCCTTCCTCTGCAG CCACCCGG 
GAGTCAGCCTTTGTCCATGCCATCGCCrcaSCIGGA^ 

TGGGTGCAGCAGCCGCCTCCAGGGCTCCCCAGGCGAGGGCTGGAAGTGGGGCGGC^ 

5 0 TCTCTCGGGAGTTTGCCGATGCCAGGG AG AACCGGCCGGATG CCCGCTCTGCCATGAACCGTCACAACAATGAGGCTGGGCGC CAG 

GTATGTGCACAGAAGCCCAGGGCTGTAGAGCAAGGGCTGCTTTTTCACCGGGATGTCCAGTGGGGCAGGA 

GGTGCTCTTGGTGACCAGGCTGCTCTGCTTGCCTTCTATCAAGGGAC^ 

AGAGAATGCCAGACTCACCTGTGGACATAGGGTTCCTGGAAGTAGACAGGTC^ 

ATTCCGCGATGAGGAGATGTCCATGAGAGGACAAGACGTGAAGGGAGGAGAAGAGGGATGGAGGGCAGACACGTATA 

5 5 GATTGAACTGAGATAGAGGGTTAGAGTGATAGAGATGATTGATACAGGCrACT 

GGGGAGAGAGAGGCAGACAGACAGACAGACAGACAGAGAGAGAGGAGAGAGACAGAGACAGACAAATA^ 
GAGAGATAGAAAGAGACAGTCAGGCAGAGAGAGAGAGAGAGAAGGGAGACAGAGATAGACA 

AGAG ACAAAG AT AT AGG AAG ACAGAGAG AG AGGAGG G AG AGAGACAG AG ACAGAAAG AC AGAC^ C AG ACAGAG AG ACA 
CTCACAGAGAGAGATTATCACAAGGATCTGGAAATACAGAAGGGAGCCATGCTA^ 

6 0 CCAGGTTATACCTXSGATCTCIAGGAGCrAAC^CTGGGATTTCAGGAGAAGO 

CACAGCAGCTCATCTTCAAATAC^CTTAGCCTCCCTC^ 

CAGCAGCATAGCAGAGCCAAGAACTGAGTAATGGGCTGGAGAGCGACCTCGAACCACTCAGGAATGCATATCAT 

ATGGAGAGAAACAGAGCTGACTGTGGGGGTCAAAACAGAGAGACTTCTTCA 

TGGTGCAGGCCTGTAATCCGGACGCACAACIAAAGGCAGGGCAGGAGGA 

6 5 CAGTCTCATACTTAAGACCTAGTCTCAAAAATAAAGAAGAGAAAGAAAAGG 

AAAGGAAAGNNNNNNNNNNNNNNNNNNNNGAAAGGAAAGGAAA 

TTCTCTGTAGCAGACATTCAGAGATGTGCTTTTCTGAGGACAAACTGTCACAGTGAAACTTACT^ 
CCTTTGACCTCTTCCTAAAGCCCAAC^Cn'AGCTCAAAGGCTCTGAC 

GCTCTTCCAGCATGAACAGAGGGGTGCCTAGAACAGGGATCACTTTACAGTGAGTGTGTTC 
70 AGGAATTT CCAGAAAGCAATGTGGGGTGCAGGGGTAGTGGTGCAAGTCTTrAATCTT AG CACTC CAGAGGCAG ACTAGTGGTTTT C 

TGGGTTCAAGGCCAGCCTGGCCTATCTAGAGAGTTCTAGGACAGCCAGGATCTCTGAAT^ 
GTTCCAGGACAGCTGGGCACAAATTTAATAATGT 

AGACAAGGGAGGACATGAGGAGAAAGGCAAAGACATAAGGCAGGAGCTTTC 
GCIAAGGATGGGGATAGAGAGAGAGATGACTTGGGTCAAGA^ 

7 5 TACACCAAGGGACTCATAACTGATTGTAACTCCAGCTCCTGCTACCTTCTGG 
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CACACACACACATAAAACATTTTTAATCTACAAAAATAGGGAAGCCATC^ 

AAAGAAACCCTCCAACCTTCAAAGAGACCACITGATGTCCCTGCAAA 

GGAAGCTAGAAGAAATTTGAGCAACACTTTGGGATTCCGAGtGATAAAACGATTTTAATC 

TG AAAG ATTTGCTTTTCT ACT ACATT AG AAAAGGTTTG ACTTT AGACAG T AT CATTTCCCAAGG TTT AC CAGGTGGCTGTGAG T AC 
5 GTAGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNIWNNNNKNNNNNNNNNNNNNNN^^ 

NNNNNNNIWGTTTCTCTGTATAGCCCTG^CTGTCCTGGAACT 

CTCTGCCTCCCAAGTGCrGGGATTAAAJGGCGTGCGCCACCACCXSCCCTGCA^ 

TGATAGGACACGGGTAACTAGCTTAGTGGTAGATCTTTACCCACT^ 
1 5 GAGACAAGGTTTCATGTCTTCCAAGCTGACTTGAATTT^ 

GACTGCAGACATCTACTACCACITTTAGCTTATGAGATGC^ 

CAACTGAGCTTCATCTCCAACCCTCG(X3ATTGGACA 

ACGAGTGTGTTAATCACATAACAGAAAGTTGTATGGGAAAGGAAATGTCACC^ 
GGCAGTGAGTACTGAAGGGAGAGATCACGCTACTGTTTTCAGGGCGGGAACGGGGAG^ 
2 0 GCGTAATAATTTTCACACTTCATGGTAAGAAAGCAACAGAAAAT 

GACAGCCCGCATACCACAAGGCACTTCCITGCGTGGTCAGGAAACAC^ 
GCAGGTCTCAGTCX3CTATTCAGATTCTTGACCAAGTAAAACAAAACCAA 
TCAGAAAGCCTGGCTGCAAGAGAGAGGGATGTGGGCGCTGCACT 
GGCCAGTTTGATAGGCAGGAAGGGTAAAGAAGCAGTGGAAAGTCAGGTGAGGTGCAA 

2 5 TTAAGCCCATGCTGTCCAATATGAATCCTTGCAAGCCACAACCTGAGCT 

AAGGAT AG CT AGAG CTGGACGGTGGTGGCACITG CCTTT CAT CC CAG CACT CTG AGACAG AGGCAGGTGGATCTCAGTGAGTT AGA 

GGCCAGTGGTCTACAAAGTTCTGAGACAGCCAGGGCTGTTACAC^ 

CACTGAGAGTTTGGAGAGATGGCTCAGTGGTTAAGAGCACTGA 

GGTGGCTCACAACCATCTGTAATGGGATCTGATGCCTACTTCTGGTGTGTCTGAAGTCAG 

3 0 AAAAAAAATCTTTTTTAAAAAAAGAATACACAGTTAAAATACACTTGGAG 

GCCTCCAGCCCTCAGT ACCCCAGCCTT AGCATCTCCAGTGCCACTGTGATTGTTC^ CTCCTCCAG CACGCTGGGCTGG 

CTTTGATOTTCATCACTCTCTCCTCCTTCTGTCCATTT^ 

GTGGATATAACCCATGTCTCAGACTCATGGTAGTTTTACAGAGCAGGTGGGAAGCAGGCA^ 

AACCT AAT AGTTTC CTGCCT CGGTTT AAAATGTTTT AGCTGT TAATGG AACAG ACTT ACTGGGTAAAATGAGAC CTT CAAT 

3 5 ATGTGAACATAATGCACTGTGGCCAAGCAGGCTAGGAAGCATCCTTATCGCCTGAAAAAT 

ATCTGCCATTCCAGCACTTGGAGGCAGAGACAGGAGA^ 

CATCCAGGGG CT ACAT AGTGAG AAC CT GT CT CAAAATGAATTTTTrTri' CTG TTTGAATG TTCAAAACCT ACT CGTCATTTTG AAG 
CAGT ACTTTGTCT ACTG TGGTGG ACGC1TTGTGCT AT AG AAT AC CAACAGATAT CTATTC CTCC CTT C CAATGATG AC CTCTTCAA 
GCTCCTACTACTTCITAGCATTGCCACACTTGGGACCCAGCATCCAGCATC 
40 AACCCCATGGCCAGCTTGTTATTTCAGTAGTGCAGTGGATC 

G CTT AAAAG ACT CAGTGTGTG CTTTGCACATAAC GG TGCTTG TTGG AATGT AAG AGG AAAGACACCAGGAG CTGT CCAGAGCAG AT 

GCTTACAGCCTGGGCAGGAAGGCTACATGTCTGGAGCAGAGGAACTGG^ 

CAATGGACAAGGCCCCCTTGTCCCCACGCATTTCCCTACAGAAAAGG 

CCTGTAAAGGGAGCAAGGTCT ACATTGTTCATAGGGAAAG CTGCTGTCCATCCCGGC CCTGTGAGTATAGGGAGCACTCTAAGCAG 

4 5 CCCAGCTCAGTCTTCACACCGTAGCATCTGCTCTCCACATTGTCCCTCCAGC^^ 

CGTCCTAGACACTCCTGCAAAGAGAATTAGCTAGACCCACAGGGAGAAAGCAGCTCC 

GAAGACCCAGGTCACCATAGAGAAGATAGTAGCACTAGGTGACCAAAGAGAGGAGTCTGGGG 

CTGCAGAGAGCAGACCCCCTGGTTGAGCCGCGCTAGGGACCAGTGCAGACACACTTCTGTC 

CACATG CAC CTCAAGTGCAAATGCCACGCGCTATCTGGCAG CTGTGAAGTGAAGACCTGCTGGTGGTCGt^GCCGG ACTTCCG CAC 

5 0 CATCGGGGATTTCCTCAAGGACAAGTATGACAGCGCCTCGGAGATC 

CCCTGAGGCCACGTTACACGTACTTCAAGGTGCCGACAGAGCGCGACCTGGTCTACTACGAGGCCT 

AACCCCG AAACCGG CTCCTTCGGGACGCGTGACCGCACCTG CAATGTX1AGCTCGCATGGCATAGATGGGTCCGACCTGTTGTG CTG 

CGGGCGCGGGCATAACGCGCGCACTGAGCGACGGAGGGAGAAATGCCACTGTGTTTTCCAT^ 

AGTGCACACGTGTCTATGACGTGC^CACCTGCAAGTAGGAGAGCTCCTAACACGGGAGCAGGGTTCATTCCGAGGG 

5 5 TACCTGGGGGCGGGGTTCCTACTTGGAGGGGTCTCTTACTTGGGGACT 

TCTCATACCTAAGGACCCGGTT^CTGCCTTCAGCCTGGGCTCCTATTTGGGATCTG 
GGGATACGGGTTTCTGCCCGAGGGTGGGGCTCCACITG^ 

GAGAGATTAGGGTCCCTCCTCCCAGAGGCACTGCTCTATCTAGATACATGAGA 

6 0 GGATCCAGTGGGGGCGGGGCTTCACCCCGACTGGGTGGAACTTT^ 

ATGGGATGGAGCTCCACGGAAGGAGGAGTTCC!TGAGCGAGCCTGGGCT 
GTCCTGGTGTAAGGTTCAACCTGCAAGCCTCATCTGCAC^GAGCAGGATCTCCTGGCAGAAT 
TGATACCAAGACCTAACAAACCCCGTGCCTGGGTACCTCrrTTTAAAGC 
GCAGAGCTTTCCTGAGGAAGATTTGCAGTCCCCCAGAGTTCAAGTGAACACCCATAG AACAG 

6 5 GGGTTCTCTAGGAACCTCTATGGGGGCTGCTAGGAAGGATCCTGGGCATC 

TTAATACTCAGATCTCCCGGGAAACCCAGCTCATCCGGTCCGTGATGTCCATGCCCC^ 
AGTTGTATGAACTTCGGAGACATGGGGACACAGTCAAGCCGCAGAGC 
CTGTTGAGGCAATGGTCACCAGATCCGTTGGCCACCAGCCTGTCCC^ 
ACATACAGGTCATCTGCCACAAGAGCrTCCTGATTGCTACCACTGTGAACCGTC 

7 0 CCATACAGGAGTGTGCCTGGAGAGCGCGGAAAGAGGAAGAGAGGCTGCAAATGCGTGGTGACT 

G CGTT CG CGCTTGT AACTTT ATTTTCAATG CTG CT AT AT C CA C CCAC CACTGG ATTT AG ACAAAAGTG ATTTTCTTTTITTTTTCT 
rJTCTTTCTATGAAAGAAATTATTTTAGTTTATAGTATGTTTGTTTCAAATAATGGGGAAAGTA 
TGTGGTTTGTTCTAGATTTTCCCTTTTATTAACIAAGCATT 
TCCCCACCCCCAAGGGTAATTCACACCTCCCTGTCCCTT 
7 5 AGGTTCAGGGTCCCAGAGCCAAGTAGAGACTGTCGACAAGCCTCCAGGGAGGGAGGTGGGG 
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AAAT AG CT AAGTCAAGAAAAG AG C CX5G TCAA.CTGTAGAAAAATCTGG AACAGGG TAG ATCTG AGAGCTTG AAT AGAGGGGTGCAGG 
TGGGTTAAGTTTAAGCAAGTATGGGAGCGGGGTAAGCATAC^GTAGTT 
ACAGGAGAAGAGGCTTAACCAAGACCCCCTCACACGCTGGGGAT^ 
TTCTCACAGCCCATTCCCCAGAATCATGTATTTAGG^ 
5 CTTCTTATGTGTAAAGTCAACAGATCTTAGACTTTAGTCCTCAGACCCTGTCACCAGGAATGT^ 

CAATGGGAGGGAGGGGTCTTTGCCTCAGGAGCTGCCATGTTCCCAC^ 

AAGGCTAATCATCCTGTTGGCAAACCAAATGATGGGGCAGGTGTCAGGGCGCCCTGGGAGTGAGGCTAGAGATCT 
ATACTTCTTTTAAGGCCACACATCTGCCCAAGCAAGCCTACCATGTCTCTGCT 

TGTCCCCACCTCTGAGTCCTATTCTGATGTCCAGATGTCAGGGCAAAGGATGCCCTCCAAAGCTGAGA^ 
1 0 CCTCTCAAAACCCAAGCTCAAGCCGAGTATAGCAGTGCACGCCTTTAATCCCAGCACT 
GTTACAGGACAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAG 
AAAAAAATTAAAATTAAAATTAAAAAAAAACAGGGGGAAAAAAGCCCAAGT^ 

CCCAACCCAGATCCAAGTGATTTGAAAGCCAGTTAGAAGATGGCCATAAGAATAAGCCTAGGAAGTGTCTTATC 

CTTTAGCTGTCGTGATACATCATGACCTCATGAACCrTAGAACTAGGAAACCCTGCTGGCCT 
1 5 GAAATATAGTCACTGTTGACATAGGGGAATGCTGGTTAGGACAACCAGGAAGATCATTAAGTGTCCTCT 

AGTGGGACTGCCTGTCCCAGAAGAGGGGAAGATAATTCCCAGACCCrAAGAGCT 

AGGCAAGGCCCTTCTAGACCCATTTGATTCAACACTCTAGAGTTGGTCGGTGATGACAGCTAAAGCCG 

CCTGTGAATTTGTCCACAGGCCTGGTAACAGAATACTATGTCACACTGCCTGAAA 

ACTt^TTTTATGGCAGGACCACTTAAGGGGGTCCTTAAACCTTCAAAGCAATCA 
2 0 GTATCCTT AAAG AAGAAT ATTGAGGAGCCTCCAGG TCTTGGGACATAAAGATGAGACCAGGCATGG CT CCACTG GATGCTC CAGG C 

ACATCTTTTGTTCACCAACCCACTGGCTAAGCrTTCTGATCT^ 

GTCAGCTCAGTCTACCACACTAAGATTCTAGACCCTCCGTGGTCACACTGTCATACCTA 

GGGTACAAAAGGAATAGCCAGCCTGGCTTGAAGGTATTGGC 

CTCATGATTGCCTGGGATACAGAACAAGGTGAAGATCAGCTAAAGCGATTGAGTC 

2 5 AACTGGGGTGTACCTCAGTGACAGAGGACTTTCCTAATGTGTG^ 

AAAATCACAGACTGTGAGGGATGCTGACAGCTAAGTGGCCTTGGGAAAGGAGTGCCATCCTGGGGCTCTCT 
CTGTTTGGGGTTTGTGGAGGAAGAGGCCCAGGTTCTGGATTCTCAGTACCT 
GCAGGCAGGCACTCCAATGACATCTATGCCTTCAC^CGGCTCCCTCTTCTCATTCTCAGGGCT 
TGACCTACTATTGGCAGGACTACCTAGGATTTCCCAGTAAGGGCAAGGGTCAGTG 

3 0 G ACACTCCAAAAAG AGGATG CCATGGTCCAGGGCCAGCCAAGGCAGAAGG 

CAGGTGAGTGGGCAGGTTCAGGAAGAGTCCAATTAGCCCGCTGGGCCATCTTGGGTGCT 

GGCAGTGATGAGAGTCCTCAGTTAAGCCTGTCCGACAGATGAGTAGTCCCTCTAAGTT 

GCTACCTGCTGCCCTAGTGCCCAGATGAAAGGGACAGACTCGACTGGGAGGCGGCAGGGCAGTGGC 

G CCTGG AAAGTCAGAGGCCAATGGGT ATGGGGTGCCAGGCTGTCTCAAG ACCCCAGTA CTTTTGGGG A 

3 5 T GCAGAGCCCATATGCCCAGGCAGAAGA CAGAT CACTTTAGAAGGTCAAAGGCT AGG AAG CATATAGC CTACAG AGGAGGCAG CAT 

TGGCCTGAGGGCITGTCAGGGACACAGGTGGGCrCTGGTGTCTGG 

AGACTACCCTAGAGATTGTCTTGGCrATCCACAGGAGATACACGACTA 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
nnnnnnnnnnn} ^^ 

4 0 GAGGAGTGGGAAATCCCAGGCCCCACTCAGACCCAAGCTCGCAGCTGTGGGCCTCAGAAGC^ 

CTCCCGCTGCCCTGGAGAAGGAGGAAATGAGCGGGAGGCCTGTTCACAGGTAGTTGAGACAGCCCCCGAATAA 
CCGGAGGCTGGGGGC CCCACCCAG CCAGG ACAGGCJTTGGCC ATCCCCTTGCAAATCTCACACT 
GGGTGTGGCCAACCCCCACCITAGGGCCCCACn'GACCCTCCCTT^ 
GGGGCTCAGTT^CCTCTCITCAGAGACAGACTGGAGGCTGTCACCATGGCCTGGCCT 
45 GTGCAGCCAGCTGTGGTTCCCACCCCTGGCTTGGGACTGAGAAAATGGA 

GACCAGAAGCCAGAGTCCCCAGCCCAAGCAGTCTTGGTAGTAGGATCTAGCACTGCCTT^TCCAGGGA 
C^GCCAGGAGACAATTTTGAGAAGTCCCTAGGACCTTCCTGAAAGCTGACCACCCACTGTTGGG^ 

TGTTGCTGTGTAGCTGGTGATCAATAGCTGGGCCAACT CTGAGTC CTAGCAAACCTGGCCCAGCCATAATG ATCTACCTTG AAGGG 
ACTTTGGGATACATTACTGCCCTGGAGTCCAGCGAGOSTCCTGGGACAGATTCTTCTCCCCCTGGC^ 

5 0 AGGCCCCAGGTCCCACCTCTCTTC^CCAGTrc 

TCTCTTGGAGGGGGGGGTACCCAGAACAACACACTCTGAGATTATAGCC^ 

CCAGAGCCAGAGTAG AAACCAGGTAT ACACACTT ACTATCCCTACAGCCCTAAGGGG CCAGCGTGCCATC CT ACTGG AGT CTAGTA 

TTGGACAGATATATGAGTCTGGGATAGTCCAGTTGGCTAGAG 

GGGCrATGTGGGAAGGCTTCCAAOUWKSTGCTGCTTCTGCCTTCT 

5 5 AATAGTGCAGGGCAGCTAGGAAGCCCTGGAAAGGGCCCCCAA 

TTTGCCTGGTCGGGCAGCTGGGACGGAGCTGCAAGCTGTCAGAAGGTGCTAAT^ 
GTCTTTTTT CC ACCCT CTCT CTCTTTTTTATTTG AGCGTCT AG AACITCATGCT 

TGGTGTTTATGGTGTGAAGATGAGGAGGACCCAGGACTAGTCCACCAAAACACAGGCCAGTGAAGAGG^ 

GC AT AGGCCCAGCTGGGACTGGGCATCCTCCACGTCCCJVTCCAGGCCTAGCTCTCTG CTG CTTG ATCCAGGAGAGCTT CC AGGAAG 

6 0 AGGCTCCTCCGTCATCCITATCAGTGGGGAGACTGGAGATTGAAGAACTAGAGaVG 

^NNKNNNNNNNNKNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNN^ 

AG AG AGACAGAAAAAT CCAAGT ACATGGG C C CTT CTT ACCCAGTGG C CTCC CAG ATTTGAG ACAGGAATGT ATGGC TT CTGT ATG A 

6 5 TGCTTGTGGGGTGCAGGGTGTGGAAGGCAGGTCCCAGAGGAATTCCAAGCCCG 

GGACTTCCCCAGTTGTTCGCCTCTCAGTGCAGAGAGCAGGC^ 

AGCCTCCCTTCCCCATCCCCCAGGCTCCTCAGCTCCCTCTACTGCAGG^ 

TCCATTCCTGGC^CAAGCTTGAGACACTCTGCCACCCCACAGTGCCAGACTT^ 

ACACAC AG C CCTGGAGGGG CGAG T C CCAAAGCTCT AAG AAGGC C ATTTG ACGT CTGT AG CTTGG AG T AG ACTC CAGGGATT ACACA 

7 0 TGCTTCTTCCCa^CTGTTTTTATTGTCCAGGTCrGGGT^ 

TAATCTGGTGGGAGT CGTGACCC AGGCAACAGTCCAT ATGCCATGGACCTG AT CCTACACCTCCTCTGCTT AAGGATATTCCATGG 
TTTCCCTGTGTCCACCAGGTGAAGTTTAACTrCTCAAGCCTACGTACCT 
GTGAGAGACTGAGAAACTGTCCTGCACAGGAGGTGTATACGGACATTACrrCT^ 
TTCCCTCAGATGCn'CTCGGACCTCCATGCCAGCCTTCTCACCA 
7 5 CAGGTCCTGAACTTTCAATGGGTTCrrCTGACTCTGGAACTGGTTGC!AA 



485 



WO 03/008583 



PCTYUS01/51291 



TCACTGTCTCTGGGGGGACACTGGAAGTACCAGCre 

AGTGTAACCTTG ATCATTCCTGGGGCTTGGAAG CAT ACCCAG CTGAACCAAGAGCTTGAGCTTCCAAGTGACCCATCAACCAGAGG 
ACCCCACACCTGTGAGAGCAGGGAGGGCGGCCTCAGAACCGAGGCCTTGTCTGGTCCT 

CAGATG CAATGG ATG AAC CATG CAGAAT CCAAG CATGACAGGG ATGAGG CCACAGCGCAGGTGGCGAAGGGCCATGTGAGGTCTGG 
5 GAAGGTGGAAAGGATGCAGGAGAGGACCTTACAGGGGAGGAGGATC 

TGAGGAGAC^GTGCAGATGAGGTGTCAGTAAGGAGGATGCAAGCAAGAAAGATGCAGGAGATGATGGAGAAGCT 
GAAGAAGGCACAGGGAAGAAGAGTGCATGTTAGGATGTTCAAGCAGAGAC^ 
ATGAGGATGGTACAGGTGGGATCCTGCATATGCTATCAGGCAGGTGGCCCAGAACMTGAGC 
ACACACTGGGAATGGTACCTACAAGGTTCTGAAAGTGCAGGCTTTTC 
1 0 TCTCCAGTTCTTCCCTTGTGCTTTAACCAAAAGCAAGTCTTGTGCACTGTC 

AGAGTAG AAAGGCTGCTGTCTTTTCTAACCTGTACCCTGAGC CAACTAGGTCAG AGTCAGAGGCCT CACATACGTGAGTAGCCCTG 

GTTGGAGGGATCTCTAAACCCCACCACAGTCCTGGGACAGCCTGCTGTTTCCT 

CCTAGTCAGCTTTGCTTCTGTTACTTGGGGATTTTTTCCA^ 

AGAGGTGCTGACATCATTAGGCCTCCTCAAGGGCACAGGCCCACCCTTCTGCrrCTGGCT 
1 5 TCTGCCAGGGTCCAGGCCTGGCTGGCAGCCCGC^GACTACCCAGA 

TAACAGTGTCCTTTGCCAACTCTCCACTGAAGCCGATGCTGGCCre 

ACCACACTCAGGAATCATGGGACCCTCTGGTCrrTTC 

CATACATGTGGCrGTCGACCCCTGTCTCTGCAATAAG 

GGAACACAGTGGCCAACATATGTAAATGCTGGAAACCCCTCT 
2 0 AGAGGGCAGAGTAGTGATGGTCTACGGGCCACATCTGGTCCCTAATGGTGGGAGTCAGGACCTGCCTGGTTTCCT 

GGACAGCTCCAGGCTCTGGAGACCACCTGAGGAGATATGGACGGGCCAGCCCCTGGCACAGTC 
TGGGGCACAGGTGTTAGGGGTGX3GGCTCATGTTGGCGGG 

TGGCTGCTCCAAAGGGCCCCAATTAAGTGCAGAATGCGATGGCAGAGTGGGGTCAAA 

2 5 TGGCCTGGGCAGGAGCGCCCCCG CCTGACCGCCTCCTCAACAGG CCCTGCTCAGCTCAAAGGCGGTCACATGACGGGTAGATGACA 

GAGAACTCTGCAGTCCTGTCTGGCTGAGATCCCACCAGATAACAGCAGGACCAAGGG 
CAGGCAGCCCCTCTrCGAGCCACTTTGTCCTCT 

AATAAGCAGGTCCCTGGTATAATAAGCATCTGTGGTATGTCTACTTCAGAAGCAGCAAAAG 
TGTGCATGGGCAATGGGATTAAGCAAAATGGGGGGG^ 

3 0 GCAGATAGCAGO^AAGGGGAGGGAGCTCACCTGGAGAGGTAAGGGCCTGGT 

TGCTGAGGTACCAGCAAGGCCAAAGCCTGCAAGCAAAAATAAGAGAGAAAGCCAC^ 

GGACTCTCAAGATAATTAGCTTTCCAGGAAATGTAATGAAAAACCCTGGGCG 

CACAGTAAAAGACACAGCTGACTTCTGCACACACACAAATGCTT 

GTGGAGGCAGCAGCATTAGAAGTTCAAGGTTACCCTCAGTTACATAGGGATTCTGGGCCTG 

3 5 AAAACAACATCAGCAACAACAAAATTGGATCCTAAGGTGACAGTAGAGTGCCAGCCTCCCT 

TCCCCA.CCTATTACAATGGTCCCAAGTAATTTTTTTTAATGTTATGGGCATTT^ 
CACACACACACACACACACAGAGAGAGAGAGAGAGAGAGJU3CATGCGTGCAAGGGGATAAGGCC 

CAGGTGTTT CATGAGTTTCTTGTGAGG AGAAACATTTG CTAGG CAT CAT AG C C CT CT AGG AATCTCACTGGCT AGG AAGACAGG AT 
GTCTTCCAGCATCTTGACCCA 

40 

MOUSE SEQUENCE - mRNA 

GAATTCATGTCTTACGGTCAAGGGAGAGGGCCCAGCGCCACTGCAGCCGCGCCACCTCCCAGGGCCGGGCCAGCCC^ 
GCTCTCGGGGTGGACT CCCCCCGCTGCGCGCTC AAGCCGGCGATGGCTCCTCTCGGATACCT CCTGAAGC A 

GGCTCTGGGCAGCTACCCGATCTGGTGGTCCTTGGCTGTGGG 

4 5 GC^TCCCAGGCCTGGTACCGAAGCAGCTGCGCTTCTC 

GGCATCCAGGAGTGCCAGCACCAGTTCCX1AGGCCGGCGTTGGAACTGCACCACCGTCAGCAACAGCCT 
TCTGGACAAAGCCACCCGGGAGTCAGCCTTTGTCC^TGCCATCGCCrCCGCTGGACT 
AGGGATCAGCTGCTATCTGTGGGTGCAGCAGCCGCCTCCAGGGCTCCCCAGGCGAGGGCTG^ 
ATTGAATTTGGAGGAATGGTCTCTCGGGAGriTGCCGATGCCAGGGAGAACCGGCCGGATGCCC 

5 0 CAATGAGGCTGGGCGCCIAGGCCATCGCCAGTCACATGCACCT 

CCITGCTGGTGGTCGCAGCCGGACTTCCGCACCATCGX3GGATTTCCT 

AAACACCGAGAGTCTCGTGGCTGGGTGGAGACCCTGAGG>CCACGTTACACGTACTTCAAG^ 

CTACGAGGCCTCACCC7ACTTCTGCGAACCTAACCCCGAAACCGGCTCCTTCGGGACGCGTGACCGCACCT 

ATGGCATAGATGGGTGCGACCTGTTGTGCTGCGGGCGCGGGCATAACGCGCGCACTGAGCGACGGAGGGAGAAATGCCAC 

5 5 TTCCATTGGTGCTGCTACGTCAGCTGCCIAGGAGTGCACACGTGTCTATGACGTGCAC 

GAGCAGGGTTCATTCCGAGGGGCAAGGTTCCTACCTGGGGGCGGGGTTCCT 

TACTITGAGX3GCGGAG ATCCT ACCTGTGAGGGTCT CAT ACCT AAGGACCCGGTTT CTGCCTT CAGCCTGGGCTCCTATTTGGGATCT 

GGGTTCCTTTTTAGGGGAGAAGCTCCTGTCTGGGATACGGGTTTCTGCCCGAGGGTGGGGCT 

TGGGCCGGAAGTCCTACCTCAATGGCITGGACTCCTCTCTTGAC 

6 0 TAGGTGGGGTTCGTGCC<3ATGGGTGGGAGGGGAGAGATTAGGGTCCCT 

GTG CTTCAGGGTGGGCCCTATTTGGG CTTG AGGATCCCGTGGGGG CGGGG CTT CACCCCG ACTGGGTGGAACTTTTGG AG ACCCCC 
TTCCACTGGGGCAAGXSCTTCACTGAAGACTCATGGGATGGA^ 

GGCCATCC^GCTCCCATCTGGCCCCTTTCCAGTCCTGGTGTAAGGTTCAACCTGCAAGCCT 
GCAGAATGAGGCATGGAGAAGAACTCAGGGGTGATACCAAGACCTAACA 

6 5 CCCTTCTTCAAGGGCTTTCCTAGTCTCCrTGGCAGAGCTTTCCTGAGGAAGATTTGCAGTCCCC 

AGAACAGAACAGACTCTATCCTGAGTAGAGAGGGTTCrCTAGGAATCTCTATGGGGACT 
CTCGTATGATAGCCTGCATCCGCTCTGACACTTAATACTCAGATCTC^ 
CCAAATGCCTGAGAGATGTTGCCTCACTTTGAGTTGTATGAACTT 
TTTCAGGACCCATCTGATTCCCCAGAGCCTGCTGTTGAGG^ 

7 0 TAGTGTCTGTCTGGCCTGGAAGTGAGGTGCTACATACAGCCCATCTC 

GGTGACTGACTGTCTTCTGCCTGGAACTTTGCGTTCGCGCT 
TAGAC!AAAAGTGATTTTCTTTTTTTTTTTTTCTTTTCTTTCTATGAAAGAAA 
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MOUSE SEQUENCE - CODING 

ATGGCrrCCTCTCGGATACCTCTTAGTGCTCTGCAGCCT 

ACCCCAGTACTCCrrCTCTGAGCACTCAGCCCATTCTCTGTGCCAGCATCCCAG^ 

ACTACGTGGAGATCATGCCCAGCGTGGCrrGAGGGTGTCAAAGCGGGCATCCAGGAGTGCCAGCACCAGTTCCG^ 
5 AACTGCACCACCGTCAGCAACAGCCTGGCCATCTTTGGCCCTGTTCTG^ 
CGCCTCCGCrGGAGTAGCTTTCGCAGTGACACGCTCCTGTGC^ 
GCTCCCC^GGOSAGGGCTGGAAGTGGGGCG^CT^ 

AGGGAGAACCGGCCGGATGCCCGCTCTGCCATGAAC(XTCACAACAATGAGGCT 
CAAGTGCAAATGCCACGGGCTATCTGGCAGCTCTGAAGTGAA 
1 0 TCCTCAAGGACAAGTATGACAGTGCCTCGGAGATGGTGGT^ 
CGTTACACGTACTTCAAGGTGCCGAGAGAACGCGACCT 

cggctccttc^ggacgcgtgaccgcacctgcaatgtgagctcgcatggcatagatgggtgcgacctg^ 
ataacgcgcgcacrgagcgacggagggagaaatgccacrgtgttttccattggtgctgcr 

GTCTATGACGTGCACACCTGCAAGTAG 

15 

HUMAN SEQUENCE - GENOMIC 

CTATGGGAAAACTTCAAGCAGCCCAACATAGATGTAAGTGGAGTCC^ 
CATTTGTAGAGGCTGAACATGTCCAAAATTCATAGGCCAGGAACrCAGGA^ 
TCTATCCAGACAACCCAGGAAAAAGAGAGAOSACAATGACTAATCTa^TAA 
2 0 ATTAAAAAATAAACAGGAATAAACATCn'ACATGTGTATAAATTCAACT^ 

CCAAAGCTCX5CACAAGAAATAAATAAACAAAATGGTTCTATAGCTATTTAAAGAA 
AAACTTAAAGGCCAGATGGCTTCATTGGTGAATTCTACCAAACATTTAAGCAAGACA 
AAATTAAAGAGTTGGAAATACTCATGAGTATCrCATGAGGCCAGAATTACCCTAATGCC 
AAAAAATTCTAGATCAGTACITCTCAGGAACATAGATGCAAAAGTTCTTA^ 

2 5 AGAGGATGATACACC^TGAGCAAGTGGGATTTATCCCAGGAATGG 

TATATTAATTGGCTAAATAAAAACATATAATTGTCTCAATAGATGCAGAAAAATCATTTAATAAATTC 
TCTCAGCAGATTAGAACTAGCTGTGAACTGCCTAAACCTGATAAGTGGCATCITATGAAAAACT 
AAACACTGTGTGTTTCCrrCCCrrAAACATCAAGAACAACACA^ 
TCTAGCTAGTGCAATAAGACAAGAAAAAGAAATGAAAGGCAT 
30 CACAATTGTTTATATGGAAAACGCCAACAAATCTACAAAAAA^ 

GG AAACAACAT CAATGT ACT AAAAGT C AGCACTTTG CAACAAAACATGGAAATGGAAAGAAAT AAT ACCACTTT A CTG AT AG 
CTTCTGGACTAAGAGGAAGCTGCAGGTAGAGGGGGCAG^ 
AAGAACCTCGACAGAACAGGAACAAATGCTTTGCCGAAGCTTCTC 
AAGGAAC^GCAAGGAAGCTTCTGGACCCXSTTGGGTTGGAG^ACAACCT 

3 5 GCAGAGACTGCAGGTTGTGACACAGAAGCT 

AAGGGTGTXjCAGGAGGTGGATGCTAGGGAGGAAGCCCACAAAGG 
CrTGGAGCCATGTCTCCTGTGCCTCGCCAGGGACAGTGGGCCCCAGCTT(nTCGACCA 

ATGGCTTTTTCTCTTCACCTGTTCCCCCCACT CTCCTGTTCCCT AGAACCATGTTC CCACATAAACCAC CTGCACTCACT ACTTGT 
ATCATGTTCTAAAAATCCCACCATGTCCCTGGAGCTCAGG^^ 

4 0 CCAGAGTGAAGGAGAGCATCTCCAGTGATGAGGGAGTCCAAGACCCATGATCCTGGGTGATGGGGG 

GTCAAGCTCCCCAGGCCCAGGCCCATGTGTGAACAGCAGCCCTGTGAAT^ 

CTCCAAAGATGGTG ACACTGCATCTCCTGTGGTCTCAGGGATCAGCTCGGT CAGGGGTTGTGGTCACCACT A 

TTCCCATGAGAGAGOTCATACCC^CAATGTAAAATCCCACTTGGGCTGCA^ 

TCACCCACAACCTTGCCCAGAAGGGGCACTGCCTCTAGG^ 

4 5 CTCCACTTGGTGGCAGTATTGGCCCAGAAGCGGAACTCCAG CAAGCAGCCACCAACCGGTCCCTAAGAGCTG CTGTGG C AGCAAAA 

CTTAGCTCTTTCCCAACAGGAAAGTCTGGGGAGTCTATTC 
TGGATAATTAATCTGCAGGATTCCAGAACTCCATGATACTGTGATTTTTTTTT^ 
CCCAAGCTGGAGTGCAGTGGCACAGTCATGGCTCACTGCAGCCTCGAACTTGTC 
AATAGCTGGGACTACAGGTGCATACCACCATACTCAGCTAATTTTTTATAG 

5 0 AAACTCCTGGGTTCAAGAGATCTGCTGGCCCCAGCCTCCCAAAGCACTAGAATTACAG^C^^ 

ATTTTTTTTTAATTCTCAAATTTGATTTTC^ 

AAGATCAGAGGGTTCTACTGACTCAGTTCCCCTGCATCCTATAATTCTGTGTTTGGATGATTCCAG 

GGATCAACCAGGATTAACC^CATGGCTAAAGGGAGAGGCrrGATGGCCCTACTTCCATGGT^ 

TAGGGAGGTCCACAATTCCCCACATATCACCAGGCCACCTGQ^ 

5 5 CGTGTGCCTCCTACATCATGCCCTAAGGGAGAGAGGACAGCATGGAG 

TGCCTGTTGGACCCCAGAGTTGACTTTTATGGCATCCAAGAGCTGCCTGTGAGAACACACATGGTTT 
ACACAACCATCATGCGTCAGCCITCTCAAAGGTGGCGTTCGT^ 

AGAAAG<2AGTGTGGGGGTATGGAAGGATCCCrGGAGGCTGGGACCTGGAGACCAGGGTCTT^ 
CTTTGCCTTGAGCCCACTCTTTCCATACAAGGAGAGGCTGACCTAAGGTTCCAGT^^ 

6 0 CAAAATG<3AACTCTAAGATGGGTAGTTGAAGGGGTGACACCC^ 

GAGGTGCCCCAAGCCCCTGCAAATGCACrCTCTAGGAAGTCTCGCC^ 

GGCAAGCTCATGAGGACCGGCTCGTGTCCACCTACCCCCCTCATTGCCTACAAGGACT 

TCTTCTGCCCTGGAGCCAGGAAAGGATGGCCTTCTGAGTTGGAGT 

T ACAAATGGTG AGCCTCGGAG AG AGCCCAGCCCTGCTGTGCTG CCTCCAGGAGG CCTCCT CATTCAGCCTCA 

6 5 CTGAGCCTTTTCCACTTCCAGAACATGGTGAAATTCACTTCCCTCTGACCACACAGTGC^ 

CCTACAGTGGGCATGGTAGTCGCTGTCATCACTCCGCrCCAGGAG 

AAGGACCTTTTGGAGGCCAGGCG CAGGGGCTCACCCCTGT AACCCC!ACAGTTTG^GAGGCCAAGGCAAG AGG ATTG CTTG AGCCCA 

GGAGTTAGAAACTACAGTGAACTATGATGGCACCACTGCACT 

AAGAACCTTCCGGAGAACTGAAGGAAAGTGGAAATTTGATAAGA 

7 0 TAACGGCAAGCTCAGAAAGATAGATACAGGGCAATGGTAAGAAGC^GACAGGTAACCAGATTGG^ 

GAC^GCAGTGCAGTGGTGTGCCACTGTGAGGGGGAAACGGGGATGGTGCCCCCGCCATCCTGCT^ 
AGGGGGCAGAGCAACAACTTCTGTGGGCTGGACCTGCCCTGGGC^^ 
ACAGCAGGAACCCAGAGCTGTGGATATGTAGACCCCIACAGACAGCCT 
rr^r-r^CACa^CCaTArcTGCAAAGCCCTGCCAGGACATACCCACATC^ 
7 5 CGGAGGAGGTAATTCCCTIGAGT^ 
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AGGGGCTGCAGACAGGAAGAGCAACTCACAACTGTGCAGGAGGGCTCATCGGCTGTGACAGGAAAACACA 
GACAGGACAGCGATAGACAGGAGGTGGGAGGCAGACAGGTCT 
GGTGGCCACGTCATTTTCCTTGTCTCTCTTTGGCT 
CCCAAACTGTGTGCTAAGGCACCCTGGGACGCCACAGCAAATTCC 

AATTTGTTTTAAAGAAAGTACAATTGAGTCTTGGACAAC^ 

TAACTTTTGACTCCXrrGAAAACGAAACTACTAACAGCCTCCTGTTGA^ 

ATACGCTGCATGCCACATGTGTTATCCCCTGTGTTCCTACAATAAACTCAGCT 

AAAGAAGCCAGACTCACAGTTCATGAGCCGGCAGTGGGTCCTCATAAAGGTCTT 
1 0 GAGGAAGACAGAAGCGGGGCTTGGTCTTGCTGTGTCAGGGTGGCAGAGGCRX^ 

AACCTGTGTTGTTTGAGGGTCACCTGTCCTGCATGAACATC^ 

TGTTGCTTCTGGAGGGCCAGTAACTGCAGTTATGTGAGGATGAAATAAAA^ 

TTCAAATAGCTACTGATCATTAAGTGGTTGGACCTAATTACTTAATAAACAAAGCTC^ 

GGCAAAGAATGACTATAAGTTGTTTTCTTGATTAATTTAATTTAAT^ 
1 5 CCCAGGTTAGAGTGCAGTGATGCAATCATAGCTCACTGCAGCCTTGAAC^ 

AAGTG CCGGGATT ACAGGTGTGC AC CAT CACACCTGGCCAATTGAAAGTTTTGATCT CAAG AACCCACTG CTTATGGAGAAGGCAT 

CCAAAGAGTGACCCCOVCAAGGAAGGTCAGGGAAGCTTCCAGAAGGGACCAAGTTC 

TCTTCTGACCCTGGACCCCTACCCTGTCCTGCTGCTGTTCCTTCOIGGG 

TCTGCCCAAACCCCTTCTCTGTTTT CCCCC AACTTATTTTTG CTCAAATGAGAAAAAAAGGAAGCAAAAG ATGACCAGTTAGTGTC 
2 0 TGAAATCCTCGCTTGCCCCACCCATAACAGGGACTCACACCCGCTGTAGACAGGG 
CAGAGATGCCCCTCCCACCTGGCTTACCCCACCCAAGCAGGTATCTTACAGAGCIT 
GGGCCTATTCItXjAAGGAGGGAGCACXJCTCCCTGGG^ 
GGAGTTTTGTTTTATGAAACACATGGCTGGCTGCTCAGGGC^ 
TCTCTGAGGACGATTAGCCCTTTCCTGACGCTGGCTTGCAAA 

2 5 TGTCCTGGCCAACAAGGAGGTCTGTGGAGACAGGACCGGGTCTCACT 

CCCTATGCCTTGACCTCAACCTACGCC^TCCCAGTCCCCAGCGCAATGCACA 
ACAGCTCCCTTTAGGCAACAGCAGAGGGCGCITCCAGGGAGGCCCCrCACCTTC 
GTAGGCAACATCACCCGAGCCTGGAAGACACTCAGGAGAAGCCCCTCAGGCCCCT 
CCACTCTGCTCTCCACCCCACCCCAGCCCAGCCCC^CrcACTTGGCrGCT 
30 GAGGAGCCCCACATGGTACAGGCAGGTACCGAGGAGCTAGTTCCAGAA 

CTGGTATCCAGAGTCTTCAGAAACAGGAGCCCTCGCTTGAGAGCCTGGCCTGGGCAGGCT 
TGCTCCTCCCITCATCTTCTCCAGCCTGGGAAGCGGCCGG 

CTGATGGGGGAGAAGAAAG(ZAGCCAGGAGGAGGGGCTTGGGAAACCAAGGGGCAA 
GCCCCCTTCAGAGAGGTCACGAGGCCCATCCTGTCGGTGGGCAGAGCT 

3 5 CTGGGGAGGTGGGGGTAACAAGAGTCTGCGTGCCTCAGGCTGGGCTCAGGCGCCCACCCCACCT 

CCACAATTGGGTGGG ATTTTTGCTGCAAGGCCAAGTC CAGAGGAGC ACAAACACGCTCCTCCCCTCCCC CTCCCTGAC CCCACCGC 
CGCTTCATTCCAGAGCTCTTCCCrCTTTCCCTCTGAGGGTGGGGGCCCCG 
CGTAGCGGAAACTGGATTCCCAAGGTAGTGAGGAGCAAAGGGAACTTT^ 
CCTGACATAGTCACCAGGTGACCCCTAGGGCCGGTGTCCACAGGA 

4 0 GGGCACACTGTGGGAAGCAGCTGCCTCCATCAGCCAGCCAGTGACAC^ 

CACAGAAGCGGCCACATGCCCATCCCTGCCTCTCCATCC^CCTCC^ 

GCTGGGCAGTGCAGTATCCCCIAGAGAACACACC^CGGATGAGTCTGGAGTTGCTTGGAGCTCT 

CTCAATTCCAACGCGGTAACC!AAGATTCCTGCnt5AGCTTAGGTCACAGTGTGACTGC^ 

TCTTAACCATCCCAGTTATCAGGTCCCCCGGGGGAGGCTGGGGACACTTCC 

4 5 GGGCAACACATGGATATGOUVATGACTGGGAGCTGGAGAG 

GTCTTCCCCTCTCCATCAGCAAACTCCTAAAGCrrAACTAAAO^TCA 
AGACCTGGAGGTTGGTGGTCCCAGAACCTGGGGGCAGCACrGaSCrrCCCCACra 
TTCCCTGGTGGAGTGATGGGGAGAAAAATGGAGGGGTCAATCCCTGCGCCACAGAGAGA^ 
CAGACGCGTGGCTCCACCTTGGAGGTCCACCTGGCCGCTGTCCCCATTGCAGGCT 

5 0 AGGCGCAGGCAGCTCTGGTCTCAGCGGAAGGAGTCCGGCCGCAGGC^ 

CCTCCCCAAACCCCCGCCCCGAGAAGGTTGATGGATGAAGGGGAGCCCTAGCCCAGCCACCTCGGGGGCAAGAGAC 
ACAGACCAGGAGCGAGAGGGGGACTGCGGGTCCCACCGTGAGCGCAGGGCGCGCGGGGCTGGAAACCCAGGACGCGGCCCGGGAGC 
TCGAGGAGGGCGCCTGCGGATCGCGCGGCCCCGGGCCGGCCGCCTGCCCATCT 
CCCGCACACCAGC^CGTCCTC^GACACACCGACA^ 

GCC^CAGGAGGGCCCAGCGACGCCGCCGCGCCAGCTCCCAGGGCCCGGCCCCCCCCGGCC 
CCCTCCGCGCCCTCTCGCGCGGCGATGGCCCCACTCGGATACTTCrTACTCCTCTGCAGCCT 
GATCTCGTGGTGAGTGAGCCTCCTCGCGTTCGCCCCT^ 
60 CGCGGTGGCCCGAGCCCG(X3CCCTTCTGCTC<^GCCCCGCGTGCG 

TGTTCAGGC05CTGGGACGCGTGGGTGGCGGAGTTTCCGGAGACATTGATTCCCGAGCGGGGGAGTAGGGGA 

TAGCTG CCCCTGGCGTCGGACAGGGCGAG C^GTGGG CTGGGG AGAGGCrGAAAGAAGGTCGGCACGCACGGTCACCGAGGGGACAG 

GCAGCTCGAGGCCAGGGGACAGGCGAGGGAAGAGGGCCAGCGAGAGAGAAT^^ 

CACAAACACACACACACGTTTTAAAACAAAGTCGATGAGACAAGACTGAACAGCTCGGAGCAGCGTG 

6 5 GACCCGGCGGGGAGTGGCGCAGAGCCGGCGGCGGGGCGCACTGGGGGCCGGCCCTGGGCCC 

GCACGGGGGTTTCCAGGGGCCCTCTCTGCGAGCCCTCCGCATGGGTC 
TGAGTTAAGCACGGGGTGTGAGGGGCTCCAGGACCCTCAATCAGAAAGCGCTGTC 
CGTGAGACCCTCCCCAGCCTGG CGATGG AAGTG CAGATAAACCAAAGGAAGGGTCCCGAAAGGTCT 
ACTGCACATATCCTGTGGGAGGGGGAACGGTGGCCACACTTTCGCCAGGGCTTGTGATCCCT 

7 0 ACCCGAGTTCCGAATTAAGGGCGCTCTGAGATGCCCAAGATTGAGGA^ 

AGCGCTGGTAGGACCCACAGTTGGAGAGAAGCCTCCCCACAAGCATGCACACT 

T AT AATAATT ACG CCGAGGGGAAGGGGGG AG ACCCAGCGAG C CGAGGT ACAT CT AAT CCG AT AAT AATTTTTCT CTT CGAGATGG T 
TCAGGAGCGGGGGCTCC CGCGGTAGGGGCGCCGGGCCAGGAGCAG CGGGGAGGGG AAGGGGCAGCCCT AGGCAGCTGA^ 
nAAGAGGCCACGAGGCCG(X5GGGACGCGCTTCGGGGACCCCCGC^CGCCCCTATTTC 
7 5 CTGCGCCCAGTGGCCCACCTGAGGGTCAGGCTCGGGGCTCGCCTTGGGGTGCGGGGATACT 
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AGCCAGGGGCAGCGCGTCCGTCCGAGAGAGCCCCGAGGGCTGCCGGCTTACGCACCAGGCTTCTAATTAGAACCCGCCGCCGCGTT 
CCTGAGCX5CAAAAGGAAGCTCCTTCGCCCCGCGCCCTTCCCATAAATGTCGCCAGCCTCCTGGC 

GTCTTGTCGG CCCTGAGGGTTCCTAGCCTCGGTCCCCATCCACCTCAT ATG CATGTATACCTCCTTTTCACCTGGGCCCCCTGTGT 

TCTACCCTGAATGGACCCTGAGACACCCTCGGGGACAGTCAGGCATTCTGCAG 

GTACCCTCCGCACCCTCCTATCAAGAGTCCAGGACA 

CCGGGGTGGGTGGTGAGGACCCCTCCCCCATCCCCACCACCACTTTGTTCTCAGTAG 
CTGAGAAGCAGCCTCCCCTTGGGGGAGGCTGGCTCCCTGGGGACAAGTCT 



GGGGACCTTGGCCCCTGCCAGGTGGGCTCCTGGTGAGCCCCAGTGGAACAGAGCAGGCTGGGGTGG 
CACCTGTGCCCCCAGCCCCTCCCTAACATGAAGCCGCT 
AGTGAAGGGGAATCTCCTCACCTGGAGAAACAAACCAGGAAACAGCCCAAATCAGGGTAATT 
GAGGCAAAGGTGTCCCTGTGGCCAGCAGCCCTGTGAGAGTGTGAAGGGCCACCTGCCT 
1 5 TTATACCCACGCAGCAGATGCCAGTGGTCCCAAGGCTGAGCGGGTCTGAAATGCAGAGAGACGGG 
CCAGAGCTGCACAGATGTCCAGGGGCATCCATTCTGAGCTCATCCA 

ATGCATCTGCCCACCTGCGTCCCATGAGGCCACAGACGGCAGGAACCTAGTGGCCCCAGCCT 
GTTCTCAGGAACAAGAATTCTTGGAAGAGCCCACA 

GAACAATCCCCAGGCACACCCGCACTCCAGGGCACTCTGAGCACCTCTCCCTT 
2 0 CTCCCCAGGGACCCCACCAGGACCCTAGAGGTGGGCCTGTGCCTCCCGCCACCGCCTCCCCAGGG 

GCAACTGGATTTTGTTC^TCTTTCAAGTCCCTCTG CTGCCTTT ACCAGACTGGGCTGTGGCACTCCCGCCACGGCCACCTC CTGCT 
TCCGTTCTC^GTCCTGGCCCTCCCCTGGGACCCACACCCT 

CTGCCTAGTGGAGCACAGCAAG CTCCACCCCTT ACCCAGCCCT CACCAGC 
TCCTGCTGGACTATGGGATCCCAAGCACAGAGGCCTGGCTGCTTGCAGATGGCCACCCCAGTTGTC 

2 5 CTGTGTCCAGCCTTTTGCCTGCCCCTGGCCTGCTCCACAGCCTCGCTCATGCT 

TCGAGCTTGTCACTATGACAGAAACACATCATGACCTCCATTTTTATAGGGCACCT 
TTTCAGGCCCCCACTCTCCTACACACTCATGGGTGCAAACAACCTCCTGGGCACAA^ 
GGCTGGAGGGCCAGCACACTTGGAGGCCCTGCTCA^ 
TCCTGGTGGGCCTCATTGTGGTGGGGAGACAGCCTGACT 

3 0 AAAGGGTGCAGGAGAGAAGGAGCTCGGCTGGAATGATGGGTCAGGCATACGTGTGTGGTGTGTC 

GTTTCGGGGACTGAGAGGGGCCAGTCGAGCAGGGCAGGGACATGGCCCAGGGTACCCAGAAGCT 

CTAGCCCCTTCTTCCTTCrCCCTTCGCCCGCATCTGTCTGTGTCTCTCTGTCT 

3 5 TCTGTGTCTGTCTCTCTCTTTOVGTCTITGTCCTGTCT 

GTCTTTTGCAGTCTCTGCCTCTCGTCTCTGCCTCTGGTGCTGAGGACAG CAGTCACTAGCAAATAAAACTCGTGAAACG AG CTGAG 
TGG AGCCACCAGGCCAGCCCTCCCAGAG CTGTGGGATCTCAGGG CCCTGGGGGTGGCATTG CTTCTCAGAGCrcCCCACACACAAA 
CCCCTCCTGAGGGTGACGGTCAGGAGTGGCCACACGTCAAGTGTCTGCAACCAGACATTGGCCCTGCATAACT 
GCCGCCTCAGGGACGCACAGCACCCTTCCAGAGTGC 
40 CCTTGAGGCCAGCTGCCCCTCTGACTCACGGGGCTTGTTCTAACGATGTTTGCTTCGCATAAGCCTCCTAGAATCTTTTGTAGCAG 
GG AGAGCAGGATGAGTAGATACAGTCAT AGCAGCC CAGTCTTAGGGGTGGTGACTGCCCGGTGGC CACTT CTTCCACTGGGTCC CT 
TGTGCACATCACCGTACTGTGAGGAGGGAAGCTCCGGTGTCC^ 

ATCCAGGGACCCCGTTGGGGCTTCTCAATGTGGGGCCTACCATGCAGAGCCCAAATGTGAATC 
TCAGAACAGAAATGGGAATTTAAAAAGGATTTGGTGACATTGTAGCGGTTTGAC^ 

4 5 AATGGAAATTTAAAAACAAACCAACCACGTCCCTCTGTTGGGACT 

GGGGGTGGGGGGGCTCTCTCAATGTGGACCACTGCCTGTAGCCCT 
GCCCCCnTTAAGAGTAGTTGAAGGGCTGGTTGCTGTTTCTGTTCAGAAGACCC 
GAAGGATTAGACTGTGGCTCAAGGGGCAGGGGCTGGGGCAGAGGTTGCAGGTGCCAAGGAGGCTGG 
AAGCTCCCTAGGGATGAGAGCCTGTGCCCCAAGTGGTACCAGGGAGGTGCT 

5 0 CATTCACAAAGGAGACCTGTACAGCCCCACAGACAGGGTCTCGCTCTGTCACCCAGGCTGG 

CTGCAGCCTCGACCCTCATGGGCTCAAGTGATCCTCCCATCTCAGCCTCCTGAGTAGCTGGGAC 
CAGCTAATTTTTGTATATTTGTAGAGATGGGTTTCTCTCTCTACAAG 

CTGGTCTCGACCTCCTGGGCTCCAGCAGTCCTCCC^TCrrGGCCTTGTGAGTAGCTGGGATTAC^ 
CTAGTATTTTTCATTTTTTGTTGAGATGGGGTCTTGCTCCGTGGCCCAGGC1X3 

5 5 CCCTCACTGGTTACAGCCCTGAGGCTGGTGAGGGGCACATGGCCTCTTCCTTGGCCTCCCTACT 

GAGGTGTCTCTCACGTTGCTGCTTCCACATTCCT^^ 
GCGGGCACAGGAGCCGCTGTGCCTGCTGGGACTCGTTCAAAG 

CGGGTTGGGAGAGCTCCAGGCAGGGCTGGGCAGCCTGGACTGGGAAGGGTTGCCGACACCCAGGA 

CTGGGG ACCCTGG CCGGCTCACAAGGTGCTTGAGCTG CCGAG AGTGGTCTTCTAGGCAGCTGCAGTTCCACCACTGGGTGTGAGAA 
60 GAAACCAGAACCTTGGGGGTCTCCCAGGCAGGTGGGTGCCCTCGGGCAGTCTCAGCCTGGGCGA 
GGGAAACACITCGAGCCTCAGGTCTGAGGAAGGAAGCCT^ 

GAACAGGATGTTTGCTCCAGAGGCCCCACAGAGGGGAAGCCCACCTGCCCAAGCAGGACCCTCAATGG 

CTGGTCCTCCACACCTGCCTGCATTGCAGCTGTGCTGGAGCCAGAGGGGTCCCAGGAGGCGACCAG 

GCATCCTGGAGGGGTGGCCCTGGCAGGTCCGCGAGTGGGGTGCACTGGGTCCCTCTCAGGCCAGGAGGAGGCTGAGATC 

6 5 GCAGGGCAGCAGGG CCTGAG ATCAGTGTCCCC ATTCCATGCACTGCTCCC CTCCTCTCT 

TT ACC CCATTTGACAG ATGGGG AAACTG AGG C C CAGG AGGGG AAAGAGAGAAAGAG CAAACAAACAAG CGAGTAACAGGGC CCGAG 
TTGG AGCCCATGCCCACCTTCCAGGCCTGTCCACACG CCTGCATGGAGCATT ACTGTGAGGTCCCCACGCCCC CAGCAAACATGCA 
AGCAGAGAGCCGTGCCCTTGGTAGATGGGTGCTCTGAGTTCGCT 

CT AAGAGC CTG CCAGCCTCAGAAGG ACTTG CACAGGAACAGTGG AGGCTGG ATATGGTGGCCACTCAGG ACCCCCACAGCTG 

7 0 GCTGCGAGAAGGCCCCCACGCTITGCCCTGAGCTTCCATCCAC 

ATTGCACATGGGACCAAACCAAGCCGGGGAGGCCAGCCT 
CCTCCTTCrTCCCCTGCATGTCACAGCCAGCATCTCTGCC^TACCT 
GGGAAGAAAAACCATTAGTGCITGTGGGCTGGAGGCATGGCIGCTC 
r^rCATTTTATAAAACTACATTTGTGTGTGTTCTGAGTGAATATC 
75 CAAAACAAAACAAAACAAAACCTAACAGCATTAGTGAAAATCTG^ 
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TTTGTTTTTCTAAGGAT AGAAACAG TTTGTGTTCTGTTTTTGAAAACTCACTAAG CATCATCTCCATTTCTGGGGCCTTCTGGATG 
ATCACACGATGGTTCCACTAGGTCrrTGTGGTGCTGGTACTGCGATCTGCTAG 

CAGGG AAAGGGGACAGAGCAAGAGAGGCTGGAGATATTGGAAGAGCAGGGTCACAT^ CAATTTTGCACAGCCAG 
AGGGATATCTGGTGTCCCATCCAGTTGGGCAGTTAGAATTCCAGCCACTCAGCTC 
5 TTCCTGGGACX3CTGTGGTGTTGGCAGTGAGAGGGGAGAGGGAATGAATTCCTCATCCGG 
TGGGGGCAGGGAGGGCAGGCTGGCCTCAGCCTCTCCCCGGGGACTGGGCCCAGCC^ 
GCAGGGCCCAGACCTCACTGTGATCCAGATCGTTGGCATCCAGGAAGTGAGTAGGGAGTG^ 
CTCCCCCCCACCCCCGCCGCCCAAGTGCTTACCTGTGGTTCCTTCTGTGAGTGATCCT 
CACTTGTITGCATGCCTACAAAGCGTGGGAGAGCT 
1 0 CTCCTGGCCCTGCCTGCTGGCTGGGGCCACGCTTGTCTTAG^ 

CCATGAGCAGGCAGGAGCCACGGACTGACAGCCAGTGCTCAGGGTCTGTCTGTGCCCGTCAC 
TTTCTGTCCCCCGCATGTGCTTTAAGCGCCTGCTA 

TGACCCCCCAGGGTGATTATCAGCAAGGCTGACTCCCCAGGGACTCAACCCAAGGGAGCC 
AGACAGACAGGGGCCATGGAGCAAGTTTAGACTCAAGGAAGCCATAG^ 
1 5 AGCCTGGGTGAACCCAG CACACATTCCCTGTGGCACTCACGCTCCTGGGGCCACCCTGGGAGG ACCCTGTTCT C 

CCAGTTCAGTTGGGGAAACTGAGGTTCCAACAGTCCCAAAGCTGCAGGTGCAACT 
GCCTCCTGGGGTGGAAAGGGGGCTGCATTGGGGACAGGAAGCCCTATGATGTC^ 
TCTGTGTGACTTGCGCCATTTATCTCTGACTCATTC 

CTGCATTGCTATAAAGGAATGCCTGAGACTGGAAAGAAAAAAGGTTTAAT^ 
2 0 CGGCATCTGCTGAGGCCTCAGGGAGCTTCCACTCATGGTGGAAGTCAAAGG^ 
GGGAGGGTGCCAGGCrCCTTCTAACAACCAGCTCCTGCATGAA 

CCATTCACGAGTGATCTGCGTCTGAGACCTAAATGGGTGGGGCCCACTAGGCCCCACCTCCAA 

GAGATTTGGAGGGGACACACATCCAAACTATAT CAACACATTTGAAGGG CATTGT AGGATCTT AGAAAG CAGTGGCCCCTGGAGGC 
CAACATGCGGGAAGCCCTGTTCTGCTGTATTAGCCXSGTGGCTGTGGCAAAGCTCC^ 

2 5 GGTCCCAGTGCCACCTATCCAGGTCCTGTGGGTTTCAGGGAGCCACATAACTAGAGCCCGTGGGGAGG 

CAGCAGGTGGAGGCCAGGTAAGGGGCTGGGTGGTGGTAGTAAGAGGAGCGATGCTGGGTAT^ 
GCAAGCCTCCCTTTCAGAGACAAGAGTGCTGGAAAGGTGGAGAGGCCGCAGGAG 
CACAGTGGCCAGGCGCGGTGGCTCACACCTGTAATCTCAGCATTTTGGAAGGC 
AAGACCAGCCTGGGCAACATAGACAGACACCATCTCTACAAAAAATACAAAAGTTAACCGGG 

3 0 CAACTACTTGGG AGG CTGAGG CT AAGATGGG AGG ATCCCTTGAG C CCTGT AGGTCAACG CTG CAGTGAG CC CTCAGCCT AG ACG AC 

AGAGCAAGACTTCATOTCTAAAAAAGTAAATTAATAAAATTTAACAAATAATAAAGG 



GCTTAGGTGTTGGAACTCAGAGGTGGTCAGGAAATGCCAGCATGAGCA 
CAAGACCCTATCTCTACAAAAAATACAAAGCAAGAGAACAGCACGGGGGCTTTGCCATTG^ 

3 5 TGGGCATGAACATCCTTTTCACGTTCCAAGGTGGGATCCGGGAGTCCAGGTTGCATTGT 

GAG AACAGG ACCCTGGGGACACTCGATCTCG AAGACGGGGTTCTCATTCACTCTCCA CCCCACTCACTGCA 
GCCAATACTCCCTGGCGAGACCCTGAGACAGGCCTGTCTGTCCCTTGTCCCAGGCAAGCCTGGCCC^ 

GTT ACAGGT CATAG TTT C CCAAG CACT G ATGG ATTC CGGG CAT CTGCATT CT CCTT GGATT AG AAATTTGGATCCT AGG AG CTTTT 
GGGGGCKKjTGTGGGGATAGTCCCTGTGCAATTTAGATTTT^ 

4 0 ATGAGAGCTCACTGTGGCCTTGTCCTCCTGGGCTCAAGTGATCCT 

CCACCACACCCAGCTAACCirTGTATTTTTGTAGAGATGGG 
CAATCCACCTGCCTCAGCCTTCCAAAGTGCTGAGATTACA 

GACCCAACTGATCCTTCAAGGCCCAGCCAAAATCTGGTGGCCCAGTGAGCCTCCAGGGCCCT 
CAGGAGCTCCCCGCTGCCTGCCAGCTTTCCAC 

45 

AATCCACAAGGGCCCGTCAAGGGAGGGGTCCCATCAGGGCTGTGAGAGGCCA 
CCACCTGCACCCTGGCCIATACTCAGGAAGGTGGTGGGCCAGGGCCCACCC 

CAG CACCCTG AG AATGG AGG CTT CAG ACTTGGG AGTCC ACCT CT CTG CTTT AT CT AG AG ACCAGGGACAAGG CCGCC C CTC CCAG C 
CCCGAGCCTGCX3AGCTGAGGTGGTCCCGGCTCCATGCTGGT 

5 0 CCAGCATCTAGCCTCGGAGAAGGGATGAGCCITGGAAATGGCCGCATCCCC 

AGTGGATTCCTTTGTTTCTCCTCATTCTAAGTCTCCAA^ 
. CTTGCTCACTTCCCTGGCTGTAAGCT AGGCTGGCCCTGGAGCCTGGGT ACCCGGCCCT^ CCAGGCCTCAGCCA 
TGACACTCAAGTGTTTGTCCTCTACAACATTCCATTAGGAAAATTTTCAAAC^ 

CTTGTGTGTCCACAGCrCACACTGAAGTCAG CCTGCTTTATCTCACACCCACCACCCTCTC CATTCTCTATCTATCCGTCTTATCT 

5 5 TTTTATGCTTACGTCAGTCTGCTCTGTCCCAAATACTGTAGC^ 

CACATGGAGTCACACCATAGAATGGGTGAAGGGGAAGCATCTGG^ 
CCTGGCTTAAAAATGGAAGCCCCTTGCCTGGGCACGGTGGCT 
CACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCAAAACCCCGT 
GTGCACACCTGTAACCTCAGCTACTCGGGAGGGTGAGGCAGGAGAATCGCTTGAGCCAAGA 

6 0 CACACCACTG CACTCC AGCCTGGGCAACAGAGCG AG ACTC CGTCTCAAAAC!AAACAAA CAGTCCTGGGCGAA 

AAAACCACGAGTCCAGCCCCTACGGTCACCTCCCTCrGCCTGGTTCAAC^ 

AGGTCACAGACAGCTGACCTTGGTCACCACCACTTC^CCCTGGGCAGTGGCAGTGC^ 

TCCAGTTGCCGGGACTCAAAGGAAGAGAGAGGATGAGGAAGGAGACATGGAAAAAGA 

6 5 ACCCAC^CTGCATCAGC!ACTTCCTGGAAGTGCTGGGATCTACAAAATGGTGGA 

TGCGTGTAGTGAATGAATAAAGAAGTGAACAAGGGAGCCGACTGTGC 

GGGGATGCACGTGGGGTCTGGCGCGAGGTTATGCACTCAAATGAGGGACAGACAGGAGGGAGGGTGACT 

ATCTCCAGGGGGTAGACATATTCtGGGACCCTGAACACTGGGAGTCCGGTCCGGAGGCCAGACT 

AGAGGTGGTCATAAGCATAAGGAACTCTCTCCCTCCACCCACCTCC^^ 

7 0 GTGTATTTTGTACAGCAAAATGGAAGACCCTAAGTGCATGGCT 

TCAAACGCAGAGCACTTCC^CCAGGCTC^CCTGGCAGAGCTTCCCTGATGA 

TAGCCACC^GCCACACGTGAAGCATGGCTGGTGCCACCGAAGGGCTGAGTTT^^ 

ATTACATTAATITAAACGGCACAGGTGTCGAATGCCTTTTGCCAW 

75 AGGAGCATCTCACAGTTGAGCCC 
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GGCAGTCTTCATTTGTTTATTCACTCTCCTATTGACGG^ 

AAGATTCCTGAACACGTCTCATTGTGGACACGTGTTTACATCTAGGGGTACATAGCCA^ 

GTCTGAGTrCCACAGTGATTTCTGCGCTGGGTCACGCCTACATCCCGGAAGCCAGG 

TCCCTTCTCTGGGCAGCCCCCACCTGACCCGGTCCCCAGCCTGAGGTTCTCAGGTTCT 
5 T CAACCTGGTGTGAACGGGGCCCATGCAGTTGTGCTGAGCAGG AATGCGGGGCTG C^TCATTTCCTGTTACAGATCTGG ACGACTT 

GAATTCTGAGCTGAGTTTTGGGGATAGGGCAGGTCAGGGTCCT 

ACATAGATGATATCAAAACATCTAGAAAAAATTAAATAGCCAGCATTGA^ 

CTGTATAACATACAGGCTTAAGAGCTGGGCAAGGTGGTTCATGCCTGTAATCC 

TGAGGCCAGGAGTTTGGGACCAGCCTGGGCAACATAGCAJGGACCTTGTCTCT 
1 0 TGGTGTGTGCCTGTAGTCCCAGCTAATCTGGAGGCTGCAGTGA 

GATTCTGTCTCAAAAAAAGAAAAATTAGAAGGCATGGTTTAT^ 

AAGTAAAAATTCTCTTACTTGCAAAATAAAAAATCAGATTTCCTTCATGGTGG^ 

CCCCAGAGCAAAGGGTCTGTAGCCTGCTCATCTGTGTCGCTGTCCCAGCCCCACACTCACC^^ 

CTGGCTGTTGGGCCACAGTATTCCTCCCTGGGCTCGCAGCCCATCCT 
1 5 CTTCTGCAGGAACTACGTGGAGATCATGCCCAGCGTGGCCGAGGGCATCAAGATTG^ 

GCCGCCGGTGGAACTGCACCACCGTCCACGACAGCCTGGCCATCTTCGGGCCCGTGCTGG^ 

AGM3GCAGATGAGTCTGGAGTGGGACAGAGTCTCCCGCCTAGCGCTGC 

CACGCGGACGCTGGGGTCCTTCCCGCTTACCTCCAGGAGGACGGTCTGGGGCTGAAGCJVGGA 
ATCCCTCCTCCTCCTGCTCCCTGGAGAGGAAAGCGGGCATTCTAAGGTGACT 
2 0 GCCAGGGCACAGGCCAGGCTTGCAGAGGAAATACACAAACTTA 

AAAGAGAGACAGAGAGAAAGACAGAGAGAGACAGAGAGAGAGAAAGATAGAGACAGACAG^ 
ACACAGAGAGAGAGGAACAGTGAGACGGAGAAAGAGACAGAAAGATGCAGAGAGAGACAGAGAAAGACACACA 
AGACAGAGAGAGAGACAGAGAGGGACAGAGCGACCAAAAGAGAAAGACACAGAGGAATAGAAAGACAGAGACAT 
AGACAGAGAGAGACAGAAATAGAGAGAGACAAAGAGGGACAGAGAGAGACACACAGAGAGAAACTGAG 

2 5 GGAGAGAACATGGACAAGATTAAAACTTTAGCATCTTTTTTTAAA 

AAAGTATTCAATTCAGTAGCATTTGGTATACTC ACAG AATTGTGCAACCAT CACCTGCCTAGTTC CAAAAT ATCTTTATCACCCAA 
AAAGGAAATCTGTGCtSCATTAAGCAGCCATTCCCGTTCTTCTCCCTTCCATAGCCCCTGG 
TGATTTAGCTACTCTGCCTGTTTAAGAAACGGCATCACT 
GCACTGTGGATAGTTTCAGCACTTCATTCCTGTTCATGGGCTGAATAATAT^ 

3 0 TCTTCAGGTTGACGCATATGAGCGTGGTCTCCACCTTTT 

TCACACCTTTTGGTTTCTGTATCTAGGAGTGGAATTGCTGGGTCACAGGGTAATT 
GGCTGCAATGGGCTGGAACCATCACACTATTTTCCACAGCAGTTGCACCATTT^ 
ATTCATCCTCACTGATACACTGTTTTCTGGTGTTTGTATTTCTTACTTCTGGCC 
TTTCAGTTTGCACTTCTCTGATGACCGATGATGCCAAGCACCATTTCATGGGCTTTGGGGCCA7TTGT 

3 5 TTAGTCAAGTCCTTTGCCCATTTTTAAATTGGATTCTTTGTCTTTTTGTTGTTGAGT 

TGGATGTGCATCAGATGTATGATTGGTGAATATTTCTCTCTCATCGTGTAGGTTTTTTTCATTTTTGTGAGA^ 
ACAAAATTTTTTAATTTTGGTGAAATCCAGTTTATATAGATGTCATACTGAAGCAAGAATTAG 

ACATCTATGTTTT CTTCAAAAAAC CT AAT AGTTT AT CTTTT ACTTT G AAGT ATTT AATCTATTTTG AG CTCATCTTTTTAT ATGGT 
ATGAGGTAGGAATCTAACCACATTTTTTTGCATATGGAAATTCAGTTA 

4 0 ATCAAATGGTCTTGGCACCCITGTTGAAAATCAATTGGT 

TCTGTATGTCTGTCCTGTCACCAATACCACATTATTTTGATTGTTGTAGCTTTCT 
TCCAGCTTTCTT C rrr c rTTTCCAGATTGTTTGGGGGGGACAGGAAAA 
TGTAAATTTTAGGAGCAGTTTTTCCATTTGTGCA^ 
CCTTGGGAAGCATTGCCTCCTTAACAATATTAAGTTATTTA^ 

4 5 AATTTTGTTTGTTTGTTTGTrTTTGTTTITC 

ATTGCAGCCTTGACCTTCCAGGTTCAAGCGATCCTCCTGCTTCAGCCTCCCAAGTAG 
CTCCCAGTGTTTGTATTTTTTGTAGAGATGGGATTTTGCTGTGTTT 

TG CCCTGGCCTCCCAAAGTGTTGGGATTACAGGGCATG AG CCACTGTGCCCAGCTTCTGTAATTTCTTTCAGCCATGTTTTGTAGT 
TTTCAGGGTATGAGTTGTGTACCTCCATGGTTAATGTATTCCTAAGTACTATT^ 

5 0 TTTCTTTTCTTCTTTTTGTTTTGAGACAG 

TCCATCTCCCAGGCTCAAGTAATTCTCCTGCCTCAGCCTCCTGAGTAGC 
GTAGAGATGGAGGTCTCCCTATGTTGCCCAGGCTGGTCTTGAACTTTTGGGCT 

CTGGGGTT ACAGG CGTGAG CCACTGTGG CC AGCTTGTAATTGTTTTCTTAATTTTTTTT C AG ATTTT C CATTGTT AAT ATATAG AA 
ATTTAACTTTTTAATCAAGAAATACAT T '' 1 "1 T ' l 'l'l 1 1 AGTTTTGATCCTCAACTGTAACCCTGCTTAATT TGTTT GTTAGCTTTGAG 

5 5 AGGGTTTTGGTtjAATTGTGTAGAATTTTCTACATATAAATTGATGTC^ 

TGTCTTTT ATTTCTTTTTCTTTCCTGATTG CT CTGGCTAGG ACTTCCAGTGCTATGTTKSAACAG AAATGGTAA AAGT AGG CAT CCT 

TCCCTCGCTCCTGATCTTAGGGTGAAAACTTTGAGCTGCCTGCATATATGGTGTTGGCT 

TATTTATTTTTAATTTCTATTTCAATAACTTTTGGGA 

TTTTAGTGCACCCCTCACCTAAATAATGTACATTGTAACCAATATGTAGTGTITTATCCCT 

6 0 TGAGTCTCCAGTGTCCATGAT ATCACTCTGTATG CCTTTGTGTTCTCATAGCTTAGCTC CCACTTATAAGTGAGAACAT AGGATAT 

TGGTTTTCCACTCCTGAGTTACTTCAGTTAGAATAATGGCTTCCAGATCCATCC 

TTAATGTCTGAGTAGTATTCCATGGTGTATATATACCACATTTTCTTTATTTTATT 

AGCCTGGAGTGCAGCGGTGCGATCTCGGCTCACTGCAGCCTCCACCTCCTGGTTTCA^ 

AGCTGGGACTACAGGCACCAGCCACCAAGCCCCACTAATT'r r i G l'rVr 1' 1 CAGTACAGACGGGGTTTCACCACTTTGGCTGGGCTG 
65 GTCCTGACCTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCTAAGTGCTGGGA 
TATACACCACGTTTTCTTTATCCACTTGTGAGTTGATGGGCACrTAGGCTGGTT 
TGAACATATGTGTGCAAGTGTCTTCTTCATATAATGAC ri ' Cri ' 1 ' l 'CCTTTG 
GATAGAACTACTTTCAGCTCTTTAAGGAATCACCATACTGTTTT^ 
AAGCATCCTCTTTTCCCCACATCCATACCAACATCTATTATTTTTTGATTTTTCA^ 

7 0 T CTCGTTGTGX3TTTTAATTTGTGTTTC CCTCATG ATTAGTGATCTTGA^ 

ATTTTGAGAATGTCCATTCATGTCCCTTACCCACTTTTTAATGGGATTATTTGTT^ 
AATTCTGG ATATTCG CCTTTGTCAAATGTATGCTTTGTG AATATTTTCT CCCACTCTC 
TTCTTTCTGTGTGCAGAAGCTTTTTAATTCAATTAGGTCCCATTTAm 
7CA7SAACTATTTGCCTAGGTTGATGTCT™ 
7 5 AAGGCTTTGATCCATCTGGAGTTGAT^ 
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TCCCCAGCACCAATTATTAAGTAGGGTGACCTTTTTCCA 
TTGGCTTTATTTCTGGGTTCTCTATTCTGTTCC^ 

GCCTTGTAGTATAATTTGAAGTTTGCTTCAAATTAGTATAGTAGTATAATTAGTAGTATAAT 
GCTATTTGGGATCrTTTTTGGTTACATATAATTTTTTC^^ 
5 CAGTGGAGTGATCTCGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCC^ 
TGGGATTACAGGCATGTGCCACAATACCTGGCTAATTTTTrTTGTTGTTC 

TG CCATTTT AGTCAGG CTGGTCT CAAACTCCTGACCTCAAGTG ATCCAC C CAC CTTGG CCTCT CAAAGTGCTGGG ATT ACAGG CTT 
AAGCAACCACACCrGGCTAGTTCCGTATAAATTTTAGGAlTG"i M ri"l"lCCTAATTCT 
ATTGCCTTGAjUSCTCTAGA TTGCTT rGGGCAGTA 
' 10 CTGTTTGTTTGTGTAGTCTATGATTTTCTTCAGC 

GCAGTGGCTTTATTTAACITTCAAGGTTTTCAAAGAC 

AAAACGAGTTTTTTTTTTACCCAGACAGGTTAAAACAGACAAACAATTCCTTGGGGA 
TCCACCCACCrTGGGGAACGCCGGCCTTCACTGTCCCAGGCAGGTTTGTAGGCACCAC 
CAAGTCAAGGTCATCATTTCCTCAAGTTGGCATTCAGTGGGTTGCGGTGACCCTTTGACTAC 
1 5 TACTAACCGTTTGTGTCTAATTTTTTGGAGGGCCCTTGGAGCT^ 

TCCCACCACTCGATGGAGCATCAAGGTCACCTTGGGAGACCCACATGTAGGGCGGAATGTGCATAGTA 

CTCAGCACAGTCCTGCCGGCAGAGGCGGGGCCCAGGGAACCACAGCTGTGCTC 

GACCATGGTGCCGAAGGAAGGCTACTCAGACAAAGCGAGACAGGTCACGCTGGGAAAGTCCT 

2 0 ACAACCCCAGTTCTCTGTTGCTTAGGTCAAAAACCCTGGGCCCCTGGCTGCCACCTT 
ATGAATGCTTGTGCCCACCCCCCCCCCAATTCCTAC^TTGAAATCCT 

AGGTGGT CAGG TCATG AGG ATAAAGC CTCATGAGTGGG ATCAGTG C CACTG AGGCTGAGGTGGG AGG ATTG CTTGAGG CCAGG AGT 

TCAAGACCAGCCTGGGAAACATAGCAAGACCCCATCTCTGCAAA 

CCTA3CTACTCAGGAAGTGTCACTTGAGTCCAGGTTGAGGCTACAATGAGCCATC 

2 5 GAATGAGACCCTGTCTCTAAAAAAATATAAAAATAAAAATAAATAGGCCAGGCGTGGTGGCTCACGCCTG 

GGAGGCGGAGGCGGGGAGATCACTTGAGGTCAGGAGTTCGAGACCAGCCT 

AAAAAAATT AG C CAAGGGGGGTGG TGGG CATCTGT AAT CCCAG CT AGTCGG G AGGCTG AG G CACG AGAATCACTTG AACCTGGGAG 
GCGGAGGTTTTAGTGAGCCGAGATCGCTTCACTGTACTCCAGCCTGGGCAACAGAGTGAG 
AAAATATAAATAAATAAACAAATAATAAATAAATAAAAAGGJ^CCTCT 
30 AAGAGGTCTGACCCAGTGAGAAAGAGAGGTCTATGAACCAGGATGTGTCCTCGCC 
GACTTGTGCTGTTTATGAGCTGCCTGGCTATGGTGTTTTTTTAGAG 
CCACTGGCCTCrrGGCTGCTTCTCAAACACTGAGCATGATGCCCCCTCCCrCC 
CACCCTTTCTCCTGTAGTAATTCCCACAGGCCCTCT 
TCTGGTCAGCCCCATGAGCTGATGATTCTGTGGAAGGAC^ 

3 5 AAGTCCTTTGGAGGGGAGGTGCCTGGCCCTGGGGGGCCTGGCATGTGCACTGACGAGGAGAGCT 

CCTCCAGGGCTGGTGGGGCCTGGCTGAGGCCATCCATGGGC^GAGCCCTCC^ 

ATCACGGGCCTGTGTTCTTCAGGCGTGGGGAAGGCACCGTGGGGCAGAGAGTGGGGACCCGAGGGTCAGGCT 
TGGTCACTGTGGGTCTGGGATGAGGAGGACGTCCCTGAAGGACAA 

TTGGATGGTGGCCTTCCTGTGATCTTGAGTGAGAGGCAGCTCACTTCCTGGAGCATCAGCCTCC 

4 0 TTACAGCTTGTCAGACCTCACAGAGCrcCTGGCTGGGGTAATACACCACGCAGCTCCCCT 

GTCAGGTCCCAGGGCCGTGCAGCTCATCGAGCGTGTCCAAGGGTGTGTGTGGCGTGTGATGCATTGTGTGTA 
TGTGTGGGAATGTGGTGTGATGTGTCATGTGTGCACATGCATGTGGTGTGTGGGGTGTGTGCCTGTGCATX3TGTG 
GTGTCAGGTGTAGTGAATGTGTATGTGGTTTGGTGGATACATGTGCCT 
TGTGGCTGTCX3CATGCATGTGCATGCCTATGTGTGCATGTGATC 

4 5 TGGTGGGATGGAGGGAGGCATGGGCAACATATACACTGCAAAAGGCCAAGAGCACCAGACTGGGCCC 

CTCCAGAGCCAGGATGGGGTGCTTAGGGCrrGCCACTCCCCCTCTGAGAAGGAAGCT^CGATTTTAG 
CCTGCTTGGAGCGATACCAGGTGATGGAGGAAGGCTGTGCTGCAT^ 
CCTCGGGAATCTTTAGGATATTTCATTGCCCTCC^GAATGTCAGGGTGTCTCTC^ 
CAAAGCGGGGAGGGCTCCTGCCCATGGACGGCATCAGCTCACT^ 

5 0 AAGCTAGCCrCATGAGTCACACCATTCAGGAACCCTTGGGTCTGTC CT CCAAGCTGAGCCTTGGGG AGCCTGTGCATCACTGTCTG 

G ATTGGACGATGCCCATGGAAAGGCACAGATCTGCACAGATGGG AATC CCAG CCAGCAG ATTGGCTGGGGGCCCAGCCAGCTGAAT 
GCAAAGCCCCAAAGTCGGTGAATAGCACTGGGTCCTAAACC^ 

CTGCTCTGGAGTCCCCAAATCTATGGTCTAGGAGCCTGGTCATATACCAGAACTTCCriTCC 
TGTATGGATAGACCATATTTTGTTTATCCATTCACTCACT 

5 5 CCTTTTGTGTGAATGTGTATTTTCAGTTCTCTCTGGCATATACCTAGGAG 

TTTGAGGAACTGT CAAACT ATTCCAGAGTGTCTGC ACCATTTTACATTCCCATCAG CAGGATCTG AGGATTCTGATTTCT GCAC AT 
CTTCATGAGCACTTGTGATTTrCTGCTTGTTGTTTTTGTTCT 

TTTGCATTTCCCTGATATCTAAGGATGTTGAGCATCTTTTCATGTGCTCATTGGCCATT^ 

TCCATTCTTCCACCCACATTTTAATTGGGTT ATTTGTCTTTTTGT C CTTTATATAGTCTGCAT ACAAGT 

6 0 C C CTT AT CCAAT ACATGACTTGCAAATATTTTC C CC CATTCTGTGG GTTG TCTTTT CACTTGCTTG ATGGTGTCCTTTGAT ACATG 

CAAGATTTTAATTTTGACGAATTCCAAGTTATCAG 
TAACCCAAGCTCAOU^GATCTACTCTTCATTAGGTTTTTTTTA 
ATTTT'CATCGACGATGTGAGGCAGGGATCCAATCACATTCITTTGCATGTGGAA^ 
TGTTCTTTCCCCTTTGAATAGT CTTGG CAACTTTACTGCAAATCAGTTAGC 

6 5 CTATTCCATTGTTCCATATGTCTGTCTTTATGCC^ 

ATGTGAGTCCTCCTGCTCGTCTTTCTTTTCAAGGTTATTTTGGCTATTTGGAGACAATTACAGTT 
CTGRTCAATTTCTGCAAAAAAAGCTTAGATTTTTAGAAATTGTGTTGAATC 
TTATGTCTTCTGGTCCATGAACATAGGATATCTTCCTATTTAGGTCTTTAATTTCTT^ 
AATCTTGCACTTATTTTGTTAAATGTATC^ 

7 0 ATTTTTTAATTATTTGTC^CTAGTGTATAGAAACACAAATGAGGCCAGGTGCA 

GGCCAAGAGAGGGAGATCATTTGAGGTCAGGGGTTCAAGAACACCCTGGCCAACATGAC^ 
AAATTAGCCAGGCATGGTGGCACATGCCTGTAGTCCCAGCT 
AGGTTGCAATGAGCTGAGATC^CACCACTACACTCCAGCCTGGGCAACAGA^ 
^CA CA CAC ^(1AT ACACACACAAATGATTTTTAATATAGATCTTGTACTCTC 
7 5 TTTTTTAATTTGCAGATTtlATTAGGGCTTTCrrACXj 
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TTCTTCCTTAATTGCCCTGGCCAGAGCCTCC^ 
CACTGGATAGGATGTTAGCTGCAGGTAGGCCCCTCTTGCTTT^ 
ATAAATAGATAAAAATGGGAGCACTTAGTTACCATGCATCTCTCA^ 
ATGGTGGCCTATCATCAAAATGTGAAAAAATCCTATCAGCCCA.TGACTCAGTTAGGTCCT 
5 TTAAAGGTGCCTGATTTTCACAAGTCCCAGTCATTATCAATGGCTCCCACTCTTGCTTTCT 
CACTCCAGGAAGATCCAGGAGTCTCCTGGAACAAGAGCTGGGACCCT 

ATAGCATATG CCACACAGGTATACATGTGGCG CAAATG T ACACAGTCGGTCCC CTGGGAAAGCACAGAGATG CACTCTG CCACACA 
GCATATGCTTAGAAATGATGTATTAAGCCAGTGAC 
AGGTGGCCCCTGGGCTGAATCTTGAAGGATTTTCrTCCTTTA 
1 0 CGACACACCATGGCCAGTTGAATCCTCATX3CCCATCTGTTGAGGC 
TCACTGTGACAAAGCAGAGACATGAACCTTGGCAGC 

CTGACGTGGGGGTGAGCAGAGCCCTCAGGCAGGCCCAGCAGTGACAGAGCCTAGCTGGAGCAT^ 
ACCCCAAGCCCACAGGGAGGGCAGGTGCAGCCCGGGAGGAAGCAGAGAAGGTAGGACT 
ATCCCTCCGGGCAGGGCGGGGATGGATCTGAGCGGTCACTGA^ 
1 5 GGGGCAAGTGGGCCTCACAGATGCCCAGGTCCTGCCCTAAGGTGCCAGCGTTGGGTTGGGGCG 
TTCCTGCTGCTATGCCX^CCTCTCTGCAAAGGGCTGAGCTCCTGCTGTGCTTCT 
GACCCTCTCCTTCTCCGCTTCTCTGATCCTACTACCCCCriTCTCCTCCCCTTTCTGA 
CCTCCTCATGGCTTAAATAGGTTTAAGTAGGAAAGGGGA 

GGAGTCCTGGTGGCCTGGGAGGAAGGAGCCCGCCGGGTGCCTCAGAGAGAGGCCTGCTCAGG 
2 0 CGCTCTCCATAATCATATGATTGGATTAGGGTCAGGGCAACCCTAATCATGTCAi^ 
AGGGTGCCTGACCACCCCTGCGAAGATGGCCITGTTGAACACAGTTAAAAACCCT 

TTGTGTGTGTGTGCACATGTGAGTGTACACCCTAGCGTGTGCATTGTGAGGGGTGCATGTGTGTGCCTATATGTGC^ 

TATATGTGCATGCCTGTGTGTATGCATGTGCATGTGTGTGTGTGTC 

CCTCACACCTCCTGCTTTCATCCCCATCTGCaSTCCACCTGGCCTGGGGCT 

2 5 CCATCAGTCCTTGGAACATGGGGCTCTGTCACAGAAGCTGAGCCTGGTCCCTGAAGAGCCCT 

GGACATTCAACCTGCCrGGGGTAGAAAGCATTTGAGGAATTTTAAGT^ 
TGCCGCAGGACCGAGGCACTGGGAGGGCTCACTTCCCTGCAGAGCTGGAGACA 

CAAGGACACAGGT ACAGAGAAAGGG CAGGGACGGCTGC AGTGGATG CAG CAG CC CAGGG CC CAGCAG CT GG ATCTCAGG CAGCACG 
TGGCTCTGCAGAGAGCCTGGCCTGCTCTAGGCCAGGTGGGGCATGTGCTCCTTAGAGA 

3 0 CCTGGAGCCTAGGGATGCCACCTGTGCCCACATGCT 

CCTGCCAGATAGGTCGGGGGGAGTGGAGTGGCATGGGGAGGTATGCAATCT^ 

CCCAGTGGACACTTCCCAGAGCCCTGGGGTTTGCTTCTTCGTACCCAGTGTGGGTGGGGCCT 

CITCACACACCACCCCCTCTTCAAGCATTTCCAGTCCCCACCCCCT 

CACGGCCAGATTCCTGGTTGGTGCCCCAGGACCCCTCCAGCCCCAAAGGGTGGGGAGAGTAATGAAAGGGTCCTGCGCACC^ 

3 5 TCAGTAAGCCTTGTTGGGTGCCCCACAAGTCCCCCTTCATCTCTCCAAC^ 

GGCCTGCCTGCTAGGACATGGGGACACACAGGG CAGCTCTTTAGGGAACCAG CGTCCCCAAAGCCAGGTTGCGACCCCTGACCCCC 

CCCATCGGAAAGTGTTGCCGTTTAAGATGCGTTGGGGTGGGGGGCTCCC 

CCCCAGTGGGAGGGGAACCCCGGCCGACTCCCCGCCGCGTCCGGCT 

4 0 AAGGGGAGAGTGCAAATGCGCGGGCGGTTCACACGCCCTTCGCGGTGCTGAATTAAAGCGTCT 

GGACGGCCTGGCGCGCGGCGCATACTTCC^GGTTGCTA^ 

AAATATCGCCGGCCATTGTGGGCCTAACTCGGCGTGACAGTGGCGCACAAAGCCGGATTCAAGCGCCC 

CCCGGCCGCGCGGGCCGCCCGGCGGTGTCAGGGGCCGTGGCCGCG^^ 

CCTTTTGTGCAGGCCATGATTACCAAATGCCACCGCGACACCCCCTCCCGGCCGCCTGGCT 

4 5 AAGGTGGGC^GCCCCCGAGGGGAGGCGCCCGGGCGTCGGCTCCXSGCGGGCTCCGGCGGGGACCGGGGCGCGGG 

TGTGCCCCACAGCAAAGGGTCTTCrCCTrCCCCrGTGTTCAGGCCT 

CCTGTTGTGGCCTCCCGATGGGCACGAGAAGGACTTTGCTTGTCCTTTTTCAGGGGCGATGGTCCC 

GCCACAGTG AC CTGCAGGCCCCTGGCCAGTCTGGTCAGGTGGTCCTAGCTGGG ATCCAAGG CACCCCCCTCACTCAGCCAG CCCCT 
TCCAGCAGCTGTGGCGTGAGCAGGCAGGAGATGGGGGCAGCACCAGGAGGGAGGGACC 

5 0 ACAACTCTATGGGTCCCAGGTATGTGGTGGGGGTGTC 

CTCCATCCCCCAGGATGCCTGAAGGCCAGTCCCTGGAGTACAGGGAGACTGCTCCTCACTG 

GGCAGGGGGAAGGCCTX3TGGGGTGGGGCATTTGCCCTGCAGACCTGGGGAG 

GAGGGCATTTTTTTGTCACAGTGGAGGTGAGAACTTCCCGGTGGA^ 

GAGTCTGGGCCCCCAGOVCCCGGCTGGACTGGCCACCATGGAGTCCTGCCCTCGGCCAGTCCATCCC 
5 5 CAGCCAGACAGTCAGCATTGCCAGGGGT^AAGAGACGAAGCCACAGGTTTCCAGGGGCT 
CATGGCCAGGTGAGCATGGGTCAACATCAAGAAGCAGACTGGGGAA 

CCCCAGGATGGGGCCACAGAGG CAGAAC CATAGCCCC CTTCCCAGACT AAGCAGCCCAGCCACCAGCTGCC AGCAGCCAGGGCCCT 
GAATCCCAGAGCCTGTGACCAAGGTACCCACCTCCAGGAAG 

TGGCCATCAGCATATCAGAGACACCCTCCCAACCCCCTTCTCCTGGCAGGCTCCTGTTCAG 
TGGGAGATCTCATCTCAGTTGATGCATCAGAAT 

CCCAGGGCGGCCCTGCAGTGATTCATTGGCCCCCCAGCCATGACCATTGTTGGAAGCCTCCAGTCCTAATTCCCTGATGG^ 
CAGAGGGGCCAGCCTGGGTGAAGCCCTTCAGACTGAGCACTCT^ 
AACAACAATCCCCACCCCCTGACCCAGAACTTTCCCTC 
65 ACCAACAGGCATGTCCCAGGAGCCCAGGAGTCCCAGGCT^^ 

TAGGCATGTCCCCCTCAGCCATGCAGTGGGGGCCCAGCCAGGCAGGTGGGCTCCCT^ 
CAGCCATGGGAACACCCATTGGAGCTGCAGGAGCCCATAATGATGAGTTCAGGGACCT 

AAGGATCXKSCTTGGACCTGCTTTTCCCACTGTC^ 
7 0 GCAAACACCTGTCCCTGCCGAAATGCTTCTTGGGTCACCTCCCCATGGTAAGGTCAG 
TTGCCACTTAAAAGACAGTCACTGCGTAATAAATAGTGGCAACTGAGCAACT 
ATTTGGGCAAATGTTGTTCAACGTTTAAGAATTTGGAAACATGGCCGGGCGCAG 
GGCCGAGATGGGCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCTT^ 
AAAAATTACCTGGGCATGGTGGTGGGCA.CCTGTAGTCCCAGCT 

7 5 ggagcttg'cagtgagccgagatcgcgc 
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ATTTGGAAACATATCCTTCCTGTTTGATGTTCAAACGTCCTTTTCTGAAGTTATGATGGTAGATA 
TATATAATATCTACTACATATCTATATCTATCTATCTATCTATCT 
TGTCTCAACAAAATAAAAGCACTTGGCAGTCCTCTGTTAGTCCCCAGATGCTCTCTC 
TAGAAACGAAGGATTTGTTAAGATCCCTCTAGTCCAAAAGTTCTGAGATGCAAGAGACA^ 
5 TTCCCTGCATCTGACAACTGCACTGCCCTCAGCTGTGATGGAGCCTTGCCTGGCA 
TTACCACCTAGAACTCCCTCACCTTCTCCGACTTTATCCATCCATCCAT 
AAITCATCCATCCATCCATCCACCATCCAACATCCAT 

TCCATCAATTC^TCCATCCATCCATCCATCCATCCATCCATCCACCATCCATC^ 
AATCCATCATCTATCCACCCACCCATCCATCTArc 
1 0 AGCCTTGCTATGTGTCCTTGTCCTCTCTACTGCCTCCCTTGTGAACTGTAAGCTCC 
CAAAAAACCTAACACACACACCAACAAAGCCTCCCACCAGAGATA^ 
ACTCTCTTCTCCTGAAAGTGGGTCATCCAACCTCTGTTTTATTATC^ 
TGTTAGAAATTGTATCATATTGAGCAGCAGTCCACTTTCTGGAGGATT^ 

TGGAGTGCAGTGGTGTGACCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGATTCTCCCACCT 
1 5 GGCACTACAGGTGCACGCCACCATGCCCCGCTGATTTTTGTATTTTTAGTAGAGACX; 

CAATCTCTTGACCTCGTCATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGCTGTGAGCCACCGC^ 
GATTTT T T TT ACTGTGGTTATTGAGAGTGGGCACCTTGCTTCTrATTCATCT 

AGGGACTCTCAGATCCTGGATGGATGGATGGATGATGGATGGATGGTGGGTGGTGGATAGATGGATGAATGG^ 
ATGGATGAATGGTGATGGATGAATGGTGATGGATGGATTAGAGATGGACAGATGATGGGC7ACAGATA 
2 0 GATGAAGGGGATGATGGATGGATGATGGTTGAATGGTGGATGGGTGATGAATTGATGATGGATGCATG^ 

AATGGATGG ATG AGTGGTGG TAG ATGGTGGATGATGGGT AATGGAT GGACAATGG ATGG ATTGTGGATGG ATGGTGG ATG ATGG AT 
GATAGATGTATGGATAATGTATGGATATGGATGATGAATGGATGAATGGTGGATGATGGATGATAGATGGATGATGGATAGATGTG 
GATGATGGATGGATGGATAATGGATGATGGATGGATGTGGATGACAGATGATGGGTGATGGGTGGATGGATAGATGGGTGGTGGAT 
GGTGGATGATTGTACATGATGGATAGTAGATATATGGATGATGGATGGATGGATGGTGAATGGATAGATGAATT^ 

2 5 CATGGTAGGTGGTGGTAGAGGATGTATGATAGGTAATGGATAGATGGATGGAT^ 

TGG AT AGATGGAT AGATGTGG ATGACGG ATGG ATG AATG AACGAAAGG AAGG AAAGAT AGAT AATTC^ AATTCAG 
TAAATAACTGCTAAATGAGACCACAACAAGTTTCTGGGAAAGATGGTTTTCCCAGGTGGAAGATACTTCC 
GTTATGCAGTTTCCTTTCTTCTTCCTTTCTCCAACAACTGATTGCTGTGTTA 
TAGGTGGCTCAAAAAGTTCAGCCAGGCTCCTTCCCAGGTGACTTGGCCAA 
30 GTAGCCTGTGTCCCATTCCAGTGGGAACTCCX3ACAGTCCACT 

TCTGGGATGGCTGGCACITCCCACGGGCAAGATGCCTCCGTTTCACCCCCATAGCTGACT 
CCTGACAAAGGTGTTTGTGGTAGAGAACTTGGTTTCCGTGGAGGCTGGCTGCCAGAACTCC 
CAGGGGAGCCCTGAGTO^CTCCTCTGTAAACAGAGGTGACT 
ATCTGATCAAAGCAATGGATTCTCTCTCCAGAAAGATGCCC^TGCTCCCAG^ 

3 5 GTGAACCTGCTGCAGATTCCCTCTAGAACCTTGGGTGGAATCCAACCCATAGGCT CTGTGCCTGCCACTTGG 

ACATTCTGGGGGGTCTTCCTTCTGCATGGTTTCAGGACATTTTT^ 
TGAGGTCTCATTCTGCCTCCCAGGCTGGAGTACAATGGTGT 

TCCTGCCTCAGCGTCCCAAGTAGCTAGGACTACAGATGCACACCACCATGCCGAGCTC^ 

GGGGTCTTGCTATGTTG CCCAGGTTTAACTTTTTATTTTGAAATAATTT CAG ACTTT C AT AAACAT CGC AGGAGTGG CACTGAG AG 

4 0 TC CTGGGTCCCCTCGTC CAACCCAGTGGTGCATAACTATGCCAATTGTT CAAAACTAAATGTGTTCTATTCACCACACTCTGGACG 

TCACTCAGCTTTCCTTCGTTTTTCTATCAAGATCCCTTTTCTGTCCCAG 
CATGCCTCTGGCTTCTCCT^CTCrrTCCCTGTCCCTCATGCCCCTGAGACCTCTGAG 
TCAATTTGGGTTGGTCTGATGTCTCCCGACAATTAGATGGAGGCTGTG 
TTCATGCATCATCCCTGGAGCCACAAGGTATTGTGTGTCCCATTTCAGGCAAGGT^ 

4 5 CCAGGCTTCTCCTGTGCACAGTTAAGGTTTTTCCCATTGTC 

TGGCTTCAACTTTAATG CAGX3 CTCTCTTTGCCAGAACAT AGGGTCTCACGCTTTTCTGGCCTGTGG 
ACTTCAGCCTCGGATGGGGGTGTCTGTCCTGGGATCTGAGCCTCTCTGCAGAGACTTCTT 
AGGCCTGCTCCAGGGTTTTCTCAGCCTGGCAGGGAGCTCCACCTCCACATGT^ 
GACCAAACTGAGCAAGAATGAGCTCTTTTCTGTCCAAGTGGAAGCCCTCAGCACCACCCGCCCTTC 

5 0 TTGGGGCTGTTTGCCTGAGTTTTGATAAGCAGGGATCTCGAGGCCCTTGTTATGGGCAG 

GCCTCAGGCTCTGGTTCTCCTCTTACCAGAAGCTCAGGAGCAAGAGGGAGGGGCTGGCCTTGGAG 
GCCGTGAGGTGGGAAGGCAGACTCAGCTGGGGTGGGGTATTCAGGGTGTGGGATGCCCAGGC 
CAGGAGGGCAGGTGGAGCTGTGTGCCATGCGTGAGCTCCAGGAGCACCCAGGACACTCCTCAACTCTACCT 
AAGGGAAGCAGGGCAGATGGCAGGGGAAGGGGCGCTGACTAGGGAGGGCACGACCTGAGTCCTTTAAA 
5 5 GGCCTGGAGGCTTAGAGAGGAAGTGCTCTGCATCCAGCCACACGTGTATTCACTCAGA 

GGCCTGTCTGTGAACTGATCAGTTGTGTTAGGGAAGTAACATGGGGTCAGGGGTGGGGGCTTTAAGTTTCT 

GAGGTGGCTGCCTAGCTGCTCCCTGGGAAAGGAAACAGCGGAGGTTGCATGATGGGCCTGGGTC 

CTGCTCTGGGTAGGCrCAGACCCTGCTCATTCCTTGACTCAGCCCACGGCTCCTGCCTTTCATTCTGG 

GGCTGGGCCTTGTGCTCCAGATTGCAAATAAAAGGTG AGCCAAACAGTC CCTG CCCCTAGGC AG CCGCGGCTAGGG AGGGAGTCAG 

CAGCAGGTCC AGCCTG AGCCAAGGAAGGAACCGG AAAGAGCC CG AGGGGAGGAC AAAGGG CTGAGTGCGGCCTGAGGCCAGGGAGT 

GGCATTGAGGTCAGGAGAGGAGATAGATCCAGAGGGGGCCTGGCATGGCCAAACTGGCATTGGGCGAAGACCTCAGCCTGCAGATA 

GTGTGTTGCGAGGGAGAGGTGTGCATGTGTGTGCATGCATGTGTTTGCATGTGTGTGGTGCACATGTGGAGGGGTGTGTGTG 

TGCATCTGTGTGCATCCATCTGTTTGTGTGTGGTGGGTTGTGAATGTA^ 
65 TATGGTGTGCATGCATGCATATGTATGTGTGTGCATGTGTCTGTGTGTGCATTGCACGAGGCGTGCATGTGGGGGTTGTGTC 

TGTATGTGTATGGTATTCATGTATGTGTATGTGTTGTGTGGTGTGTGTGTGTATTTGTGTGTGTGCATGTGGATGGGG 

GTATATGTGGGGTATGTGCATGCATGTGTGTGCATGTGTATGGTATGTGTGGTATGCCTGCATGGTGTGCATGTGTGGGGGGG^ 

TGCATGTATGTGAGTGCATGTGTATGCTGTGCTTGCATGCATGTGTAT^ 

GTGTGTGGTGGGGTGTGCATGTGTGTGCATGCGTGTGATGTGTGCATGT^ 
70 GTGTGTGCATGTGCATCAGGTGGGGTGTGCATGTGTGTGTGGTATGCATGTGTCTGTGTGCGC^ 

CATGCATGTGGTGGAAGGAGGAAGCTGGAGGCTGGGGCCTCCTAGGGGCCT 

GAACCAGGAGGGGCAAATTXX5CTGAGAGGCCCCAGGGCCACCTGGGAAGGCCA 

TGGACAGTCTCCAGCGAGCAAGCAGAGCCCTTCCTGAGCTGGTGGTGGGAAG 

7 5 GCTGTTGGCCAATCCACAGAAAAGTCAACCAAAGC^ 
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AAAGCCATAGCCTCAAATACCTTCCACITTGTATGTT CTTGTTATGGGAGCAACGTG OGTTCTTCTGTGCTGGCTGTGTGTGGATG 

TGTGTTGGGCCGTGTGGGAGCGTCTGAGTGTGTGCACACTCACTCGTGCCTTTCT 

GAGCCCCAGGGGCrTCCTGTCTCGGTCCTCTCCTAGGCAGCCTC 

CTATAAAGAAACACTTGAGGCTGGTAATTTATAAGAAAAGATGTTTAATTGGCTCAGGGT^ 
5 TCTTGCTTCTGCrTCTAGGGAGGCTAGGAAGCTTCCAATCA 

GGAGCGGGATAAAGAGTGGGGGAAGAGGTGCCACACACTTTTAAACAACC^ 

GCACCAAGCCATGGGACCCACCCCCCATGACCCAGTCACCTCTACCAGGCCCCACCCCAGCCCT 

ATCTAGGCAGGGTTAAATATCC^VGACTACATCAGCCCCATCCACCCGAGGATGAAAGCTCC 

TCACAGCCCCCTATGATGACTCTGTTGTGACCCTAAGGGGTCCATTGCAGGCCTCTGCC^ 
1 0 GTAGCCCTGTATCCCCGCCTTGTCCCATGGGAGGACCCT 

CATCGGAGCTCCCTTCTCCCTCCTGCTCTCCCAGGGCCT03AGGGGAGAGAGAAGCT 

CTCTCCCACAACAGAAGCAGAGACATGGAGGAACCTGCCCCAGGGCACACCTAGCCCCCAGAGGC 

OUUVGCCCAGGTGTCTTCITGCTAGGTCTX5TAAGATCCCTC 

CCCCAGCTCAGACTGCAATGCTTGGCAAACAGCAGGCACTCTGTTCCTCACCT 
1 5 CTCTTTTTATTTTGTTTTTTTGAGACAGGTTCTCACTCT 

TTGAACTCCTGAACTCAAGTGATCTTCCTG CCTCAGCCTCCTG AGTAG CTGGGATTATAGGCCCCCAGCC CnGAGTGTCTTTTATC 

AATGGAAGCAGGGCCTGGCCAGGCCCACCCCCTCTCCTCATGCCCTTGCAGAAGTGTCT 

CATTCTTTCAACCTGTCTGGGGAAGCCTGGCTGGGGTCCTGGCCTCTGCAAA 

CTAGCCTCCAGAGCTTTCTCCTAAATTCCAGCCCTCCCCTACCACCCTTCTGCACAAGC^ 
2 0 TTTCCCCATTTGGGGAAGAAGGAAGCTGTGTAC^AGG 

AGCTGGAGGCAGGGG CATTTGGGGCACTGCiGGCTGG CCCTGGGG AGGCGG 

GAGGCAGCTGTACCGCTGAGCCATGCAAGTGTCGGTGCTTGGGGTGGAGGAAGGCAATGGCAAGTGGC^ 

GTGTGGGGTGGAGCCACACTCTGGGGTCTGCACCCCAGGCTATCTGGGCAGCAGAAGGAATT 

TTCGTTCAGTTGTATGATTATTTTGAGATCCTTTGTGCTATTTTTCT^ 

2 5 GCTTTTTAACCATCCCTGTTAATGGACATTGGGGTACTTCCAGTTTGTGGCTTGTGTGA 

CACGGGTTTCCTCTGGGGCAAACCTAACGGCTGGTGGACGTTTACCCTTG 
CTTTCCCCTCCATCCAGCTGTGAACAAGGGTCAGGGTCTGATTTCTCCACATCCTTC 
TTGCAGCCATCCCCGTAGGTGTGAGGCGGCCCCTTATTGGGTTTTTTGATTTGCACA.TCC 
TCCAGGTGCATGCTGGCCTTCCTTGTGCCTTTTTTTTAAAGACAGGCT 

3 0 ACTGCAGCCTCAAACTC CTGGGCTCTAGCAATCCT CTGGGACTACATG CACACACCACTGCACC 

TTGCTAATTTTAACATTTTTTTGTAGAGATGGGGTC^ 

CCACCTCTGCCTCCCAAAGGGCTGGGATTACAGATGTAAGCCACCTTGCCAGCTGTATGTTTT^ 

GGTGCTATCCTGGCTCACTGCAGCCTCCACTTCCTGGGCTCAAGTGATCCTCCCGCCTCAGCCT 

CAGTTAATATTTTTAGTTTTTGTAGAGATGGGGTCTTGCTATGTTGAAAAGGCT 

3 5 TCT CAAAGTG CTGGG ATT ACAGG CATG AG C CACTGT ACTT AG C CAAAG TTG CTTGTT AAAAAGAGC CTGGT AC CAACCTCCCCT CT 

CTCACTTCTGTCACCATGTAATTTCTTGCACACACTGGTTCCCCTTTACG 
AGGCCCTTGCC^GAGGCAGTTGCTGGAGCCATGCTTCTGTGTAGCCTGTAGATCCGTGAACCAA 
CCACCCAACCTCACGAGTTCCTTTATAACAATGCAAATGGGCTAAGACCCCTGACCrrGCCCAAGG 
GTTAACCCTGCATCTCTCCTCTCTCTACAGCTACCAGGGAGTCGGCCTTTGTCCACGCCATTGCCTCAG 

4 0 GTGACACGCTCATGTGCAGAAGGCACGGGCGCCATCTGTGGCTG CAG CAG CCGCCACCAGGGCTCACCAGGCAAGGGCTGGAAGTG 

GGGTGGCTGTAGCGAGGACATCGAGTTTGGTGGGATGGTGTCTCGGGAGTTCGCCGACGCCCGGGAGAACCGGCCAGATG 
CAGCCATGAACCGCCACAACAACGAGGCTGGGCGCCAGGTAGGTTCGCCGCCC^ 

CAGGGTGTGTGCCCTGGTTCCTTGGGGCATATGGCCCGGTGAGGCAGGATGGTGGCCAGGCTGAGGGTCTTCT 
GGGTGTGCGAAGCTTAGCACCATCCAGCTACACCACAG CATCCAGTG CGCTCCTGGGGATGGCGAGCCCACCTGTGGACTGGGGTA 

4 5 GCCAAGAAGGAGTGATGAGGCAAAGGCCTGGTCAACAGGAGCCCCGGCAGAGCCCAGGGAGGGAGACAGCCAGGCAG 

CGGGGAGGAATTCTGCAATGAGGAAATGCAGGCACACAAGGTGACCCGATGTCAAGGA 
GCTGGGAGAGCCTCCCAGGCAGCCTGGAGGTGAAAGACAACAAGCGACTTGGTGGGCAGAGCACAGAG 

AT CTTGATCTGCAGAGGCCAAC CCAAACC AAG CCX5TGTAAAGCTTCGTGAGCCTGTGTCTT ATTCTGTCTGTGCTGCTACAACAAA 
ATACCTTAGACTGGATAATTTATAAACAATGGGAGTTTAGTACTCACAGTTCTGGAG 
50 AGATACCTTGTCCGGTGAGGGCCTGCTCCTCATGGATGATGCCITCTGTGTCCTCACCCAGGGTGTGGATCGCTCT 
TTATAAGGGCATTGTTTTAGGCCATTCTTGCATTGCTACCCGAGACTGGGTAATTTATGAAGAA 
ATCTGCAGGCTGCACAAGCTTGGTAACAGCATCTGCTCAGCTTCT^ 

TGGT AGCAGGCACGTCACATGGGCCAGAG CAGGAGCAAGGAAG ACAGAGTGGGGGAGAGGT CGC CACX5C7VCAGATCTTGTGTG AAC 
TCACTCATTATCATGAGGACAGCACAAGCCATTCATGAGGGATCTGCCCCCATAACCCAAACACCTC 

5 5 ACACTGGGGATTACACTTCAACATGAAATTTGGAGGGAACACAGATCCAA^ 

CCCTCATGACTTGACTTGATCACCTTCTAAAGGCCCTACCTCTTAATGCTGTCACCTTC 
GGGGACACAGGGTCTTGCTGGATACAGTGGCACCATCTCAGCTCACTGC^ 

GAGCCTTCTGAGTAGCTGGGACTATAGGCGCATGCCACCACACCTGGCTAATTTTTGTATTTTTAGTGGAGA 
GTTGGCCAGGCTGGTCTTGAGCTTTTGGCCTCAAGTGATCCACCCACCTCGGCC 

6 0 CCATGCCCAGCCGAGGGTAAAGATTTTAACCCATGAATT^ 

GTGGTCAGAAGATCAGTGGATGGCCAAGGGAGGAAAGATGGCTGATGGGCAGTGACGGGCGTAAGGACCAAOT 

TTTAACAACATGAAGATGGCAGCTGACCTTGGCTGTAGAATGAGGGGACCACAGAGGGGTGACGAGTG 

TTTAACAGTGAGGAACTAGATAACTCAGACTAATCTTCCTGCAGAAGACAACTrCAGATGCAG^ 

TTAAAAGTGTGGAGTCCCCATAATTACCCAATTCATGATAGATGAGGCAGTGCAGTGCCATGGGAGAAATGGGCTTTTCAATAAAT 

6 5 GGTGCTCAGTCAACCGGGTGGTGCTATGGTCTGAATGTGTCTCCCAAAATTCGTATGTTGTAAGCT 

TGG AGAGGTGGGACCTAAT AAGAGGTGTTTAGGTGATGAGGGCCTTG CC CT CATAAAGGGACCAATGCTCCTGTAAAAGGGCTTGC 
AG AAG AGGGTCTCTCTCTC T CTCTCTTT CT ATTACTTTCCT G ACATG ATGTT C CTCC (HTCTGG AGG ACATGGCAT 

TCATTCAGGGTGCCCTCTTGGAAGCAGATAAACCAAGCCCTAACCTGTC^ 
TGAAAGAATACATTTATGAATTACTCAGTCTCAGGTGTTACGTTGTAGCAATACAGCAT 

7 0 AAAAATAACCCTGX3CTTGCTACCTCACACCATCTGTAAAAAT 

CAAGAAAATATAGAATATCTTTGTGACCTTAGAGTAGGCAAAGGTGTCTTAAAC^GTC 

TGGAAT AACATAGACTAT ATT CAAATT AAGAACTTT CATC CAT AAAAAGACAC CACT ATGG G ACTG AAAAGG CAACC CAT AGAGGA 
CACATTCACAT ACAT AC AT CCACACT CACAT ATG CAGCAAATGT AGAG AAG CT AC AG T AGGAAAAGAACTCTGC AGACT AAG AAAA 
AAJ\AG AT AG AC AA AC C A A.TGOG AAAATTGG CCAAAAAATT AAAAG CCATGTTAT G AAAG AAAATAT AGAAATT C CACT AC CT AT AC 
7 5 AAAAGGTGCTCAATGTTATGAGTCATCAAGAAAATC 
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ACACTGGTGTTCTGGGGCAGGTAATTTGTAAAATTTTGTATATTTTGCACTT^^ 

TTCCATAAAATGATTTTTAAGTGTCAAATAACTGACCATCTAGTGAGGAATTAGCAGCCCCTGAT 

GGTACCTGGGGTAGCTTTTGTCTAGGGGCCATTTGCT 

GAGCTAAGGGGCCCACCAAGGGTGCCATCTCTGGTAAATCCTTCCTCAGTGTAGGCTGTG 
5 AGTAGTCTTGTCCTGTTCACATGGGGAGACTAGAAATCTCT^^ 

CCATACACCTGGAAAAAG CAGACAAAATCCTCTITGGAG AAGG ATGATATCCCTCCTCTAAG CCTATAAATGTTTCTATAATAATT 
TTTT AGAAAACACAATGTTGGCTGGG CGTGGTGG CT CACAC CT ATAATC CCAGCACTTTGGGAGGC CGAGG TGGGCGGATCGTGAG 
GTCAGGAGATTGAAACCATCCTGGCTAACATGGTGAAACCCGGTCT 
CAGGTGCCTGTAGTCCCAGCTACTTGGGAGGC^ 
1 0 CGCACCACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCTGTCTCAAAAAAAAA 
CAATCAAAGATAATCTATCACATGATCTGATGACACAGTGAGT^ 
AGACAATGGAATTATTAGACACAGACTrTGAGGGGAAAAATGCTTACTTAGGTTC^ 

TCAGGAAACTAGAAACTATAAAAATTATATAATAGG CTGGG CACGGTGG CTCGCACCTGTG ATCCCAGCACTTTGGGAGGCCGAGG 

CGAGTTCGAGACTAGCCTGACCAACATGGAGAATTCCCGTCTCT^ 
1 5 TAATCCCJVGCTACTTGGGAGGCrcAGCCAGGAGAA 

CACTCCAGGCTGGG<^ T ' rri " lTl ' ri H l t3GAAACTGGGAG l " rr^ ^ 

ATATATATATATATATATATATATATATATATATATACACAC^^ 

AAGAACAAATAAGGACTTCCAGTTAAAGATGAGAAGCTGAACACTTCCGTCTGC^ 

TTAATTTTAAAATAAATAGATATGTAGAGATAAATATAAATAAAATATAAATAGATACAGGTAGGGACA 
2 0 ATGATAGATATAAATGGATATAAATAGTGCATAGAGATTATACATAGATATAGATGCTAATACAGATAGGATGGA 

ATTAGATAGATGGATGAATAGATAGAGATGGATACTCATGAGAATACAGAAAATAAAG 

GCT AAAAGCAGAT AGATG AAT ACT CAAG CAG ACTCAAG AAAACTG AG TCTTAAG CCAGT AG AGAAGAAATTTGAG AAC CAGT CT AG 

GAATTCCTACTGTCTGATATCnTGAGAAGTTGAGATAAATATGGGGCTGAA 

ACTTTATAGCCCACTCACTCCACTACCCAACTCTTTGTAGCCAGGCAGCAGCCT 

2 5 GTTTTCTGCAAAGCACAGGTCTCTGATCTGAGGGArcCC^G 

ATATGGAAGTTCATGCCTTTCTCTCTCCTCCTGATTTCCTAATGCTAGCTGTG^ 
GATTCTGCTCTGGGGACTCTGACCAACTCAAAAGAGGAGACTTCAAGACAGTT 
AGATCACTCTATAGAGAGGACCAAAGTTGACAAGTCTCTCCCATGTACAGGAACT 
ATACGAGCAGACAGCCAAGGATCATCAGACCTCTGAGGAAGGACTCTT^ 

3 0 ACAAAATTTATGGGAAAGAGACATTGTAAAGAGAGATGAAACTTCGAAAGATGA 

TCCATGAAATTGGAACAGGATGGTAATTTTAAACATTCAGGTTACAAAAGAGCTCTTAGAAAT^ 
AATCCTCAAATTGAAAGATAGTTGAGATACCCTCCCACA^ 

AGAACTGTGGAGCACAGGTTGCAGACCAAACAGGCCACTAAGTGACG^ 
35 CTGTGAAGCTCAGAGCCTACTGCACAGAACAGCCCTCCAAGCTGCCAAGGAGAATX3TTT 
TACCAGACTTTTCGGGAGGAATAATGAAAGCTAGAAGAAAACAGTAATAATACTTTTTG 
GCCAAGAATTCACTTCCCAGCAAAACTATCAATTAAATGAGATAGTAGAAAAGA 

GTAGATAAGTAAGGGAGAATTTGAGGCTGAATCIAGTTACAGGTACATTGATAACACCAACTA 

4 0 AGTCATCATAATATATGACACITAAAGTACTCCCAATACCACCA 

G AGGGGTTGGTGCTTGGG AAGTGTGCAT CAGTGTGGTGG AGT AAGGAG AG AAAACT AACTGC CAC CCAC CATGG CT AG AGG CC AAT 

AAATAATGCCTAATGCTGAAAAATCAAGAGGTGTCAAGACAAGTGTGTTAACAGAGTTGTAGACATTGCAAGGGGCT 

ACCAGAATTCCTGTTTATCTAAACAGGTCTTGGAAACTGTTGGACTCTTCGAAG 



4 5 GAAACAGTCAATATCAGAGGGAGGAGTTCAAGAAAGAGCCAGAAGGGGACGAGGAGG 

AAGCCTTAG ATTAGAACTGTC CAAT AGAAAT AG AATGGG AGC CAC AAATGG CT CAT AGATTATT ATAATTTG TC CCACAAATGACA 

CAAAGATCATTATAAATTTTCTAGTAGCCACATTTAAAGAGTAAACAGGTAAAACAAG 

GGTTTGAGACAGACTCTCGCCCTGTAGCCCAGGCTGGATGGAGTC 

CTCAAGCAATTCTGCCTCTG CCTCCCAAGTAG CTGGGATCACAGGTTCCCG CCACCACGC CTAATTTTTGTATTTTTAATAGACAC 

5 0 GTGGCTTTGCCTCAAGTGAGCCACCCGCCTTGGCCTCCCAAAGTG CTGGG ATTACAGGCATGAGCCACCATGCC CGGCCAGAAACA 

AATTTTAATACC!AAATTTGATTTAACTCAGGATAGCCAACA 



55 



60 



65 



CCTCATGGX5CTCCTGTCTGTCCAGTGGCTGCCACAGTGCAGTGCCACCTAGACTGTTCTG 
CCCTACCCCTTCCCCAAAGAAAGAAGGGAAAAGAAGCTGAGTTCTACACTGCCCATAGGGGAAACCGTT 



CTCTCTTTACTCATGCTCAGGACTCGTGCCTTTCCACCCAGCCGTATACACCTCT 

GGAACGCCTATCCCCCTGCATCCCTGCACTCT'GACTGCTGTGGCTCTTCGTGTATGG 

CGCTCCACCCTAGACAGTGTAGAAGTACTCAGCTAGGAGACGGGAAGAGGGGAGCCCCCCAAGCCCT 

CTGGCAAGGGAGCAGGCACCAGTATCTGCACCATAGGAAGGGGGTGG CCCTGGGGCCCTG CCTGGGAGTCT 

GGGCTTTATGGAGCAGGTAG£CTGCAGGCGACATGTAATGCT 

CCTGCGCGTGCCCCGCAGGCCATCGCGAGCCACATGCACCTCAAGTC 

ATGCTGGTGGTCGCAACCCGACTTCCGCGCCATCGGTGACrTCCTCA 

AGCJVCCGGGAGTCCCGCGGCTGGGTGGAGACCCTGCGGCCGC^ 

TACGAGGCCTCGCCCAACTTCTGCGAGCCCAACCCTGAGACGGGCTCCT 

OXSCATCGACGGCTGCGACCTGCTGTGCTGCGGCCGCGGCCACAArc 

TCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGCACGCGCGTCTACGACGTGCACAC 

CCTGGACG GG G CGGGCCCTG C CTGAGGG TGGG CTTTTC C CTGG GTGG AG CAGGACTC CCACCT AAACGGGG CAGT ACT C CT CC CTG 



CTC CTCTCTGGTGG CTGGG CTG C TC CTG AATG AGG CGG AG CT C CAGG ATGGGGAGGGGCTCTGCGTTGG CTTCTCCCTGGGG ACGG 
7 0 GGCTCCCCTGGACAGAGGCGGGGCTACAGATTGGGCGGGGCTTCTCTTGGGTGGGACA 

CCCAGTAAGGGCGTGGCTCTGGGTGGGCGGGG CACTAGGTAGGCTTCTACCTGCAGG CGGGG CTCCTCCTGAAGGAGGCGGGGCTC 

TAGGATGGGGCACGGCTCTGGGGTAGGCTGCTCCCTGAGGGCGGAGCGCCrCCTT 

TTCCTGGATGGGGCAGAGCTTCTCCTC^CCAGGGCAAGGCCCCTTCCACGGGGGCT 

7 5 GCCCTCCTGATTAAGGCGTGGCT^CTGCAGGAATC^ 
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ctgtaaggttccatccacccctccgtcgagctgggaaggttccat^ 
ggcccctctccaagcxacctggctttggaatgctccaggcgcgccgacgcctgtgccaccccrtc 
cacctgaccaggggccctacctggggaaagcctgaagggcctcccagcccccaaccc 
agggacttcgcagaggcaagcgaccgaggccctcccaaagaggcccgccctgcccgg^ 
5 ggaactggccit3ctgcgccccaggccccgcccgtctctgctctgctcagct 

cctgccctcgggtctccccaccrcovctccatccagctacaggagagatagaag 

GTCCACAGCCCCTTAAGGGAAAGGTAGGAAGAGAGGTCCAGCCCC 
GGAGGTTTGGGGGGCATCAACCCCCCGACTGTGCTGCTCGCGAAGGTCCCACAGC 
TC^TGGOSGGACTGGAGAAATGGTCCGCTTTCCTGGAGCCAATGGCCCGGCC 
1 0 GGGGAGGCCGCTGAACCCACCCX3GCCCATATCCCTGGTTGCCTCATGGCCAG 

CACCCTCAAGGTGCGGGGAGAAGAAGCGGCCAGGCGGXSGCGCCCCAAGAGCCCA^ 
TTCTGCGTTTTTG^TTTTAATGTTATATCTGATGCTGCTA 

TTT CC CTTTT CTTT CT ATGAAAGAAATTG TTTT AGTCAT AG CTTG TGGGTTTCAAACAATGGGAAAAGTGAAAAAAG AGAAACAAG 
CCATCAAGGCTGTGGTCTGGGCTGGATGTTTGGTGGTGGGTCAGCACGGTGTTGAAAGAGTAG 
1 5 AGGGCCTGGGTCCGGGGGCAGGGG^ATGCCCCAGGGCCGTGATGGGCAGGGGACCTCCAGAGCTGGA^ 
AGGATAGCCAAGCTGAGGAAATGAGGGAACGGCCACTGGCACGAAGGCCTGGAGCAGAGTAGAGCCT 
AGGCACCTGGAGGCCCCAGGTAGGCAGCCCTAAGGTAAGGGGGTTGGCCGAGCCCCCAACAA^ 
CCACCTCCCCTCCCGCAGAGCCTGTACTCACTCAGCTGATACCTTAGCACCA^ 

AGTGGCCTGGTGGTATAGGATCAGCTGGATCCCAGCAGGACCTTGGGATGGGACTGGCCATCAGGAGTTCCT 
2 0 TCGTGGGGCTGTGCACAGGGGTGCCCCCTGTGCGAGCCCCCATCTCCTTTCCCCAGCCAAGGAAGGCA^ 
GGTGAGCCATCCTGCCGACAGGAGCCCTCATGAGGCCTTACTAGGGGTGCTGGGTGTGCCTCTGG^ 

GGGCTGTCTCAAGGCCATGCCTTCCCCGTCCCAGCCCCACGRAGGATCTCTGCTCCACTCTCTGCCCTCCCATGCCTGTCCAGTCC 

CCATGCCCATCTCTGGGCCCTGACTGGATGCTCAGCCAGACCTTCAGGGCAGTGAATG 

AGCCAGGAGCAGCCAGTGTACCTCCCTGCAGGCCX-^ 

2 5 GGCCTTCATAGAACAGGAATGGCTGGCCATCCCCCACT 

GGTGCACCAGCGTCCTAGGCATGGAGGTCATGGGAACACAATCCCCAGGCCTCAAGAGCCAGAGAAGGGCCAGCCCCAGGCAGGAG 
CATAACCCCTCCAGGAACAGTGTCCCCTGACCCTGCCACG CACCTGCTGTC CTGGAC CCTGAGGGCACAGGCCCAGGGCATGCAGC 
TAAGGCTGGGCAGAGGGGCCTGAGGGACTGCTGACCTCCCTCCCCACACCCTCACCCCAAGGTGCCCTC^ 

CACCCTCTCTAGG CCACACTGGGCCTGG CCACCCTCTCTGGGCACCATTTT CAAGCTAGAGCCCTTGCTCCTGACCTTTGTGACGG 
30 GTCCACTAGAAGCCTGGCCTTCCCAAACCCCCCACCTGCCCCTCACTATCCACTTCCTAGGTCCCCCTGGTCCTGTGCCT 
CCTCCCTAAGGCTACAGATCC^GCCCGGCCCCCACCAGAGCCCTGCTCCT 

GCCCAGGGCCTCCTCACAGTGCTGGCATAGTGAAGCCTCACAGCA.CCCACGGAGCAAGAGAGCATACCGCCCTCTACCATACCCTG 
CTGGCTGCACAGGGCTTCTGCCCAGGTCTGGCTGCTGCCCAACCCCTG 

CCCACCCGAGGACAGTCAAGGAGGCCAGGCCCAGGAGGGGATGAGGGCTGAAGTGTGGCTGCT 

3 5 TTTCTCCAGAGCCACCAGCCATGGGAGCACAGAGCTACCCAGGGCCTGTCAGGGCCAAGGGAGAAGGG 

GAGGTCAGGCACCCCTCTCCTTTGCCrCCCAAAACTGTGCCrCAGCT^ 
CTCCCAGCCATGGCCTCATTGGCTGGGGCCGTGAGAC^ 

AGGCAGGGCCACTGGAATTCCCGGGGGGCACCAAGGGCAGCTGTGCTACCGTCT 

ACCCCTGTGGTGCATCACCAGTGGGGGGACCCCAAGCCCGCTCAACCCGCCAGGCCTTCCTGAGCAATAGCCATAAAGGGTCAGTC 

4 0 CTTGGGGGAGGGCAGTGCCCCACAGGAGCCCCTGGTCAGCCCCAGCCCATGGGCACAGTGCCTCTTAGTCGGGACCCAGGTCACAG 

CCACCCTAATTAG CACCT ACTGGG CCCCAGTGCCAGGGGCATTGCAG ATGACAGACACTGACCCAG CCGOT 

CAGAGGGCACGAGATCATGGGCATGGCTGCCAGGTTGCAGGGACCTGAGCAGTCAGCATAGGCCGGGGC^GAGCCCACACCCAGA 
TCCAGGCCAGGGCCAACTCGGGGCTCAAAG CTGAGGTAGG CC CCAGAGAGATCCC CAGCACCC AGGCTCTATCTCCGCTGCTGGGG 
CCCAGAGAGGAGATGGTGCTGGCCTGAGGACACACAGTCTGCTGGAGACCCAGGGGGCCATCTGGGTGGAGAGCCGAAGGTCCCCT 
45 TGCAGGCCAGGTGTGAAGGACCACTGTGAAGATGGTCTCAGCTGTCCCCATCCCCACTGTGGCCCTAGCACAGACAGAGGGTGACC 
CCACAAGAGGTTCTGGATGTGTTTCTGAGGAATGGGGGTCGGCTGGTCrrGCAG 

GGCAATGGGGGCTGGG AGGTGAAACX3CT CAACTCACCCTGTGT AGGAGCAAGG CCCACTCCTG CCTGGGCCrCAGTTTCTTTTCTA 

TGTAAAAGGCAGTGAGCCAAGGGAGGTCTTCCCTCCCACGGGCCTGCTGGCTTGGGCTGCTC 

CTGCO^AGGGCAGGCATCACCCCGGGAGCCCCAGCCCCC^ 

5 0 cCCCAGGGCGTTTATAGCCCATCCTCCCCACCCGCCCTGCAGAGGGAGGAAATGCGCCrrGGTGGGC 

CAGGACAGGAATAACCATATAAAAGGCCCGGAGGTAGTGGGGCCCCACCCCACCAGGGCTCCCCTGC 
ACCCTCCATGCTTTGAAGTCCTGGAGGGGTGTGGCCAGCCCCCAACTTGG 
CTACTTTCAGGATCCTGGGGCCTCAGTTTCCTCCCTTCAAGGAGGATCCT 
CCGCAGCAGGO^TGCAAGGCCAGCTCOIGGTCCCC^TGCCC^ 
55 CCCX5TCCCCAGTGCTGAGCCTGGGCTTGGTGTGCCCCGGGG 

CTGGACCATGCTGGGTCGGTGGGGGCCTGAGAGGCTCATCAGTGGAGACGGCCCCTCCCCT 
AACTCCCCTAGTGTGGGCCTGGCGTCACCZATCCTCCAGTGGGTGGGAGAGGAGAGGGiCAGAGGGCAGGACCCCT 

GTCCAAGGTTGGAATGTGGTCTGTCACGGCCTCCAGGGAACTCCAGGACAC^ 

6 0 CGCAGTCCCTTGGGCTATCCTCAAGAAGGAACATTCTAG AAG CTTCTTGTTGGGGGAAGGCGAGGCCTGGGG AAAGATG CCCCTGA 

AGAGGAAGAGAAGCTGGGACTTGGGGGTCCCACACCCCAGTCCTTGGAGGGGGTCCTCC^ 
TCAGGCCCAGCTTGCCGTCCAGGCCrCAATACGCCTGTTTGTGTAT 

CTAAGCCCTAGCAAGGCTGCCCTGGCCACAGGTCCCCCGATAAGGTGACCCTGTAACTTCATTCCCTGG 
GCCCTCCCCGATGTTTCAGCCGGATGGATACCTCCGCCCITTCCCCTGCAGA 

6 5 CACTCAGAGCCTGGCATCATCAGAGTGGGGGACCCAGGTCCCCGAGCCCAGGGCGG 

TGAAGCCTGGAGTGACAAGCCATGAGAGAAAGTCCACGGGGAGAACCAGAGGGGGACTCCTGGGATC 
CTGCCAGGCTCTGTCCTGAGACCCrGCCCrTCCATCTGTAGGCCCTGCCAT^ 

GAGCACCCTGGGCTGCGGGCCCAGGTAAAACAAGTACAGACCCTGAGCCTGGACTGAGCTGAGGATGTGCCCCATCCTGGAATATA 
AGAAGACACGGCCCTCCCCTGGGGTCTGGGAG ACACAGCCrCAACCTGGGTC CTGGGGGAAGCAGTCCCACC CTGGGGTCCAG CTG 
70 TCGACCTGGGATGAGCCCTAGTTCCCTCTGCCCrACAGTC 

GCCCTCTTTCCCAGTCTCCTCCCCTCATGGATCCCCCTTTCTGCCT 
GGTCCCTGTCGGAGTCITGCTTGGATCTGGAGCCCTGGATCCCAGGGGCATGACCrCCT 

ATC^GACACTTCCTAGGGCTCCAAC!ATCCTCCTGTAAT^ 

7 5 GGCAGCCGGGGAGCCCTCAGAAGGGGCCCCACTGAAGCCTGGGAGCC^ 
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GGTGCTAATTGCTGTTTAGCTCCAAGCTGACTCCTTGCT^ I 1 1 1 1 T TTT TACTTGAGC 

CTCTAGAGCACAGAGTAAAGAGTGATTTGGCTCrrTGC^ 

CCCAAACACAGGCTGGCTGGOAGGGAGGAAGCAGGGGTGGGACATCCCATCCAGGCCCA^ 
GAGCTGTGGTCTCACAAGCAGGGGACTGTGATTAACCCAGGAGAGCTTCTAGGAAGAG^ 
5 TCCAGGGGAAGCTGCAAAGGCTCCACCATACCCTCAGGTGC^ 

TTGGCTGGGTCACAGCCTGGGTCTCCTTCCAAC^CACACCTCACCTGCCC^ 
GCCAAGGCCCGGTCTCCTGGCAGGAGCAGGAAACACAGAAGTCCAGGCGCAGGCATATGGG 
ATTTCCCCAXKaTATCTCCCCCTCGTTCACGAACCTGAGGTGGGCAGACrATGCC 
TGCTGCCCAAACTCACCCCACCATGGCAGAAGGCCAAGAGGCCAGACTGGCATTGCC 
1 0 GAGGCACCGGCCCTGCAGTGCCACTCCTCAATGGAGCAGGCAGC^ 

GGACATGGGAACCATCAGGAGTGTGCAGGAGGGGCAGTGTTGGCGGGTAGTGTAGACCAGAATGTA 

ATAGTGTAGACAGGGTAGTGCAGAGGAATAGCACAGGTGGGATAGTGTAGGGGGATAGTGTC 

GTAGAGGGATAGTGTAGGGGGATAGTGTGGGGGGATAATGTAGGGGGTATAGTGTAGAG^ 

GGGATAGTGTAGAGGGATAGTGTAGGGGATAGTGTATGGGGATAGTGTAAGGAGGATAGCATGGGGGATAGTGTANNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

OTJNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNlfNNNNNNNNNNNNNNTGCAG^ 

AGAGTGTAGGGTGACATTGCAGGGGTGATAGTGTAGAGGGATAGTGTAGAAGGATAGAGTACGGGGG^ 
2 5 ATAGGGGCATAGTGTAGGGGAGATAGTGTAGGGCAATAGCGTAGGGTGATATC^ 

GATAGTGTAGGGGAGGAAAGTGTAGGGGGGATGGTGTAGAGCGATAGTATAGGTGATAGTATAAAGGGATAGTGTAGTGGGGATAG 
TGTAGGGGAACAGTGCAGGGGGGATACTGTACAGGGATAGTATGGGGGAGAGTGTAGGTGGATAGTC 

ATAGTGCAGGTAGATAGTGTAGGACAATAGTGTAGGTGATAGTGTAGAGGGATAGTGTAGTGGGGATAGTGGAAGGGAATAGTG 
GGTGGATAGTGAAGGGGAATAGTGTGGGGCATAGTGTAGAGGGATAGTGTAGGGGATAGTGCAGGGGTCACAGTGCTG 
30 CCAAGGCACCGTGCTCAGTGTCCATGATGGTTTGTGTGCTTTGTGTCTGGAGCATCACGTGGGGA 

GTCACCCTCCCTGTTTCTGTGTCACTTGGCCTGCCTCAGGTGGAGGGTCAGAGGTCAGTGTGTTCCCCGTTAGTG 

CTGGGCTGTGCCTAACCATAGGTGGCTCTGCTCAGGCCATCCTGGCCCAGTGCTGCC^ 

CTGAAT 

35 HUMAN SEQUENCE - mRNA 

CGGCGATGGCCCCACTCGGATACTTCTTACTCCTCTGCAGCCTGAAGCAGGCTCTGGGC^ 
GTTGGGCCACAGTATTCCTCCCTGGGCTCGCAGCCCATCCTGTGTGCCAGCATCCCGG^ 
C^GGAACTACGTGGAGATCATGCCCAGCXTTGGCCGAGGGCATCAAGATTG^ 
GGTGGAACreCACCACCGTCCACGAC^GCCTGGCCATCTTCGGGCCCGTGCTGGACAAAGCT 

4 0 GCCATTGCCTCAGCCGGTGTGGCCTTTGCAGTGACACGCT 

CCAGGGCTCACCAGGCAAGGGCTGGAAGTGGGGTGGCTGTAGCGAGGACAT CGAGTTTGGTGGG ATGGTGTCTCGGGAGTTCG CCG 
ACGCCCGGGAGAACCGGCCAGATGCCCGCTCAGCCATGAACCGCCACAACAACGAGGCTGGGCGCCAGGCCA 
CACCTCAAGTGCAAGTGCCACGGGCTGTCGGG CAG CTG CGAGGTG AAGACATGCTGGTGGTCGCAACCCGACTT CCGCG CCATCGG 
TGACTTCCTCAAGGACIAAGTACGACAGCGCCTCGGAGATGG 
45 GGCCGCGCTACACCTACTTCAAGGTGCCC ACGG AG CGCG AC CTGGTCTACTACGAGG CCTCGCCCAACTTCTGCG AG CCCAACCCT 

GAGACGGGCTCCTTCGGCACGCGCX5ACCGGACCTGGAACGTCAGCT 
CGGCCACAACGCGCGAGCGGAGCGGCGCCGGGAGAAGTGCCGCTGCGTGTTCCACT 
CGCGCGTCTACGACGTGCACACCTGCAAGTAXK5CACC 

TCCCTGGGTGGAGCAGGACTCCCACCTAAACGGGGCAGTACTCCTCCCTGGGGGCGGGACTCCTCCCTGGGGGTGGGGCTCCTACC 

5 0 TGGGGG CAGAACTCCTACCTG AAGGCAGGG CTCCTC CCTGG AGCT AGTGTCTCCTCT CTGGTGGCTGGGCTGCTCCTGAATGAGGC 

GG AGCTCCAGGATGGGGAGGGGCTCTGCGTTGGCTTCr CCCTGGGGACGGGGCTCCC CTGGACAGAGGCGGGGCTACAG ATTGGG C 
GGGGCTTCTCTTGGGTGGGACAGGGCTTCTCCTGCGGGGGCGAG 

AGGTAGGCTTCTACCTGC^GGCGGG^CTCCTCCTGAAGGAGGCGGGGCTCTAGGATGGGGCACGGCTCTG 
GAGGGCG 

55 

HUMAN SEQUENCE - CODING 

ATGGCCCCACTCGGATACTTCTTACTCCTCTGCAGCCTGAAGCAGGCTCT 
GCCAGAGTATTCCTCCCTGGGCTCGCAGCCCATCCTGTGTC 

ACTACGTGGAGATCATGCCGAGCGTGtfSCCGAGGGCATCAAGATTGGCATCCAGGAGT^ 
60 AACTGCAC CACCGTCCACGACAGCCTGGCC AT CTTCGGGCCCGTG CTGGACAAAGCTACCAGGG AGTCGG CCTTTGTCCACGC CAT 

TGCCTCAG CCGGTGTGGCCTTTGCAGTG ACACGCTCATGTG CAGAAGGCACGGCCGCCATCTGTGGCTGCAG CAGCCGCCACCAGG 
GCTCACCAGGCAAGGGCTGGAAGTGGGGTGGCTGTAGCGAGGACATCGAGTTTGGTGGGATGGTGTCTCGGGAGTTCG 
CGGGAGAACCGGCCAG ATGCCCGCTCAGCCATGJUVCCG CCACAACAACGAGGCTGGGCGCCAGG CCATCG CCAG CCACATGCACCT 
GAAGTGCAAGTGCCACGGGCTGTCGGGCAGCTGCGAGGTGAAGACATG 

6 5 TCCTC!AAGGACAAGTACGACAGCGCCTCGGAGATGGTGGTGGAGAAGCACCGGGAGTCCCGCGGCT 

CGCTACACCTACTTCAAGGTGCCCACGGAGC^CGACCTGGTCrACTACGA^ 
GGGCTCCTTCGGCACGCGCGACCGCACCTGCAACGTC^GCTCGCACGGCATCGACGGCrGC 

ACAACGCGCG AGCGG AGCGG CGCCGGGAGAAGTGCCGCTGCGTGTTCCACTGGTG CTGCTACGTCAGCTGCCAGGAGTGCACGCGC 
GTCT ACG ACGTG CACACCTG CAAGT AG 

' 70 
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Table 23 

MOUSE NOMENCLATURE 
ICSGNM Ly6e 
Celera mCG2785 

5 

HUMAN NOMENCLATURE 
HGNC LY6E 
Celera hCG1765592 

10 MOUSE SEQUENCE - GENOMIC 

TCTCACTTCACCTGGGTCTTCCTCAGGACy^GGGGCCTGCTrc 

GA(IAGCCCTGCCTGGGAACACCCTAGGAGCGAGCrTAGCAGACTGCAGATAAG<K3 

GGGAGGGACTTAAGGATTTTCTGACTGCACTAATCTTGT'CCT 

AG^TACAGGAATGACTAAGGCTGAGTTGAGAAACTCXrrCACGGCTAGG^ 
1 5 AACTTTCAGTCATCCTCATGAGATAACATTCTGTCTGGGAAGG 

CTCAGTG^TGTTCTATTGCTGTCAATAGACAC 

AGGGGTGTTGTCTATTATCACCATAGCGAGGAGCACGGCAACIAC^ 

CATCTGG ACCCACAGGCAAG CAGG AAGAGAGTGACACTGGGTCTCGCTTGXSACCTTTG ATGCCT CTCAGTGACA 
CACTTCTTTGAACTGAAAGAACCTAGTG^^AAAACC^ 
2 0 ATCTITGATGTAAAAGCATGAGGTAGTTTAATGACGGAGCTC 

TGGTGGAGCCATAATGGCGGAGCTCCGGTTCAAAACGTATCTCACACAGGAGACAGTGGTT^ 
GGCTTTTATAGAAAAGGGGGTAGGGCTGGGGGAGGAA^ 

GAAGGGCGGGAGATAGGGAGGCACTAGGCCAGTCAGGACATGTAAGGCCTGGGCATGTCCT^ 
CCAGGTTTTAAAGCTCACAAACAACTCTTTGGGCrATTTT 

2 5 AGAACAAGXSCTANNNNNNNNNNNNNNNNNNNN^ 

TGTAACrTCCCTGGGGGGCAGCCGAAGTAGGTGTTTCTATACAT 

TTCACCCTCACTGAACACATGTCTAGAATCCCACCAAGGTGGGACAGGCTCAG 

TGGCACAAGCCTAAGTCACACCATGGATGTCACTGTGAGCCCATGAGA^ 

GGTTCTTAGCATGTATGAAGCCGTAGGTTTGATCCTTAGTACGCATAGCCCTGGTATGGCGGTGCACAGC 
30 . AAGCAGAGGCAGGAGGTAGGAGGATTAGATGTTCATCCTTAGCTACTCAGTAAGT^ 
GTTTGTTTGTTTGTTTGAATTTCTTGITACCATTGOT 

AGGGGCCTGTGGTGTACCTGTTO^CGGCACTAGGATCTGAACATCTAGT 
GAGGTCACTGGCCACGCAAGTAAGATCTGAGTCCXITGCT 

3 5 GATGGCCTAGGAATGAATGAGTGAACAGGAAGTGCTGAGCCACCAGTGGAGTGGGATCACAGATTT^ 

TCCCTCTCATGTGGTGTCACAAGTGGACTGACTAAATTAAGGAATTGA^ 
GTAGAAACTCCTGGAATCACTGCCTGGGTCCTGAGATCACTGCCTTAAGGAGGGTTCTCCT^ 

CAT AACTCCTAG AAG CTT ATCACT CACTCTCAG ATT CCT CTTAG TTGCACAGCATT AAGGGAG CCAG CCAGACTCCTCT ACACAAT 
GCTTCAAAGGAGACTAGGCTTTGTGCCTGAGTAATGACAGAG^ 

4 0 GTCTCTTCTGATGTAACTTGTCTTCCAGGTCCAAAGGAGGGGCCTGT^ 

ATG AGCCACACGTGGTGCTAGACACAAGATGAAGTTCTCTG CCCTCAGGTCCTTCATAGTG ACAG CTTGACAGG AATTCAGATCAT 
TGGAGCAGCCCAAGAGCAAGTATGAAAAAGATATTCTAACTACAAGGAAGACATCT 
TAAGACC^GGGAGGAAAGAGuAAAGCTAAGGCCTCAGATACTGAGCAAC!A 
AACACCCCAAACTGTGGATTCCCAGACCACCAAATGTTACCATTGTTGTGTTGAGAGAG 

4 5 GAGAGAGAGAGAATATGATGTTTGGTGGAAGTGCAGTCAGGTGTGGGTGGAAGAT 

CTTCCCCATGAGGGTCCAGCCATAGATGGTATAGTATAGTACAGTA^ 
GGGAGGGGGGGTTAAGAGGGTAGTNNNNNNNNNNNNNNNNNNN^ 
GGCTGGTGAGGCATCTACACATGCTTTAATATAATCITTCAGGC 
GTATTCACGTATTCCCAAGCCAATTGTTTTATTAATATATCIAGT'C^ 

5 0 TCTAGCCACAAAACTCTCGTAAGATTC^TCATGCTTTTGTCTAATTCCTGAAG^ 

TCTGCCAGCTAGATACAGCTATCTGACTAATTTTCTCCAACACTGCTTTTG 
TTTTGCCCAAGAAGCATAGGCACTGTGGGTTTAGTTCCATTTCTATTTTCCTCA 
CCAATATTCTAATGTCAAGCTTCCCTCCTCrrAGAAACCAGGGACrGATTTTTTG 
GGCTTCTTCCAGACAACATGTGCTTAAACACTGACAGAAACAATCITTTCC 
55 TCGTCTCTTAACGGATGTACAGATCCTCTCATCTGCAGCACGGTCAC 

AAG AAAAAAG AAAGAGAAAAAAG ACTGATCTGCCCGGGTGTCCCTGTTTG AAGG CACCAACTG CTGC AG ACCCTCTGGG CT AATGT 
TTGTCTGCGTG AAAAG AGT CT CTCGAGCIAGATGGGCAAAGAGTCAGATG^ AGTG ACAAACAG ATACACAC^ 
TTGGATCTGAATGTAATGTTCCAGTCGAGCCTCAGACATTTTATAATACAGAGTAACAGAAT^ 
GCGAGAGGAGTGACTTO^AAAGGAACC^GATAAATGTTTACTTT^^ 

6 0 GAAATTTACACTTCAGTGfTTAATAGCACCAGGGGGTTC 

ATTCTCTAGGCCTTGTTAAATTATATCTAGTCTGGAGAATCCTCATTATCAGGATAGTATAT^ 

AGCTTGTGGTAAAATTTTTATCTCAGTGAGGATTTTTGTCTCTrCCT 

TGTAGTGCTTAATTAGTTACTGAATAGGTTAAGCTCCITGCTGCT 

AG AATACAAT CTT AAAAGXSCATTTACAATAAG^TAATACT AAAT AG AGAGCACGTGGATC CATACACTAGACTAACTCGGG AATGA 
6 5 AAAAATTAATTTTATGGTTTATATAGATTAGGGAGAGAGTGACAGACCCCAGGAAGGGAGTTTCC 

ACCAGCTTCCCAGCTTTCGCTTTGTCAAGCTGACrrCCACCAGAGTGCCATGG 

GCTAAATACCTTCTTCACCATAAGGGACTCTTATCCCACCA 

TCCTGGTGTTTTATAACTGCAACAGAAAATTAATGGGATACATCCTGTCTTA^ 

AAAAGTCATTCAAATAAAGCATTTTGGAAGCAAAACTAG^ 
70 GTTTCTCTGTGTAGCCCTGGATGTCCTGGAACTCACTCrcTAGACCAAGCT 

CCAAGTGCTGGGATTAAAG&CATGCACC^CrACTGCCTGT^ 

TTTDU^AACGGGGCATAGGGGACAGTGTTAGATTTGGTCCTGTGTCT^ 

tttpatataCAAGGTCTCTTTTCATCCTAGCTTGACCTGGAC^ 

CAAATGCTTAGATGACAAGCACGCACrACCAGTCCTGGTTTATATGGAGCTGGG 
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GTACTCCAGCAAAGGAGCTGCATCTCTAGGCTCTCAAT^ 

GTTCTTCTTTGCAGAGGAAGATAGTTTATTCTGGAACCA^ 

TGTTCCCAATGTGGCAACAGTTTGAGGAGNNNNNNNNNNNNNN^ 

GAAATAGAGGCTGGCCATAAGCATGTGGAGAGGGGGAAGGGAATGGGAAGAGAAGGGGAATGAGTO 
5 GAGGXSCTAGAGAGAGCAGAAGCGATGGCAAGAAGCCCCTTTTATAGTGAGTCGGGC^ 
TGGAGCCTAGACAAAACACTAACAGCCATGCTGTAGCCT 

TGAGCACCAAAAGCATGTGTGTCCCCAACCCTTATCTGTATGGGATGATCCTGGGCATGGCTGGAG 

ACACAACAACAAGCTGTTTCTTCGCATCACTAGCAGCAAACACGGACTATC 

CTACAGGGC^CTTGGCCTCGGCCCC^CCCrGGC 
1 0 CCTAGGACCAGCCCCTGTCACCACCATCAACTCCTTAGACTGTGAGGAAGAT^ 

AAGAATGGAAGTGTGACTCTCAGAAAAGCAGGTGGAGGAAACTCAGCAGCCCAC 

TGGGGGTGAGGAGGGAAGGGAAGGAGAAGGAACAAACAGGACTGCCAGGCAAA 

TCACCCAGAAGCAGGACGTGGGTCCCAGGGTTCTGCTAGAGGGATAACACTCAGCAGGTGGTO^ 

GCACTTCCTGAGCGGTAGGAATAGAATACTGGAGGACGCTTAGGTAAAAGCAGAAGG 
1 5 TTCAGCACTTAAGATGJVGAACCAAAACCGGATGCCTGGGCTGGATCCTCT 

GGCACCTAGAGGAAGTGGGAAGTGAGAAACAGATCTGTTCAAACCAGTGCTGCAATGACCTC 

CGCCTCATCCCCAAAAGAGGGATGGCCCAGACGGGCTCAGCCCAGAGGGACCTGACAACCTGAAGGGC 

GGTCCAGGACTTCCTCAGCCTTACAGCAGCAGAGATTCCCATGAGGGTTTCAGGA 

AAGCTAAGCCTGCX5CACAGGTCTTAGTATCGGTGATTGCCAGAAGCCCTGGCCTTTCCACCCT 
2 0 AGGTAGACATGCTACACOIGAAGCCACGCCAGGGCTCTGATGCCACATTC 

TTCTGGGTACCACCGGGGACCGCCGCCGTGGGCG CAT AGCTTGGGACCCCTGT CGTAGTCTCAGAGGAGCAGTTGTGCAGCCAAGC 

CAAGGGCACAGCCGGAGTTGCCAGAGGCCAGCAGCCAGGGTCGTTCTCCCCACCAGCCAGAGACCCG 

TTTCGGTATTATCTTCGGGGCCTCTTCACCCGGAGGTGAGTCTGCTGAGCCCGACCTCAGGT 

2 5 GCCACGGCGGGGAAGCACTGGGAGAGAGCGGGAGGCCTTGACCGCAGCG 

CAGAGTGGTTGGAGCATGCGAGGCCCTCTGCAGCGTGCrCTGTTCCTACCATCA 
TCTGGAGAGGCAAAGGCCCCTGCTGAGCCTTCACTGGGTACCCCTATCTCGACTCC^ 
TACACCCTCGTAAACTAGAGCAGGAGGCTGGGGGGTGGGGGTAGGGGT^ 
GAGTGAGGAGCGGGAAGCTCTTTGTCTTATCCAGGATTTTGCATATCAATGCTGAGGTTCTG 

3 0 G CGGGGGTAACCTGAACAACCTTCCTCCGCTGGCCX3 CTAG CCAATAGAC^CCTGGCTTCCGCGC ATG AGCCCTTTGAGTTTGTGGA 

GGTGTTCCTCAGGTCCCCAGGGCCCTGCCCCGTTGCCTGCAGATGTGTGGACAGACAAAACATTGGA 

CAGCTCAGGGCTCCATCGTTCCTAAATCCCACTGTGTAACGAAAAAAA 

AAACCAG AAATGAGG ACAGGCCCAGCTGGGCTTGGCACTG AACTTGGC CACCTGGAG^ CACTACAAGATGTCAGCA 

3 5 AGTCTCAGGATAGAAGAGGCCTAAGGCCAGTCCTGGAAGATACCAAAACTTCACCTCCTCT 

TAAGCCACCTTTGTCTCTGGGACGCCCCTCTCCCACTCAACAGATGGCACCAGTC^ 
AAATAGGATATTGAAACCAAGCAAGCTGGTCCTCTCCT 

AGCCATCn^CTGGCTAGTGTGCTGACTGCTACAGGTGTCAGTGTCCCCAGCGAGCAGGTA 
AAATGGGTCCTGAGCCTCTACCTTGTACAGAAAGGAATTAGTGAAGCCCAGAGGCCTCCAGCCC^ 

4 0 GGGGTGGGGGTGGGGCTGTATTTGGTTCCCAGGGCTCAGGGAAGGTTTTCGGTTCATGCATGCT 

ACC CCAGGACAACCACTT AACTGT CCACTTC CAT ATCAGAGTT CTTGGTTCCCCTTTACTTCGTCTATGGAGCCCCTG AGTTI^ 
AAGGGGTATTAACTGAAAGGTACCIATTAGATGAACTTGGAGAAAGATTTGTAGGT^ 
AATAATTGGCTACAATAGCAGCAGGAGGAGGCCAGGAATTCTCAGGTAGAGATTrGATTAGCAGCA 
TGACTCCTACAGGTTTCAGAAAGTGGGGAGAGGCTCTCAGATGAATGGCTTC 

4 5 ACTGTAATTCATGTAATGTATGTTAAATAGCCCCAAGAGTTGTTTCT^ 

CCAGGACATGCCCGGGCAAGCCCATCGCCTCCCTAGCTCCCACCCCTCTGACCTAAGTTAAATGTTACAGGCT^ 
TGGACCAATCATGTGAAACCGCGCCIAATTCCTCCCCCAGCCCCACT 
GAATCCACTGTCTCCTGTTGTGTG AGATACGTTTCGACCCGGAGCTC CGC CATTAA^ 
TGTTCTATTCGCTGATTCTTGGGTCCACGCCGAATCGGGAGCTGAGTGGGGGTTTCCCCACT 

5 0 GACTAGCTTAAGGCTGTACAAATTGTCCXSTCTGATTG^ 

GTTGTCnTAATGGCTACCCCCAGGGCAGCAGGGAGTCAATACTGCTCT 

GGACATAAGGCAT ATTGGGG AGGTGTAGAGTGGGCCAGGCCCAGGGTCTG CTGGTATCT ATGCCAGGATC CTGAGTGGTGGTGGCG 
TAGCCTGTCTITTAACCAACTGCCTCrcrCACAGTAGGCTTCTTTGGCTTGCGAACCTTCA 
AGAGTCTTCCTGCCTGTGCTX3TTGGCAGCCCTTCTGGGCATGGAGCAAGGTATGGAGCT 
55 CT CCTG ATCTCTCATTCTTTCTCCTGAGTAG ATGCC CAGGG CT CCTCCT G AG C C AG CCCTCCT AGG AACG TCTGGC CTTCCCACCT 

CCTACTCTCT AGCCAACTGACCTAGTTCCC CTGATG CTGTGCTGGCCCAG CTAC ACCTTTGTCACCTCGTGTTG ACTTTAGCCACT 
CCAGTACCAAAGTCTGAAGTCAGTTGTCATTGGCTGGCTTTTTTCCCTGACCC^ 

GTTGGGGATCAAGGGCCCCACrGGAAAGTCACCTTACCCTGGAAGGCTTCCCCAGGCAATAGGCAGGTGTCACCTACAGCA 
CCTTTrcCAGTTCATTCCCTGATGTXSCTTCTCATGTACCGATCAGAAGAACAATATAAACTGCCTG 
60 GAAAGACCATTACTGTATCACGTTATCTGCCGCTGCGGGCTTTGGTGAGTAGCT 
GGGGCTTTCCCCATTTCCTGCTACCCTGTCrGTCrGTCCCCrCATCCTCACITT 
TGAACAAGGGCTGCTCCCCGATCTTGCCCCAGTGAAAATGTCAATCTCAATCTCGGTGTC 

AG CTCCTTCTGCAACTTCAGCGGAGCTGGCCTCGG ACTTCGTGC CAGTATC CTTGTT 
GGCTCrTGCTGCAGCTGAGCCCCTGACCATCCCCCATGTGTTCTCCATATCCCCCCAGCTCAGGAA 

6 5 CCAGGGGACCCTAGGAACCTTCAGCTCCTCCTGGGTGTGTCTAGTTCCCCCTCCACACTCTCT 

CTCACCCATACCTGCTCTGTTGAAGTGGTACTrGGCTACCCT^ 

GGGGTGAAGATCTCCrTGGAGTCrCAAGAGTACCAGAGTCAGCGCCGAATCTTGTGGACACACTGA 

AGATGTATATCTGTGTGCTCAGTGTATCCCTGTGTGAATGAAGCCACT^ 

TACIAGCAGAAGGCCTGTGTCGCCACCAAAACCCCTCCCCCTGGTATCATTGTACC 

7 0 GGAGGACTGCCACCCCTCCAGATGAAGGCrTCCCACTACCCGATGOVGTTGAGTCCCATCCTGCCCT 

TGCTGCTATTCTAGTGCCTCAAATAAACrcTTCACACCCTTTCCTGCTATGTC 
CCAAAGGGACCTGGGGATCCTGCyVGGCCCACAGCCCTGCITAAGCGATTCTGAAAGA 

GATG CCATCAT AGCAGAAGCAGAGGTCTCTGGGAGGCCAG CCTGGTGCATAGCAAATCCCAGGAATCT AGGCCC!ATTTAGlt3A6AC 
AGGCCTGAGAGAAAAGAGAGAGAGCAAAGGCAGAGAGAACTTGAAAGTGTGCTC 
7 5 CATAAAGGCAGGCCCATCAGGCTGGTCACACATTCCCA^ 
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CTTCAT AAACTACCCTGT AGTCrAAGCCATGGGAGTCTGG CCCAGGCAAGTG ATACTTGTCCACCCTATAACTGG CTTTGCTTTGC 

TTTCTGGTATTTTAAGACGGTCTCGCTATGTTAAGACTAGCTGCCCCAAACCCAGAG 

CATGCACCACCACACCTGGCCCn^AGACTTTTTTC 

TGCTACTACACCTGGCTTCCTGGGGTTCAGTGCTGTGAGTATTATGGACACTC 
5 AGCTATTTCTCAGCTGTTCCrAAGCrrCTGGGCTGTTGGGTGATACC^ 

TATCCGTGTCTGAGGAATACTGCCCAGTGGGGGTCAGACACAGGGACAAC^CAGGGGCT 
GCATTGGCTTTGTGCCATGATTCTCAAGACCAGGGTTCCCATGACGTATA 
GTAGCTCAGCTACCTGAAAAGGACAAACHTTGGGGAAATGTCACCGGGACAGC^^ 
AATATTGAAAGGCCAAATGGAGAAAGGTGGGAGCCAAATCrrTCTGCAAGGAGCT 
1 0 GGTGTTCAGGACAGATCTGAGTAAGGTTTCTAGAGACTGGCTTGTCTC 

GAAGACACCTGGGTTAGACTCCCGGGACATGGGAACCAGAAGGGCACCAGGCAGTGACAG 

CCCTATCTTGATTTAGCTTCCATATTGTGATGGAGTC^ 

CTCTAGTCTGGAGGGCGTGATGGTTTCAAGGCTAAAAACTGGGA 

CAGAAGGGGCTTGAGTTCCAGGAATGACCTGAGGAGAAGGGAGGGTGAGGATTGGAAA^ 
1 5 GAGGGG CCAGGGG CCAAGTTT CACTT C CC AAAAG CAGAATCG AAAGT AT CCCAGAGTT AAGTTTCAATTTCATGAAG CTTGG AGG C 

AGG AAGGG TG TGGAAAG AGGGAGGAAGGGTGAATGGGGAGGG AAGGGTG CACACTGCAC CT CTAAG AAGG CTGT CCAGGTTTGG AG 
CCACCAGCTTCACTCATACCAGATTCAAATGCTAGATCTGATCCCACATGATCCAATATC 
ACCTCCAGACCCTGGCCAGCTTTCCTATTATTGCTGCTAGTTAGTGATGAG 

CT AG C CTG CC CCAACCAG CAAG TG CTT AG CT AC CTCC CTTCTGT CCCAAGGG CT CCAGTG ACAG ACT AG AG ATTG AGGACTG C AAC 
2 0 CTCAGGTGGACGCCTCTCC^CCTGAGCATCATCCAACCAACrrGAGTA 

CCTACTAGGGCTCCCTCTAATAAACAGAGCATATGCAGGAAAAGCCTACTCTGTACCAAA 
GTCTaSTAAAGGGGTCTATACTTCATGGTGGATGCTTTGGGGGCTCTGTC 
GTTGCrTGTCTAAACTCTATCTTTAATTCTAACCATCTTO 
C^GAGGTGTrrTGTTTGTTTGTTT G TTTGrrr GU T- rG TTTACT 

2 5 TATTATTTAAAAAAGCGTGCAGATCAGGGGCTGGAAAGATGGCTCAGCAGT^ 

TCCATGTCCTGCACCCACATGGTGGCTCACAACTGTCTGTAATGCCAGT^ 
ACTGCATGCATGTGATACACAGACAGACAGACAGGTAGACAGACAGACAG 
TTTTTTTTAAAGCTTTCAAACAAAACCACGATACATCAAACATT 
GTTAGAACTGGCATTTGAAATGGAAGCCGTGAAGCTCAG 

3 0 AGGGTTACCGTTGCTGTGATGAAACIACCATC^CCAAAAGC!AATATGGGGAAGAGAGG 

GTGCATCACTGAAGGAAGTCAGGCGAGGCAGGAACCTGGAGGCAG 
TGCCCCTCATGGCrTGTTCAGCCrGCTTGCTTTCTTCCrCCTCCT 
TTTTTTTGAGACAGGGTTTCTCTGTATAGCCCTGGCTGTCCT 
G^CTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGTGTGCACGATCA 

3 5 G ACTGCCTG CCCAGGGATGGCCCCACCTGCAATGG CCTGGGCCTTCCCCCGTCAATCAG CAAGAAAATGCTCTTCGGGGCTTG CCT 

ACAGCGTGAAGAACGTAGGAAGGCATTTTCTTAATTGAGGTTTCCT 
TATCCAGCCCACACCCTGACCCACTTTTCTTTAACTCCTTCCT 
AAGCTTGCTTTCCCTGACGCTGTGGCTGTTTCTCTGACCACTATC 
GGTGCACGCCITTAATCCCAGCACTTGGGAGGCAGAGGCAGGCAGATTTCTGAGTTCGA 

4 0 CAGGACAGCCAGGGCTATACAGAGTAAACCCCTGTCTCAGAAAAAAA^ 

TGGCCACCCAGCCGCAGAAAAGAGCCAATGCAACCACATTGGCAAGAGAGACT 

CTT AGGG CCTT AACATGGGGGTAAGGTTTAAACAGGAGACTG ATAGCA CCCCCCCCCCA 

CCGCXSGTCTCAGTTCTCAAGTCrACAGTCTCTCACAAGTTATAAATCTTTCTCCAGG 

CTTTGGCrCTTCCAGCAGGAAGTATCTGAGGAAATTCCAGTT^ 

4 5 ACGCAGATAAATTTCTCCCAGGGAGGCIAGGTT CC!AAAGG ACTG ATTTG CTTCCCCTTGATCCTT AACTAAA 

TTAAGTTGTCCTTGCGATCTTCTTCCTTTCCTTCT 

TCCTTAGCCTC-CTTCATCAGGCCTCATGGCCACAGCACCTCAGCTTCC^^ 
CAAGCCCTTTCCTGCAACACGTTTCATGTCAACTGGAGCAC^ 

GTCCCATTAGG AAAGGCTGCCTGATTGCTGGAGGG CCTCT CGGCATACTCAGAACTCCG AGGACCACGGCTAT AGCCCAG ACCTCT 

5 0 GGAGAAATTTAGAAGATGGAATGCTGGTCTGTGTACCGGGGCATAGCCA^ 

TC1TACAGGGCATGACCAGAATGACCAAAGTGCTATAGCTCTCCAAGGGGAAGC 

CAGGTGGCAAGAAG CAGAGGG CAGCTAGGAGCCCAAGTCGGGG CCZACTGAGCAGG AAGTGGCTGTGGTCACG CGCATGCATACCGT 

ACACCACTTGTTGGGGATCTGGGGATCTAGGTCCAGCAAAGAAAGCTTCA^ 

TAAAGC^GGCACCCTCAGTCAATCTTTGGTGAAAATCACTCAGGC^ 

5 5 ATGGGGTGAGTAAGTGCTTCCTATTGGCTTGGTTTGGGATTCTCTATCTGCATGGGGAAGGACTTCAGGT 

TGATTTTCATGGTCCTCCTAGTGGGGTGTCTCAGTCACGTGCrCCAGGACAGGAATCTGGT^ 

TTCTCAATGATGAATTTTATGGAAGTTCTGAACTTGCTACAACCTTACCAG 

AACTCCTCCTTTTCCTTTTTATAAAGGAGAGGTATCATTG 

CCTGATTGGTAGTTATTTTTGCAATGCTACTAATCTCCTGCTTTAGAAATTGGAT 

6 0 AGCTCATACAGGGCCAAGGAGGAGCAGAAGCCAAGCTACCIAAGGGGTTAGAGAGC 

CCCAGCGATCCCCCTGGAGCTCTTTGGGCATTTGCAC^^ 

TCTAGCAAAACACAAGTCCCTCCCTGTTCAGCAGGTAGTAAGCCCAAGGCCCTC 
TGTCTGGCATTGAAGGGACTCAAGCGTGTCTGCAGTGGA 

GTGTTTGG AC CAGGG CTG CTCT AC CAG CTCCTGT CATG CCT AATG AG ACAAGG AAA C CCAG TG CAG CAAG CAGAGGGATG AAG AT A 

6 5 TCT CTCT CTCTCTCTCTCT CTCT CTCTCTCTCTCTCTCTCTCTCTCTCTC 

ATAGAGATAAATCTGGGAAACAATTATTATAGGA 

CCTGTAGTGCTTTGCACCAGAGAAGTGCAGGGGGTTGATGTATGTTGATTTC 
GTTTTTGAGGAGGGTCCTACAAGGCAAAAGGG&GAAGAAATGC^ 

TGGGTAAGGGT CT ATGGGTTTTATGGGAACAGCTG CTAATAG CZAGTCAGGAGAGAG AAGCACAGAAAAAAGAGG AG'riTl'l'GGG AG 

7 0 QAGTTGGGGTGGGTCATTAAAGGAAGGAGGTCAGCTGTATATTC^ 

AGACTGATTTTCCCIATTGAAGAATATTAACATGCATAAAGA 
TGGCTATGGGGTATTTGGCATAGCCATTATGGTAGAGTTTACCCATC 
GAAAGCTGAAAATGAGTTTTAGGTTGGTGAGCATGGTGCCACTGT^ 
GACATTCACCTGTCTTTCAGTAJ^GGAAAGCAG^GG 
7 5 CTGGTCCAACATGCTTCAAGTGGGC^ 
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TTTGATGGTAAGTGTTGGGAAGGCTAGAGAGAGGCT 
CCTCCAGGCITGAGTAGGCTAAATAGACCTTAAGTGC^ 
ATTTGCAATTTCTTACAGGAACAGACTGCCCGCTGAGCTTACT^ 
TTTCCCTATATAAACACAGAGCTGAATATTAAATTTTGAGCCTTGATCAGAAACTTTGT 
5 ATCCTTTATCATTTCTAGCCTCCCTTTCAAGTAGACCCAGTTCTCCATGGC^ 

G CACATGCCTGGG CAGCAGTCACCAGGTCATGAATGTCTTTCCAG<3AGATCITG 

CCTCCACCTCCTCTGTCCTTGATCTGCCAGTAAGAGGCAGG 

AGGAATAGGCTGATACTGTGGGAAAACACAAACGAAATCCTTCT 

ATCATGACATCGTACTAAGATGGGGGTGGGGGTGGGGGTAGGGGTGGTTAAATTTTGGTGTGTGCCAA 
1 0 TGCTTAGAGTTTTTGTAGATGTTAAATATGCTTGGTGTA 
TTTTAGAAGTTGAGCrTTTAGGATTTGATGTATTCTTTCT 

CAGTTT C CCATTGTGTG CAG AAAG TTTTGAACTGAGAACCAGTG AAG AGAGGG CCATT ATT AGT'lT CTTT T ATCTGTTG AGGAATG 
CCCTCTGAAGCTAAGTTCACCAAGAGCTGACAAAAAAAG^ 
CCTATCAGTTCTGATCAAGAAGTAGGGTCGCCCTGGAGGAGAGCT 
1 5 GTCTCCCAACCCAACCTCCATCCCAGAGAACAGGGTGTTAAAGAAGGGACTTCCTTGA 
CCTCTTGTGAACCCTGTCCTCTAACCCCAAGTCTCTGAAGACTGTTGATGA 

GGAAAACAGCTATGGAGACTCCAACTCCTGGGCAGGTTTGCCCGGATGTCTAAGTAGGGAGGAC^ 
AGTTCC^GCCAGAACTCAGCACCTGACTGTGAGGTGGGGCCTGACCCCTCTA 

2 0 ACATATACACTTCATCCCAGGCCTGGACAGAGGGCAGAGAGCCCTAAGAGGCT 

GTCCCCATAGTCCTCTCCCTCCTCTAAACTTCAGGAAATCGAAACTCAAATTCTCCGCCTCT 
ACTGAAAGCTAGCTCCCGACCACCTCTGCCAATTTACTTTCT 

GCCCTGCCCATGGTGAGTCCTAAGTAGGACCCAGAATCTAAGCAGGAAAGTGGAATGGGAGTTCT 

2 5 AAACTCCTATGGGTTGGACGGGAAGGGTCGGGCATCCTC 

TCAGCGTGTGCTTGCTATGGGGAGATTCCATTGACTTGAAAGTTGAAAT 
CGCGTGTGGCCAAGCCTGGAGCCCGGGTTTGAAAGCACCACCCTCCTGGGATAG^ 
GCCGGTGAATTAAAGGGGCAAGGGTGGCCACGGTGGCACATCCAGCTCCC^ 
GAGAACACAGCCACATGCCACAATTGAGTGTCCCCTGCCCTAGAGCAAACTCCCTTCA 

3 0 CTGGGCAGATGACTAAGGATCTTGGTCTAATAATTCCATCTTCCATTGATGAGTCTGC^ 

ACATCTCTCGTGGAGGAGAAATGCACGGGCGCTAGGCTATGTTCCTTCCTCACAGCTGCCTT 

CTGGGACCCCAACACATCACTCAGGGGAGGTGAAATGCAGTCTAAAATAG^ 

AGGGGCTAGGACAGTGGCTTGTGGTTCTCAAACTCCTGGGCACCCA^ 

TCTGTGTATCCCAAGGCCGCTGAGGACT13CAAGCAGTTGGGAGAACAAGGGCCGAGGGCCTGTGATC 
3 5 ACTTAACTCAGGTGAGTGCCAGGAGTGTGCAGGGGCCTTCTACCCGAGACrTAG^ 
TAGTGGTCCTTGATGAAG<3AGACAX3CCCTTGCTTGTCCAAACTC 
GAACCAGAGATTAAATACCTTTTGCAGAAAGGGACGCTTAGGA^ 

MOUSE SEQUENCE - mRNA 

40 CTT CACCCGGAGT AGGC1TCTTTGGCTTGCGAACCTTCAG CAG ATGTCTGCCACTTCCAACATGAG AGTCTTCCTGCCTGTGCTGT 

TGGCAGCCCTTCTGGGCATGGAGCAAGTTCATTCCCTGATGTGCT 

CCAGTTTCATGCCAGGAGAAAGACCATTACTGTATCACGTTATCTGCCGCTGCGGGCTTTGGGAATGTC 
GAACAAGGGCTGCTCCCCGATCTGCCCCAGTGAAAATGTCAATCTCAATCTCGGTGTGGC^ 

GCTCCTTCTGCAACTTCAGCG CAGCTGGCCTCGGACTTCGTGCCAGTATCCC ACT ACTGGGCCTTGGACTCCTG CTTAGCTTGTTG 
45 GCTCTGCTGCAGCTGAGCCCCTGACCATCCCCCATGTGTTCTCCATATCCCCCO 

CAGGGGACCCTAGGAACCTTCAGCTCCTCCTGGGTGTGTCTAGTTCCCCCTCCACACTCT 

TCACCCATACCTGCTCTGTTGAAGTGGTACTGGCTACCCT 

GGGTGAAGATCTCCTTGGAGTCTGAAGAGTACCAGAGTCAGCGCCGAATCT 

GATGTATATCTGTGTGCTCAGTGTATCCCTGTGTGAATGAAGCCACTTGGATTCTGGGGTGGGC^ 
50 ACAGCAGAAGGCCTGTGTCGCCACCAAAACCCCTCCCCCTGGTATCATTGTACCCACCTTGTACTCTGTTCA 

GAGGACTGCCACCCCTCCAGATGAAGGCTCCCACTACCCGATGCAGTTGAGTCCC^TCCTGCCCrCTCT 

G CTG CT ATT CT AGTG CCTCAAAT AAACCGTTCACAC CCTT 

MOUSE SEQUENCE - CODING 
55 ATGTCTGCCACTTCCAACATGAGAGTCTTCCTGCCTGTGCTGTTGGCAGCCCT 
CTTCTCATGTACCGATCAGAAGAACAATATAAACTGCCTGTGGCCAGTTT 
CTGCCGCTGCGC^CTTTGGGAATGTCAACCTTGGCTACACCCTGAACA^ 
CTCAATCTCGGTGTGCCGTCCGTGAACAGCTACTGCTGCCAAAGCTCCTTCTGCAACT 
CAGTATCCCACrACTtXGCCrTGGACrCCrGCTTAGCTTGTTGGCT 

60 

HUMAN SEQUENCE - GENOMIC 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNN MWNNN>INNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

HNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

75 ii^NNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNNNN1W 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNIOJNNNNNNNNNNNN^ 
NNNNNNNNNNN1WNNNNNKNNNNNNNNNW 
5 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNKNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 

OTWNNNNNNNNOTWNNNNN^^ 
1 0 NNNNNNNNNNNNNNNTGGCJUUUV^^ 

AGCTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTC 
GCCnAGATAACAGAGTGGGGCTCCATCTGAAAAAAAAGAC^ 

CTGCAAC CTGG AGAGCAGTGG ACACTGAT CACAGCCTCACAAGAATATGACCCTCCCTTTTTTCCTTTCCTCCTTCCCCTG CGGCC 
AITTCITTCTTTAACTATTGAAGCCCCAAAACCCTCTTTGGAAAAGGAG^ 
1 5 CCCGGGCATGGCTTCAACCTTGGCAAAGTGAACCTCTGAACTGGTTGAGACCTG 
AGGTTGGGGAAGATGTCCCAAGAGGAGGGACAAGCAGCAACTCAGGGAGG 

TCAAGGTGGCAGTGGAAGTTACAGGAAGAGCCTAAGGCCCCAGTGGTGGGAGCAGATGCTGAAATCCCCGGGAG 
TGCCCTGGGAGAGGCGGGAACCAAGCCTGAGAGCAGAGTACTGGCAGCTGCAC^ 

20 CCACCMAGCCAGCAGGCAGGTCCCAGCCCAGCCTGCCCCGTCCCGCCCCAGCCrcCCCTGTCC^ 
GCCTCCTGGCCAGCCCCACAGTCCCTGCCTrCCAAGAG 
GGCGCTCCCCCACAGGACCCCTGAATCGTGGCCAGAGAAAGGTCAGG^ 
CTGACCCGTCCTAGGAGC^CACAGGGCAGCTGTGGCTCCTTCCCTCTTGAACCCAGA 
TCTGCCAGCCCCAGCAGTAAGGCCACCGGCTCATTCTTCATGGGGCAGCAGCATCT 

2 5 GCCAGGCCTTCAACCAGGTCCCTCCTGACAGCAGGC 

GTGCCCCAAACTCCACAGGCGCCTAACCCAGTCCTGCCAAGGGAAAGTGCTCAAGGCTCTGCC 
ACTCTCACCCGTGGGACTTCTCCCCACAGGCCCCAGCAGCCTCCTCCCX3TCA 

CTAGGGCCAAAGGCCTCTTAGTCATGGAAGTCTCCCTGAACCTATTCTGGTTCGGAGGTTGCCTGGTTTAAAAr 
AAAAACAAAAGCCTCCGCATGACAGCAGGCCCGTCCCAGTCTTATGATCTCrCT 

3 0 TGAGGAGGAGGGAGTGCCGTGTGAGGAGGAGGAGGGAGTCCTGTGTGAGGAGGAGGGAGTGCCGTGTGAGGAGGAGGGAGTCCCGT 

GTGAGGAGGAGGGAGTCCCGTGTGAGGAGGAGGGAGTGCTGTGTGAGGAGGAGGAGGGAGTCCTGTGTGAGGAGGAGGGAGTCCCA 
TGTGAGGAGGAGGGAGTCCTCTGTGAGGAGGAAGGAGTCCCGTGTGAGGAGGAG^ 

GTGTGAGGAGGAGGGAGTCCTGTGTGAGGAGGAAGGAGTCCCATGTGAGGAGGAGGAGGGAGTCCTGTGTGAGGAGGAGGGAGTGC 
CGTGTGAGGAGGAGGAGGGAGTCCTGTGTGAGGGGGAGGGAGTCCTGTGTGAGGGGGGAGGGAGTCCTGTGTGAGGAGG 

3 5 AGTGCTGTGTGAGAAGGAGTGCTGCTACGTAAGAAGGAGGTAGATG 

TGGCCCTTGTTACAGTTGTGGGTAGAAATGGAATCTCTGTGACATCTCCTGCTCCCCT 

ACTCCCTCTCCTTTCTGATCATGGGCCCAAAGAGGCTCATTCAGAGAGGATCAGTCCCATGC 

ATAGAGGTTTACCTGGAGGGGCCACGTGGGCGGCATGGAAGCTCCCCACCCCTTCCGGGAGACGGCGCCCTGTGC 

AT CCATTT AT AATTAACTG CAAATGTG TTT C CCTGAGTT C C CT GAG C CACT CTTG CAAATCAATTG AACG CAAAAC CGGGAGGGG C 

4 0 TGCAGTATTGCTGGGAAAGAGGGTCCTGATCCAGACCCCAAG 

ACAGAGCACAGTGAGAGAAGCAGGTTTATTTGAAATGGCTCCGTTCC^GAGTGGGGCGTGCTCAG 

GTCCTCTGTTAGTGCCTCTGCTTACAAGAAACTGTAAGGAGCTACAATTACACCTGGAATG^ 

GGCTGTGGGTCCCATCAGTGCCCGTTCATCCTCCAACCTCAGGCTGCT 

GGGTT AT CTGG ACAGTCTG CGGG CT TG CTGGG AG ATGCCC C AT ATGG CCACAAAT ATT CTGTAATT AT AATTGGTGTCCAG CTTGA 
4 5 GACATGGCTATTTTTAGACCATAAGCATTAACCTTATA 

TCATGTTTCATTAAACCIAGAGGCCTGGTGAGCAAGCGTTCCTCTAACAGTGGGAACCTCAACGT 

CAGAGGCTGGCCCTCGCAGCTCATGTGTGTGCATATGTGTGT 

CTGCAGGTACCGGAATGGTGTTGGAGGACACTCGGCTGGTGTC 

NNNNNNNNNNNNNNNNNNNNNJTO^ 
50 NNNNNNNNNNlWNNNlWNNNNNNNm 

jjjn^NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

HNNNNNNNNNNNNNNNNNN^^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNIWNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN1WNNNNNOT 

NNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNKNWJNNNNNNNNNNl^ 
60 NNNNNNNNNNNNNNNNNNNNNNN^ 

NJ3N1JNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNlWinmNNNNNNlWNNNNlWN 

65 nnnni^wnnnnnnnnnnSnnnnnnnnnn^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNWW 
. NNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 mWNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNWTO^^ 

NNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
75 NNNNNNNNNNNNNNNNNNNNNNNNNN 
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NNNNKNNNNNNNlWNNNNNNNN^ 
ITONNNNNNNNNNNNNNNNNHNNNNNN^^ 

TGGTCCCCAGAGAGCTGAGCTTCTCCTCCGATCCCCCGCAGGCCTCCC 
ACTGCOGCCAGGAGCCTCCCGGCCGCOT 

CGTGGGCCGGGAGGGACAGCCTTGAGGACTAGGGGAGGGGGCACGGGACCTTGCAGAGCT 

CCCCCGGCATATCACCCCGGAGCACTGGAACCCGCCCCCGC'n'C'riXil'i 1 1 GGCACTGGTGGTGCTTGCGGTGAGGTCCAAGGAGC 

CCAG CCTCCCTGAGTGGACCGCCGGGCCCCTC CCCGTTCCGGGACACAGGAGAGCTCCCGCCCCTTGCTGGCTGGG CAGCCCCTAG 

AT ACCTGGCTCCCAGGGG CCAGCTTCCCTGAG CCTGGGGATGAGCCATG AGCCTGCAGCCTGGCCCCAGGGCGGCCCCCCACGGCC 

TGCCGGCTGGCTCCCTCCGGCTCCATGGCCCACCCGG CCTTCCTAATTGCCTTCGGCTCCCACCGGTGCCTTAG CTCAGCCTGGTC 

GCCCAGCGGGTTGGGTGCCACCCAGTGAGCAGGTGGCGGCACCAGCTG^ 

GACAGCTCAGGGACCCCCCCCZACCrACCAGAACCTCACTTAGGGGGT 

CTCCCTGGAGGCGGGGATAGGCCCCAGCCGGACTTGGCACTGTTm 

C CCAGGTCTCACTGTCCTGGCCrGCTGCTCTCCCCTTAGGGCCTCTGCGGG 

TACCCCCAGTGCCOUSCACTAGCTCCCCAGGCCCGGGATGTCCCT^ 

GCCTCCCCCTCCAACACCATGCACTCACAAAACAGAGAATCACGACCCCAGCTGGGTGTGTTCCAG 

G AGGGT CCTG TG CAG AGG CCGAGGAGCAGT ACAG CGAC CCAT CTGGC C CTTTCCTGCTGGTGGGACCAGTGGCACG CAGCCTTG TC 

TCTCGGAGCCCATTTCCCAGCCACAGAATGGGGAGTAGCAGATAC^ 

GGGGAGTGCTTTCCCC CTCAGCCATGCCCCACCTGGCC CCAGGTCTCCATGTG AGGG 

TCCCAAGGAAAATTCTAGATACAGATTAGCTAATATACACTGACAGATACACATAT 

AGACAGGGTCTCACTCTGTTGCCCAGGCTGCAGTGCAGTGGCATGATGATAGT^ 

TCCTCCCACCTCAGCCTCCCAAGTAGCTGACCACAGGCACGTACCACX3CCCGGCT 

CTCACTGTGTTGCCCAGGGTGGTCTCAAACCCCTGGCCTCATATGATCCT 

GTG AG CCC CTGT AC CCAG CCCCAGACT AG CT AAT ATT ATG ACTG AT C ACCATTCC C CATTCCC CATT C C CCCACCCCAGG AC CACT 

GGCAGAGGCCACrCACrCTGCCT TC TTTTCTGTGGTTCTGAGAAGGCTG 

CCAGGTTTCAGGAGGGAGATAGCCTAGAGCATGTCTGCACTGAGTGAGGAG 

TGGCAC^TCCCTTCCCCTCTGCTGTCTGTGTCCTCATCTGCAAAGTGGGGGTGCATGGTCTTG 

CCCCCTAACCAGTGTGTCTCTCCAGAGCAGGACAG^aCTGCTTTGGTT^ 

AGATCTTCTTGCC^GTGCreCTGGCrGCCCTTCTGGGTCT 

GTGCCCTGCTCCACTCCCTCTCCACCCCTCTCCCCTGAGO^CGCrCCAGGGG 

CCG^CCTGGCCACACTGTCrCACTGTGTGTTTGAGTGT^ 

GCCTTCAGCCC^GGGCCTGGTATAC^GTAATTTCrCAGTAAATGTCCA 

GAGACAC^TGTGTCCTCCTTCCGCAGCCAGCTCGCTGATGTGCT^ 

CGACCATCTGCTCCGACCAGGACAACTACTGCGTGACTGTGTCTGCTAGTGCCGGCATTGGTGAGTC 

TTCCTCCCCTGGCCATCTCCCTAGCCCGGGCCGGGGCTCAGCAGAGGCCATTGCTGTCT 

CAGCCTCATTTCCCATGCAGGGAATCTCGTGACATTTC^CCACAGCCT 

GCGTCAATGTTGGTGTGGCTTCCATGGGCATCAGCTGCTGCCAG 

GCAAGCGTCACCCTGCTGGGTGCCGGGCTGCKXraAGCCTGCT^ 

TCCCCCGATCCCCC^GCTCAGGAAGGAAAGCCCAGCCCTTTCTGGATCCCACAGTGTATGGGAGCCCCT 

ATCTGTGCCCTTGGTCCCAC^CAGGCCCACCCCCTGCACCTCCACCT^ 

CCCTC^CTTCTGGGGTGGATGATGTGACCTTCCrTGGGGGACTG 

AGGACCAAGTCCCATGGACATGCTGAOVGGGTCCCCAGGGAGACCGTGTCAGTAGGGATGTGTGCCT 

CACTGCACGTGAGAGCACGTGGCGGCTTCTGGGGGCCATGTTTGGGGA 

TAGCCCCCTGCCCTGGCACAGCTGCATGCACTTCAAGGGCAGCCTTTGGGGGTTGGGGT^ 

CCCCAAATCCAGCCAGTCCTGCCCCAGCCC^CCCCC^C^TTGGAGCCCTCCTGCTGCTTTGGT 

CCCAGCTTCCTGCTCTGAGTGTGGCTGCCCCTTGCGGGGAGAGGCAGAGCACCCCAGGTT^ 

CAGTGCTCTGCCCTTTCTTCTGCCCCAGGTTCAGCAGGTCAAGTGAGTT 
TGGAGGAAAGAAAGCGGGTGCAAGAAGGAGCACGCTCAGGCTGGGGCAG<IACCCAGGGC 
GTCnXXSACCCGCTGGTGCAGAGGGCACAAAAGCTGGGCTC 
GCAGGGGGCTCCC^CGaVGAAGCTCTGGGCAGCAGCCCTGACCC^^ 

ACAGGTGGCAGCCTCCTGACAGAGGAAGCCGGCCCAGGCCGCCGTGTCTGCAGTTGGGTGCT 

GGCTGGAGAGAGAAAACAAGAACAGACACCCAGTGGAGGTGTGTTTGTTCATT^ 

CTCAGCCACTTAGGGCCCGACCCTGAATTTTCCCGGCTGGCCAGAAACTGTGG 

AGCAGGAGGGGCCGGTGGTCAACGTGGAGGCCCGAGATTCCCACCCCTGGCAGCTGATGTGAATAGCCGGGTCCT 

GAGGTCAAGGCAGGTCAAAGACCTGTGTCCTCCTCCTGGGTCCCTGAGCTAGTGATTC 

CACCCATGAGGGGCCCATTGTAACCGAGCGAGAGGGCCTGGAAGAGGT^ 

CTGTG CAAGGACC CGGAGCCCATCAGAGCAGCCTGCAGAGACTGAG CT^G AGATTCCGTCGGGAGTTTGGATTCGCTGAGTGGGTC 

GCTCTAGCATTGAGAAATGCAAGCAGAGAACAGAGGGTGGAAACCAGATGGTCCAGGAG 

GATCCCGGTGTGCAGTGCGTTCACTGCATTGTTCACT 

AGCTGGTGAGAGGAAGCTGCACCCAAAAGCCATGGCCGGGAACCT^ 

CGATGATGGAATTTCTAACAAATGCTTTGTTTCTGCTAAAGCTAA 

AGCTGACTGGTTACTGAGCCTCACAACCACACTGAGACGGGGGCGCGGTCATCCCCTCGTT^ 
GGAAGCCGCTTGTCCTCCTAGGATCTGGGACTCGGGGCCC^GCTCrCCCTCrGCT 
TGACCTCCTGCTGGTGGTCGGTAGGGTAGTCCCGCCCGGTTCCTGCTGCCCCGCCCTCG 
TGGTTTCCAGGAGCAGCGGGAGGGAGCCCGGGTO 

TTCCTGAAGCCTGGGGGAGGGAGAGGAAGAGAGCCTCGTTTCGTTTCGGGGCTCTGGACCT 
ACCCCGCTAGGACCTGGGATTTCCCAGTCTCCACACCGGCCCGGAACTGGGCCCCCT^ 
CCCGCTTCCCCGCTTCCCAGCATCTGCCCGCGC^TGAGCCT 
CCACCGCTCAACCCCTGTGGGGAOTCCTC^ 

TTGACACCCCCGCCCTCTGCGGTGCCTTGCAGGAGTCGCCAGCCTGTGGGCCCGGGACC^ 
AACCCGGC^GCCACAGC^GGCCTGC^CCCACGGTGCCTC 

TGMTGCGAGACCCTCCAGGCTGCirCrGGACCTCGGGGCTGGGGCTGAGCCCGATC 

GCCTGGGGGAAGGCCCTCCAGTCTCTGAGCCCCACCCTGGGGAATTTGCTGGG^ 

TTAATAAGGGAACAAAGTCCTTGGGCGAACACCTGCATTTACTATCTGAGCCA^ 
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TGGGCTGCTGGGCGGCCAGGCGGGGGCATCGGGGCAGGAGCTGAGGGCTGCACTGGGCAACCAC 

AGGGGCAGTGCCAGGGCGGGCTGGGCTCACAGCAGAGCCCAGAGGAAGGGGCT 

CCAGAGGCCACGC05CACTGTGGCCGCCTTGTTACG 

T CAG TCT ATCCT CAATTTCTC*TGGGGT AAGAC C CTCCAGGG AC CACAGGATGGGTG AGGTTCAT AATCTGGG CACAG AGTGGGGCC 
5 ACAGCCCTTGCTAGTGTAGCTGAGCTTCTGCCCTGTGGGCAAGAGAGGAGGGTGTCT 

GACCCXJ CCCCAGGACCGCCCCCTTCAGGG AACCTTGAAG CCAGTGGGGACGCG^ AGGCGAGCAGAAGAGJUICGGGCX3GCGGCCCCA 
GGCCCCAGACCAAGTCTGCAGTCCATTAG^ 

AGGCAGAGCCCCTGGGATGCTGCCAGAGAGTGCCCTGCCCACCCCTCACCT 
TCGATAGTGCCAGTGACCAOVGAGGAGGGGCAGCTTTGAGCCTGCATGGGATAGGA 
1 0 ACCtSTAGCACTCGAAGCACTCAAGGCCCTGGCCTGGG^ 

CAGTCCCTCTCACCAGCACCGGCAGCAGGCCTGGGCAGCACAC^ 
GTTCCTGCTGGCATCTGGAGCCCCCAGCATCAAGAGGG 

AGCAGCATGGGGCTAGTGACCTTCATGGTCTTGGGTATGGGTGCATTGAGGGOVC^ 
GCTAGGGGAGGCCTTGCACCCCCGAGCCTGGGAACCTCTGCCCCACTTCTGGTT^ 
1 5 TGCACACAGCTGATGGCCTCATCCCATCATCT CACAG AATGGGAATT 

GAAGGCAGACATGACCTTGAGCAAGACCCCTCTCACCCCACCCCTTCT^ 

C!AGATCCGGAGGGAGGAG<X3GCTGCGGTGAGGAGAGGAGGGGCTGCGGTGAGGAGAGGAG<X5AGA 
GACGCCTTAGGAGAAGTTCTGACTCAATAGAGAGGAGGGTCCT 

CAGTGTTGGTGT CTG AAGCAGGAGAGAAAAGCAGCCGTG AGAC CTCAGGTCACACCCAGGACCAGGCCCTTCCCG CTCAGGATGAG 
2 0 GGTCTCTGGCTGAGCCCACCACTGGCAGGCCCCCGTGCAGACAGAGCCTGGGAGGG^ 
ACCCCAAGCCAGGCACCCCTTGTTTTCTGTGCAGGGAGCAGAGGCCC 
AGGGGAGGAGGGNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNinWNNNNlWNNlWNNNNNNNN^ 

2 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

CACCCGAATTTGTTCTTCACGGAGAATCAGAAGACACAGGAAATGGG 
ACCAGGTGCTGKK5CAGAAGGAAGAGAGGGCTGCCTGGGCTGTGGGGA 
CATAAGGATGGGTCGCCTCTCCGGTGGGTGGTCCCTTCCGAAAAGCAGA^ 
TCCATGGCATCTGAGAGGATGGGCCAGGTGAGGTGGGGGTGGCTCCTGAGAGCAGGGAGGAC^ 

3 0 CCCAGAAGACCCCCCCAGTGACTTGGGGCAGCTGTG<K3CA 

GAG C ACAGG C CAGACCTG AAGGGG CAG CG C CAGG GGGG CC CATGG T CCACGGGTCACTCTG CTGGCCCTGCGGCCACACTG CT CAG 
GTCCCAGGGCCTGATGATGGCTGAGCCAGACATCATAACACCAAGCAATGGAGAAGTTAGAATAT^ 

TTTTTG AAT CTT AAAAT AT ATG TTTT AAT CTG ACATCT ATAAG CAGTGCATGTT ACATGC CAGT CAG AG CATGAG CAT CCCCAAGA 
GGTGTGCACTCTGTGTATCTATGGATAATGAAAGCACGGCTTTCACACAGCA^ 

3 5 CTTCT ACTTT ATTTGAAAG AG T ATTG ATTGTG TC CCACT CG ACTG ATTT TACACC CT CT ATG T ACATGGT CCCCAG TTT ATGGG CA 

GGGCX5GTTATTTTGTGTTTTATTTTGTTTTTGAGACGGAGTCTTGCT 
CACTCCAACCTCCGCCTCCCGGGTTCCAGCGATTCTCCCGAGTAGCTC^ 
TTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGGCTGGTCTCGAACT 
AGTGCTGGGATTACAGGCATGAGCCACCACACCTGGCCrGTTATGTTTTT^ 

4 0 GCAGTGGTACAATCACAGCACATTGCAGCCTCAACCTCCCTCCCAACCT 

CATCTCAGGGCTCTGAACCTGAGCCTCCCACCAGGG ATTTCCC^ CGCCCTCCTCTCTG CAG 

CCCCATCCAGCCCCCACTTCCCAGCATCTGCCCACGGATGAGCCTCCGGAG<?ATGGGCAGCAAGTCCT 

CCCCTTGACGCCCITGTTACAGAGGCTCTTTTCACAGC^ 

4 5 TTCTATGGGGTAAGACCCTCCAG<^CCACAGGATGGGTGAGGTT^ 

GTAGCTGAACTTCTGCCCTGTGGGTGAAAGAGGAGGGTGTCTGGCACAGGAAGCCCCACCCGACCCCCTT 
CGCCCCCTTCAGGGAACCTTGAAGCCAGTGGGGACACAGAGACAATC^^ 
AGTCCTTCCAGGCTCAAG^GATCCTCCTGCCTCATCTTCCAAGTA^ 
AAAAAAAAAATTTATAGAGATX3GGATCTCACTATGTTGCCCAGGCTGATCTTGAACT 

5 0 CCTCCCAAAATGCTGGAATTGTAAGCGTGGGCCTCTGCACCCG^ 

CAGAAAGCACCCAGATGCCTTTGGGAGAACCAGCCTCCTCCATGGAGGAAAGCTTG 
GATCTGTGGAAAATCCTTCTGACGGACTTCCCCTCAGTGCCTGATCCATACTCA^ 
GCAGATACACX^AGACAGTTCCCCAAATAGCTGAGCGAATAGCGCAGAAGCAATATTGAAGACCTA^ 
ACTGATAAAGTGCATCCAGCCACAGATCIAAGCAGCCCAGAAAATTCCAGG 
55 GAAAATGCAGAAGACCAAACAAAAAAGTCCTGTCAACAGCCAGAGAAAAA 

TGTGGAAGCCAGGAGAGACGACAGCCCTGAGTTCCACAGGCAATGGAAGAAAGACTC 
TGCACTAAAAGAAAATCTTTTTTTAATTAATTAATTAATTTATTTATTTTAC 

ACAG CTT ATTGTAGCCTTG AC CCAGGCTCAAGCGAT CCTCCCACCTCTGTCCCCG AAGT AGCT AGGATGACAGGTGTGCACC^ 
TGCC CAGCTAATTTTTGTATTTT CATAGAGGCGGGGTTTCTCCCTGTTG CCCCAGCTGGTCTTGAACTCCCTTG ATCAAGGAATGC 

6 0 TO3GACTACAGGTGTGAGCCACAGCACCGGCCAAGAAGGTCTTTTAAAAA 

AGCACTTTGGGAGGCCAAGGCGGGTGGATCACAAGGTC!AGGAGTTCAAGACCAGCCT 

AAAATTAGCTGAGTGTGGTGGTGGG CACCTGTAATCCCGG CT ACTCGGGAGGCTGAGGTAGGGAATTGCTTGAACCCGGAGGCAGA 

AGTTGCAGTGAGCCAAGATCACGCCACTGCACTCCAGCCTGGCGACAG 

AGAAAAGTGATCTTTAGGCTAGACATGGTCCATGGTC^CTCTCACCTGTAA 

6 5 CTTGAGTCCAGGAGTTCCAGACCAGCCTGACCAACATGG AGAAACCTGT CT CTACTAAAAATACAAAAAAATTAGCCGGGCGTGGT 

GGCGCATGCGGGTAGCCCCAGCTACTCAGGACGCTGAGGCAGGAGAATCGCTTGAAC 

ATTGTGCCATTGCGCTCCAGCCTGGGCAAGAAGAGCAAAACT 

GTGGCTTGTGCCTGTAATCCCAGCACTTrGGGAGGCTGAGGTGGGTGA^ 

7 0 GAAG CAC^ AGAACCACTTG AACC CAGG AGG CGGAGGTT GCAG CG AG C CAAG AT CTCG CCACTG C ACG CCAG CCTGGGCAACAGAG C 

gagactacatctcagaaaaaaaaacagtgatctttagccac^ggaaaatggtcccat^ 
agcacac^ttcgtagaaatgtggtgagtctaaatgaacatcaattctattaaaaaagataaaa 
gcctgtaatcccagcactttgggaggccgaggc<kk;cggatcacctgaggtcaggagttcaagacc^ 
actccatctctactaaaaatacaaaaaaaattagcaaggtgtggtggcacatc 
7 5 ggagaatcacttgaacccagaaggcctctcacgccattgcattccagcct 
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AAAAGTTCTCTCTTGTGGGAGATTTAAGTTTTTC 
AATGAGGCGAACCCACTGGGCGGTTACACAG 

AGCCCAGGAGTTCAAGACCCACATGGGCAACATAGTGAGATGGAGCAGGGACC CCTC CTAGGTGCCTGCCAGGCTCTCCCAAGCTT 
GGAAATCAAGGAACACTCTTGAATCCCTTOVAGGGAAATTCC^ 
5 AAGGCG ACGGGAG CACGGGT CTCC CAAGCAGGCCACAGC CACAGGTGGT^ C CT AT AGAAACT AAAGCAT AACAT ACATT C 

TCAAGTCTTTCAGAAACCCCCACCCX5GTGGAAAATGCCGACTGCTATCACAGAGA 
TGGCCGTGTTCTCTGTCATGAATTTCTTCTTGAG 

CAGCACCCTTGGCATAACTAACTCCATCTTAGAGACTCCATTGTATATCTrACAGGG 
CTTAATAAACAAGTTAAAAATAAAGCCTGCATCCAAACAGATGAGGACGCAAGC^ 
1 0 AGGCCTCTGTGATTGCAGAAGGAAAGGCCC 

TCCTATCTCTGTTTCCTCTGGATGTT 

HUMAN SEQUENCE - mRNA 
1 5 G CTCCGGCCAGCCG CGGTCCAGAGCGCGCGAGGTTCGGGGAGCTCCGCCAGGCTGCTGGT ACCTGCGTCCGCC^ 

AGGCTGCTTTGGTTTGTGACCTCCAGGCAGGACGGCCATCCTCTCCA 

TGGGTGTGGAGCGAGCCAGCTCGCTGATGTGCTTCTCCTGCTTGAACCAG 

TCCGACCAGGACAACTACTGCGTGACTGTGTCTGCrAGTGCCGGCATTGGGAATCTCG^ 

CTGTTCCCCGGCCTGCCCCATCCCAGAAGGCGTCAATGTTGGTGTGGCITCCATGGG 
2 0 ATTTCAGTGCGGCCGATGGCGGGCTGCGGGCAAGCGTCACC 

CGGTTTGGCCCCTGACCGCCCAGACCCTGTCCCCCGATCCCCCAGCTCAGGAAGGAAAGCCCA 




TTCCCTGGAGTCTTACGGTCCAACATCAGACCAAGTCCCATGGACATGCTGACAGGGT 

2 5 TGTGCCTGGCTGTGTACGTGGGTGTGCAGTGCACGTGAGAGCACGTGGCGGCTTCTGGGGGCCATGTT^ 

AGCAGCCTGGAGAGCCTGAGTCCCTGTAGCCCCCTGCCCTGGC^ 
TCTGCCACTTCCGGGTCTAGGCCCTCCCCAAATCCAGCCAGTCC^ 
GTG CCTCAAATAAATACAGATGTCCCC 

30 HUMAN SEQUENCE - CODING 

GAAGAGCAATCTGTACTGCCTGAAGCCGACCATCTGCTCCG^ 

GGAATCT CGTG AC ATTTGGC CACAGCCTGAGCAAGACCTGTTCC CCGGCCTGC CCCATCCCAG AAGGCGTCAATGTTGGTGTGGCT 
T C C ATGGGCAT C AG CTG CTG CCAG AG CTTTCTGTG CAATTTC AG TG CGG C CG ATG GCGGGCTG CGGGCAAG CGT CAC C CTG CTGGG 

3 5 TGCCGGGCTGCTGCTGAGCCTGCTGCCGGCCCTGCTGCGGTTTGGCCCCTGA 
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Table 24 

MOUSE NOMENCLATURE 
ICSGNM Rasa2 
Celera mCG21645 

5 

HUMAN NOMENCLATURE 
HGNC RASA2 
Celera hCG16062 

10 MOUSE SEQUENCE - GENOMIC 

TGGAGCTCTGATCCTGAGGGCCCGCATGAAGGCCAGCTAGGTATAAAAATGCATCT 

GGAAACCACGAGACAAGCTTGCTACCTAGACTAGCCAAACTGGCAAGCTCCAGC^ 

GATAGAAGACAAGGAGACCTGTAACCCCCTAACCTCCACACAAATGCACACAGCT 

GATGCAGGTACCTGTGGTTGTCAAGACTCAGTGACATTTTGTC 
1 5 CAC CTGGGTTCAGG C C^TCT AACACCT ACCITTCrCAATGGGATGACGTCAGGCCTCACAAG AT AC CC CTGAAAGCGCAACCATTT 

GTCTGCCTTCCGCAGGAACAGTAAGTTATTCAGATTCCAGGCTCTAATGGTC 

TTCTGGAGGGAGTGTGAAATCTGTATGTATACAGGAGTGGCCTTTGGATT^ 

AATCAACTTCTTGTGTCTCAGCrTTCCrAGGTTTTTCT 

TGAGCGTCCTCAAAGACCCCCCTTTGGTGGTGCTGTGGTTTCTTCTGTTGTCT 
2 0 CCACTAGCCTGCAACAGAACCACACTGTAGCTGCCTCCTCAATTCCCAGTC 

CTGAGCCCTACTCTGGGGCTAGTGTTAGACATGTTGCTAGTCTCTTGTAAAAACATAAG 
AGGAATCCCAAGGTGAGAGGCTGAGGCTCTTGCTCTAGGTAAOVGGAAAGGTA^ 
GACTGAGCTGGTGGGCTAAGGAAGTACAGAAGCTGAAGC^TAAACTGAGGAATGTGGGTO 
ATC^CTGGTGCATGGGGGAGGGGGACTGCTAAGAGCACTGTTTGAGTCT 

2 5 TCTTCCCCCTGGACCCTGACTCCCCCAGAGAAGGTGGACACCTCAGTGAGAACAAAAAGAGGC 

CAGTGT CCTGCTT CAACTCTGCTG TTAG CCTC C CAGATGGTG ACAGG CCTG AT ACIAAAGGAG AAT AGAAGCCAT AGCTTTG ACT 

TCTCCTCAAAATGATGCTTAAATTCTTGAAATGATATTTCGGGGGCACCTAAAAGAGGCTGAAAGGGT^ 

CACTTAGAGCTAGAAAAGAAGTAACATCGGGAGAAGAAGGCGAATAAAA^ 

3 0 CCTGTTCAGTCTCCAGATTGGTGTATCTACTTCTCTACTCAAATACTTCTCTTTCTGCTGCTCT^ 

CTCTCACATOAAAATGCCAAGGAAGTGGCTCAGAAATTG 

TCCTCACCACTGGAAAACTTCCCAGTGGGCAATGGGGCTTCTGAAGAT^ 

CAGGGTGCATTGGCTCAAATCCTAGAGAGTCTCTGTGGGTCAGCTTTGCT 

CGTT AG CT ATG AAGTG TTGTCGCGTCTAGTGG AG CATG T AGCAT CT ACCTG CAG AGGCCGG CTCGAGG AG CTTTGT CAGT ATTTT A 

3 5 CX^GCCAGCAC^CTGCCTCTGCTTTGTAGAGAGAGAAGAAACTCAGCAAAGAATTCCCCTTGTC 

TCATTGCCCTTCCATTCCCAGCnTTGTTTCAT 

CTTGCATGTGTGTGCCTGGAGGCCGAAGAGGCGCTGGATCCCCTAGAACTGGAGTTACAGACCATCATC 

TGGGAATTGAACCTCAGTCCT CTGGAAG AGCAGCCGGTGGCT CTTAACCACTG AGCCGT CTCTCCAGCCTCAGCCAGTTCTTCGTG 
AAGCTATACTCTCTCTTGGCTGAGACTTGAGAAGATTTTGAAAGTATC^^ 

4 0 TAAGGAGAGGAGGGGGCACCGGAACCTTCTCTGAGCA.GAGCCACGTGGGTCCACATTTCCTTGACT 

TCTGTACTAGCCAGGTTAGTG ACAGG AACACTGGGAGG CCCT AG CAGTATCTT AAGAGTGCGGGACTCCATTAGACCATGTCCTGA 
AGCTCCCA.TGGCTGCTGGGTTGGCAGGGGCTGCCTGGCCTGAGGTACTCTAGCCT 
CTCTGGGATGCTGGATGAGTGGATAGGGCTGCCCAGACAAATGAAGAAATGTCCTGGCACAGAGTC 
GCTC!AACGGTTCCACCCTTTCCTGGCTGTTGTTTTATCAGTGAAAA 

4 5 TCIATAATTCAAAAGGAAGGGAGCCTCGGGACACCTGAAGTCACTCTGG 

C CTTT AACCGCTTCCT CCAG GAGCTAAACTTCTTTG AAAG T A CTGG AG CCTTCGGGAT AAAAAAAAAAAAA 

TGTCCACATATTITACCAGTTTTGTCATGGCCTTGTTTGGTTCTG 
CCCATGTCCAGCTTTGCTAAGAAGAAAACTGCTGCCTGGACT^ 
ATGAATGGGGGCACCAGATCATCACTGATGAAAGCAACTTGCCTCCCAGCCtAG 

5 0 AGATATACGAACATGACCTACATTTAACTACTAAAAAAGCACAAAC^ 

GGCCCAGGGCGTCAGCCAACCCACTTGAGCCCAGCAGCGAGGAGCCAGGAGCCA 
CTACAGAATGAGACCCTGTCTCAGAAGTAAAATAAAACAGAGGCTGGAGAAATGGCTAAGC 

GG ACCTGAG C TTGGTTCG CAGTACACT AATGG AGG CT AG CT AAGAACTATCT AT AGACTT AGCT CCAGG GGACCCAATGC C CT CTT 
CTGACCTCTTCTGGGTGCCAGGTACTCGTGTGCTGCGCACATACACACATACA 

5 5 AAAATATTTAAATGAAAGTAAATACTAAAATATAATAAAACTAAAAACTTCGGACCAAA^ 

CTACTAATAGGATGTACTAGTATTCAGAAATAGGATGGAACTGACCAGCTCTAGAAATAGTCTCTAGCAT 
CATTAACAGAAACGGCAATTTTCACACAT<X>CACCTTG 
TAAGAAGTCAGAAAAGACAGGGCIAGTGAATGACTGCTGATGAGAA 
TAATAATATGGGGAAGGAAGTGAACTGCAAGATCGTGTCTGCTGCAATATTGTC 

6 0 T AG ACGAATTT AAAACCCAG CG C ACAAAG CCAT AGACAAAATGAG ATGG AAAAGCAGTCTTG GAG CTC T CT 

GTAAAACTCGATTCCTGAATGCTTTATATTAAGGAACAAAGACTAGTC 

TAAAGTGCTCCTTCAGAAATGAAGTAATTAGAAGTGACTTGGGATGAAAAGGAAGA 

ATGATTTCAAACTCCACTACAGGAAGATGTCCTGGGAGGGAA 

ACAAACT AG TAACAG AG TT CATG AAAAACAAGGG AAGTT AG AAGAAC CT ACTGT AAGCAACAGT ATTTTTTCAAAG^ 
6 5 AACAAAACAAAAAAACAAAAGAAAAG CT AGCTTT GG AAGT CT AC CTGG AGGATGTCAC AAAAG TTTT CAGGTCAG ACGGCTTC CAT 

TTGGACGACTATGATAACAAACTACAAGTGAGAAATTTCAGAAAAGGGAAGCGCCACGTCTGCAGAGCA 
ACTCATTCATGCGTGAGCACACTGAAATTGTTAAAGAATATGGAGTGCA 

GGGAGGGGGTGGGGAGG AGAGCT ATG CAAAGCTGCCTCTCTCCTTGTGCTAGGACX3GTTT CATAT AATAGAAGTGATCAGTACT AT 
ACAATGACTCACGTGAGAGCC^CTGCCCAGGAGAGAAAGGAGAGGAAAA 
70 CTCTTCTCTTCTCTTCTCTTCTCTTCTCCTCTCCT 
TCTTTCTTTCTTTCTTI'LTTTCITT ^ ^ 

ATATATAGACCAGGCTGGCCTTGAACTCACAGATATTGGCTTGCAACTGCCTCCTAAATGCT 

GCCTGGTTTAGGAAGCCATTCTTAACAAGTTGTTGGGGAAAGTGGCT 

TCCAATCTAAGCCCCGAACACCTGGCC^IAGGAATGTGAACTTCATTCACT 
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GATTGCTTGGTGACTGATACAATGAATGAAGCCAT^ 
TTGTTTTCCACAAACCAAATAAGCTTCCTGGGTTTTGCTl^ 

ACTTATCACATGCX3TTCATGTCAAGAGCTTGATGTGGCATTTACTGAACTAATCCT 
CTTACTTAATACTATTGATAATCCTTCAGAGAAGGAAGACCCAGTGTAACATTTCAGT^ 
5 TTAAAGCCATAGGTTAGCTCTGTTCTTAGATGCTAGTCAGAAGCCTCCT^ 
GGGCCAGAGAGCTGAGAATCTGGGACTGAGAAATGCT 

GGAGGGATGGGGCAGGAAGGAGGATGGGGAGGAGAGCTATGCAAAGCTGCCT^ 
CCAAGCAGCTGTGCTTACCTACACACGATTAACCC^GT 

TCTGAGGAGCnXSTGTAGGTAGTTAATGAATGGCTGCTGGGGTAAGGGGAATCATT^ 
1 0 TCCAATAAATAAGCCTTGATCCACAGCCCTAATTAGAGTCTATC^ 

AGACAATTCTCTCTCTCTACACACACACACACACACACACACACAC^ 

TTGGGGG<jGGAAAGAAGGATCTCAGTAGGAGTG^GAGAGGAATAAG<3GG^ 

TGTGTGTGTGCATGTGGTCGTCAGAGAATACATTTTAAGAAAATATGTTTAAAGACT 

ACTTTCTCTGCATGTATATATCTAAACATCAGAGAGGGCATCAGA 
1 5 AGTTGAACTCGGGAAGAGCAGCCAGTTTAACCACAGAGCCACCT 

AATATCACTCCTAAACATCACAGCTGTAAAAGGATAGGATAAGGTTTCGTATAGGATCTACAAAGCCTTC 

T ATTGTGGACACT CTT CAG ATG AAGTCTTG TTTTT ATTTTT AAAGAATTG CCATGACAG AATGG AAAGGAG A 

AAAAAGAAGTTTCATCAGAGGGATGTGAC^CCAGAGACACTGTGAAGAGAAGGTCT 

T AGAGG AGGGAAAGG C CAAC CGAAACTAAGG ATT ATGAAAAGGT CACAAGT AAAC CAG AT ACTTG AT ATGAT AATTT AAAAATTGA 
2 0 ATTAAATAAAAGGAGTTTGATGGAGGTACCCTGTGAAAGTAGATAAAACTTGCGCAGA 
TGTGCTGGGAGTGGACCATTTCTGTTGTTGGTCCCACAGGCCCACCAG 
AGCCCACCAGAACTAGACITGTGAAGCCGAAGATACIATGTGCTG 
GGCGCTTCCTCCCAGCTAGCTAACCCCTCATCATGCCAGAAGGTGCTATTCAG^ 
CTCAGCTGGGGTCCCTACTTCCTACATCACTGACCTGCCAGGCAAAGGGCACCT 

2 5 GTGACCAACCACTTCCTGACTGCAACTCCTTCAGAGGAAATTCACATCTGG 

AGTCACAGCCCCAGTAGGGCAGGTACAGATGTTGTTCTGCTAAACAGATGTCATGTGTCCA 
TTGTGCATAGGAATTCGTGAGGATGTCATGTTTGCTTCGAGAAGTCCACCCCCTCATG^ 
TAGTCACTGAGAATAAAAAACTGGATGCTCAGCCTTAAATAAGACCTGTATATCACCCCCCCT 
GKK3AGAAAGAGTGTAAGAGTCAGAGGATAGAGTGGAGTGTGGTGGJATTTTCTTGG 
30 TACAAAGATAITTCTATTTGTATGTGTATGAGTGTTTACTTGCATGTATCTATACCATA^ 
GAAG<3AGTAGGATCCCCTG<3AACTAGAGTTACAGAGCATTGTGAGTTGCCACTTGGGTGCT 

ACATCCTGTGGTAGGGTGCAGGAGGATTTACAAG<3ATTTTTGACAGTTGATG^ 
CCACTGTTAAGTTTCCTGTGCTCCTGTAATTAATCTCTCGTCTATGCTTCTGGAAACAACTCT 

3 5 AAAAAAAAAAGTCTTG CTGGTG TG ATG ACTCAGTG ATT AAAGGTTCTT AAT ACCAAACCAG ATGG C CG GGTT AATC CCTAGGAAAC 

CCACGGTGAAAGGAGAACCACCTCCCAACAGCCATCCACTGACCTTCCAGG 

GAAGGGGGTGCAGGGGATGTGGACGGCGATGAAGAGGGTATTGAATACTATGCATACATTAAACATGTGTATGAAAACG^ 

ACCATGACTGTGTATAAGTAAGACATACCAATAAAGCATTAAACGTTTTAAAATAGAAAAAGCCAGTCTCT 

AGCTCTGATTTGGAAAGAAAGGGAAATACCATTTGGAGTTTCTGTTTGAAACAAC^ 

4 0 TCT CCAAGGCTGTATGGTACAGG ACAAGGGGCAC CCTCATGCATCACCAGCrGGACTGG AAGTATTGCTCG<3T AGCCTAAGACATC 

CAGCAAGCTTAGGTCACTCCGTCTGCCTGACTGCCCTTGGCACAAGTGGCT^ 
GCAGGCTCTCCTTGACTCTGTTTCCACAACITGCTTCTTCn'CT 

TCTGCCCCGTCTTTTTGCCCTAGGGCC^TTCCTATATGTCTGAAAACATTCCTTGACTGGCTGACTCT 

ACCCTGCTCTATT AAATCAGTTTCTTCCTTCTTAGGGCAGCGCAGTTAAAC CAGGAAATCATC CCTCTCCTCACATAC 

4 5 CAGOGTGTTATCCACACAGTTTACTCACTGTAAATAGTTCCCTTAAGATAGTGGCAACAACTAGT^ 

GGG CCAATCT ACTTT CT AAAAAGC CTG CTT CT GGG AGTGGG AAAG AGGT CAC CGTT C CG ATTCTGAAATG C AAAACTGGTGAT AAA 
CTCAAAATTTCTCTGOGTCTCAGACAGGACTGGACGGAGTCTGTGGTCCTTTTCrTGA 
GAGCAAAGGTTGAGTATGTGGTTTGTCCCCCACGTGCCGCACTGCGGGAGC^ 
CTTGACGTTTGCCAAAATGCCAAACTTTTTCCrc 

5 0 ACCTAGAGCTCCGTTCGTGC^GACGTCCAATTCTCAGGGTTCTGGC^ATCACAGCAC^ 

TCAGAGCGCCTCTTCGCGACTTGCCCAGGGAGGCAAGGCGAGGCGGATGC^ 

CGAGGCCGAGGCGGGXjCTACAGCGAG<3CrCGATCCGGCCTAGGCCTCX5CTCC 

CTG<X3GTAGGGGCCGGGCG<3C^CCATGGCGGCGGCAGCGCCTGCTC 

AGACCCGGACACGGGAGACGAGGACAGTCGTGAGOTCCGAGTGCTGCAGAGCCTGCGCGGCAGGA 

5 5 GAGGGGACCCCCGGGGCG<3CGCGGAGGGTGG<3TO 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNIJNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNOT 

CACTTGTGCCCGCCC C ACCTTCCCG CXX3CGG<3G ACTTGAATCGAGAGCCGGCG 

6 5 CTTCCTGTGAGCAGGTCTCAAAACTCAGTCATCCAGTCCGCGCTCAACCCACTTG^ 

CGATGAGTGTAGACTCTTACTCAGAGACGGCCCAGTTCGTTTGTCATTGCTATC 
TCGACTTCCCCCAGCTTATTGAGATG<3TCCCTTTTCACT 

CTTTTTTAATTCAGATTATTTTATCCCCAGAATGAAATCGGTCTCCTAAATAATGCCTGGCTATAAGAGA 
AATTCAATTGTGTTAGAATGGCTGATACTGCTCTTTCTTTGGCAGAGTATAAACTTTA^ 

7 0 TTCTCCCACTTGAGTTTTTGATGTTACTCCCCTTTCAAGGATTTGGAATTTTATTAAAACCCCAGG 

CTTTTCAATTCCCTGACGTCATCCTGGGACTCCCAATTCAG^ 
GTGTTTTGTGCGTCTTAAGTTCCCAGTATTTTAAGTTAC'lTrCTTAGGCTG 
ATTTGCATGTATGTAATCTTTGGGAAAAGCCAAATACACTAGAATTTTCTT 
TTATATTTTTCTGTGACTTGATGCTAGACACTGTTGAAGCCTTGCGAATCTATA^ 
7 5 TCTGTTGGAGTTGAGATTTAGG<3TTGGGCTGTTTTTAGGAGTAAGTTACCTTCTG 
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AAOVGTGTGGTAAAAATATCACTCCAAGTGAAATTCGGGTCTGCAGATTAAA^ 

AACTTGGTTTCTGTGGCTTAGGCAATCTCACTCCCAGAAATTGACCATTAATTTCTO 

ACTAAACCACTTGTTTCTACCCTAGACAGTTTATTTATT^ 

AGG AT ACAT ACTGAAATCAAGAACACCCCTCCCATTT CTTG C C CTT CTTTT AG CTGGT ATGAT CAAGGTTGGTTGC CCT CCGG ATC 
5 CTTGTAAGGAAATGCCCTTCTGTGTTTCCrGAA^ 

GCTGAAGTCACAGAAATGGGATGATGGCAGTTTGACAATC 
CCACACTGAGAAACCTTGGGGAGAACCTGGTAGCTCCCTGTTA 

CTCTCATCTGATCCACCTGAGAGGAAGCACTCCTGGGTAGTGTAGTCCGAATCATCCTTAAGA 

ATATTGTGGAAG TT CT AG AGGTT AG AT AATGGTCCACAC TTT CTT CAGTG TTTCATTTCT AACTTATTTGTGTC CCAT AGATTTCA 
1 0 CAAGGTAT AAAT AGT ATCAAGTTGACAGAT CAGAAAACCAGCGGGTGGGGGAG ATGGGCTCAGTTGGT AAAG TG CCTG CTT AG CAA 

GCAGGATCACCAGAGCCCAAGTAGAAANNNNNNNNNNNN^ 

NNNNNNNNNNNAGAGAGAGAGAGAGAGAGAGAGA^ 

CACACACAGCTAGTGCACACAGTCAGTACATTAACAAGTACACACA 

ACACACACAGAGACCATCATATAGCTGTTTTCACACAACCAGTT^ 
1 5 TGATTAGTGCCCAGTITCTTAGGGGATTTTAGTGTCCTA^ 

ACA.CAAAGTTGTTAGATATCTTAGAATTTTCATCTGAACCnTTAOT 

ATAAGTAGCATACCTAAAGGTAGTTTGTATGCACCCCCCCTCCATTAT^ 

TGTGTGTCTGTGTGTTTATGTCTGTCTGTCTGTCTTCAGGCTTGATA 

ACTGTAAACATTTTGAAAGGCTTCTACACTACCTT^ 
2 0 AGATTTGTGGGAAGTG TTTGAT AT ATTGTTCAGTG AGATCTGTG AG ACAG TGATG TGGC CCAAGAACT ACT CACTT AGG ATTTTGA 

CATTTTCAGACCAAGAATGACTTCCGGCTGTTTGACAAAGGAGATAGATTAAGT 

C CTTCAG CCCCAG CCCCCTAAAAAAAAATCTGGGT CAGC CATGGGC CATCTCTTAGGAAGGAGTTAAGT AGGAGAT AAG CCTTATC 

TGATGCTCTTGTCCAGCCTGCACTAACGTTrCCTGGTGTAGACAGO^ 

TTAGGTAATATTCCTGTGACTTCAGGCAGCTGTTTACTCTTGA^ 

2 5 TGCTTTGTCACCAGAGACrTGGCTCACGTTGTTTC 

ATATCCTTCCAATGCAATCCAAGTCTCCTCTGACCAGATTTGCCCTTGTTTCTAGCCATATC^ 
AOTTAATTCATGACCATTTTCrrrCTTATAAAAATCATrrGT 
ATATTTGTTGCTGCTGGTGGTGTGTGTGTCrCAGTCTCACTAGATCAGGG 
AGATACTTTCAGAGCACTGTGTATCTGGATAAGAAAGAATTTCCATATCTCTGGAC^ 

3 0 agCTTCCTGAGTCTCTCTGGAA GTTCTTTA TTC TTTT^ 

ACTTGCCTCCTTAC<3AAAAGGATTACATTTTTTCTAAATGCTGGCTTATTAATTAGAATATTAA 
AGTTTTGTTACTAATTGGTCCCCAGAGCTTGGAAAAGAAAATATTAAACTCATACr^ 
AGGTTTAAATGTTTTGGTACTAAGATGTAAATCAAGATGTTTTATTGACTACCCA^ 
ATTCATGCTAATAATTCATTATTATCAGATGTTTGAAATGAGATGATGAATACTTA 

3 5 TAATAATTCATTATTGTCAGATGTTTGAAATGAGATGATGAGTTC 

GAGAGGAAAAAGGGACCTTTG T AAAAGTT CACAG AGCATTCAG CT AAT CATTTGCTCTGTG CAGGACAGGAATAAACCT CAGGGAG 

TTCATAGAGTTTTTGTAACTGAGAACTCITTGAATAG 

ATCCTTTAATTATAAACCCTAATCCTATTTCCTTGTTAACGTGCT^ 

ACn^TATATTTATACCCTTTTATACCCTAGTAACAGTTTTGTTGTATAG 

4 0 TGTTAAATTTTTATC^CATCTAGTTTTGTGTAATGTTTGTTTTT^ 

TAGCCCTGGCTGTCCTGXlAACrGACTCTGTAGACC^ 

ATTAAAGGCGTGCGCCACCACGCCCAGCTTTGTGTGATGTTTTAATCATCAAAAATCTCT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGCGCGCGCAGGTACATATGCATGTGTGTATATGTAGATGCCAGAGGATGA 
TCTCAGACATTGTCCTCAGGAAACACTGTCCACCTCrCTTATGGCTTGGAGCCCTACTA^ 
45 CT CCATGGATCCCTCTGTCTCTACCATCTCTGTGCTGGGTTACATG CAGGACCCTTCACJVCCCAGCATTTTTGCCT AGTTTCCAGG 

GATTAAACTCAGGTCCTCAGGTTTGAGGCAACCAAACATGACTTAC 

AGGTTATATCAAATATGAGCCAGGTGGTGCTGACATAGGCCTTTAATCCCAGCACTTGGGAGGCAGAGACAGGCAGATA 

GGAAAGAAGAAAACATTTTTATTATTGAGTGTGGTTTCT TlTrCTl 'TT 

5 0 AAGATTTATTTATTATTTATATGAGTACACTACAGCTGTCT^ 

GTGAGCCACCATGTGGTTGCTGGGAATTGAACTCAGGAACTCTGGAAGAGCAGTCAGTGCTCrTAA^ 
CCCAAG CCTTCTCTC CAGCCCGGGTATTTTCTTTATTTACATTTCAATTGTT AC CCTGT^ 
TCCCTCTCCTTCTATGAGGGTGTTCCCCCACCCACGACTCCTGTCT 

CACAGGACCAAGGGCCTCTCCTCCCACTGAAGACAGAGAAGGCCATCCTCTGCTACATATGTAACT 

5 5 TGTACTCCrTGGTTGGTAGTTTAGTCCCrcGGAGCrGGGGGGGGGGGGGATA 

TCTCTCAGTCCTTTCTCTAACrCCTCCATTGGGGACCCCATGCrGAGT 
TCTCAGGAGCCTGCCATATCAGGCTCCTGTCAGCATGCCTCTTGGCTTCCACAATAGTGTCT 

GG ATCCCCAGGTGGGGCAGTCTCTGGATGGCCITTCCrrTCAGTCT CTG CTCCACACTTTGT CTCAGTATTTGCTCCCGTGAGT ATT 
TTCTTCCCTTCTAAGAAAGATGAAGCACCCACACTTTGGTCTTCClTCrTCT 

6 0 TATTCTGAGCTTTTGGGCTAATATCC^CTTATCAGTGAGTGCATA 

GATATTTTCTAGTCCCATCCATTTGCCTACGAATTTC^TC 
ACATTTTCTCTATCCATTCCTCTGTTCTTTCCAGCTTCTGG 

TGTTATACACTGGAG CATCTTCTGGGTATATGCCCAGGAGTAGTAGAGCCGAGTCCT CAGGT AATACTATGTCCAATTTTGTGAGG 
AACCACCAGACTGATTTCCAGAGTGGTTGTACCAGCTTACAATCCC^ 
65 CCT CTTTCTCCACATCCACAC CAGCAGCTGCTGTCACCTGAGTTTCTGATCTrAGCCATTCT AACTGGTGTAAGG 

GGTTGTTTTGATTTGCATTTCCCTGATGACTAAGGATC 
AGAATTCTTTATTTAGCTCTGTTCCCCATTTTCTAATAGGGTTATT^ 
TTGGATATTAGCCCTCCATGGGATGTAGGGTTGGTAAAGATCTTTTCCGAATCT 

CTTTGCCTT ACAGAAG CTT CACAATTTTATGAGGTC CCATTTGTCGATTCTTGATCTT'AG AGCAT AAGCCATTGGTGTTCTGTTCA 
TAAATGGCCATCTTGCTGAAAGCAAGCTACAGATTCAACGAAATCCCCATCAAAATTC<^ 

AGC AATTTGCAAGTT CATTTGG ATT AACAGGACAGTG AAAACT ATTCT CAACAGT AAAAGA^ CAT CTCTGACT 

TCAAACTGTATTACAGAGCAATACrTGATAAAAACTGTATGGTACTGGTGTAC^ 

AAGACCCAGAAATGAACCCACACACCTATGGTTACTTGATrrTTGAO^ 

7 5 TTCAACAAATGGTGCTGTCTCZAACTGGAGGTCATCATGCAAAA 
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GTCCAAGTGGATCAAGGACCTCCACATAAAACCA 

G TGTGG'n'T (J IT ATAGTG CT ATT AATG AT AGACTAACCGTGCAT ATTGAGG AG CC CCT ACTTTT ACCT CAAAAATG AGGACTTGGGT 

GTGTCTTTCAGAAGATACCCCTCAAATTATTTAATAGTTGCTTTACGAAGTG 

TTGCTTTAGACTTGAGTriG'l'lTAATATACTTCTTTTGTA 
5 GTTTGAATCCTAAACATTTTCCAGTGGGCAGTAGTTAATTCAGTA^ 

AACCCCTGGTGCTGAGAAATAGGTCTAAGCTGTGAGTGGGATGAGAAAGTCTTT^ 

CAGTAGCGTTCATTCATTCTCTGCTTAGAGAGTTAGAAAGCTTTA 

CGTGAAGACTAGAGGACCCAAGTGGCACACGTTAATCACTTGTTGGACACT 

TACTATGGATGTTGAAAATGCACTAGATTTAAGCTATGAAGACTGCTTTACCT 
1 0 TTGTACCCATGCTCATCTGCTCCTGCTAAGAACTTAGTGAGGTGATTAG 

ATGTTTACTGTTTGTGTCAAATCCCACCTTAACTTTTTGCAG 

CTCTAAATAAAATC^TCATTTTGAGGGTCTACCAGCTTAGTTTGTA 

GTGTATTGTAGCTTTTCCACTTCAGAGAAATAGAAGCAAAATATGCA.GTC 

TATATCCTCACTTAAAAATAATATAGTATGTATAATACCTCATCTCACATCATCTAGATGTA 
1 5 TGTTGATTGCAATTTAAGTTGTCTCTTGTCTTTG 

AGGTGTGTGTGTGAGTGTGGATGTGTGCTACTAGGGAGAAAAT^ 

CATGTGTGTGTGTGTCTGTGTGTGTGTCTGTGTGTGTGTGTGTGTCTGTATGTGTGTC^ 
CAGAGTTTCTCTGTGTAGTTCTAACTGCCCTAAAACTCACTCrrGTAGACCT 
TCTGCCTCCCACGGGCTGAGATTACCATTGCCTGGCTGCTACATGTTATTT^ 
2 0 CCTGAATTTGGGAT AAT ATTCTTG ACTCITGCACT CATGTCTGGGCTT CAT A 

CAGATTTTTAAATTAAAGAGXinTAATAGAAAAATGAATGCAAA 
AATGCTGGrTGAAATTGACCAGAAATGAATCAGAGAAATAAAAGTAT 

T CAGT ATTGAATGAATG CTT AAGGAAACAGATACTGTTCGT ACTTTCATTCTGATTTTAC^ TTT AT AAAACCATGTT AGAG CATTT 
GCTAGTATCTAATAAAAACCTAGATGTATTTTTCTTATGCTT V I ACTTTTG CCAT CT CATCCTGCAGAC ACAAAG AT ATTTTGTG T 

2 5 TGACCTTGTTTGTTTTTTAAAAAGGCACATTGAAGTTCT 

AG CAG TT AAACAGAAT AAACTACAT CT AT ATT ATG ATG ACCC C AAGG TG T ATT ATTGAGTGAAAAAAAGCAAATTGTT AGAGT AGT 
ATGTATGTTTATTATCTGTCATCCATCTATCCGTCTGTCCATCTATCC 
GGACATCACAAATGCTAATTAAATTTTCTATCACrGAGCTACAAGCCAGCCC 
GTTTTTCCCCCCTATAAATTAGATAAATTTTATTTAAAAATGTCTAG 

3 0 GGAATTAGTCTTCTGTAATGACTTGATTTTCAATAGGGAGTATAGTA 

TGAATTTGATAGTGTCAGATATTGTGAGACTGAATTTAACTTTTGAT^ 

G CAT ACATTG AAATG AAT AAT ATC AAATTTT CTGTTTT^ T AATTGT GTGGG A 

CATTAACAGCAAGTGCCTGCACCTGCTGAGTTATCTCACCAGCCCTCCT 

3 5 ATCATGAATGTGTGAGTGCAGGTGCCTATGGGAGCTAGGGGAGTTAGAGGTGGTTCGGACCAC 

ACT CAGGTCTT CTTCAAGAGAAAC ATGT GTT AGC CCCAT AATTTT ATT^ CAGAAAAATTCAATG 

GATTCCTATGTACTTTGCCTG1TTGACCAGTTGCTTATGCTAAGA 

TATATAAAGGTCCTGAATAATTTTGAACCTGTGTATTTGGACAGTCTTATT^ 

G CAGTGTTT ACTG AAT ATTTG CTG CTTT CCCGAT ATTGG G CT AGG ATACTTTG CTT CATTTCGTT ACAGG ATTG CTGTT CTTT AAG 

4 0 AAGGAGCAGCAGAGAGGCTCTTTAGCAACACCAGATTTCAGCAGGC^ 

CTGTCTTGTAGTGTACTGGCATGAATTCATTTTTGTTGTCAA 
ATTAAAGCTAGATTGTTGTCAAACACAGTAAAAATGAGTCCAGGGATTTGTAA 
TTGGTTTTTCTCTTCTANNNNNNNNNNNNNNN^ 
CTTTCTTTCTCTACTTCCCTTGTTTTTTGACATGAAATCT 

4 5 GAGTGCTAGAGCTCAGGTGTGTGCCACTGATCTCTCTGTTCTTTAATGCACr^ 

CACCTTTTATAAATGGGAGTTGGGTATGTGTATCTTCTTTGATAAGAGTTCCATTA 
CCTGTCT ACATGCCCCGAG ACTCTCC CAGCATCAGCTTTT AGCTTTTTC 
TTGCTAACTATCZATGGAGCTGTAGGGACCAATGGCCTAGCTCAGCAGAGATC 
CAGGGGCCTTTAGGGACCITTAGGGACCTGCTCGAAGCTGACACaiG 

5 0 TGGTAACATACGTCTATAACCTATGGACATTT CCTGGCAATACCTTGTG AAACTAG CCTTTTCACGTACAGT AGTTTCCTAACTTC 

TTGTTCTAGACTAAGCTTGAGTCAGAGGTGAGTAGCATGATCACAATATAGATAACTG 
TGAACGTGGTGGCTTGAATAAGGGAATTTTATATAGATGCTAACrTTTAAGAACCT 
TGATTTGCTAAGTACTTACT1TGTTTTAGATATTGTGCCAAGTG 
CCTTTACACAGGAAGTTGAAACTAAGGACATAACAGATTCT 

5 5 ClACTCATTTAACTTGCTGTATTGTCAGTACTACTTGTTTr 

TGAAGACTTTACTATATATTATCATATCATACCTAGGGTTGTCT 
AACCTCAGTCCGTATTTTCAAATATTTTAAAGTTTTTTAAAGATTT 
CTOTGTTTACATGTGTACCTCATAGCCTCAAGGAACACGATATCCCCT^ 
CTGTT AGGAACC!AAACriTAGTTCTCTGCAGTAGC7LAC 

6 0 TAAGTATGTATGATC CTT C TT T GTTrTTTC TTGTTAGTGTGA^ 

AAGCTAAAAGCTTCTGTTTAGGTTGACATCTCTGAGGACATCA 

T CTCT CTCTGT AGTTTCTCTTGAG T1TTTT CTTTCATCTCTTTC CTTTTCATT AC CT CAGT CT AAGTT AG AACTCTGT AG CCTTCC 
ATTTTAGGTTCTCTGTTGGCCTCCATCCTGTATCCTCCTGTT 

TTG AAG ATG CTTCTT GTTCCCACCT C CATT ACCAAAG ATG ACAT CACATCC CATG T AAGG C CTGTCCGTGTGG T AGG CCTTCT ATT 

6 5 GGAACATTCATCTTCTGGTCTTAATTTCTC 

TTTTTTTCTTGTACAAAGAAAATTTATTTCTCT 
TTTITTCCCCCATTTTTTATTAGGTATTTAGCTCATTTACATTTC 

CACACAAAGTCTG CGT^n'CCAATGGGCCTCTCTTTCCAGTGATGGC CGCCTAGGCCATC'l'TlTGATACATATG CAGCT AGAGTCAA 
GAGCTCAGGGGTACTGGTTAGTTCATAATGTTGTTCTACCTATAGGGTTGAAGATCCC 

7 0 CCTCCATTGGGAGCCCTGTGATCCATCCATTAGCTGACT 

ACCATTGAAAGATTTTAAGTAAAATAACATGACATTCTTTAAAAATCTGCT 

CTTTTTATTATGGTTTGGCTTGACTTTTTC 

TATTACTTT G TTCAGGACCCrrGACCTATTATGATTrCCTCTGAG^ 

GTTTT AAAGGT ATG CTTTGAAGT AGGG CTGGGTTT ACCAAGT CTCTTT CAGACAAGGATAGTCTCAGAACT ACT A 
7 5 CTTTAAGAAT AG AGTGGAAAGAT AGAT AATCATG CT ATTGGCTTTTGACTATG CATTGATG CTCTTGCTG CTG CAGATT 
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TACTTTATAAAAAGGAACAGAATTGCrTACCTAATAGGGTTGTTCT 

GT C AT CCAG AT ATTT AAGAGAT AGTGT ATC CAGTACT ATGCTCAC CT CTT AAACAACAAT AAAAC C CACTGAT AAG T A 
TATATACAGAATCTTCCCTAAAACATCTACATAAACACGTGTACACACACAGGTG 

CAG CTTTGATGT AAGTATT CT AG TG CCAGCTACAATTT ATATGGT AGACT AGGAAGATGGCTCAGTTGGT AAAATG CTG CAA 
5 AATCATGAGAATCTTAGTTTGATTCCTGCAACTCACATAAAAAAAAAA 

TTGGGGGAG ATGGGACAGG AGGACT CCTGGGACTTGTGGGTT^ CT C CAGGTTCAGTGGGAGGC 

TCTGTTCCCT AAAAATTT ACATGAT AGGCT CTAAACAGAAG CATG CCCGCATGG AG TTACTGT ATAAT CT ACACAG AG AGGACAAG 
C^TCAAGACAAATAACGTCGGGGCTGGAGTGGAGGCTAAGCAGTTT^ 
AAATTCTCTTAGAGAGGATGTGAGTTTTTAAATTAATTTAATATAAAAACTGATA^ 
1 0 GGAAGAAGAGGTAGGAGGGGGAAGAGTGCAGGGTTATCTCTGTCTGTCCAAGGCCAACGTGGAA^ 
AAAAATTGTCTGTCTGTGTCTGTGTGTGTGTGTGTGATGAAGAAAAGCTATATCT 

GTG AAGGTGTTTTGGC CAACTT ACACTG GATCTATGAAAGAAGAAATATCT AATGAAGAGTTT ATTTTGTT 

GTGGTTTCATAAAAGGGACAATGGCTGCCCCGTGGATTTCTTCT^ 

GTTTGTTTGTTTGGTTTGGTTTTTCGA 
1 5 TGGCCTCGAACTCAGAGATCCACCTGCCTCTG CCTCCCAAGTGCTAGGATTAAAGGCGTGAG CCACTACGCCCGGCGGATTTCTTC 

TT AAT ACATACGAAACCTGT AACTGGT ACTGCTTGCT ACCAG AAACT AC CAAAACT CAGTGTCCTT CTGTTTT C CAGTTTT CT AGT 

GAATGACTCTGATTAGTGGAACTTAAATTAGTAGAACITAGAAACA^ 

TAGAGTCTCCATAATACCAGAAAGCACACAAGTAATGCTGTACCGCT 

TTAAGAATAGAACTCAGTATGCAC^CTGGAGTTGCCTTGAACT 
2 0 TGTGTG CCAC CATGCATGG AGAG TCTTTTT ATTTT AT AATGGCATGTTTT AAAAAT ACCTAACAGTGGC CT CTG ATGAATT A 

AATAGTCTATTCCAAGTTTCCTCCCATTTAAATATGCTCTGTAGATAAAAGCCTAAATGAGG 

GAGCTCCCAAACAGAGGGTGGGTTAGTGCTAAGATACTAGAGAGTGGCCACATACTCTGGT 

AAGTGCTTAGTGTTACTAGTGTTCGTAAAGGTTCTAC^CAAAGGAAGGTCTGA^ 

GAGGAGGGTTAGAGGATGGGTGATTTTGACTCAGTCATGTGAGTTATAGCCTAAACTTGC 

2 5 AAAATGCAGGCTTCAAAT6T6CTTCACTTTGTACTTTATGTA 

GATGCAGAGAGGCXSGCAGTGCITTCXrrTCGAACCATCGCCrAGGGTGCT 
TACAGTTTAAAACTCCTCATAACTGATAAACATTACTGTGATTATTTTG^ 
TTATTTGAATGTCAACCAGTGGTAAATAGTGAAAAAAAAAAAAAAACATCAGGAAAACTACT 
ATTTCTTACACTATCTGAGACrTGGCCrTTGATTATTTTAAG 

3 0 AACCCTTGTTTCTCTCTTGTGGGGTGGGACTGGTAGATTATTTAG 

TAAAACAGGGACACGTCCTAGGGTAATGATTTAATTTTCT AAACT CTCTTTTACATAT^ 
ATCTATGTATATATATATATATATATATATATGTATGTATGTATGTATGTATGTAACTAGG 
TTGGTTTTAAAACAAATCCATTGTGTTATTCAGAGGCAAAGGAAATTCTGA^ 
GAAGGTCCTATCTAGGGAGAGAGGAGGGACAGGGGAGAGAACCTCAAACTCAC^ 
35 TATCACAGX3CATGACCACCATTCOVGCCIAGTTCATTTTGTTTTTGA 

TTGTTTTCCCTTAAGATAGTTTCTCTGAATAGCTTTGGTTGTCTTGGAACT 
GATCTACCTGTCTTTGTCTCCTGAGTCCTGGGATTAAAGGC^TGTACCACTACT 

AGTCAGGATTCCACACAAATGTTCCC^ 

4 0 GATTTCAGTATACACATCAAGTAGACACTTGGGTGTCGTCCTTTTGGTACAC^ 

TGGGTTCTTTACTCCTTC!AACAAACATATCACCAGAGTTTT^ 
GGAGCTGCAGAAACCCTTAACAGAAGTTCAGCCCATTACAAATTCTCTCAAGGAA 
;UVGTGTCATTTGAGGTCAGGACTGAATGATAAGAATGTCCCAGCCGTGTAAAGAG 
ATAAACTTGACATGGCTGATGAATGGAGAAAAGTGAACCAAGGGAAGAAC^^ 

4 5 AGAGTGTGATTTAGAGATTAGACTTTATTCTCAGACAAGTGGGATGAGCTGAGAAGTGACT 

GCTTGGTCTGGATTATGGCTCAGCAATAGAGCATTCGTCTACTCT 
AAGGAGAGAAAAGATCACATCTCCTGGTGAAGAACAGGTTGGTGAAGAGTAGCAG 
TGATCCTGTTTGGGTGATTCTAGTGAAGGGGAGACGTAGCTGGGTTTTAGATGTTGTCT^ 
GAGCTGGATTGTGTGAGGGAGAGGCAGGAAGACGATAAGTGAGAAGGAAGTGGTGGTA^ 

5 0 AATAAGATTGGTTTTGGGGACTTGCTAAATATGAGATACCTGCCAGATATCTAGTGGTATGAATGAGG 

CACAGAACTGGGAGCTCCTAGCATTAACAGTGTGTACTGCAGTAGTTTATAGAAAAAGGAAGGG 
GGCAGTTACAGCATTTCGATGAGGAGATGAGTGGAAGACATAGGAGAAAGGCCAGAAACCGTGTGCTGTGGAAT 
AGCTTGTTACATGTCCACTCTGTTAGATGCTGCAAAGGTGTGGATGTAGTAAG 
GACATTGGTGGCCTTGACTAAGCAGTTTCATTGTC^CnTTGGAA 

5 5 AGCAAGTGAGGGGCAGTACTTAGAAGAAATTCTCCITTGACAGAGGAGTCT 

CGAGGATAACGGTCTGCTGCTTTTGTTTGTCTTAATTGTGGAGTC 

AATGCTGACT CAGGAGAGATTCATTG CAGGAAAATTCTTG AGATGGTT CTGGTGAGACAGGAGGGAGAGAAACTG CCCTTTCATAG 

AGGAAGGATGCAGTGATGTGTCTATACATAGCTCCTCTGGTCTTTTTACTTTT 

CTCATTTTTATAGA(nTCTCAGGAAAGTTCACCACCTTTTAGCTCA 

6 0 CTCTGAGCACAAGTTGCTTTTTTTCCCTGTACrrCCATATTATG^ 

AATAAAAAATTTAACATCAATTTAAACTTTTGTGTCAACAGATGAAGCAAAAAATTT 
GCGATTGTTTCTGCACrATAAATTTGX3ACCAGGAAGAAGTTTA 

TT AG AAATTATT CATTT G ATGT AT ATTCATGGTTTT ATGT ATTTTC CAG ATTTTT AT CAACAAATTGAGTTTCTTG AT ATTGAAG C 
TAATATTTTTTATTTAAAGGTTTTGTATTCTATTATGAATTrATTTTTTAGATTTATAGAATGTAA 

6 5 TGTCAGTAAGCTGCACTGACAACATTGATCTCCTTAGCAGTAGTTCTGGGA^ 

TCACCAAGAAGTTCACACCTCCTAAGTAGGCTGCTTGGTCTTTGAACT 
GATTATTAAAGTGACATCCRGTCACGTGCCCAGCGCTTGCTCAGTTCTGGGT^ 
GAAGTGCTTGCTCTCTCATGTGGCCTTTCATTGCCTTAGGAAGTAGTTTTTCTTAGTT^ 
GG AACAATTTTAGTCCATCTTTTATGTGACAGCATTAAATGTTTGAA 

7 0 ATAGTGCCTATTTGTTTGAGCTTTCCTCAAAGCACATCATTTTTACTTCrrCCAGA^ 

AAGCTAGAGAGGTCGATGGCTTTCTTGGTAAAGTACTTGCTGCACAAATAGGAGGTCA^ 

AAAGCCAGGTACAGTGGCTCCCATATAGTTCCCAGTGCTGGGGACATGGAGGGAG 

GTAGACAAACOnTTAACAAGAGTCCTGTCTCAAAGAATAAGGTAGAAAATGA 

araraaa <r& rp CCCT T TCTCCGCATCTG CATAGATACAAGTT ACTTA CT AAT GGATTATCAGG^ AAT AAGCTCTTGGATTGTTl^T 

7 5 atctgtcaaagtgtaattattctttgaaagtact 
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AATTTTGATATAGAACTGTGCrAAAACCAGGTGTCATCATACACGCTTTAAATCCCAGCA 
CTGCTGAGGTTGAGGCCAGACTGGTTCACATAGCAAGTCCCAGGCCAGTCGGG 
AGAACCGAACCAAACCAAACCAAACCACCACCAAACTGCTGAAAGAAAGTC^ 
TGGGACAAGCATGGTGAATGAAACAGTTGTCGAACTGTATTTTATCCCATTGGAT^ 
5 CTCCCACACACTTGCCTGTGTTTTCCTGTGACATTTGCCTGTGTCT I'C l"f I GTCTTTTCTCCTCTG CCCTTCC 

TTCCCTTTTCCTTTCCTCTrCTCTATGCGGGGTCTAGCCTTC 
ATATTGGTGGTGTAGGCAGGCCCCTCTGCATAGCTCTTTGTCATTGGGTTCT^ 
GAACTAAAAGTAACCCTTGTAGTGGCTCCGTCAGTGGGTGAAGGAACCAGCTGT 

CCACATGGTGGAAAG AGAGGACCAACTCCCACAGGTTG CTCTCTGACTTCACATGAG CGCCCTCCTCAAAAAATGAGAATAGTACA 
1 0 TAATTAAAAACAAAACAAACAACTAAAGGTGCCTTAGTAGCCCAAA 
CTTATGCCCATATTTAGAATCTTAATTATTTTTC^CACTTGTGC^ 

G AAATGTCACGGGAAGGCTG CATG ATGTGTCAT AT AATGAAGG AAGGT AAAAG CACAG CTGGGATTTG AG ACCTGAAAAT ATTTAA 
ATTTCAGTTTGTTACAATTTTAT ATCACAAGGTAATCAGTGATT AC^ CAAT AGT AATTTTG T AAATTGAG TCCAT AAATTG 
AG T ACTT ATAGTTTGT AAT ATCTAGC CATATATGAGAAAAT AAC C CTT AAC CTTTCTGAACGTTTGT AT AG TGG AAAAATGATGAA 
1 5 ACCAGTTTI V l T I'A rrrL ' m 'CCA^ 

T AAGGGT AT AG CTAGCT AAACT AGAT AATT CTTG CATTCTG AATGCCT C CAATCTT ACTTT CT AGAAT AGGGC^UUUVAT AAT CCTT 

CTTCAACTTGATTTTATTATAACGTTCTTAATACTTGTCTAA^ 

TTCTAACTTTTTCC^CCTTTGGAATTTTTTCT^ 

GATTTAAGAGATTTTGCTTCTAAATATCTCTAAATCAGAGCAAGTGCT^ 
2 0 ATTTCATAAAGGAAG AT AT ACAAGTATCTC CAAG ACACATGG AAAGAGAT CCAGTT CCATGAGCCAC CAGG AAAAT ACAGTTT AAA 
AC CATTTTTGTCACT ATT AGGACATAAT CTGT AATG AT AG AACTTT C C C CAGTTG ACTGGCTGTCTG AAGGGTG ATTGCAGAGGGT 
ACAGGGAAATTT CTGGGGGAT AGGTG AT AGGTTTTATG TTT AATTCTTTCT T AAATGTTTTGTTTGT AAC 
TCGTACCCACTCCGTTACCCTCCCTCATCTGATTCCCATGCACACT 
TTGTGGGTATGTATGGGACCCACTGAGTTTAATTAGGGCTGCTTCCATACGGGTAG^ 

2 5 CAAACAAATX^CTACACCACTGAGGAAAATGATTTCCCCTTTCCAGTAG^ 

AGTGCTTGTCGTGAAAGCATTGAGAACTTCAGTTTGGACAGTTAGCA 
GCTGAGGTGTGGAGCTAAGAAGATCCTTGGGACTTGCTC^ 
TGGTTTGGTTTGGTTTGGTTTGGTTTTTT AAAAT 
CTGATCCACACATtSGGAGTATGCACACCCACACACAGGTCAC^ 

3 0 AAAAATTAGCTCAGTCTTGTTAGACAAGCATGAAGACCTGAGTTTGGTTCCT 

ATGAGCTGGGCTTATATGAACAGGAAGGTCCCTG<3AACTAGGTAGCCAGCAGTX3T^ 
GACCCTGTATCAAAAAATTAGGTGATGACACTCAACTAGGTGATTGACCTCTGGCCTTTATACT 
ATATCTATACATATACAAAGAAATCATATCACTGGGCTAAGAATATAACTCAGTA^ 
TATACAGTATTTAGCTTAATCAGAATXKK5GGGAGATGTAAAAGAATTTTAATC 

3 5 CCAAAGCCTTCTAGGACTCTGTGATCTGGCTGGAGT^ 

TTAATTTGTAGAAGGAAGCAGCCGTGAGGGGCAGACAAAATGACAAGTCAGAGAAAGAT^ 
TGCTCAGCTCTCACAACATGAGAAAGAGAGGTGACTTAAGAGAGCAGCATAGAGG^ 

GAT AAAAG ACAGAGAATAAGAAGGAGC CAGAAG ATTAGAACAG ATTGC CAGAGTT AGTTTGAGGCCAAGCAGAGCAATTCAGTCAG 
ATGCTAAGAGGAGCCAGTGTGAATCAGTCAGCTTGGAGAGGAGTTTGG^ 

4 0 AAGAGCTGAACAGATGAGATTATTCAATAGTAAGCCTCTGAAACAACAACTGTATCr 

CATAGCATACACAAGCTCCCTGGCTCAAACCTCTGGAACCCACCCCCACTGCT 
TTGATTGAGGTAGAAGCTACACAGGTTTOTAATTTT 

AATTAACTCCGGAAAGATAATAAAAATGCTTAATGGTCTATTTCTAATATAATTAAATATTTTATA 

ATT CAG T ATT AT ACAG CT GTTG C CGGTT AT ATCTTTCT AAGAT AAT AAT ACGAATX3GCTTTGT CITG ACT CT 

4 5 TTGGGTGAGCATGGCTAGGCAGGT ACT AACT CTT CTGTT CACAG GACCTTCT 

AAAGTCTCCTCAGTTCACCTCGATTTGTATAGAATTTGTTTGAAATTCTCAAGA^ 
AAAGTAAAATTTGAATGTGATTTTATTATTGGTACCATTTTAATAGTGATAATAAATATATATTATTTA 
TTTTCTCTCTAGTCCATATTTCAGTGAAGAATTTTATTTTGAGATTCCAAGAACTTTC 
AAAATGTTTTACAAAGAGATCTTCGTATAGGTAGGTACTATTCATAATTATTTATAATTACAACATTA 

5 0 CTGTAAGTCTTTGATATCTGCTTATTAAAGTAAAACCCATCAGGCATAGACGTTCACA 

GTCCGTCCATCTCAGGCTCTCTATTGTGACATAGAGTTGCTGTCCATCTCAGGCTCTCT 

GAGAAAAAGAAAACCAAACAAATATTTTTATTAAAAAGCAAATACTAA 

AC^GCATAAGCAGATAATTTACTTCTGAAATGTTATACCCGG 

CCTGTGATCTGAAACATATAGTAGGAAAGACTCGTTCCTGTGCGGCTTTTAGGCCATGCT^ 

5 5 AGGTGGAAGTTGAGTCACTGCITGTAACTGCACCGATGTGATTTTTAGAATG 

ATAAATATTTTAAAACCCrCCCrTTTAAGGTTGAGGACTTC 
CCTGAAGTACATCTTTCTAGTGTCACTTTCTACAATTCT 
ATATTTAAAAGAAAACAGATTCATAGGTTTAAAGATTTTTGTG 
CTTGAGGACTATTGCTGCAGTTGCCATTCTACTACAAATCTC^TTCCGTACT 

6 0 GCCAGGCGTGGTGGCGCACGCCTTTAATCCCAGCCCTTC^GGAGGCAGAGACAC^ 

CAGAGTGAGTTCCAGGACAGCTAGGGCTACACAGAGAAACCCTGTCTCAGGTATTGAGTT^ 
ACrTCTAATATCTTTTTAAAAGTTTAAGTAACAACTTTATTA 

AT ATTTAAAACTTATTTTTT AAAGACTT AGAG AAGGG AT CATTG T ACATTT ACATATT AT ACACAT ACACATG CA CAT ATCAGAG C 
AAAACATACGAAAAATGATAAGATTTTTGGAAATCCTGGGTAAAGAAAATAGTTT^ 

6 5 TTTGAAATTCTTTCTCCCCCCTCCCCTTTTTTTGAGAC 

CAGGCTGGCCTAGTCCTGGGATTATCAAGATAAAATATTTGTAAAA 

GCTTCCAGAAAGAAAAAAAATACACACACATACACGCAGATATGTTGTT^ 

TAAAGTTGATACAGCrTCACATAAAAATTTATCATTTCTGTTT^ 

AT ACAGTTT GGGATACTTAAAAGGAAAGTATGTGTGGAGCAGAGACTC-GGGGAACAACCATG 

7 0 CATC CCATAAACAACCACCAAACCCAGACACTATTGCAG ATGCCAG CAAGAG CTTGCTG ACAGG AG CCTGAT ATAGCTGCCTCCTG 

AGAGGCT CTGCCGGTGCCTG^GCAAAT ACAGAGGTGG ATG CTCACAGCOVTCCATTGGACAG AGCACAGGGTCTCCAATGAAGGAGC 
T AGAG AAAGGACCCAAGGAGCTGAAGGGATTTGAAGC CC CAT AGGAGGAACAACAATATGAACTAACCAGT ACCCCCAGATCTCCC 
AGGGAATAAACCACCAACCAAAGAAAAC^CATC^TAGGACTCATAGGTCCCGC^ 
TriiGTC^aftGGRGAG^CCTTTJGTCCTGTGAAGGTTCGT^ 
7 5 TTGGTGAGCTCGGGGAGGGGGGAAGGATGGGGTTTTTTGGAGG^ 
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AAT ATCT AAT AAAAAAAATCAAAAAAAAAAAAAAAG ACAT AAC CT AC CGTTGGGAT ACT T AAA AGG AAAGTAT x ill AAAACAAGG 
AAAGACGTGAATT ACTCTT AAACT ACCCTGTATCTCCTTTGTTGGAG CTTTTGTC CAAAAC 
GATCCTTCCCAGGGCTTGCATTCAGGAAGG 

TCCrTTCATCCTCCrrrGTTCACGGTATTTCCTCTTTCCCTGAGA 
5 G AGCTAGCTTAGTAATGTTTGAT AAAATGATCTTTG AGGT A CTTTG CACATTTGTT AGT AATAGGT AAGAT ACTGTTTGT CCTTTC 
ACTTAATTGCxAGCATATAACTTCITATTAAATC 

CAACCTGGAAAATTATATAGCATTGCCATTTAGATTCTGTTTTTATCCT^ 

ACATTAAGTCCAGCTGAGTCCTTCTACTGCCCTGATTCAGAATGACCC^ 

TTTCTTCTTTAAGGTATAAGAAAAAACAAACTTTATAGAAACAAATTTT^ 
10 GGAGGATTAC CATGAGTTT AGGTCT AT CTTGGGCTAAATCG CAAGT ACCAGGCT AGCCAGGGCTACAT AG CAACAT CCAGTTTCAA 

AAAAAGGAGT AACGGTGGATTTTT AT AT CAG CACAT ACAAATT AAAAT ATTTT CTGT AAAAT ATAT AACAATG CTTATG AAC CAT A 

AGGTTATATAAACTATGGTTTCrCATCACrTTATTrATTT^ 

GTGGTAAAGAAACTTGGTTTTCACTGCAGCCGATT^ 

GAAAAACCAGTCAAAACCAACTAATTATACTAGm 
15 GACCTTAAAGT AAT AG AG AG CTGG AAGG AAATGG CTTTT CGATTTC^ ATTTTTTTTT ATTTT ATC^ 

TGTCTTCAGACACACTTGAAGAGAGTGCCAAATCCCATTACAGATGGTTGTGAG 

ACCTCTGGAAGAGCAGTCAGTGCTTTTAACTGCTG^ 

TGTGCACGGCACCTATATACATAC^TATATATACATGTATCCATACAGGAATTAAC 
TTTACrCCATTTGTAATAAACTAGATTTGTGGAGAAAAAAATGTTCCTTATTTAAAC 
2 0 ATTCCTACCTCCCCTCCTGTTTACCTCCTTTTATT^ 

AAAAGGCCTTGTATAAATTCTATATGTGAACAGAAATGTATTGTTGAAGTTTCTA 
ATTCTAAGGGAATCAATCTCTCTCTCTCTCTCTCTCTCTCT 
ATGATTGTGTAGCCATTCACAAGGGCATAACTCOVGGCAGGATTCT 
AAGTTTAAAACAAAGAGACAAAAAGCATACCCACAGATACTTT^ 

2 5 T CT CTGTC^TT AAATTCT AAAAAAAAAAAAAAAAAAAGT ATTGTG TG CAAAAATC C CTT CTT AAAT ACAG AAT AT AG CT AAGT ACT 

TTTTCTGTTTTATCTACTCTACCCCGTTTTAATCTTCTGTCTC 
GTTGGTGCCJUVGGGAGAAGCCATGTGAGGAAGTAGAATC^ 
TTCTGTAAAAC^AGAATAAAATTCACCATGCTTGACTAAT 
TCAGTTGGTAGAGTX3TITGGCTCGCATGTACAAAGTCCTGAGTGCCATCC 

3 0 TTGTCACCTGAGCACCTAC^AAGTCAAGGCCGCGAG 

CCAAAAGCAAGTTAAGGAAGAAAGGGTTTAGTCC^CTTACGCGTCAGTTTCCAA 

AACCTGAAGAOVGAAGCCGATTGCAGAGGCCATGGAGG^ 

ACCCAGGGATGGCACACCCTACCCACAATGGGCTGGGC 

3 5 GGAGGCATTTTCTTTCTTT CT ' IT ' ITIT TAA 

TTCATGTGGTTGTTGGGAATTGAATTTTAGGATCTCTGCTCACTCAGTCCCT 
TAAGTACACTGTAGCTGTCTTCAGACACACCAGAAGAGGGCATCA 
GGATTTGAACTCAGGACCTTCGGAAGAGCACTC^TGCrC^ 
GTTGAAGTTTCCTCTTTTCAAATAGTTCTAGCTTGAAGATAGTTGACTTAA 

4 0 TA CCAGATTATCCATTTTCAGTTTITGACACATGACTTAATAC^ 

TGCTCATGTGTACTTGAGGTTGGCTTCGATCTCACAGTGCTCTCTC 
TCTGTCAGTGATTTGAACAAGAGGAAAGAGAAGAAGAGGCITC 
AGTTTATTTATTTATTTATTTATTTAGTACGTTCTTAATGA 
GCGAGATGGCTCAGGTGGTAAGAGCACTGAGTGCTTTTCCGAA^ 
45 CACCCGTGATGAGATCTGACGCCCTCTTCTGGTATGTCTGAAGATAGCTACAGTGTATATATTAATAATAGTAAATCTT^ 
GAGTGAGCAGGGACTGAGTGAGTAGGGTTCAACGAGGACCTGACCAGAGCGAG^ 
ATG AAGG CTT ACAACCATCTGTAGAGTGACAGTATACTCACATACAT^ 

AAAATATAGAGCTCTATGTGTTCTTTGTGTAAATCTTGGAGGTGATACCAGAAACTAATACTAGAGC^^ 
GTATCTATAAAGCCATTATGTTTTATATATAGTATTGGC^TTTACATTTTTAGTACT 

5 0 cTTTCCAGAACTTTTATCTTAAATGGTAATTACT 

TGGGAGGGAGGGATAATACTTTTCAGGGTAATATGCTATCATTGGG 
CTAGCCTTGTGAGTATAAAAGATTTGAAAACAGCTTGAAACTAATATA 

TT AT AAT AT CTTTT AAAGACATG CAAAAAG AAAAATAG ATGG CAG CAT AG AAAG CAATG AG ATT ACATTT AGCCAGGACT CAATCT 
AATGATTACTTTTAGTAATTATAAAATATGTACCAAGTATAGCCGTGAGTACT 

5 5 GC^CTATGAGGTAGGTATATTAACCTCATTTTGCAGATTATATTAATAG^ 

AGAGATCTGCCTAACTCCTAGTCACAGCACTGTGCTGTCTT^ 
AATTGCTCAGGGAGCATCAAAATATAAGAACAGAAGCCT^ 

GCTGTTTGAAGTCAAATTACAGGAACTTCCTCGGGAATGTGAGCTACATAGTTCCC^ 
GCAGTTAGGTTCTGACAGTAGCACACAAGATAGAAATTCACATAAGTTAAAA^ 

6 0 GTTGACroCCACAGAGAACATTAGTTTTTATCTG^ 

TCGGCAGCAATGTACTCTAAGGCAGGTGAGACCTGTGATCCACTGGTTGAT^^ 
GAAAAAGGAATAGTTATGTTTTCTTCTTTCACTCTCTT^ 
TAAACCGCTTTGGCTCCTAGCAGGTTGCGCTCTCACTGTACCCGC 
GCCAGTTAATTTTGATATGCTTTGACTAGCTGAATGGCTC^ACACTTCT 

6 5 TTTCTGAGTTAACATCTTTAAAAATCTGTATTTCATCTCTGCT 

CTGCAGGCCTATAAGTTTCATCTTTTrGCrrTCCCCATC 
ATTCGCrcCACAGCAATTCTTCATTATCAGTCAAAGC 

GCCAAAATAAGACCAAACATTAGAACCAATTTCCAACAAAAGAGAAAAAAGATTCT 
GTGAGCTTGTGTATTTAGAGAAAGAATAACCITCKXn'GTCATTOT 

7 0 GGG ACCTAGAGATCACT ATTG CTAGGCT CG CTGGTCAGCAAGTCCCAGTG ATCT ACCTGTCTCTG CCT 

CAACACTACAACCATCTTTGTTAAAAATTA I TTCTTT T 1 1 VI 1 1 T G AGAT TATACT ATAGTT ATATT ATTTCCCCTTTCCTCTTTC 

CAAGCCCTCCCATATATCCCTCATTGTTGTTTTTC!AAATTTATGACT 

TGTCGGTGTGTCTGTGGGTATGGGTGTGTGTGTGTGTGTGTGCTCGCGTGTGCGTGC^ 

7 5 CTGTTTC^TGTCAGATGCCCAATTGGTCrrAGCCTTCTGGGTAAGGGCT^ 
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gtgtctgttgtcatccitcntccgcttgtatto 

acataattcc^tagcaaactctctgatcctctggctctcacagtctctccacccctc^ 
aggagttgtattgttgatgaatcrcttgggactgggttccacgg 

cctttgt aaag agaagtt cc cttgat atgcaggatgaggat at aagggcagat attca c catctttattt a 

5 tacacctcttttcctgggtggctggt<xx;agttact 

cttaatgattcctaaagatgtcaaccattctttttatatgcct 
actcttagtgcctcttgtaaaatgctatattcatctttaagtctct^ 
tttacagagagaagttagctaaatttttgtcactgtccact 
gtctgggttaggcccgtttgtttaggggattgagtggatgaaatgatcag 
1 0 gccattaccactaattagtatttgtaactcaaatccgtaca^tttc 

aaggaccacgtgtctgtggagatccaggaagtgacacgtgtaaatgtcggtcactcc1t 
atttcataccacgcacttgattgttatcttagccttttactttatct 

CTCATGGTAGCAGGATTTAATGAGTCCATTTTCTACCITTTTC 
TTGCCTGCATGTATGTCTGCACCACATGTGTGCCTAGTGTCTGTGAAAGCCAG 
1 5 GTTATGAAATGCCTTGGGGGTGTTGGAACCAAGGTCCT 

CTTGGCAGGATTTTTAGAATATTTATTGTCTGTGTCTGTGTCT 

AGAAGTCAAAAACTGGCATCCGATCCCCTGGGGGGCAGGAGTCGGATGTGGTTGAGAGTCCArc 

GAGTCTTCTACAAG AGGAGTGCTTGCTCTCTGTC^ CTTTTCTAC CTTTTT AATAAAATACAA 

AT C l TT TTCT T GACTCTGTTTTTTGCCATTTTCCCCrCTX3ACA 
2 0 CGCTTCTTGGGACTTGATGCCCCTAAGCAAATGGGCACGTCACTTTTATC^ 

ATACAAGTGATGATAATACAGCCATCCTCTTAGCTGTGAGATCCTGAA 

T AGGT AC CTCGAT AAAAAACATG CTTTG CAT AAG ACTGTTG ATT ATGGG TGGGGT CT CACACAACATGT ACAGGCAGCAAAC CCAC 
ATGTTATGCCAGATGCAATGTTAGTTATGTCCnrrTCGATC 

2 5 GATCATACTGTAAGTTATCGATCGGTATATACTTTTTCIT 

TCTGTGTAGTCCTGGCTGTCCTGGAACTCACTCTGTAGAC 

AATT AAAG ACGTGCAC CAC CACTG TTTT AAAT AGCC^TT AAG AAGAAATCAAG A(^TT AAACACATGTT CAG TT AAGGGATCATA 

ATATAAAAGCATAATTCATTATTATTATTATAGATTGATTTTAAAACTTTGTGA^ 

TTCTATTTCITTTTAGATTTTTTTATTTTTTTGAAAA 

3 0 AATTCAGATCTGCTTACTTCTACCTCCTCCTGAATGCT 



AGTTGTGTCTAAAATTATTTAGAGGCAATTTCTATAACTAAAGCT 

CC^G CCACTCCAG AAACTG AACAGCTTAGGCAGTGTAG CCATAA 

35 TGAAACATACAGAGTTCTTTTTTATTTGTCTCTAA 

T ATTTGGTACTCTTTGGAAT CCAGAAG AAGC ATCGGG TGTC CGGG AG CTGGAG TG CTGG 

GGTCCTCTGTGGGAGCAGCCAGTGTTCTrAACCAGAGAGCAGTCTCATGCTGAAACAGTCT 

AATATTTAAAATGTTCTATGATTTAAAGACTATTTAGAT 

ACCTGTTTTCrGTGCATTTTITGATACCAGGAGATAGCAGCrGCACT 
4 0 TTGGTATGCTGTCTTCTTATTTATCCAACTGATATGTTTTTAC^AGATTTTGAATCT 

AAATTTAAAGCAATTCTCTCTGGATTCrCCCGTCTCTGACTTTCTGTACTGGGATTC 

GTGTATGTGTCTTATGTCTCCTAAGCGCTCCCCTCCTCTGTACT^ 

TATTCTCCTTTAGCTGTGGCrGATCTGATGTTAGATATATCT CATTTCAGTTACTTTAGTACTTAGCTCTAAAGTTATCATTTAAT 
TCATTTATAGTTTCCAGTTATTTGCTAGATTTTTCA 

4 5 CAAAO^TGGGTGGTATATATTTTAAGTTTGAGGCTGGCCT 

ACTAAAAATTACTACATAGTATAGACCTTCAAGGTCAGTATCAAAACGCAGAGGTTGTA 

CCGAGTTTGTAAGATATATATGATTTTATCAGTATTTATACCTAAATTTTTTTTTCTG 

CTAGA UT T TTlTr CTGGACrA Tri ' T ' ri 'TGGTTCCAT^ 

5 0 ACTCTACATGACTGCTCGTTTTCTCATTGACGCCTTTTGTTGTTT^ 

ACTCATAACTGTATATTGCTCATTGTGTAGTAGTCATCTAGCCrAAAGTTGA 

AAGTCGGCATAAAGTAATAAAGGTGATTCAGATTATATCTTTAAATGAATGTCTAGAAGGGCTCTGA 

TGGAAATATCCCTACCCCTGATACTATCACTGAGTACACCAGGCCCTAGAC^ 

TTGTCTTGTCAGCTATGTGGAGGTAAGGCAGCACn'AATTGTTAACCGGTTTAGGATTGCT 

5 5 CAAGCTCTGGATTACAT^GTCCCTAAGCTCTTTCTTGAGGAAGCTAATGC 

ATTCATCTCTTACCAGCTGTGTGTAAG AGAGAACTCTATTT^ CCATCAGTGCTTTG ACCGTATAGTGT CACTCT 

TACAGTCTC!TCTGGCTAGGTCATGAGCTTCTGGTCAGTAGTGACGGTGTCTTAC^ 

CTTCCTAAAACATAGCAGATTCTAAATCAATCTTCAGTA 

AATTTCATAGCTTrAGAAATTAATTTATTTTATGTTTTCAAAAAATATAAGTGCCT 

6 0 ATATAAACATTTATTAAGCAGGTATAAAGTTCCTAAATTrCT^^ 

CTTTGCTATATATGCGCTVTGCATGGACATACATTGGTGATTATTAGCCATTTAACAAATAA^ 

GTTAGTGAATAATTATGTGTGCCATTTGGATAGAAGTACATTGTAATATXTTGTTAGCAAAATAAAATA 

TCTTTTTGTCCTAGGGAAAGGTTCACCTTGAGTTA 

GTACAGTAAGTATATCCTTTTTATTCATTTATTTGTTTTTGTCAAAACCACCTAGA 

6 5 TTCTTATTTTAAATATTTTAGAAACTTC1AAAATAAAGC 

ATTTAATCCACTT AAGCTATGG CAAGTCATT AATT ACT CAAGAAAGTTTT AAT G AATGTGGGTTG AG CAACAAATGT ATGT AAG CA 
TTAAGGCATGAAGGGTCATAGGAAGGAGCCTCTGGCTTCCT 
CACTCAGTCACTACGCAGAGCTTCAGGGAGCCAGGGATCAGGGTGAGGAGGTCAT 
AAGATTGCTAGCACCTTTCCCTCTAGGAAGCTCTCZAGTAGTCAGTCCAACrrCT 

7 0 TTCTTGCCTCTGTAAACGCTTGCTACCZAAGCGTTTCTC 

TG AGGTT ATG AGG TTCATATCT ACAT AC CTTAGG AAGG AG TT AG AG CAGTTGTT ACTTT CCT ATTG CTG CT CCACACT AT AATT CA 
CTCTCCTAATGATGAAAACCCCAATTCCTTGAAGCATTGCCTITTCTGTGTTGT 
TCTGCTTTTATTAGCTATCTCAACTAATAAATAATAAAGAGACTAAGCGATTT^ 
GCTTAAGGCATTTTTTCCTAAAAGCTTTATAATC^TGGTAACATT^ 
7 5 CAGTTTTTGTCTTACTGATTTTCAAA 
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AGCACTCAGGAAGCAG^GAGGTGG'CTCT 

TAGAAAGTCCTAGGACAGCCAGCCAGGGCTACAGAGAGGGTTCTGGGACAGCCAGTGCT 
CTACACAGAGACCCTGTCTTGACAACAAGAAAAAGACAGTAGAAATGAGTGTGCAGACT^ 
GAATTCTTTGGATAAGGCCCTCCTGCTCCTCCCTTCCCTGCrTGGGTTTA 
5 G ATT AG CTT ATTTGCAT ACTTGTTT AGGTTTTTCT CTTGGGTGCTTGG CTT AG TTTTT ATTTATGT AT ATTGTGTG CTTGTG TGCG 
TGTATGCCATACATGTATAAGTCCTTACAGAGGCCAGAGATGTTGGATCTCCTGA 

TTATATTTTAGGATGTAGAATATTAGGATGATTTCAAAGGAAAA^ 

TCTTCCTCACACTCAGCCCTAGGCAACCACAGTCTGATTTCTGTGTCTGi^^r 
1 0 TCTTTTGTGTCTGGCTTCrTTTACTTAGTATTCT 

CAGTGTTT C CAGTGT CTCCTG AGCT AGGCACACGGGTTCTAC CACC CAGACAAACCCCAGCCC CTT ATTTTTTCCCATG ACTTG AC 

AGATAACTCAAC^UVTGTACATGCn'ACTTAAATAAGCCCAATTCCATACT 

AATGTTTTAACTTGAGGTTATGGATTTACAAAGTACAAACAGG 

CATGACTAAAC^AAATTTTTATTCCTACACTTTGTTGTGT^ 
1 5 ATCAAGGTACAGTACATACTGTCTTCrGAAGGCTrTTAT^ CTCTGCCACTCTC CCGACT 

CTTTTCAAACCrCTTTTCTGTCrCCAAGTCTATAACATTGTC^ 

CGGAACTCAGTGCTTCCTTACACGCTGTGAGGTCCTGCGTTCAGTCTTGAGTAAACACAC^ 
GAAATAGAACCTAATTGTGTATAGCCITTGAGATTGGGTTTTGTGCAAATCCA 

CT AT CATTGTAAAAAGTCGTTCTGTGGGAAGGCGTTGCTAGAGAGATAGCTCGATAGTTGAAGAGC^ CATGTG CAG AGG 

2 0 ACCTGGACTACTTCCCCAGCTCACAACCGTCTTrAATTCTAACTA^ 

CAGGCATGTGGTAAACAGGCATACATGCAGACAAAACATTCTGGGGCTAGAGAGATC 

TAT AGAGGTTCCAAGTTCAGGTCCCAGTG CCTGTAACCGAATGG CTGACAACrGCCTGAAATGCCGTAACTCCCATGCATTAAACA 
CIT AT ATACTCGTGGAT ACATG ACATC CTTCCAGT AT ACATGAAT AAAT AATAGAT AAAA^ C CATGTT CTGT 

TCTGTGAA TTTT ' lT CCATTTATTTTTAATGTAATTATTGATAGA 

2 5 AACTT AC CTGAA(ZAT AAGTT AAAATT ATGTTTGG TTTTTCTrGG AT AT TT^ CAACTGTTTT CCCCTTT C CAGGT CT 

CCCCTTCGG AAACCCCCTATGTCATTTCC CCTCCACCn^CCTCTATG AGGGTGTTCCCCCACCCACCCACT CCCATTCTCCCACCC 
TGGCATTCCCCTACACTGGGGTATCGAGCACCCTOVGGA 
ACCATGAGCCAGCACCATGGGTCGTTCCATGTGTATTCTTTC^TTGGTGGT 
GCCTGTTGACACTGTTGCTCCCTCTATGAGX3CTGCAAACCCCCTCAGCTCCTTCAOT 

3 0 CCAAGCTCAGTCGAATGGTTGACTGCAAACTTCCTCCTCTGTATTTGT 

GGCTTCCATCAGCAAGCACTTCCTGACATCCACAATAATGTCC^ 
CrCTGGATGGCCTTTTCTTCAGTCCCTGCTCCACACTTTGTCTCCAT 

GCCTCCTCTTTTACCTTTTCCTTGCTCCCATGTTTAATATAT^ 
35 G CATCAGGCATTT AACTT AATCCAGAAAAAAGGAAAG CAT AGTATTATTTAC^ 

CTGAATTAACATTCCTTTTGTCATTCTGTTTAAATATCCTCCCCGCCCCAATTTCCCATACTTCT 
ATACTCCTCATTTTCCTTCTCAGAATGTTTTGATTTTTCTCTATACACACACACAAGCTACA^ 
GCTTTGGGCGTGGGTGTGGGTGGTGATAGCATTAGTTGTTTT^ 
TGTTTGTTTTrrG T TCTTCHG^T T l TT GCTTTTTG^ 

4 0 CTGTTTCTTGCTCTCCTTTGGAGGCCX5ATTTCTGAAAGACATTTGTTTAAAGATACAGGAGTT 

TATTC^CAAATGTGTAOlGCATCAAAGCrGACCAATAGATACTTACATGACAG 
ACTGGCCGTGTGTGGCTGTTGAAGACATGAAATAAGGCATATGTGACrGATATTTTGTTAAATTTT 
TTAGTGGCTGCAATATTAGTATAGCTCCAATAGAAAGCAGAAAGCCTGGAATCCT^ 
AC^AAATATATGATGAAGCTAGAATACCTTATGAAATTAGAATGTAAGGAAATACTCCAAAAAAT 

4 5 GAGACATTG AAATCAATCTGAATAAGTTT CAG AAGG CCAATTGTGGGACAGCTTCAG ATTTG AGGACAGCCTCAAAATTAGTATTA 

ATAATGGACTAAAGCTTGAAGAATAAATATTCATGAATCACTACTGATACAATAC^ 

GTTACTTTrCCTTACAGTATAATTCTAATTAATGAA 

GGTTGAAGCAGAGAGACAAGCACTCTATGCTCTCAGTTATATATGG^ 

GTTGTTCCCAGAG CCTCGGGAAGATGTAGGGG AAGGATTGGGAGAAGGAGG ATTGTTT AATGGGT ACAGAGCT AT AG CGT AGGAGG 

5 0 AATAATTTCTAATAATTTATAGCACAGAATAATTATGAGTG 

CAAAGAAATTATGTTTAGATGTTAGATATGCTAATTTCTCTAATCTGATCATTACTGTGTGTGTGTGTGTGTGTATG 
ATGTGTGTGTGTGTGTGTGTGTATTGGTCTGTGACACTGTATTCATAGGTATGTACAATTACATGTCAGAAATAAAAGCATATATC 
GGGGCTGGAGAGATGGCTCAATTGTTAAGGACATTGACTCTGCTCTTCC^ 
GTTCACAACTACCTGTGACTTCAAGATCTGATGTTCTGACATAC^CATA 

5 5 GTAAATAATTGAAAAATAAAAGCATATAATTAAAATATAACTGAGATCTGGAAGGATGGCACAGTGGTTAAGATCACTTGATACTC 

TTTCAAAGGACCTAGGTTCTGTTCTAGCACCTAAATGGAAGGTGG 

ATCTTCACTGGTATCAGACATGTAGGTGGTGTAC^TACATGTATGTAAACATTCATAGAAATAAATAGCT 

GATGAGAAGTATCTAAGAGTACTAAGTATGTTAGCCTAAGTATGATTGCATTTTTGTATAGTCT 

ATTTCATAGATAACTCTTCACAGTAGCATATCAGGTAGTCTTGGTACTATGTACAGAAAAGAACAGCGAGCTA 

6 0 TCCCCCTCCAATAATAGACATGGCCCCCAGTGGAGGGATGGGGCCACCC!ATCTTCAAAATTTTTGACC 

AAGGAAATTTAGGGAGAAAAATGGAACAGAAACTGAAGG-AAAGGTCAT^ 

C ACACCAAAC ACTG ACACT ATTG CTG ATGTTGTG CTTG T AG ACAGG AGC CT AG CACAG CTGT C CTCTGAG AAGCT CT ATCAGCAG C 
TGACTGAGACAGATGCAGATACTTACAAGCCAACACTGGACTGTCGGAGA 

CTAAGGGGATGGCAACCCCATAAC^AAGACCAACAGTGTCAACTAACTGGGACCCCTGCAACTCCCAG 
65 AAGAGTGTACATAGGCTGGTCCGTGCCCCCTCCCCCAATCCACACAGAGGACTACCT^ 

TCT AG AGACTTGG TGG C C CTTT ATCTTATTTTT AAAT AT AT AAAACT AT ATTT AAGCTG GGACATT CAC CCCAAATAAAACCTCTTT 
GGGAGTAGAAACTAGACZATGTGTCTGATTTATTTAAATGGTATTTATTTTG^ 
AAGAGCAAATGTCAGACGAACTTAGATTAAGTAATGTGAAAGCAGTTATCACAAGGTCAGAG 
GCAGTGTAGAATCCGATACTGCATCCTGGAGTTGAGCAAAATTTG^ 

7 0 TCAGTTTCTTGATTTCAGTAGCTTCACCTTGGTTAC 

GCACTGCCACTGTCACACACCTGAACTTTTACACATTCACV^ 
GTCTACCTGTGTTTCTTTAATGAGACCTTACATTTTTT^ 
TTTTT AAATTTAGAAAT AG AAAACCTCATAGGAAGGAGGAAAGAAACAAAAA 
7 5 ACATGTCACCCCCTCTGGCTCACTCGTCCCATGGTGAGAAAGGAAGGACTCAGCCAC 
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ATCAGTTTCACGAAGAAGCCAGTGAACCCCATGATAGCAA 

TCAAGTTTGGTACATCTTTTAGTCAGCCGCATTGAGTCTTTTACAAACT 

GTGTGTGGGATGACGGTTTGAGAGGGTAGAGGCACGTAGCCTAGGAGAGAATTG 

TGTCTATGGTGTGTGAGGTAAAAATTTTCAGTGTTCTTGTCTACAAT^ 
5 TTCTCACTGTAGGCCATGCTTATTCATTGTTTGAGTTGGACATAGAA 

GGTTTTTAGGTATCAGTTAACTTACTTGGAAACAATTCATTCTGTT^ 

CATATTATCATAGAGCTTGCAAGAGGCCCACCCAGC 

GJACCCCAAAGTATTTCTTTGGGCTGTACTTTTTTTGGTTAGTTT 

AAATGAGTCTTGTTTTCAATTTTCTCTTTACTTGGCAATACAT 
1 0 GATGTTTCTCAGTAGGGGTTTGTGAAACAAAAACAAAACACTTGGCA 

ACTTTAAACCAACTAACTTAGTAATAATTACTGAGGAATAAACCAACTAACTTA^ 

TCATTAGAGCGTGTTCTTTGCTTGCTATAAATACAACCATAGTCCGTACAGTAGATTGGATO 

TTTTGTACAGAGTGATATAGGAGCTGCTGITGAGACTCAGCATGAGAGTCrGG 

TCAGATTTTAAGTAAAACCACAACCATTTCCTTGTGTGTCATTCATGCT^ 
1 5 TGTAGTTTCTCATGTAAGGCTACACCACTTTCTTTTCra 

ACCTTGAGCATTTCCCAGGAAGAACTGTGCTATGCTGGTCTGAGTACCGATTGTCCT^ 

ATGTAAATAAGTAATTTGGAGTAGTTGATACATTGGAACTACAAAAATGGAAACAG 

TTGAAAGATAGTGTTTTAGGTTTGGTAGTGCAGCTCAGTTGCAGGGCACTTGCCCT 

CAGTCCACAAGAAGGAAACGAGAGGGAGAGAGAGAGGGGGTAGATACTATGTTATGATGTT 
2 0 TGTTCTACCTTAGC^TCAAGGCCTXSCCATGGGTTGCCTCTCATAAAC 

GGCCCCTCTAGGTAATGCTTACTGAATTGTTACTGCATTTTCAAGTTTAAAGT^ 

ATCAAGTGCTGAGTAACTGACATAAAATAATGCTTTAAAAGAATTTTTTGTAAGTTCTA^ 

TGTTATCACAGAAAGATACTGGGTGTTATTTCAAATCTGCTATGTGCTTATTAGGCT 

TAAAATGTACTTTAAAGTATATTCCTGGGCCAGGTATGGGGGATGCATGCCTTTAAT^ 

2 5 ATCTCTGTGAAT CAAGG CCAC CTT AG ACTGTATAG CAAACT ACAGG CCATC CT^GG ATCT AAAAGAAAACAAAAAACC^AAAAAAAA 

AAACAAATCATATACACCTGGCTGCCnTTACTCATTAACATTCTrACT 

ATTTTAGTGATTGTCAACTCAAAATAGCAAAACAGTTTCCATACCAGTATCTGGGAAGCCGCT 

GTTCATGTGTCGGTGCTTTGTGAAGAGAGCTCCAAACTCTAGAAGAACTACTG^ 

TTGAGTAATCGGTCTATAATTTATGAAAAACCAGTAACACTTTG 

3 0 ACTGACTAGTCTGGAGTTTGTGTGCAACTTAAAGCTCTTTGAATCAAGAAACAAAAG 

TTTATCACCTAAAGCTGTAGTCTTTGTTTTTTCGGTCATCAAGTCTTAAAAA 
CCATGGAGCTTAAATACTGATATGCTATTATTACAGATTGGTTTTAAATTTAAAA 
AAAGACTTTTCACATCAATGTTAAAAATGTTCATCCTCACTGGATA 
GTAGAGGCTCACACATTCTGGCCAGCCTGCACTACTCACAACCATGTCTCACAGAG 

3 5 GAGCTGCTTGTrTGGAGTAAACCTCTTTTGTAGTTCTGTCATTGAACCAGACTATTGTT 

ArrCTCAACTCAGGAAGTTGTTCTrGGTTTCTGTTTAATGATTTCT 
TTTCACAAGAACTATTTTTTTAAGATTTATTTATGT^ 

GTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGTGCATGCCAG 
G CTGGGAATTGAACCAAGATCCTTTAGAAGAGCAGCCAGTG CTCTTAAC CACTGAGCCATCTCTCCAACCCGAAACTTTTTTTTTA 
40 ATCACAGTTGATTTTGCTATTATACTACCGTTGCACAGACATTCCCATTGGTTGAGTCTAGTCAAAGA 
TTTCTTTATAACTGCCTCTGAAATTGTCCATTCACTGAGAATTAGCA 
CTCAGAAGTACACTGGGTTCAGAGTTACCTTTCATGCTTCTCA 

TCTCACCACTCCTTCTGGTAGTGTAATCGGATCAGCTTCTCCTAGCCGTGGAGACAGTAACCCAATCTTGCCATGTATAGATA 
AAAACTTGCATCCTGTGTGTGGATATTCATCAGATTCAAGTTTGTGAGGTTTATATCC^ 

4 5 AGTTACTGT CTTCTCTGAGTTGTCCTAAAATGTAGGCAGAGGTTAGGCATCCTAGCTGGGAA 

ATTAAGTTCTACMGGTATAACTCTACTACATTGG 

CAGTTCAATGAAGTCTTCTATTTTGAGGTAATTTCTTGTTTTATATAAATGTTAATG 
TCGTAAACCATAGCCATCAAAATTGTTTTATTAATTAATGCAGCTAATAATCTCTACCCAGAG^^ 

CATG CAGCTCTGGTTGGTTTGGAAGTGGGG AGGGCT AG AGCCCCT AC CACAT AGTCACATTGTCACAG CTCCCTTGAGAGCCTCCC 

5 0 CACACTTTCCAGTCTACCTGACACATAGTGACTTCT 

ATCTGAAACCTTTTCCCAGCrGATATTAATGATCAATTAAATGTAAACCCTGCCTTCCATATCCT 
GAAGAACGGTAGCAAATCTGCCAAATATGAGACTACTTTGAGAGTTTGAAACAGTTT 
CTAAATCTAATTTACTCTGTAGATGTAGCTTGTTCATGCGGATCTCCAGCTGCCTTGACT 
TTGCATCTTCAGCTGCTTTGAAATTGCCCCAAATAAAGTGCTTTGGCATTCTTTAAAGTATTTAGTC 

5 5 GACTATAGAAAAACTAAAAATCAAAATATCTAAAAGGTTGTAATAACAGAACAA^ 

TGTAAAAGTGGGT AATTACATCTTGAAGTTTTTATTTTGTTTTTTTC CTCTCT CACCAAAGTACCCCAACTCATG 
CCTGATTGTTGTTCTTTAATAACAAGGAGCATGAGCrGAAGAAAATG 

AACCATGAGAATGAGAAGAAGATGACAGTTCTAACAGCTCCTGCCCAAAGTCACTCAOT 

6 0 GGAATTCGCTCCTTAGGAAGAACCTGTAGCCA 

CTCAGGCCTTAAACTGTTTATTCATCTAGTCCTTTGTTAGATTGG 
AAGAAAGCACACACAGGTGCATATACACAGCTATAATGTGGAGC^ 

ATGTGGAGCAAGACATAAGAAAGCACACACAGGTGCATATACACAGCTATAATGTGGAGCMGACATAAGAAAGCA 
CATATACACAGCTATAATGTGGAGCACCCTCTCTCGAGCTCAGGGTTAAAACAGTCCAAAGCT 

6 5 AT ATT ACAGAG ACAG AACACTTTG GG CACAGAGACAG AAAAACT G AAAG TT AT AC C CAGAGAACAT CAAGTAAAGATTT AACT AAG 

TTTAGTAAAGTAACCAAAAAAGATGGGATAAACTGGTAAGTGTCC!GGCXK»TGCTr 
CCAGGATCTGTTAGCAGAGCCCAGAGACCTTCAAAGAGCTG 
GAAAGAAATCAAAAGCATCCATGGATCTGTGTAAAATTCCTCCACC^^ 
AGAGGTGTAGGCAGGAAGAGCAGTTTCAAGGCCAGTCrrCAGACCACCT^ 

7 0 TCATGACTGAAGCATGGTAGGTGATAACTATGGATCAGAATAGAGG 

TTTTTAAAGTGAGGGGAATAAATCTAGCTCTGCCTGCCTCAAAGCAACAGTCT 
CCAGACMCATGGTGTGGCGTTAAGTTACATTTAGAACAAAATAAAACAACCT^ 
AGAGAGCAJIGAGGTAGCAGGCTAGCCCACAGGTAAACCGAAT^ 
7 5 AAAGGAGAGAAGAAAGAAGAGAGAGAGAGGCCAAGGACACTGACA^ 
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GTAGACCAAACGCTTATAACACATGCTTGTTAGTT 

CTCTTCAGAAGTCCTGAGGGTGGGGGAAGTGGATGCTCAGCTTGGTG^ 

CAGTAAAAACTGTGAGAAGGAAATATGAGTAGGTGTGACCCGTGCMACAGAGCGTTCTAAC 

TAGAACTGACGGTGTGGAGGCX^GGAGGCCTCTGATTTT^ 
5 TGCTTTGGAAATTGCATTCACACTTGACTCTTTATATTTGTAATO 

GAT ACTTT AAACAGTCTCT ACTGAAAT CCTT CGTTTGTAGGTAAC C AG AT C CAG CAGTTACAG CAGGAAGT CC CAG TT C CAGGTGG 

AGG AGG AGGACATTGAG AAGCTGG AG ATC AGGTCAGTGC CGTC CTGTGGCG CTG GAGAACGGt- 1 TT TCTC CATG CAAAC CTCCAT A 

GAGCCCTTCATAGCAAGCTGATCGATGAACTTAGATGAACAGGCATCTAACTTAA 

CAGAAATATCAAGTAGTTCTCTTGTGTGAGCAAAGACAGCCTA^ 
1 0 TT CACT AC CTGG ATT ATAGT AT AATG AATGGAAG ACTGATT AT ATTT AATC CAATTT AG LTT AACAT 

ATCTGCTGACTATCAGACATTTCAGTGAGGTAAGGCGATACCCGCA 

TCAATATCTATGCTGTTTTGGAAAGGAAACTTTTGCAAATGCATGT 

TAATCATGTGTTCACATTGTTAGGCAATATTTTTCTAAATCACCG^ 

AGAAAAAGTGTGGTCTTGTTTTGTTTCCTCAATTCAGTAAAGTAACCC 
1 5 AAAAAAACAACCCrAGACTTTCAAACTTAGTTTCTCCTTT^ 

CACTTGCTGCCAAACAAGTCTTCCTGAGTTGTATTCCrrGGACCCTCACAACGGA 

GACCTCCACTGTGCACAGTGGCATGCATCCCTCCCCCAAAACTAGTTAATAAGG 

TTCATAAAAATTGCTGTAGTAATTTGCTTTCAATGTCTAGATAATGCATTATATGACAATGCT 

ATGTAAAAATATAGTACAAATCAAAATCACTTGAAACTACTCAGTGTGGTTTGCCAA^ 
2 0 TTGACTTGTGGAATAATGAAAACCTAGTCCAAGATGTTT^ 

TTTCATCAAGCCTG<3TAAGAAGACAGATACTCTAATGAACCAAATTT 

AAAT AAGT AAAAAG TT ACTGAT ATT AATTTGGTT ATG AATTTGTT ATT AG AGG AAA^ CACTTGA 
TCTGTGATACTGAGATAAAGAATTTAGTTGTCTA^ 

CTCTCTTCTCAGTTTGGACGGGGTTTCAGATCCTAGCACCTACCTCAGATCTCT 

2 5 ATCCACTGGTCTCTTTGGTCTTGATGGGCACCTGTACATGGACACACACAAAC^CACACACACT 

GAAATG AG AAACATT AAG ACAT CG AATGAACCG CTCGTTGATG CTTGATCACATGTCAT C ATCTGCAGTTTGGATGCTTTT C GGT C 

AAAGAATTGTGGGTAGAAAGCAGAGGGCTTATTGAGAAGACAC^^ 

ACATTTACAAATGCAGATAACCTCGATGTGTTTTCT 

AGTGATACTGAATTAATAACATTAATGTTAATAACCITAATGTTTrGA 

3 0 CTATAGCCCTTAAAAACTAGTGTTTTCTGTCTCTTCX3TGTGTCTGAGCCCATGTTTCT 

ACACTGGATATGTTCACCAGTCACTCTCAGTTTCTGAAGAGAGGACTTCAGAACCAGGAG 

CAGTGAGCTCCAGGGACTGCCTGTCTTTTCCTGCCCCX3TGCAAGGATTC 

TGCTGGGAACCTAAACTCGGATTCTCATGCTCTGCAGAGCAAGCACTTTATCAGCTGA 

3 5 TTTAAAGGTCACTTTTTAAAATGAAATGTAATATTTGTGGCTGCTTATTAAGTGT 

AAATCAGCATAGTCAGCATTTCCATCTCTCAAATACCATTTCCATGTTGGATGCTGGAGAATTAGACT 

AGTTGTT ACGGACCAT AGTG AACCTGTTGGATT ATAAACAGAG CTGCTACTCCCACG CTG CATTGAGAGCCCGTTCTAATAATAAT 
ACCTTTCTCTGATG ATCTCCCCTC CTGC CTCT AGCAACTT ACAOTCTTTGTTTTTATGTGGATCAACTTTA 
TAAGAACITGTGGCCTTTGTCTTTCrrGTGCCrTACrTCACATGA 

4 0 GATTTCACTCrTTAATAGaVAACCGTCTGTTGATGAGTAAATGAATTTAAA 

GAACCGCTCTTTAGAACTCAGCTGCCTAGAAGAGGAGTTGGTGTTGACTGTTTAGGGGTTGGGTTTT^ 
ATTGTTTTAGrrTCTGAGACAGAATCTCACTATTTAACCTTGACT 
AGAGATCCACCTGTCTTTGCCTTCrAAGTGCTGAGATCTAAGCTGTGC^ 
GTCTCACTTGCCTTTAAAGGAGCCTGCCACCAGACAGTCCT 

4 5 TTTAGACTAATTTTCAAATTTTATTTTrTAACTCT^ 

TAATTTCTAAATGCATCC!AGAGAAGGAGAAAAATGACTTTAATTGT^ 
TCATATTTCCTCCCTTCrrCCATGAAAATTCCTGAACTCAGTTTTTAAGTCrTGAA 
AAAATGAGTTGTTGTGTCTCCCAGGGGCCTCCACAATGAGCCTCAGACCTGCCCCCAC 
CAAATTTGACAGAACTTTCTACCITGGCCAACTTGGAATATTAACAGACAAT 

5 0 ATAAATACTTCATGTGCTCTTGAGTGCTGGTGTCrTTCAGGT 

GGCrGCCTGATTTTGGTTGCTAAGAATTGAACTCGTTCTTTGTTTATGATCr 

AATATTTTAATTTAAAGTAGTCTGTCAGAAGCATAAATGCTAAAATTTATGTAGCATAAAAGAGAAGAATCTGCTAAT^ 

ATTAATGTACATTGTATGGAAGTATAAAAATGTTAATGCCATTTTAATTGAAGATCTTGATCT 

GCATCCTCTAGTGGTATTAAAGATAACTGCACAAAACCTCAT^ 

5 5 CATAATAACTTATTAGTAGGCAGTACTCrrrGCTTTAAGTTTTTCTA 

ATATTGTATCTCTAAAAACTGGTTTTTGATGGTCTA 

GT CTGTGT ACTGCATGT ATGCCTG G TG CTTGTGGGGGTCATCAGATGGTGT CAGATG C CCTG C CAC AGG AATT AG AG AT AGTTGTG 
AGCCAACATGTGGGTCCTCTACAGAAGCGGCAAGTGGTCTCCACTGCTGCATCATCT 

TTGCAG ATT AT AATACAATGACT CATGCCACTTTCCCTTGCTTTCT AAGACGTCTTTTGTTG CCCTTATGTACATATT ACTTTT AT 

6 0 GTGTGTCCCTXGCGCGCGCAGCCTGTGAATGCCTGCAGGAATTGTTCT 

GCATCTCTGTTTACCTTGAGTCGGTTCTTGGTGGTGG^rGGAACCrGGGAGAA 

ACTCAGAGCATCACACATTTCATACTCGGAGTGATTTACCTCCTTA^ 

TTAGCATTGTCTTTTTTGTGTTGATATTAGCTAATTT^ 

ATAAGTTTTTTTTAATAGAAATTACACCATTCCTAGGTAAAAAGATGTAAAAG 

6 5 TAGGTCCTATTTCTGAGCCAGCCTCTGTTCTCTAGATTCTGGGAATATTT^ 

TTCGTTG TTTTGTTGT ATTGAAC CTGGG TTTCTGTG TGGCCTGGG AGTCAT AATCTC CCAAG TG CACACACG ACCACATGCCCACC 

TGTACTATGCCTGTATGTTTACTCAACAGTGTAGTAGAACACTATAGATAGGCGTTGACTCTTGA 

CTAGAATCAGACAGCCTTAATTCAGGTCCCIATCCTCGCCAGTAACCACrTG 

70 aTCCCGCTTGCTCGTTAACTCACTACCATCACGTGTA^ 

AATGGAAACAAATCATCTAAAACTOATGACCTGGGATCTCT^ 
GTACTATGGACCGCTGAAAACTTTGCTGCTAAAGTCACCAGATGTTCA^ 
TC^TTTGTTCTGGATTTGTATATAGGTACAAAGCITCAAGGGATAAA 
GGAGCCCTTGTGGTCAGCTCTG^G-AACTACTTTAAAAATCT^ 

7 5 GTCTCTGCTCTGAACrrrGGTC^GAC^ 
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GCTTGCTGACAGGAGCCTGATATAGCTCTCTCCTGAGAGG 

CATTGGATGGAG CACAGGAT CCACAATG AAGGAGCT AG AGAAAGT AC C CAAGGAG CT AAAGGGGTTTTCAGCC C CATAGGAAGAG C 

AACAATATGAACTAACCAGTACCCCCAGAGCTCCCTGGGACTAAGCCACCAATCAAAGAAAACACAT 

GCTGCATATGTAGCAGAGGATGGCCTAGTCAGTCATCAATGGGA(^ 
5 GGGGAATGCCAGAGCCAGAAATGGGAGTGGGTGGGTTGGTGAGCAGGGG 

TAG CAAATTTCATTTG AAATT AAAATATCT AATAAAAAAAAAG AAATT AAA^ CTT ACATGAAAAGGCT AAACTAATTCTGCA 

AATGATAAAAGATTTCATAGTGTCAGCTCTTACAGAGTGCTGTGTATTTGATTT 

TTATCTCTGATTATAATTTAGAAAACTAAATACTATCTTACCTCTG 

GGTTAGTGATTACGCATGCAACCATGGTTCCTTGATTCTGTCCTTGTT^ 
1 0 CTTATATTTTGGGTGAAATATGTGAAGATCAAAAGGATGCTGTTTTGCCC 

CCTTTTATCACTGCGGTGGC CGAG CTGG ACTTGAAGG AT ACT CCGT AAGAACTG T ATTTCATT AAATGTTT AAACTT CAATTTGCC 

AGATTGTTTTTATAGTCTCAGTTCTTTTGTATTGACC^ 

GACAGTTCTTAGAGTTACCAAACAGA 

CrTTGTTACTATTACTGATCITTGCrCCrGGATG 
1 5 GGCACACXSTAGAGCGTTGTGAAGAGCGGTCAGTGCCT^ 

CCACIACTACACCATCATCTTATGCACACAGACCCAC^ 

GC^CAOVGACCCACACTACACCATCATCTTATGCACACAGACCCAC^ 

TCATCHTATGCACACAGACCCACACTACACCM 

CACTACACCATCATCITATGCACACAGACCCACACTAGA 
2 0 ATGGCAGCAAACATTGCTGTTAATAGTAGTAAAGGTTACTAAAATTGTGTATATGTTCA 

TTCTTCTTTTTGCTGTCAATGGTTTTTGTCT 

GTGTCTGACTGAGATGATGAAGATAGTGGGAGGGCACTACCrrcAAAGTCACGCT 
CTCACAGGCAGAGCCTGTGACCAGCTAGAGCAGCCTTCT 

TGCGTGTCAGATGTTTACGTCATGATTCGTAACCACAGCTAAATACAGTTATGAA 

2 5 ACCCCAACATGCGGAGCTGTAGGAGAGGGCCCCAGCAGTAGGAAGGCTGAGA 

TGTGTTGATTAAAAGGGTCTGTCCATTCATGGGCTACrrGGAGGAATAGGTGTAG^ 
GGATTCATTCTTACTCACAAGACACttCTGATC 

GG AAATTGTTT ATGCTGCTTTAAGTTATCTT CAGGTGGAAG CTTTAAATTTTGTTAGATTTATTTTTTATAATATCAGAAGAAGAA 
GAATTATATTCATGTTGTTAAATC^CCTAGGAAAGAAGAGTAACTGAGAAATACTTGTATATTC 

3 0 CCTGTGAAATAGATCCTGTTAAGTTGAAAGAGGGAGATAATGTAGAGAATAATAAGGTATGA 

ATTAATTTTT ATCTTCATGTGAT ACCTGTCT CCCATAAATGGTTTATTTTATTTTCTATGTATGAGTGTTTTGTCTG CATGTATGT 
CAGTGTGCCATATGTGTGCCTGGTACCTGTGGAAGTCAGAAGAGGGCATTGAATCATCTGGGA^ 
CATTATGnrGGGTGCTAAAATGAAACCCAGGTCCTCrGCAAGGAC^ 
ATATACTCTTAAACTGTGCTTCTTGTGAAAATTACT(IATATATTCATTCATTT 

3 5 ACCTTAATGTTGATCGGCCTTTAACCTGAGTTGGGACTAGAA 

ATTTGTTCTCATGAAGATATTGAAGCTGTGTGACAATCAGAATCTTAGTGTTTACTT^ 
CTATGTGGAGAAAGTATTCAATACAATCGTAGGATCAAGTGTGAGCTGCCCCACTC 
AAATGGCTGCTAAGAAATTTCCTAGTAAGTGCCITGGTATACTTAATCACATTTTC 
TTGGTTTGTGTGAAATACTTGTGCATAGGTACAATCT 

4 0 ATTTTTTCTAGAATCTCAAAACATGCAATTCACITTATTTCA 

TAAGAAGGGACAGTTTGCATCTGTACAGTTTTCTATAATATGTAAGGAAATCTTTATC 

AACC CGTGTGTACCAG CT ATTTTTTC CAG TTG CAG ATTTTCAAAC AGT AG T AATTGGT AG TTTT AT AGAATTG ACT AGATTTGACA 

GCITGAAGATACTAAGATGTCTGTGGTGAGATGTCTTTTCAGAA^ 

ATGATCTTGCTTCAGGTTTTGCAAAAATATCTATATTAAGATTTTGCTTGGAA^ 

4 5 TGTTAGCACTTGGAAATACTCAAAATTreAG 

CACAGACAAC^CAACTATTTAGAATAATAATCAGGTTTATCr 

ATTTTGTT ATTTCATT ATGCTTCCATTCGTCAGACCCTTTATTTAATTCACT 

TCTTGGCTAATTACAATAAAGAGATAAAACATTTA^ 

TGATTTTGCATACATAAAGATGAAATAGCAATTAGATTCTGTTTTCAAATA^ 

5 0 GTCTGTCTTATCCTITACrCTTAACATGTTTG AAAC C^TTACCCA.TTGATTAGCCAATGAAACTGAGGTCTCAGTTGTTGTTACTA 

C CTGGAGCTCTGCTACCCTGCTAGCAGGGCAGGCAGTCATAATGATGG AC AG CTTG ACAGTTCCCT ACTGGGTG ATTTGTAAGACT 
AAACCCCACGTCAAATCCGGAGATTCTTTCCCTCACTGCCCTTTCTCT 
AAACAAGTCCTGACAATGAGTGATCATTGCTACAAGTAGCTTTCAATTCA^ 
TACTGTCrCTACCrAATTGCTTTTAGTCTTTAAATTGTTGGCTTTTCT^ 

5 5 AATTGTGGGTTCATAGTTTCAGAAGCCTCACTCCAACACCGACAGGCT 

GTTACGGCMCTTGTAGCAGAGACCGCTTGTCTCCTGGCAGTGAGGAGG 
CCTTCAAAGACATATCCAAAGTCCTCTGGTTCCGCTAATAAGCCTCCAGCT^ 
CAGTTGGAATCCGTCAGTGTGGTGAACGTTGATTAGATCGTGATCTAGTCATCT 
GTGGTCTACTGGfrCTTGTAGGCATTTTTCATCrrAATGAACTG 

6 0 TTTTT AGGGTG AATTT AAAACAAAAAT AT AAAACTGAAAG CTAGTTTTT GG T ATTC AGTGTTGT CACT AGG T AGT ACT ACG T AGGG 

CTTTAATGTTACITTTTTTTAAAATAATTTCAAGAATCTT 
GAGCAACTTTGCTTTCCGTTTGTTATTTTAAGTTACAGCATTTTTTTC 
GTTTTTACAAGTTATTTTCCTITCCTAC^TAGTTTACATGArrrri 
GCATGACGTTTCCCArrTTATCCCTGC^GATCACCCTCATGrrCAGTACTCT 

6 5 CTGTAGCGATACTCTCACCTCATGCTTTTC^TTTGCGACCTCATTATCCAGTAA^ 

TTATTT ATTTTTTGT AAAATG AAG AAGAAATATG C CATTTTTCTTTTT CGGTG AT AAG ACATTTCACAGTTGG CAGAATGCTG CT A 
AGTTGATGTCTGCrrGATGAGCAGAGGGGAGCACTCCT 

GGAGCTGGGGATGCTGGGGTGGTAGGTGGAGTTGGGGCTGGTTGTTAGCCGACCCT 
ATTTCTATATTCTGAGGCAGACTCTTGTTCCATAGCCGA 

7 0 GAGTACAGGCATTCCCACCGCTCCTGGTTTACGGTGGTTTAATC^TGGCAGTATTTTAAGATCT 

GAAGTTTATTTGAAAAGTATTCAGGAACCTGACTTACTTGG 
ATGAGACTAAAATGAAGTTCTGTTCAGTTACCTAAATCTAC!ATCTGCGTTGAAT 
TCTTTTAACCTATTTCrrAGTCCTTCATATAGCATATTTrTAGTCACT 
TTGATGCTTTAGCTGCTGCrrGTGTACAGTAGAATAGGAAACA 
7 5 TGTCTAACTCGCTTAGGTCTTCCTAGTCACTGTAGGAGTAAATT 
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TAACTTCTCTAGGCTCATATAACTAAGACATCACATTTGGATCCTACACAATTTGTACT^ 
CACAGTCTATAAGCAC^CTC^TATTCAAAATCTrm^ 

CC AG AAGAGGG CATCAGAT ACCATTACAG ATGGTTG TGAG C CAT CATGT AG TTGCTGGGAATTG AACTCAGGACCTCTGGAAGAGT 

AGTCAGTGCTCTTAACCGCTGAGCCATCTCTCCAGCCCTGGTC 
5 AGACATATGCAGCCTTAATTGACATTCCCTACTTCAATGTACCCATG<3TCCATCT 

AAACTGG CTTCTG AATG TT CACTTCCCCTC CGAATAGCCTCACAGTGTT AG CTGAG AG C CATGCTG CAGTX3GCAGAAGATGG ATGT 

TCTGTGTAGCCTGTATCCTTTTGTTCATGGAGAACCCTTCTGTGGCTC 

TTGCCAGCCACATTCTATACAGAAGTAGAACTCCOlCCTCT^ 

AATTTGAAGAAAATCCCTCTAACACAAACACTTT^ 
1 0 TTGCCCCCATCCCCAATACCTCAGCATCTCCATCCAGTCTTTAGCCATAATTCTT 

AGAGACAATATGACATTTGACATTGGTGCTGTAGGGATGAG 

CCAGCACATGATTAGCCAAGTAAAAAGTAATAGTAAGCACTGCT 

TCATCAGGAACTGGAAGTTACAAAGTAATACAGGTGCAGAATTGATCA 

AAG AAACT AATT ATAGAAG ATGTTTT AG AGT AGACAAGAAAGAAT AAAG AA CAT AG CAAGGGCAGGAG 

1 5 ATCTCAGCGGTGCTTGCTGAGAGTCAAGGGATGTGTGCA 

AAG GCATGGG CTGG AGAGTTGG CT CCG CGGTT AAG AG CATTAACTGT ACTTCCAG AGGTCCTGAGTT CAATTCC C AGCJ^ 

C^TGGCTCACAACCATCTGTAATGAGATCAGATGCCCT 

AT AAATCTT AAGAAAG AAAGAAAAGAAGAAGAAAGGGTGAG CTGGAT^ TT AAG AAGGCAT ACTGCTCTTT CT A 
AGCCCCTAACTTOUTITCCCAGCACCACGT^ 
2 0 AGCTCTGGGGACGTGCACCCTGTGTGCACATACCTACACACAGATACACA 

ATAATGAGATGGACTAAGGGCTGGGACTAGAGCTCAATGTTCAGTCATTTGCCAAGC^ 
GGCTGTAAGAAAGGAAAAAAAACC^AAACCTTCACTTGAACCTCCT^ 
TAAACATTGATTTTGTAAACCAATGTTGAAATTTCAAAAGGA 
TATCCCTTTOAAGGCCCTCCTGCTCA^ 

2 5 CACTCTCTCTCCCTCTCATGGCATTTTCTATTCTGGCTAGTAACA 

CCAGAACAAGGGAGGACCTTTTCCTAATATGTGTATCCAAAGCATGTG^ 
AAAAATGATAGTTGAGAGAAGGAACTCATCAAGTGAGTCTTTTCTTAATGGAGAA^ 
GCAGGCACTCGAAAAGAGGGCTTGCTGCCTCCAGGTCAGAATCCTAGJ^ 
TGCTCTTTTGAAAGATAACAGCAAACTTGGGACATTAGTTTT 

3 0 GAATGTGTGCTGAGTTACTTTGAATTAAAAAGTTTAACATTTTGGTTGACTT^ 

ACTGGAAGCCAGGTGAGGAAGGTCCTGGGTATACAGTGATGGGAAATTT^ 
CAGCOSAGTGACGTGGTACTACAGTAGCCACGGTTATGT^ 

TGGGTCTAAGCCAGCCTTG<;TGACCTGACACTGAATAAAATGAGCTCTGTTCCCTCTTGCT 
CACTGAATTTAAAAACC&CGTTTAAGCCCGGCAGTGGTGACA 

3 5 TGTGTTC^GGGCCAGAATGGTCTAC^AAAGAGTTCCAGGACAGCCAGGGCTACACTG 

ACAAACAAATCAACAAGTTTAAATGGAGAAGTCTTGTGAAATTCAATTCAAACTTAAT 
TCTCTTCCCAGGTAGAGCTGTAGTGGGGTTCTCTTACAGGTAGAACAGAGTCCCAGGTGCTCC^ 
CATAATTTATAACTTTTACCAGATATATCATAATGGTCATATACCAGATATTCCTTCTCCGGA 
TTCCCCACTACATTCATTGCCTGCTTTTAATATGAGATATTGAAAGATAAAACA^ 

4 0 ATGTCTTTTGATAATATTGTTTACTGTGGTAAGATAGTTGGTTATGTAGACATGTAGGTCAGTAAGACTCA 

CTGT CCATTT ATT AT ATGT ACTTT AATG T AAATT ACACT ACT AT CAG GT AGACAGACCTGAAAGAT AACGTGGCT AA CAGACCCAA 
ACAGTTACCTTCCAGTACGTTCTTATGTGAGTGGGTGACTCCATTTCCTGGTCTGAAT 
GAAC!AGTTGATTGCAGAAGTCTTACGTGTTCTTTGAATATC 
AGAAAAAAAAAATAGGTATCTCTGATATTCCAAAAATAAATAGATT^ 

4 5 ATACAGTTGATTACATTATGTTTTTCTTTTTTTCTAAGTCCITAG 

GTACCACATTATGTTTTTCAAAGGGAACAAGTCATTAAGTGTT^ 
AGGTCACGCATGCTAGGCAAGCIATTCCACCACTGAGCTGTTCTACCGG 
TCGGTreCCTGGACTGGATTTTCrCTGTAGCriT^ 
CGTAGACCTACACCACCAACTTCTTTCATTGTCTCCCAACCCAAGT^ 

5 0 TTGTTTGAATGAACATTTTTAGCCATTATTTTAGAGCCATATAGCCCATTCACTCTATTCTGCT^ 

GAAAACIT^AAGAATTTTGATCATTCTTACCTTTTCTAGGATACACAGA 
ATTATTGGAAACTGGGGATGTCAGTCCAGAAAAAAGGTGAGTTTATAACTGTTTCCTCTT^ 
ACTAAGATTCCTTCTTCACTTTACTTCCTTACATGTTAAAAATAAAACTGAA^ 
GTGGGTGTTCTACTGTAGTCACCTTTTGAAGGAGA 

5 5 ACACAGGCACAGCTGAGTTATGTGGTGGTCTGTGCCATAGTCACCTAACA^ 

TCATGGAGACATG^ACAGCTTTATCCTTAGCTTCAAAATACAGGCCCGTCATTGTCTT^ 
GTCACTCATGAAGCCGTTTACTTAAGAAACTTATGCAACCTTACGTATAAAAATCAAACAT^ 
TTTrGAAGTCCTC^GCTCCCAGTGCTGAGGAGTTTTTTCTTCCA 
AGACCTAGAAGGAAAAGGTCAAACAGTGACTGCCAGTTCAAAGCTATATGCA 

6 0 CTCTCIACCAAGAATGTTCATCAAAGAAGGGAGAAGAGCCAGGCrCAGATCAATAAAGTAA 

AAAAAAAAGGCTTTGCTGGCACCCTGCATCTGGCCCAGCATGAGGTATC^^ 

GGAGCCTGATCATGTTCAGGGAGAGGCTGAGCATGATATCCCATGTGCT 

CATGATCCTGGAACATTCTGGGTTCGCTGACTTGGAGGAATG 

TTCCCAG CC CGCCTGCTGTCCCACATGGAG AAGAGTTGAAGAAGCTCT CTATAGTTGCTTCTGAGCCACGATCTGGCCTACT ACAC 

6 5 AGATGAGGCAG CACTCTTGGCCCTCTGGCTGCAGAC CTGG CTCCTTAAGGCAC CCAAGTG AATTGCCATGAGT AATT ATG ACTGAT 

TAGCCAAGAGGAGAGACACCACCTTACTTGCTACCTGACATGAATGAAATCTGCGGATC 
TGTGCCGCGCTTTCCC<X3AGTTGGCTCTTATCAGTGGGGAGAA 

TTGAGCTCATCTGGGGAAACAAAGCAGTTGGTATGGAAGCTGGAAGCCTTAAGAGGAAGTACTTTC 
TAGGACACGGCTGGTATGAAGAGCCAACAAACACACAAAGAAAAGCT 

7 0 ACCTGTATTAGGTACCTTTATTATGTATAACAAGAGGAGTTAGCTGATCTTAATGGGAAGTCT 

TACTTCCTGAAGGAGGAAGTTGTATTTGCTAGTTTCTG 
ACAGTCCTTGACAGGGCTATGTTCTTGCCATCAGTCCACATTCACCCT 

C CAGGTTCTATAGGGAAAATTTATGAAGTTGTTCACAATTTTTCATTTGTGATA CCT aagctgggcccat 

TCCTGTGTGGCTGGTGGGAGCAGCCCCTGTGAGCrc-GAGGCCrCAC 
7 5 TCCCTGCCITCTTCCTTTCCCTTGAGACTAGAAATCAAAGGGCrGGGCTGTATGTACCT 
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ATCAGACCATAGGTTCAGTCCCCACTACTACCATTT^ 
CTGCATCCCTCCCTGTGTTTACTCTG^CCTCTGGAGA^ 
TCAGGAGACCCCCAGGTAGATTTTTATTTCTTACTTGGTCCTTTAGACCT^ 
GTTAAACCTATCCATGCATTTGTAAGTACrTTTCAAATTCTCT 
5 ATATGCAATGTTTTAATACATATTTACATTGTATATTGGT 

AAAGACAAAATACAAATTTGCATACTAACGAGAAGGATATACTTGTT^ 
TGAGTATATAGAGCATCTTCAGTGTTTTTCAGAGTTG^ 

AAATGG CAG AAGTAT ATTTAAGGCCATTTTTTAAATTGTTGGAAATTTTGTC CTAATCTCAAGCT AGAATTTACTTAGTGTTTCTT 
TGGTTTTTGGTTGG TTGGTTTGGGATTTATTTGTTTG TTTAGTG AAACTTAAGT CAG CAACAT AAG CAATTAAAGTCAAACATT CT 
1 0 AAT AAAATC CAAAACT ACTAATTGAT ACATG CTGACT AATGATGTGGG AG AAT ATT AT AGGTTTTTGTTT CTGT AATTAAG CCATT 
ATGTTTTTGTTAGAGCAAAAGCCTTCATGTCTAGTAAAAATAGCTCAGTTC 
CTTCAGGGCAGCAGTGACTGGGGATGTGTGATGTATTGGCATGCAGAGTGCAAA^ 
TCAGAGTTGGACATGTTTGATTTTGAACTAGTTCTT^ 

CATGTGAGACACGACTAATAATTTAAGAACTGTGCGTATTTAACATACACAA^ 
1 5 AAAGTCAGTCATGTGTGAATTTCTCAAAATGTTTCAAGAAG^ 

TCTAGTTTAAAGAAATCAACATCrrCATATTCTTCCAAAGAGTCCATTTCC^ 

CCATGAAGTCTTTTTTGGGTTCGAGAGATGGCTTAGAGGTT^ 

CAACCACATGGTGACTCACAACCATCTGTAATGGGATCCGATGCCCTCTTTT 

GTACATAAT AAGTAAAGAAATLTTTTT'l TTT T T T 1 AAAGGTCrrriTCTTCCAATTCTATCAAATAAAGCTGTGAGATTAAGCAAG 
2 0 TCTTAACAATAACTACTGTAAGCCTTAACTCTATAAATACAGATTTCTATGCAAC^ 

ATTTTG CATCG TAGGAT AT ATGGCAACAT C C ATCAATT AAATGTCAATGG CAT AT ACCCAGTTTT G AAGGCCAGAAATG CTTT^ 
CTATTACTAAATAGATGTCCCCTGGAGGATGGAACTGGCTTTGTTAAGAACCTTTGTTCT 

ATCTAGTAGAGTGGGTACTCACTTGCCATAGCAAAG CAAAATGGGTCAC CAGG ATGGCTCAGCAGGT AAAGGGACTTCTTAG AGG A 
CCTTATTGCTGGGGTCACAGTGATTGGAAGGAGGGTACCAGCTCCC^ 

2 5 ACATTCAC7VTAAAGTCAGAAAAAAATGTAAAAGAGTGTTTTCAATATAATGAGT 

AAATAAAAATATATAAATGCCTACAGCATAAAATGTCGATATAAGACCTACAAT 

GTCTTITTAAGTCTTTAATTTTTTGGTCCTTAACTTGTTATTCATTCA 

AAAAAATTATCTTAGGATATAGCTAAACTAAACTTGAAGAAAGTAATGTTTCGTTATCA 

CAAGG CT AGTCTT AAAG C CTGTTGAGTTTCT AT ACATG ATTTAGCTGTTGATGG CCAAG AACATTTTT ACTG AAG T ACCTGTTT AG 

3 0 ACTGAAGACAT ACTTT CT CG AAG ACTCT ATG AGTG AAAATGCT CC C ATCAGAT AGGG CTTG GTTTTGTTTT ATTTTTACAG AGTCT 

CAGTATATAGCCTTC^CTATCCTAGTTCTTAGTATGTAGACCCGGCTC^ 
AAGTGCTGAAACTAATGGTGTACACCACTATGCTGAACCTAAATTTTTTTCT 
TATGCTTTTAATTTGTAAAATAAAATTACAAGTTTTGTATANNNNN^ 
NNNNNNNNNNTACATTTACTGTCTCATGTATATCTGTAGTCCT 

3 5 AGGCAGTGGTGGCACACGCCTTTAGTCCCAGCACTTC^ 

GAGTGAGTTCCAGGACAGTCAC^CTACACAGAGGAAACCCTGTCTCa3GGGGAAAAAA 
TGTGTCIT'TCTGTCTGTCTATAGATCGGACCTAGGATTTTACATATCCTAGACTGTTCTC 
ACGAGTGTCCATGTAACTCATCCCATACTGGCATTTATACTAGTGACTTTTAAGTGACTCGTAACTAAA 
CACAGTAAAAGGAAAGGTAACCAGCAGAGTTCTGGATACAAGGTGGTTATGCAGTTTGG 

4 0 TGAAGATGTTTACAGCTCTAAGTAGAGTGAATCTAAACCAGCTCCTTACCT 

ATTGG CTTTTATCTG ATACTTCTTCATATAATG CTAAGAAGTG AGAT ATTCAAGTAGCCATTAAACTCCACTG<n , GGATGGCCCAT 
CT ATT AG AACT ACAG ATG AG ACAGCAT ACTTTG TT ACTGTTGTCCCTGG AT ACTTG AAC CTG CAC CAC CAAGACAAAAAGG ATT CT 

CACCCATAGCCCCAGCTTGCTGCGCCITCAGTCCTGTAGAGGAAGCACT 

4 5 TGTCGTGTGTATCCTAAATCCTAAACAAAATAAAGTTAAAAACAAAAGCGAAGA 

TATCTCTTGGATAAGCAGATAGATAGGATTCCTCTCTAAGAGGG^GAGGTGCTTTGTGAA 

GG AAGCCCGTGACT AGTCCAGCTT ACAT ACTCATGAAGCTAAATG CTGTTCACTG AGTGGCCT CCTGTTTCT AGCAATGGATGCTA 
CAAAT TTTCTTG AG C CACA.T C CTT AT ATTCTG CAT AGAAAATTTG T AAAT ATTT AT ATTTGATTTTAGGTT AG CTCATTTGTCAAG 
GTCACCAGTT AT ATAT AAT AGAAATCAG CAAG CTATTTTCTATGAAT AACCAATG AATGGG AGGGCTT ATTGGTTACAT A^ 

5 0 ATAAATCCTTGTACTTGTATTAAATATCCCCCTIT^ 

ACTGTAAAAT AGTACATG AGAGC CCTGTGCCAGTGTTTGCTG CAAATCAAAGATGAGAGTTTT AAAAGGGAAGAAAAAT AAATG CT 
TTGAGCCAGGTGGTGGCAGCACATGCCTTTAATCCC^ 

TGGACCACAGAATGAGTTCTAGGCCAGCCAGGGTACCAAAAATATGGATAAGGGCCAGATGTG 
CACTGAGGAGGCTGAGACAGGAAGATTACAGTGTGTTTCAGTTCAGCCTCAG 

5 5 TT AAGGATT AT AG ATTGAGTT ATTT CT CTAGGT AAGT ACATT CAG AATTTG AG T AAG TT AAGAAACAATTI TTGCCTTT AAT CC CA 

GCACTTAGGAGGCAGAGXSCAGGCAGATTTCTGAGTTCGAGGCCAGCCTGGTCrACAGA 
AGAGAAACCCTGTCT CAAAAAAC CAGAGAGAGAG AGAGAG AGAGAG AAAC3^TTTTTTAGAACGC<TTT^ 
CTTTGTATAACCAAATGTATTTTATTTCATAGATTACTATAAGATGAGCCT 
GCAATATTTAAAACTTCTTGAGTGGACATGATGTCACAGTGGAAAATTAAAG^ 
60 TGT AGTAAAACAAAAACTAT ATK3AAATTACCCITAGGCT 

TACTATATATGCAAATGTTTCAGAATCCTAAATCTAAAGCACm 

CTTTAGTAATAAGAAATTCCCTAATATAATCAGTCACATG 

GCCTTGGCTATCCAGAAACTCATTGTAGACCAC^CTGCCTCAAACT 

AAGGTGTGTGCTGCCACCACCACCTGACAGATAAAATAATTTTTTATAATATACCTTATTTA 

6 5 TCTCCCCATTTAGTTCTTGGATGAAATTTCATCTACGGAAACTAAAGAGTCCAGTGGTACGAG 

C^TAATTAACTAAATGAGACATGAAAGTTATAACTCACTGTAGACAGT 
AGTTTTCATTCTGTTCCAGAGCITCCTAAGCGGCTTCAGCC^ 

ACACGCTGACT ACACAGTCCCAGGCTGTT CTGTAGTTAGG AAATGGCT CTGTTTCCTTACCGAGG AGTGGTTGTTACTAGG AATAA 
TGAGATGGAAATAACTTTCCTGGCCCTTTTTATTCTGTATAGCACAATTCT 

7 0 CTCATGGTTAAGAGAGGTAGTGGCTCCTGTAGAGGACCTGGGTTCAAT^ 

TCCAGTTCCAAGGGATCTGATGCCCTOTCAGATTrCCACAGGCAAT^ 

TAAAAGGAGAACTAAG ACAG CCCTGCATTGATAG AAGGACGTCCTGAGTCATCCC CAGACGTCACTG CTGTGCACTAGTAGC CGTG 
CATGGCCXaCGATGGCACCTAACAGTCAGGGTCAGCTTGGT^ 
TCTGTTATGTTGGCAGTGAGATGTATAAGAGAGCTCAAGGAAGAACCCGTAT^ 
7 5 TTGACGAGCAGAGAGCTCACCTACCACAGAOVGCAAGGTAGAGTTGTG 
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TGCCGCGCTGATCGCTGCACAGTAAATGGTTCACW 

ACATTATGTAGTATATAATAAAACTATAACGTAAAAGCCTC'GTGTAAACC^ 

ACAGTAT AACAAT ACAAAG CAG AAAGG CACAGACTT AACAGGTT AACT AACAAC CACAAG CACTCAG CT AATCTGTT AACTCATGG 
TGCTCAGTAGGTTTTGAAGTCCCATGAAATCTGATGGGTAAATACCAACTAGGAGAGCTTCTCCT 
5 TAAAGGCCTGATACAAGGTACTTTGAGAGAATATAGAGTGCTTCATTGGGAATTCCTGAATAGTAG 
TTTTAAAGAGATGAGGTCTTGACCCAGAACTCTGGGCTTAAGCAGTCTCTTG 
GTGTGCACATGTGCACACACACATATGAGCAGTTTAACCrGAAGGAATGAATAG 
TTTTTACAGTGTGTTGCATTTTAG<5CATCTG 
TCAGTAGGCTTACCTTTGTTTTTAAAGCTGTCTTTGAATA 
1 0 TATGGATACCATAGTTGTGTGATAATGGTTATTTTAATAGCAGAAGCACAATTTA 
AGGCTCTAAACAGTGGGTGGAGTTATAGGACAC^GACAGACAGAT^ 

TT AAACAG CCGAG AT AACAGAGAGAGACTG CTG CTTAAAAAAACAATCAAACAAAAAACCCTAATAATTATAACAACAAT AGTG CA 
ATT AAATAAAATTT AAG ATAAATT AGTCAT CTTCTGTGAAT AAACAAGTT ATTTGTTCCTCAGTT AAAACACT CT AGC CAGG CG 
GTGGTGGTGCACG CCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGATTT^ CTGGTCTACAAAGT 
15 GAGTTTCAGGACAGGGCTACACAAAGAAACCCTGTCTTGAAAAGCCAAAAAAAAAAAAAAAA 
TAATCTAACCCAGCATTCCAAAGCCTGCTAGTTAGAAAGTTGTTG 

TATGTACATATACACATGGAGTTTGCATAGGAATTTCATGGACCTCAGAAGCCTCAAGTGTACCT 
TTCnTCTAGCCAGTGATGCTAAAAAAGGATGGACTTTCACAG 

TTGGTTTTATGTTGGAGGAAATACACAGGATGGTGAGATTATTTGCCAAAGCTTACAGTTCTAAGCATG 
2 0 CTATTTTGATGG ATTTTTTTCC AAATTGG ATT ACAT C CTTTC CTG CTGT CT AAAAAAG AATT CATCGAACAT AAGTTCACATTTTC 

TGATTTCTTGGGGTGGTCATGGGTCATATTACTGCATTTTTAGAGTGCCAAAGGGTAGGTG 
TTTGTTCCCAGTTAAACACCTCAGAAGCTGCGCTTTTGTGTC^ 

GGCTGTTATCTTAAGTTT CCTTTGCTGTCAGTGACGTGTCTCACT CCCATCCCTGTAGCCAGCCTTCATCCCTT ATGGCTAATATA 
CTAGGGATTAAGGTGCAGATACTATAATTGCrAAACTACACCTAAGATCTAA 

2 5 AAATAAAGTGTCCGAGCTTO3TATTTGCTGCTTGCTCAG 

T ATTTTTTG TTG CT ACAG CT ACT ATATT AAATTTT GCT AT AC AAT AAATTTTGAGG TGCCGGG CGTG GTGG CGCACGC CTTT AAT C 
CCAGTGCTTGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGG 
CACAGAGAAACCCTGTCTCAAAAAAAAAAAAATTTTTTT^ 
TTCGTTACAACATACTATGAATATATGTTTCAACTCTTGTTTAA 

3 0 CCATATAATGATCCTTAATTTTCCTCAGAACrTTAAAGTTATAATACTCTGACCTATGGCC^ 

TTACCTCAGAAAATAAGTAGACCCCTAACCTAGTTAGTAGTTAGATGGAAAACT 
GTGCGCATATTTTATTTTTATATATATTGCTCCATTGTTAATAGCT^ 
ATTTCrGATGTGTCTCrCTGGAAACTCTTGTAACATTTCTTTAGTAAGTT^ 
I T lTrcrrA TlVlT CTATTCCTOCCTAACA 

3 5 ACTTGCCAACACTTTTAAGTCCTATGAGTCTAAGCITGTA 

GAGAGATCTTTTGTTTCTAACTCAGATTGTAATGTTATCATGGTC 

CCTAGAGCATCTAACATTTGTAGTATTAGAATTGGTTCCAAATGAATAAGTACAACTTTAGAGTATG^ 

ACAGATTTAAAGAAAGAATATTTTTAAATTATCTGGCCTCCACT 

TTGTCCCCCAGATCATACCTCTGATTCATTCCCTCAGATGTTCT 

4 0 ATTTGGTTACCGCCTTGACAGTTTGCTCATCTTGTC 

TCATGTTATCTCATTGCACTGCCCTGCCACCCAGCTACAGCCTTGCTTCACT 
TTAGTTAACCATCCTGCTATGTCTGAGGCTCCAGCTCCAGCCTGCTGTGCCTGCAGCT 
CATCAGCATTTCCTTCCGACCCTACCTTCTGTCCTCGCCAATGTTATCT 
TTGTTGAATGTTATTTCCACAC^AAGTTCTCTAACTGCCCAATCTAGAGTAACT^ 

4 5 TATCTTCGTCACAGTGTCCATCTCTAGTTAATATTGCTCTAC 

C CTG TTT ATTGTT CTG TCTT AAATT ATT ACAAT AG T ACCTG GCAT AT AG T AGACATTCAAGATG T AT CTTTTT AAAAGT ACATG C C 
ATGTCTGTTT GAATCACCTT AAATTTT AGG CTTTG T A C CAG GAAAGT GT CTGAACAACT ATTTTGTATT AATGT CTTTATTT ACAG 
CAAAAACAAAATCACAATTAACCAGATACTGTTCTTTGTCAAAAAAAA 
ATAGAATGTTTTTAGCTTCTTATATTTCAACCCACACGT^ 

5 0 GTAAAAATGCCATCACAAAATGACAGAAGTCAAAACTGCATTTGTCATTAGCTTCTTCTA 

GCTGTACTGACATGCATCATCTTAGAGGAGGCCCTTTAACAAACTCAAATTCC 

ATTCCATCTGCTCCATAAACTCTCACATTGTCTCTAGCACCTCCTAAGCAAACTT^ 

TGACAGCTGTAAATAGTTTAAACCATGCTTTGGAGCTAGCAAGAT^ 

GCAGTCCACTGTCTC^GACCCATATGGTAGAAGAAGAGAACTGATTTCTGCCAGTTGTCCTATACCTC^ 

5 5 CACGTGCAGGT ACATACATG TG TG CGTTCG CACACACGACACACACACACACATG CACACATATACACAC AT ACACTCAAACAC AT 

ACACACACACT AAAAT AAAATG T AATTTTAAAAAAATTTAAACAAGG CT CATGGTGCTCATGGTGG CACACG CTTTT AAT CCCAGC 
ACTTACAAGGCAGAGGCAGGTGGATCTCTAAATATGAGGCCAGATTGGTATACAGGA 
GTCTGGAAGGAAGGAAGGAGAGAGAGAGGGGAGAGGGAGAAAAGAGAGAAACTTAAGAAT^ 
CTCAGTTTCCAGCACCAACATGGCGGCTAGCAGTCATCTGTAACATCAGTCTCAAGATATGGAAAGCCCT 

6 0 GCGGGGGAGAGGGAATTGTTTGTTTTATTTTTTTGAC^ 

CCA(ZATGCATTAGGAATAGAGATGTAGATCAGAGCTAGA£CACTTGCCTA^ 
ATACAGACACTAAACGATGATGTTGTTGCTGTTGCTC 

ACAGAAAGGATTCTACTTGAATAGTCATATTCACTGTGGTCATAGCTCCTGAGCCT 
ACATTAGTTCTTTCAGCTGCCACATAATTAGTGTTGCAAATTTGTAATAGCATAGAAAATAATTGT 

6 5 GATGCTTTTCCTTGAAAATATTTTCATTTATG 

CTGGAGAGATGGCTCAGCGGGTAAGAGCACTGGCTGCrCTTCCAGAGGTCCTGAG^ 
AACCATCTGTAATGAGATCCGATGCCCTCTTCTGGGGTGTCTGAAGACA 
TTTTrAAAAAAATTCTCGTTCCATATTATACATAAACCATCCAGAAGTCT 
TAGCCTTGCAGAATTAACATCTCATCTTTATC^TTATTACAGTAA 

7 0 CAATT TACACT ATTCCAGTAAAAAAT ATT CTTG CTGTGGAAAAATTGG AAG AAGGCTCTTTCIAACAAG AAAAATGTAAGTTG CATA 

AATATTGAAGGATTTTAAGTCATCAGGTGTACGTAATTTTGGTA 

TCAGCTGGTGGAGGACCCTCCAGTCCCCAGGATGACATCTCTGTTCCTTAGCATCTCCT 

CCTG CAT CTG AAATT ACCTTTTTG TTCIT ATGCAGAG^ C 
TTAG CCC CATT GG CTAAGATGATCTTTGGT ACTGAC CTG CTT ACTG CAC CAATT CATTC CGACACATTAG CAG ACATTTCTTAG C C 
7 5 AAGCAATCCAGGTCTCCTAAGTGCTTTTTCriTTTGTA 
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AATAAAGATGTTCCAAGTGATACATACAGAGAAAACGCTGTACATCCAJ^ 
TCCTGTGCAGGGTGAGCAGATGCAACGACAACAGGCTCAGCTCTTTCCACCCCT 
CAGGAGACGAGTGAAAGCACACCAGGCTGCAAGCCGTGCACC^CGTGAGTAACT 
TGACAGACGCTATGTAGAAGTAGCTGGACAGAGGCCGCACGGTGCTGAAGACATTTCrATAA^ 
5 GTTGCCCAGTGGTAGAGTGTCTCTCTCACCTGACACACACAGGGGCTTTGGTTT 
AGTAAGTAAATTTCTATAGACACAACTGTAAGTTAGTGGTC 

ATATATATGTTTATTGGGACTGGGGAGCAGTTAGTGCTTGTTGTTCTTGTAGAAAG 

G ACACCAGCTCCAGAG CACAGATCAGATACCGTCTTCTAGCCTCTGCAGAGGCATGTATAGG CACGCACGCATGCGCACACACACA 
CACAGATATACATATATACATTAAAAATATT TT TT ATG ATTTTACTT GAAAATCriTTTCAAG ACrTTG AGTC CTT AACAAC CAG AC 
1 0 CTTGTGATCCTGTTG 

MOUSE SEQUENCE - mRNA 

ATGGCGGCGGCAGCGCCTGCTGCGGCGGCGGCTTCTCCCGAGGCGCCGGCCCTCT 

GGACAGTCGTGAGGTCCGAGTGCTGCAGAGCCTGCGCGGC^GGATCTATGAAGCAAAA 
1 5 AAATGCGCGATTGTTTCTGCACTATAAATTTGGACCAGG 

TTCAGTGAAGAATTTTATTTTGAGATTCCAAGAACTTTCCA^ 

TCTTCGTATAGGAAAAGTAGCCATCAAAAAAGAAGACTTGTGCAGTCA^ 

ACTaVAATTCAGAAGTTCAGGGAAAGGTTCACCTTGAGTTAAGAOTGAATG 

CTTGTTGTACACATCAAGGCCTGCCATGGGTTGCCT 
2 0 CCCCTCTAGGAATGACCAAAAGAAGACAAAAGTAAAGAAGAAAACAAGCAATCC^ 

CCAGATCCAGCA.GTTACAGCAGGAAGTCCCAGTTCC^^ 

AATAATGAAAACCTAGTCCAAGATGTTTTCCTGGGAGAGATTAAGGTTCCTGTCAAT 

CTGGTACTTGCTACAGCCAAGAGACAATGGAAACAAATCATCTAAAACCGATGACCTGGGATCT 

CAGAAGACTGCGTGCTTCCATCAGAGTACTATGGACCGCTGAAAACTT^ 

2 5 TCAGCTGCTTATATTTTGGGTGAAATATGTCAAGATCAAAAGGATGCTGTTTTG 

ACTTGTCCCTTTTATCACTGCGGTGGCCGAGCTGGACITGAAGGATACT 
CTACCC^GTGTCTGACTGAGATGATGAAGATAGTGGGAGGGCACTACCTGAAAGTCACGCTC 
GAATCCTCGAAATCCTGTGAAATAGATCCTGTTAAGTTGAAAGAGGGAGATAATGTAGAGAATAATAAGGAG 
TGTGGACAAAGTATTCAATACAATCGTAGGATCAAGTGTGAGCTGCCCCACTGTGATGTGTC 

3 0 TG^CTGCTAAGAAATTTCCTAATCACCCTCATGTTCAGTACTCTC 

ATACT CT CACCT CATGCTTTT CATTTGCGAC CT CATT AT CCAG AT ACACAG ACAGTT AGG ACATTG ACT CT CAT CTCAAAAACCAT 

TCAGATTATTGGAAACTGGGGATGTCAGTCCAGAAAAAAGTCACGGTTCAATUV^ 

AAGAAGAAAGATATTTTACAGATGTGAAAAAGTTCTTGGATGAAAT^ 

CCTGTGCA.CCTGAAAGAAGGTGAGATGTATAAGAGAGCrrCAAGGAAGAACCCGTATTGGAAA 

3 5 CTGCTTGACGAGCAGAGAGCTCACCTACCACAGAQtfSC^ 

TGGAAAAATTGGAAGAAGGCTCTTTCAACAAGAAAAATATGTTCCAAG 

AACTGTGTGGAAGCCAATGAGTGG ATAG ATGTC CTGTGCAGGGTG AG C AG ATGCAACCACAACAGGCTCAGCTCTTTCCAC CCCTC 
AGCGTATCTCAACGGAAACTGGCTCTGCTGCCAGGAGACGAGTGAAAGCACACC^ 

CAGACATCCAGATAGATATCGATGAAGACAGAG AAACAGAAAGAATTTATT CTATTTTTACC CTCAGTTTACTTAAGCTGCAGAAA 

4 0 ATGG AAGAGACCTGTGGGTCTAT AG CTGTGTATCAGGGGCCACAG AAAG AACCTGGTTATTCTAAGTT CACCATTGAGGATTCTGT 

TGCAACCTTTAAGACAATTC^CAAATCAAAAGCACCATAG 
GTAGTGCAAAATACGGAAGCAAGGAAAATCCAATTGTGGGAAAAATATCATAG 

MOUSE SEQUENCE - CODING 
45 ATGGCGGCGGCAGCCCCTGCTGCGGCGGCGGCTTCTCCCGA 

ggacagtcgtgaggtccgagtgctg cag agcctgcgcggcagg atct atgaag caaaaaattt attgccatatcttgg acccaaca 

aaatgcgcgattgtttctgcactataaatttggaccaggaagaa^ 

ttcagtgaagaattttattttgagattcc!aagaa'cittccagtat^ 

TCTTCGTATAGGAAAAGTAGCCATCAAAAAAGAAGACTTGTGCAGTCATA 

5 0 ACTCAAATTGAGAAGTTCAGGGAAAGGTTCACCTTGAGT^ 

CTTGTTGTACACATCAAGGCCTGCCATGGGTTGCCTCTCATAAACGGACA^ 

CCCCTCTAGGAATGACCAAAAGAAGACAAAAGTAAAGAAGAAAACAAGCAATCCACAGrrCAA 

CCAGATCCAGCAGTTACAGCAGGAAGTCCCAGTTCCAGGTGGAGGAGGAGGACATTGAGAAGCTG^ 

AATAATG AAAACCTAGTCCAAGATGTTTTCCTGGGAG AGATTAAGGTTCCTGT CAATGTGTT AAGAAGTGATT CCTTTCATCAAGC 

5 5 CTGGTACTTGCTACAGCCAAGAGACAATGGAAACAAATCAT CTAAAACCGATGACCTGGG AT CTCTTCTGTTAACTCTCTGCTATA 

CAGAAGACTGCGTGCTTCCATC^GAGTACTATGGACCGCTGAAAACTTTGCTGCTAAAGTC^ 
TCAGCTGCTTATATTTTGGGTGAAATATGTCAAGATCAAAAGGATG 

ACTTGTC CCTTTTATCACTGCGGTGGCCGAGCTG^ACTTGAAGGATACTCCAGAT CAAT ATTTAG AGG AAACTCCCTGG 

CTACCCAGTGTCTGACTGAGATGATGAAGATAGTGGGAGGGCACTACCTGAAAGTC^^ 

6 0 GAATCCTCGAAATCCTGTGAAATAGATCCTGTTAAGTTGAAAGAGGGAGATAATGTAGAGAATAATAAGGAGAATCTGTACT 

TGTGGACAAAGTATTC2VATACAATCGTAGGATCAAGTGTGAGCTGC 

TGGCTGCTAAGAAATTTCCT AATCAC CCTGATGTTCAGTACTCTGCAGTG AGCAG CTTTGTCTTCCTTCGTTT 
ATACTCTCACCTCATCCTTTTCATTTGCGACCTCATTATC 

TCAGATTATTGGAAACTGGGGATGTCAGTCCAGAAAAAAGTCACXK3TTCAAAAACT 
g 5 AAGAAGAAAGATATTTTACAGATGTGAAAAAGTTCTTGGATGAAATTTCATCTACGGAAACTA 

CCTGTGCACCTGAAAGAAGGTGAGATGTATAAGAGAGCTCAAGGAAGAACCCGTATTGGAAAGAAG 
CTGCTTGACGAGCAGAGAGCTCACCTACCACAGACAGCAAGGCAAAGATGCAATTTAC^ 
TGGAAAAATTGGAAGAAGGCTCITTCAACAAGAAAAATATGTTCCAAGTG 
AACTGTGTGGWAGCCAATGAGTGGATAGATGTCCTGTGCAG^ 

7 0 AGCGTATCTCAACGGAAACTGGCTCTGCTGCCAGGAGACGAGTGAAAGCACACC^ 

CAGACATCCAGATAGATATCGATGAAGACAGAGAAACAGAAAGAATTTATTCTATTTTTACCCT 

TGCAACCTTTAAGACAATTCAGCAAATCAAAAGCACCATAGAGAAGCT 
GTAGTG CAAAAT ACGG AAGCAAGGAAAAT C CAATTGTGGG AAAAAT ATCAT AG 
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HUMAN SEQUENCE - GENOMIC 

CTTCCTTTTCCTGATCCTGCCCAAAGTGAGGAGGTTTGTC 

GGACCCTGGGAGGAGTTCACCTTATCGGTTGCCTGACACC^CCCCAGAAGTGGCT 
GCITGCAGGGAAAGGGGTGAAATATAATTTCCTGAGCITCCCCCAA 
5 CT CCAG AGAAG AGG CGTGGAAAGCAG CTG AGGTGGTAG CCAC CAGG CTT CCAGG CACTGG C CATCACAAGGGAAAGGTGAATCAG C 

AG CGAGG CTGGGG AGAG AT ATTCTAG TGAAGAG CCAGATGGAGGATGT AGGGTCTGAGAGG CAGC CAGAG TGGAGTCTGAGTCAT C 
AGGG AAAACAAGCAGG CGCTG CCAAG CCAGG AGGAAC CAAATCACAGGT ACT AT AGAT ACAGCCT CAAT AXJCAGG C CCTCACACAA 
CTCCCAGGATTGAGCCCGTCTCACCCTCAAAAGAAGTATCCAGAAAGTATCCAGA 

1 0 CAC CATCTCAAAAAAAT AT AAAAAAAG AACG CAGATAT CTAAGAGGCTG AG CATTATTTACCTAAATGGATGGACCATTTATATAA 

TGTCTATTT AC^TCGCTG ACAC CTGT AATTC CAGTACTTTGGGAGGT CAAGG CAG AAGG ATTG CTT AAG C CCAGG AATTTGAGACC 

TGCTTGGGCAACAAAGCAAGACCCCCATCTCAACAAAAAATAAAAATAAAAATAAATG^ 

GAGGCCAAGATGGGAGGATCCCTTGAGCCCAGGAATTCGAGGTTG 

AGAGTGAGACCGTGTCTCAAAACAAACAAACAAACAAACTC^TGCATC^ 
1 5 AAGGAAGAGAAGAACAATTTTAGAGAAATGAAGATGCTACATCTTCTTT 

TTCACACATACTAGTGTTGCCAGTTTTTCTTGTCCTCCTCCC 

CCACCCACACCCAACTGTAGAGAAGAGCGTGGAGGGTCCAGACCGGATCTTTGGGACIT1T 

AGGAAT AGAAATG ACAT AAGCCACAT GGGT ACATTCT AGGGAG AAAAGAAT CAAAG CCAAG CTTTT CTTG CAAGAAG CAACTGAGA 
GTGGGGGCCA1TGATGTGCTCAGGGGCATCAAGTGGGAGCAGGAGGACAGTCAG 
2 0 TGTGCT ACT CTGGGG AAAGAGGGTGAGGGATCTTGGTTTTGTTGGATATTGG AG AAG AT AAGATGAAA^ 
GCTCCAGCCTCKJTGCTGTGACACACACCTATGATGCCAGCT 

ACTCACCTAGGCAACATAGCGAAACCCCATCTGAAAAAGAAGAAGAAAGAAGAAGAAGAAGAAGAAGAAG 

AGAAGAAGAGGAAGAAGAAGAGGAAGAAGAAGAGGAAGAAGAAGAGGAAGAAGAAGAAG 

AAGAAGAAGAAGAAGAAGGGGAGGAGGAGGAGGAAGAGGAAGAGGAAGAAGAAGAAGAAGAAGATGAGGAGG 

2 5 G AGG AAGAGGAAGAAGAAG AAGAAG AACTG CT CTAGATT ATATA CTTGT AT CTCATTTTTT AACAGTAGAATT ACT AT CAGGACTT 

TGCTTCTTTGAACTCTACCCACTCAGGCACTGAACAACrGCAGACCT 
CAAAAGATGTATTAACATACAGCAGAGATTCTCAACGGGGCGGTGA 
TTTGATAGTCACAACTGGAATATGCTGGTGGCATCTAGTGGATAGAGTCTAGGGTGCT 
TCCCCAACGAGAAAGAATTACCCATCCTAGCCAAAATGTCAGTAGTGCCACAGT^ 

3 0 GGCAGTTGTTTCCTTACAGCTTCCCTTTCCAATGACAAAATAAAAG 

TATTCATTCTTAATCAGCACT ACAT ACAT AT AATAATCTCAGCCAAACCATAT 
AATTCCCATTGAAGAATTCCAGATCCTGTGGAAAAAACGGAGTTGAATAA^ 
TATTTGTATTTTCAATATGAAATATGAAAATATAATAAAACTAGCACAGCAAATAAAA 
ATGGGTAATC^GGTTACrCTCTATGCTTTGCTC^TTCTGAGAGAATCCTCAGAATCT 

3 5 AGTTTAGCTGAGGTGTTAGTACGTTTGTTTTGCCTCATTGCAGATAAGACATTCACr 

GGGGACAGTGGTATATTTGGCCTTCCATACAGATGCTCTATAAATGCCTCTC^ATTTT 
AACAGCCCTTCCCAGTTCTATCTCTGCACAGACAGCCCAAGAATTCAGTCCT 
CACAAACTAGATGTTTTACATTCTGGTGGCCATGTCGAGTAATTATT^ 
GTC^GTTTTCCAAAATTCAGTTATATGATTATAAAATAAGTACCAATG 

4 0 TT ATCTTT AT ATATGTG ATAAGGATTG AGGGG TTCACTCC CT CATT C CATC C C C AATCTTG CCCTCTTTT AG ATGAAAGGCAGG AG 

GC7U3TTGCCTAGGAGAGGGCAACCAATAGCCCTGGCCTAGGCAGGTCAATGCTC 
AGGGCTAGGATACCAGGCCGGGCTGTTCCTAAAGACTCTGAGCCCA^ 

TTGCCAACTTTGCTGAAAATGTGGAAGCCCCGCGGGTTCTGAAGGATAACGCTCTCGGCTTGGG 

AT CATAAAGTGGTGAG<MTCCCTTTTTGTGACAGTGGCAGGGGGAGCCAGAG CAGG AG AGGAAGAAT AT AACAAATCCAGCAACCC 

4 5 CATCATGTTCCCCACACCCTACTCTGCCAATGTAACCCACTGGGCACTCCTGAAAATCAATAATTGG 

GGCAGGTAGAAAGTGGAGACACAATAGAGAACTGCGGATTATTTATGTTAACAGGATT^ 
CCTAGATATTTGGTCATAATAACAAGTATGAAAACACCTTGCAACTTTGAA^ 

AATTGTATCTAGCCTACGTTTAGAACACAGATTATGATCGATGAGCTTCTGCTCTCCTTGGCTGTCT 
TGTGACAGACGCAGCCCCAGCCCAGTGTGAGGATTGGAAACCTCAGACCCT 

5 0 GAGCTATTATTGCCTTTTCTCTGCACCTAAGCTTTCTGCAAAG C CCAG ACCCAG CC AGCTGGTTCTGTGGTCTGCTCTG ATTTCCC 

TTGTGGATTTTAAATTCAGAATTCTAGGCTCATTTGTTAGCAGCTTCT 

TCGTTCATTTATCAAACATTTAATAAGTGCCTTCTGTGTTCTTCTATATTTTCT 

CAGTCTTTGAGATGGTTTCAGACAGGTAAAC^AACAGGAATAGTAAAT 

TACATGGT ACAT AATG^CCTAAAGGAAAAAGAGGCCAGTT CTG TGAGACAAAGTCAGGAGGTGA^ 

5 5 AAATATTTTCATTTGAACTTAGGGACTATTATTGGAAGATAAG 

TACTGCTCCCCTTTCTGTGT'CACTAGAAAACACTGGTTTGGTTCA 
TTCATCCCCAGACCCAGGGGTATCAATGTTGCTTAGACTAAGAAAATTAT^ 
ACCTCATGGTGGTGATTTACAGGAAAGCTACCTGGAAAAGCGTTCCTCCCT^ 
CCACTGAAAGTAATGAAAAAGACATTTTAGAAAAAAAAAAATACACCTTCCTGCCATTGGATGTTGA 
60 GCCATATTGTGACCATGAGGCAAAAGCCAAAAGAATTGCAAAGAAGCTAACCCAATAAAGTCCTGTTTATGAATGAATA 
G AAAATG AAACAAGTG AATGG CTG CACAACATTTCCAGTTTTCTTTTTACACCT 
TTTTGAGTTTTGGGGGTTTTTGTTGTTTTTGTTTTTTGATTTGT^ 
TCTTGAACTCCTGGGCTCAAGTGATCCACCCAACTCGGCCTCTGAAAGTTCT 
CACCCATTGTTTTGTTTTATCACTAAAATGACAAGAATGCACTGCAGATG^AATGGATA 

6 5 TTGAGAAATAACTGAAGTCACACTGACCTAAGGGCATCAGCTGTATTGTGCTCAGATATTCAGGCCTT^ 

GGTGACAGAATTTGAAAAAAAAAAAATTCTATTTTTAGAGAGAG 
ATAGCTCACTGCAGCCTCAAATTCCTGAGCn'CAAACAATCCrCCCACCTCAGCCT 
GCCACCATGCCCAGCTAATTTATTTATTTATATTTGTAGAGACAGAGTCTCACGGTGTTC 
TCAGGTGATCCTCCTGTCAAAGTTGGTATTACAACCGACTGTTGGAT^ 

7 0 GTACCGAAGGAGTTGGAGGTTTTAAGGTAAAAATTTGT^ 

AATGAAATGTTGAAAGTATGTTTTCTGAACATTTAAGGTCTCTCCTCACCTC 

TGGATTT AGAG ATTTTTTTTT AAT AATGT AATT AGTG CAT AATG CTTAACATCAAACAGT ACTCTGATGGGG CAAATGGGGAAACC 
AAGTCATCAATGATCAAAGGAACTTGCTTTCCCACCAGTCCGGCAGA 

ATCTACATTTAAAT ATTAAAAAGAAAG CACATTT CTCTT CCTCCT ACTCGC CAG AAAAAATGT AAT AAAT AACTC^ 
7 5 ACAACCCCTTACAATGCCATGTTATAGGTTGGTGGGTGCTGA 
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GCTGCAGGTTATAGTCTTCTTATTAAGAGACAATGTAG 

GACTGAATGAGTTTAAAACTGGTGAAGAAATAGGAAAAGACATAGTAGTGAATGGTTGCT 

GAAGGGATGG CCTTTAAACACATT CATT AAT AATTTGAGGAAGGAAG TGAAGTTCAAG TTTGTGATGATGCAAT ATTGTT C CAGTT 
ACT AAACATCATGAAGAAAAG AG AGGAACTCT AG ATG AATTTAAAACAT AATG CAAGGG CAAT AAGGGGAATG AG ATGT AACAGC C 
5 TTCTCGTAATTATACTGTATGTCTTGTTAGTCTTTAGAAAACITTATTTrC 

ACTTT AGGAGG CTGAGGCTGGTGG ATCACGAGGT CAGG AGAT CAAG AC CACAGTGAAACCACG T CTCTACT AAAAACACAAAAAAA 
ATTAGCCGGGCGCAGTGGCGGGCGCCTGTAGTCCGAGCTACTAGGAAGGCTGAG^ 
CTTGCAGTGAGCCGAGATCXSCGCCACTGCATTCCAGCCTGGGCAACAGAGCA^ 
AGAAAGAAAGAAAACTTTATTTTCCAAATATTTTATATTAAGGAAACATTGA 
1 0 CAGTAAAAAC^GAATAATTAGAAGTGACTTCXMATGAAAAAGCAATGGAGTTTGGG 

CTGCAAAAGG AAGACTTGT CTGGAAG AACAAAAAAGTGG AGTGT CC CTTT ATGTAAAATTG TTTT CAT ACCTTT AGTG ACCATTT AG 

TAACAGAGTTATGCACTTGGGGTTAAGCTAGTGAAAAGGAAGGGAAGTTAG^ 

TTTGTCCIATTGGATACGTTTGTCCATTGGATACATT^ 

TCAAATAATTTCTCTGTTTATTGGA^AAAGAAATAGATGGATGAGGCATCGOT 
1 5 GTG AG AAAAGGGG CT CAGTGTGGCTGG AG AT CAG AGGAGT CAGGTACG CTGG ATCCAGTT C CAACTCTTGG CTGAAG AATTTGAAC 
TTTAGTCATTTCTAGCATTGTrCATTACCrGTAGTGTGCTCTTC 
GAGCAGGCAACTGATAAGCTCCATTTTTTTTGTTAGGAAA 

ATTACCAGAGTTCCAGTCAAAAGCATGAGGTCAGTTACrrGAGGTAATCCTACTACGTGCT 
CTGCTCCCAATACTTATTTAATCTTAATAACAACCCIT 
2 0 CAACCGGGCGCAGTGG CTTACACCCAGCACrTTGGG AGGCTGACACGGATG^ CAAG ACCAGCCTGG 

GCAACATGGCAAAACCCCATCTCTACCAAAAATACIAAAAATTA^ 

GGCTG ACATGGG AGGATAGCTTGAGC CCAGGAAGTTGAAGCTG CAGTGAGCTG AGATCGTGCC^CrGCACTCCAGCCTAGG CAACA 

GAGCAAGACTGTCAAAAAAAAAAAAAAAAAAAAAAAGAAAGAAAGAAAGAGAAGAAAC^ 

TTGCCCAAATGTAAAAGGGGTAGAAAO^GCTCTATGTCTGATTCTA 

2 5 T AACACGTTGAT ACTCTTT AAATG AG AAT AATTTT AACAAGG GCAATGACAG AG AATGG AAAGG AAGGAAG AGATTTCAG CAG AGT 

CAGAAGGATGTGGCAACTGAAAACAGCATTGGGACAAGGGATAAGAAAAAGTC^ 
GAACTCAGATTTGGAAAGGGAGACAGGAAATTAAGTTTTGGACITATGTT^ 
GTTCACITTATCCAACAAGTGTAAAGATCACATGGGGAAATAGAAGTCACCTGCAAC^ 
GGTCCATGTTAAACTCTGACCATTCTCTGGGGCTGTCCAGCCTT^ 

3 0 TCCTATTTCTTATCTCCCAGTGCCTCTGGCCrTTTCT 

T AGTCTCTG AGGTCTGAGG CCATCACTATGTCTGACCCTCAAAAAAT^ CTTATTCCTGTTGTTGCCCTGG 
AAAAGAATTACCCTGCCCGATTAAGTCACAGTTTCTTCCTTC 
CCTCATCACATACCAGGGTATTATTCATGCAGTTTACTCACTC 
TACACATTTCCTAGGGTCAGTCAACTTTTTAACAAAGCCZ7A 

TGGGGGTGGCTGGTCCTTTTCTTGAACCAGGTTTTGC^ 

CTGGCGGGG AGT ATTTG GAT AAT ACTGGAAG AG C CTC AAATG CATCTCTTG CCTC CTTTTG AAT AAG AT ACTTTTG AGGT CCAC C C 

ACCTGCCAAATTCCATTCTATTTTCITCCCACAGGGAGTTTC 

TGCAGGTCTATTCATT'GGGGGAGCCCAATTCCCGGGAGGATTTAGG^ 

4 0 AAGGCCGCTCAGAGCCACGACTGGGAGCTACCCrTAGCCAGCGCCC 

GGGGTGGCGAAGGCGGTGAATGGAGACCCGGGGGGGGOXSGGGCGGGGCCGC^ 
GGCCCGGGGGCGGGGCCAGGCGCGCGGCTGGCCTGGGCCTAGGCCGCT^ 

CTG CGGC ACGGGCCGGGCGGCACCATGGCGGCGGCGGCGCCTGCTG CTGCGGCGG CTTCTT CCGAGG CGCCAG CGGCGAGTGCGAC 
TGCAGAGCCCGAGGCCGGGGACCAGGACAGTCGCGAGGTTCGAGTGTTGCAGAGCCTGCGGGGCAAGATCTGTAAGCGGGGGCTGG 
4 5 GCTGAGGGGACGCCCTGGCGGCX5CTGGGCGCGAGGCTGAGGGGGCGGC 

CGCGGGCCCGGGAGGGGGCCTGGGCaSGGCGCGGTTGAGGCTGGAAGGGGGTGTGTTGG 
GTGGGCCGGGGCTGCTGCTCTCCGGAGGGAGTGGGAGCTGGAGATGGGCCTGGGGTGGGAGAGCCCG^ 



CGGCCGCCCTCAGACTCAGGCCTTCGCGCGGCTGCGGGCGCTGGGGAGTGGCCCCGCGGTC 
5 0 GCGCGGCCCGAGGAGGTGGCGGGCTGGGAAGTGGGGCTGGCCCGAGGGGAGGGGGCTC^ 

AGAGGTTGAAAAAGGCGCCCTGAGCTGCAGGAAACGGGCCGCCCTGGGGCCCGAGTGGCTACCGCGCGGTCTGTTCGCGG 
GCCCCGGGAGGAGCTCGGCGGGACACCTGTACTTGCCCCACCTCGCTGC^ 
GACTAAAGGCAGGATGTAGTGGCTTCCTGTGAGCAGGTCTCAAAACACAATCA^ 
CTTCTCATGCGTGTAATTGCTACTTTAGTG^ 

5 5 CTGTGGACAGATTATTTGAGTTCTGTCATTTGAAC!ATTT^ 

ACAGTGGTTTATTTAGATGAAGTG LTl T' l ' l ' bT I TTCTGTTTGTAAGAGAGATCAATAACATTTATTTTTGCATTTGTCGAAAATTA 
GCATTATTTTCTTCATGCAGTATTCT^GATTGGAAACATCC^ 

TCACTTTGAAG AAAATTG ACTTG CATT AAAGTGG CTAACAATTCTTTCCTGGGCAGGATGTAAAATTTT CCTCTCCTCTAATACCA 
GT ACTGTTG AG CTCACATTCTCCCACTTTTCCTCTTTTCAGGTGGTTCACGTATTTGGG AT^ 

6 0 GTTAACTTTTCTTATCTTTTTATTCCCTGAGG 

GAGAGGTAGATCTTAGTTCCCAAAATTAAAGTTACTTTCTAGGGCATAA^ 
TTCTTTTTTCTGTAACCTTATATTTGAGGGGAAAATTTTATTT^ 
AATGGGCCAAAGTTTCTCCCTTTATATGATTTTCCAGATTTTTACCAC^ 
AGACTCACTGGTACGTAACTGCAGGTTTTACCATGGAACTACATATACACATGTCTTGGAATTG 

6 5 CTTAGTTGTCCTGTGCTTTTGTCTGCCCCATGCCAAAGA 

AAAAGAAAAACAAAAAACCACTTTGTTTCTATGGTTTAAAGC^ CTTGG AAATTG AT CATCCAGCTG CTGGGAATGATAT 

TACTGCTTTAACCACAGTTGAAGTATTCTATCTGTTATCCTAGTC^ 
ATGGAGTGTCTGCITTCTAGAAAAAAAAAATGTATTTGTATACTTATATACAGAAGCTCT 
AATGATGTAATCCACAGTTGGTTGACIATTCAGAACCTTGTGAAG 

7 0 GTAGGGTGGGGACTCAGGGGTTGGAAGAGTCATTAGCAGGTGTC^ 

AAGCAGCTTGGGGAGGGTGTAACCGTGTCAGCTTTTCTCTTCACCACA^ 
GGAGATGTGCTGGCAGATTTTATACTACTGATTGAGAGCCTCTTTTCTGGAGAGTT 
AAGAAATTTTCTTATTTCTGGACTTGCATAGGAC^ 
TCTTGAATGTTCAAGAGATAATAGTCCATAACCTTCTTTAGGGATTTC^ 
7 5 AATATCTGTCAGTTGGACAGATATTTGTCTCTTGGACAA^ 
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ATAACAACATGAGACCTCAATATATATAAAG^ 

CGGGGGGCTGGGGAG^AGGGAAGAAGGGACATGCAAATATGTTTCAGGTC 
ATCCATCTATAAGACAAAATATTGTGAGCAACCCTGGATTTG 
TTCCITAAACTTTTTCAAGAGGTGAGTITGGTTGTTTGAT 
5 TACTCGACTCCATTCTTCGGTCTGCACTCCTC 

CTCCCCTCAAAAAAAAAAAAAAAAAAAAAACCGTATTCAAACAGAAAAGAGGGA 

TCATGCTGTTATTTGX5TGACCTCCTTTTTGATTAGCACTTGTTA 

GGAGAGCACCCATGCCACCATATGTGTGGGCTTATCTACTCAAC^A^ 

TGAATTTTCAG ATGTGTGGT ACAGTGCGAGAT CTGTGGG ACTATGTGTTCCAAAACTCACTT AGTTT CTCTGGTGTCTTCAGACCA 
1 0 AGAATAGCCTAGGGTTATTTGACAGAGTAGGGAAGATCTATAGGGTATGGTTAAGTGCTTGAG 

AGTT ACCTGGGTTTGAATCTT CACCT AATTTC CAAClTiTGTT AATGTCTTTGGG C CTTCATATT CT CAT AT AT AAAATGGGG AT A 
GTAAAAATACCT ATCTCACGGGTTGTTATGAGGTTTAAAT AAGCCTCCCTCCCTACTCT CTCTGCCTGCTCTTT 
CTCACTCCCCAGGATCTGGGTCATACACCG AC CATCTCTTAGGAAGGAGTAAAGT AGGAG ATAAAAACTAATCTTCAT CTGATGTT 
CTOGTCCAACGTATACTGGAGTTTCCAGGTGT^ 
1 5 AT CTT CAT ATGGCG TTAG C CAGCT CACTTATCITGAAGCTT AC CIT C GATGCTTTTCCACT AT CCA 

TTGTATACAGGTTGTTCCCTCTGCTTGGGATTGTCTTTCTTCTT^ 

TCAAAAGCCTG CTCTATGTGATCTCCTTCCCAAAACCTTCTGAGACACIT^ CTGCTGGG 
GAACACAATTCCTTGCTCATTTCCTTATTATAGCIACATTTCA 
CAAGATAGAATTGATTCTTGTTCATTTTTAAAAAATC^ 
2 0 CCCTACCTCACAATGTTATCCTGTGGATAATTACTAGATATGA 

AGTATTAAGAGAAAACTGTATGAGTCTGCATCAGGAAGTTCATC^ 

ATGCCAAAGTTGCITTGTTAGGAAAAAGATTAAATTTTTCTTAATGCTCTGATTACT 

AATTGGTCCTGGAGCTTGGAAAAGAAAATCTCATACT 

AACCGTCTATTTAATTCTGTGAAGTATAAATGAAAACATTTTATTGACTATCCAGTAAAGT^ 

2 5 TATTCATGCTAAGAGTTCATTTTTATCAGATATTTGAAATGAGATGATGAGTTATTTAGGATGG 

GGATAGAAACTGGGGAGAAAAGGGTTCTTTGTAAAAGTTCACAGAGCAGTCT 
ACCTCAGTGAGTTCATAGAGTTGTTTTITTAAAA 

CATC ATTTT AT CCTTT AAT CATAG CTGCT AAT C CT ATTTC CTT G TT AATGTG CTTTATG AAATTATTTG ACAAT AAT ATCZAG ATTA 
CATGTTTTTGCTACTAATAAAGTTCTATTCrrCTTTTGAGAACTGC^ 

3 0 TTTTTATGTGTATTAAGACATTTTTATGGCATCTCTTTAGGTT^ 

AACATTAAAAGTTTGTATCAGTACACATACAGAGA 

AAATTCTTT AACCTTGCTG AG C C CGTATTTCCTTT ACTTGT AAAATT AGAGG TTTGG ATATGGT CTTT AACATC CCTTCT ACCATTT 

TGATTATTGCTTTTTAACTTGAACCATTTGCrTTAAGGCAGTAC^ 

TGTTTTGCATGATTCACTGTTTTAAAAATATTTAAGATTTATTAGATAAG 

3 5 AATT AATT CAGCAG CACG T AAT ACAT ATCG AAGGGTCTGTT AAG AG ACG CT AAC C CACTGG ACTG AAATGGGC CT AAG CTGTGAAT 

CGGATG AAAGATC CTATTTTATCCT AAAG TCT AT ACT AAG C AAT AG CTG TTTTT AG ACATCTGT ATT CTT AG ACAATT AT AGAAAT 

TTAAAGCTGGAAGGTAGAGAACATGAAGGTAGAGAAAATAAAGGTTCTTACCC^ 

GCAAGTGTCAC!ACAGTTGATGGCCrGTTCGGCACTAGTCCACT 

TACTATGAATGTTGCAAATGTACTACATTTAATACACAGAGATTATACTACTTATCTGTTTATATATTGAGAGGTGGT 

4 0 CACTGTGCCGTCTACACACTCTTAACTGTTCCTTCTTAAGAACTTTGTGAAGTGGT^ 

CTAGGTG^AGATTGAACCTGGATGTTTAGTTACTTGTGCCATGGTCT 
CCTGTGAACTAAAAAAATTATGATGCCCAGACTGTACC^ 

TGATGTTGGGCAGCTAGGATAGTGAATTTCTGCTTTTGTCACAAATAGATTAGC 
GTGCTATCATTTAAAAGGTATCAGTAGAAAGGTTGGTATAGTTACAGTGAGATTTTAGTTTGTTT 

4 5 ATTCTATTTGATTAGATAATATGAACAC^GGTACAAAATTCAAGAGGTACAAAACAGTATATACATTGAA 

TCATTATCTTCTAGCCATTCCCTCCCAAGAAGCAAGCATTACCAATT^^ 

G CATTTACAAACATAAATTTATGTGCACTAATATATTCCTTC CACCAACATACAAATTATAGC^AAAT ATATACATTTTTCTGTAT 
CTTGCTTTTTTACACTTAACAACATAGCTTGAAG^ 

ATAATATTTCATCATTTGGCTGTACCACAGTTTATTTAGCCAGTCCCCTATTAGTTGACACr 

5 0 TACAGACATTACTACAATGAATAACCTTGTAAGCATGTCATTTTTAATATGAGTGTCT 

TTAAATTGATGAATTGAAGGATATGTGCATTTTTATTTTGGAAAGACAATGC^ 

TATGACTGCCTATTTCTTTATACCCACGTAGGGTX5TATTTTAAAACTTCTT 

TAATTTGTAGTTTTAATTTGCACTTTTAAATGAGAGTGAGGATTGCTCT 

CCTCCCGGGTTCAAGCGATTCTCCTG CCT CAGCCTCCCGAGTAG CTGGGATT ACAGGCIATGTGCCACCACGCCCAGCTAATTTTTG 

5 5 TATTTTTGGTAGAGACGGGGTTTCACCATGTTGGCCAGGAT 

AAGTGCTGGGATTACAGCCGTGAACCACTGCGCCTGGCCCTAATTTTTAAATTTTAA 
GGATCTCTTCATATTTTTAAGAGCCATTTTTTTTGATGTGCTATGTGAACTGGTCTATT^ 
TACAGTCTTCTTGATTTGTGAGAAAATGCTTTATGCTGTAAGGAAGTTAGCTATTATTTTAAAAC 
CTGTATACTTTGGCTGATGCCTGAATTTATTCTTAAGTTGATTGTGAGATCCAGTGAGGGCAGTCAG 

6 0 ATATAACGTCTGATGGTTGAGAAATTGAGAAAGAACTGCTTGAGGAAAGGTAAGTGAAGACTT ATA^ 

AGATTGGGAGAAAATATTTGCAGTGTATCCTTCAGACAAAGGAAATGTGTATAACA 
CACTTTATGCGTATTAACTAATTTAATCCTTACAGTAACTCTAAAAATACCATTATTAT 
GATACACAATAGCTAAGCTAAGTAAATTGACCAAGGTTAGATGGCT 
TCTAGAGTCCATCCTGTTAACTCTTAACCTACTGTGCTATTATGCCTCTCTATTTATT 

6 5 AGACTTAAAAACAAGAGGCCATTAGAAAAATAAAGGATACGAATAGGCATCTCACAC^AAA 

GAAAATATTGGTAAAGTAAAAATTAATCAGGGAAATAAAATTTTAAAACATCAATT^ 
TGGTGAGGGTTTAGGGAAATAAGTATTTTCCTCATTGTTGTTGTGAGTTTAAATTGATA^ 
TATCTGTTAAGAATTTAAATGTCTATACACCATGCATTTTTACTTCTACCTATTTG 
GAGGCATATAO^GATATTCAATGTTGCCTTGTTGGTAAAACAGAATAATTGGAGACGA 

7 0 GATAATATGTTCATACACGTATTCTATAGCAACTAAAAAGAATGAG 

TGTTGGGTAAAAACAGCAAGTTGTAGAATAATATGTGTAGTATCCT 

TGTAGATGGATAAATACATAAAAAGAGGTCTGAACAGTTGTACAGATAACAGTGATT 

GGTGCTGGGAAAACTGGCTAGGCATGTGTAGAAAGCTGAAACTGGAACCCT 

ATT AAAG ACTT AAATGTTAG ACCT AAAACCGT AAAAAC CCT AG AAG AAAAC CT AGGCAAT AC CATTCAGGACAT AGG CATGGGCAA 
7 5 GGACTTCATGTCTAAAACACCAAAAGCAATGGCAACAAAAGCCAAAAT^ 
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CAGCAAAAAAAACTACCATGAGAGTGAACAGGCAACCTACAGAATGG 

ATATCCAGAATCTACAAAGAACTCAAACAAATTTACAAGAAAAAAAC^CCC 

TTCTCAAAAGAAGACATTTATGCAGCCAACAGACATATGAAATAATGGl^ 

NlWNNNNNNNNNNNNNNNNKNNNNNNl^^ 
5 NNNKNNMNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNWJNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNmiNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNKNNNNN^^ 
10 NNNNNNNNNNNNNNNNNNNNNNIWNN^^ 

NNNNNNNIWNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNKNNN^ 

NNNmiNNNNNNNNNNNNNNKOTJNNNNN^ 
1 5 NWCCTATAGGGAGTATATTTATATATTATATTATCAAAAGTAACTATTTATATAAATGAACr^ 

GTGC CAGTCTCTGTTGCTCT ACTTTGG AAGACT ATC CT ATCTG CAAAACAAGAGT AAT CAAATGTT AGG TTTTG T AACATTTTG AG 

TGCTATAGAAGAAATAAAGTCAACTTTTCTGTTAAGTTTTTTTGG 

GG CTGAGGAGGGTGG ATGGCTTGAGCT CAGG AGTTTG AGAC CAG CCTGGGC CACATGG CAAAACTC CG TTTCTACAAACG AAAAAA 
AAAAAAAAATGCCAGGCATAGTGGTGTGTGTCTTCAGCCCTAGCTATTAATACTTGGAATGCTGAGGCCA 
2 0 CAGGAGGTTGAGGCTGCGGTGAGCTGAGGTCGCACCACTGCACTCCAGCCTGGGTGACAAAGTC 
AAAAAGTAGAAATAATACAGCAAATTCTTATATACTTTGCCTGTTTGACTAATTGTT^ 
TACATTGTTTCAGAGAAATGTGGTAAAAAGGGTCTGAAAAT1TTGGATCCTTTAA 
TTAGTTTTGAATTITACAGTTTrATATAAAATATTCTTGGTAGATGGGGTC 
TGTTGTGTTAGGATACTTTGCAAAATATTTCATGTAATCCAATTTCATGAAGAA 

2 5 AG ACAGAGGTTATTTGCTCAGG ATTGTAAGGG AG C AGAGCCAAGTTTTCAGAAAGTCTTTATGTTACCAAAGCCCACATCCCTTT C 

TCTACAAAGCTGCCTGTGAGATGATATATTGGAAGGAA1TGATGAATACATTTATGTTGTCAAG 
TGGCTATGGTCTAATGTAGTGGATATTCAATAAAATTATGATGTTTGAATGCCA 

TCATGAATGTAATTATTTTAGCTGCCTAACTGCTTATGTGGGTTTGTATGTACTTTTGAGTGGTTCTTG 
ACATACACACACACAGATATACACATACACACCTTTATATGCT 

3 0 GGAGGGGGGGCGAAGATCAATATGAATCCTATTTTGGCCCCCTGGATGCTC 

TGTCCATGTAGCCCTGAGTTCTGCCAGCACCTGGCTTTTT 

GCCTTGCTAAGCTTCTTGTAGCTATAGGTACCATTGGCTTAGGCCAGAAGAGATTCCT 

CACTTGAGGAAGGGGTCTGTAGGGAGCTTCTAGATCAGCACACATTCAAAACTGAAACC 

GATCCTGCCCGCAACTTGGTGACGTATCTATATAACCCATAGACACTTCCTGGCA^ 

3 5 GTAGTTTCCTAACTTCTTGTTCTAGATTAAGCTTCAGTGAAAAGTGATTAATATGA 

TGTGCACATGACTTCCATGCAGCAACTAGCCCCGATTATGTTCTGTAGACTTTATCACTGA^ 

TACTTTGACTTTGTCTTGTCACT AG CCATTGT CTTACTTT ATGTGTAATTGT ACTT ATAT ATAGTTCAGGTTTT ATACCCACAAAAA 

CCCTTGTGATATAATACCACTATACAGGATTGCCTTATTTATGTGGAATTGTACTT 

ACACACACGAAAAAACAGTTCTTTGCTTCACTGAGTCrrCGGCTAGCCACCCAAACTG 

4 0 CAAGTGTTTGCTCACTGGTACTACCTGkTTTTATCATA 

GTGTTGATTTTTTTTTTGAGACGGAGTCTCCCTCTGTCACCAGGCTGGAGTGCAG 

CTCCCAGGTTCGACAATTCTCCTGCCTCAGCCTCCCGATTAGCTGGGACTGCAGGTGCACGCCACCACATCCAGCTAA 

TTTTTAGTAGAGGCGGCATTTCGCCATGTTAGCCAGGATGGTCTCGATCTCCT^ 

ATTACAGGCATGAGCCACCGTGCCTGGCCAATGTGTTGTTTTTTTAAAGATTACCAAT 

4 5 TTTACAGATACAAGTCATTTGCCAGGAGTGAGGAAAATTGCTTGACAACT 

CTCTGCTTAGTGGTAGATATTCCAGCAGTCACCATATGCCTATTGGTACTATTTCTCA 

GTAACTTTCCAGTGGTGTTTTCAAGATGACAGTTAAGGAGCCACTAAGGAAGGAAAT^ 

AGTTCTTGACAGTCCCCCCTCCTTTTTTTGTTTTTGTTTCTT 

AGACTCTTTGGAAGGTCATGTGTGATAAAGACAGTTCTTCTCTTCCTTTCTCCATTCAGAA 

5 0 GTTTTTTTCTTGTGTAAGATTTATTATATACAAAATATTTTTTAAGTGTGTGAAAACT^ 

GTTTAAAGCATGteTTTTCTGCTACTTTTCTACTAATCACTT 

AGTTTTACTAATCATATAGCATAGTTGTGAAAAGGTTAAATGACTCGGTATACTGCCTGGCATTTTGTGCTC^ 
TCTAACATGAGCAGTAGTGATACTCATAAAGTATCTTTTCATAGTTTTCTTAATGGA 
ATTCATACTTACATGTATATCCAGTTCTTGTAGTGTTCATCAGGTTAAACTGGCAATAAGTGTTAATA 
5 5 CAGTGTTACTCTGTGCITTTTAAATTTGAAGTGTTC 

TAGAGCTATAGAGAACCTAGAAGATTAACTGGCTAAGGAGTTGCAAACTC^ 

AGCAGAGATTATTATGATTTAGTAAGATACACCTGTCTGCAGTTTCCACTCAAAATCATTTTGC 

AACCAGAAATACAAATTTATTrATTGTTAGCTCTCAGTTTCTAGATGGTCA 

ATCTCTGG CTAG CCAACCTCAG ATTTGTATCCTCATTTT AGAGATGTGG AG CTTGAGGT ATAAAGGTTGGACGTGTTTCCATGTAT 

5 0 TTCCT ATAATAGAAATCCATTGATT CTGT AACAAAGGACT CATTTTAAACAATG ACCTCCTCTGTTATACCCAGTTGTT AT ATTAC 

AGGTTATTTTTGGTCAGGAGCTAATGGCTTGGATGAAGGGAATTGTATATAGATGCTAATTTTTA^ 
G CACTAATAGTTACT ATCCATTGAAT ATTTACTCTGTCTTAGGCA^ 

TTGAAGCATTATAGTTACCTCTTTTTACATAAGAGGTTGAGAGTAAGGACATACAGATAGTTAA^ 
CAGATCTCTGCCTCTXTTAATGCATTCATTTAACTTGCTATACTATACTAGTACT 

6 5 TCATATTGCAAAAATTGAAATTAAACTTTAGAAGAAAATTTCAAAGACCTTATG 

ATATGGAATT AG ATTGG GACAACAG CATTTTAATTTT ACT CT AAACT AG TCAAT AC CAG AGACT AT AT GAT ATTTTT AAATG TG CA 
TTAGATTCATTCTTTCTCACTCTTTTTTTTTTGTTGTTGTTC 

CCTG AAGTTCTACTGTTCTGTTT AGGTTG ATATTTCT ATT ACTGGTTGAG CATTCCAAATTTCAAAGTCCAAAATTCCAAATG CTC 
CAGG ATG CAAAACTX3 AG CAC CAACATG ATTCTCAAAGG AAATACTCATTG GAGCATTTCAG ATTTT CAGATTTGGGATG CACAACT 

7 0 GGTAAGTATATATAATGCAAC^TTTCAAAATGTAAAAACATTTGAAATCTGAAAT^ 

TATTCGATCTGTATTAGAACATT ATGTCCTGAGTCATTCAAGCCAGTTAATGCCA CTGTGGATTCTGTCTCCATAGTAG CTC 
TTCTATTTTTTCTTTCCTCCTATCTCCCriTTGTTACCTTAA 
CACAO^GAAACTTTTAGCGACCCITGTTGCCTACTGAATAAAGTTAAAC 
CTGTCCTTTATAATCTTTTCCCrCTCATTCCCCCrTCATCTCTCTTTAT^ 
7 5 GGTGCTCTTCTTGTTCCCACCCTCCAGAACTTGTTCATGTTATC 
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TTTTGAAAATATATATCATTGAGAAGAGTTATTGTGAAGATTAAG 
CATGTATTAATTTATTCAGTAAGTGGTAGCTATTATTATGATTTGGCTTTCT 
TGTCCAGGACCCTGGCTCTTTGCAGTITTTCTAGGGGATTAAT 
ACCTTTGAAGCAGACCTGGACCTACTACATCTCITTCAGATGGGAATAGTTTCT 
5 TTGAATAGAAAGACTCTAGGCTTTGGAGTTTGATTTATGCCAGATAACC^ 
ATGAAAGGGAATAGTAATGCTTATCTCACAGGGTTGTCGTAC^ 
GC^TCTAACTACTTAATAAATCTTTATCTTCTTTTCTGTCTT 
AAACCTTCAGGTATAAATGCATGCACATAATCCTCAAACATCCAC^ 
TACATGAACAGAAGCATATAC^TCAGAATCTCTACCTTTGATAGTAGCCTGATA^ 
1 0 GTAGGATTTGAATTTACTGTCTGATGACTGTTAC 

AATCTTAACAGTTTTGATTTGATGGTGTTTATCTTTCGCGCn^ 

TAT AGTT ACATGT CTCT C CT CITnTI' CAGGT ATTTATGCAG AAGTT ACT I~i'CTCAATG AAGC CTT CCTTGAGCT AC CTCATTTCT 

TGTGGTGCCCATACTTATTAGTATTAGGCCATACATTTTTACITATTT 

AATAAGGACACAGACTTTTTGTCTGTTTTGCTCACTGTC^ 
1 5 ACTTCAGCACTGGTTTATTTTCAAAACGTCTC 

CTT TTTAAAAAATCrTTGGAGTAGTTGACATAAATCAAC^ 

GATGTGGTATTGGAACCTCTTTTTTGCCCATGCCAAAGTTATATACACACG 

TTTTATGCTAAGGATGGTGCTAAGAATTGGTCCCCTTCCTCAAGAAGTTTATAGTTCAGTGA 

TGGTTGTGTCTCATCTTTACCACCTCAAAACATTAGAGAGCCCTGAGGC 
2 0 GTGATTTTGTATTGTCTCTTGGTTTTGTAAATTTTATGTACTTACTGATGATTCT 

CTTGGACAGAITGTCTGCACTTGGATGTATAAGGCATCTCAGATTTAATA 

GGCTGTGTCTCTAGAAATAGCATCrcrATTCATCCAGCTGCT 

ACACCCTGCAGTAAATCCTGTTGGCTCCATCTTCAAAATACATGCCCAA^ 

CTCCCCTGACCTGCAAGTTTTCGCTCATTGAGATTACTACAGTAGCCTCCT 

2 5 TGATTTTAAAACAAAAAATCTTGCCATTTTrATGCT 

TTCCCGTGCCTGTAAGACCTTATTTAACCTGTCTCCTCCCATACCCTTACTCCATCT 
CTTATTGCTrTGTAGCCATACTCCCCTCCTO 

CTTTCTGCCTGTCCGTCTCCTTCCCCAGATATCTGTATGGCTTATACCCTCAATGACTTCCTTCAG^ 
ATTGAAAGGCCITCCCTCCTTCCCTATATAAAATAACATTGACCTCCTCCCTGTCTCTACCCT 

3 0 TCTTAACCTGCTTTGTTTTCTTGAATAGCAATAAATATTTATTTCT^ 

GGTATTTGTCGTCCCTGTATATATTGTACAACAGTTTCGTATGGGCAGAGACCTGTTTTT 

G AACAGTGC CTCGTGTGTTTAGGCACTT AGTAAATATTTGTTG AGTGAAATG CAACAAGT ACAAAACAAT ATGATCACTGTAGTAG 

AGATGCAAGGTACAGTAGAAACACATGGGAAAGAGTGACATGACTACATAGGAATTGAGGAA 

AGAGCTCTTTGGAGAGAAAAACAGGTAGGGCATTTCATACAAAGAAAATA^ 

3 5 GTTCAAGGAACTAAAATATAGTACTGCTACAGCATAATTTGTGGGGGTAAAGATGAACTAGATGAG^ 

CACCTGATCATGGAGGGATGGAGGGTTTTTGTGCCAGACTAAGGAATTTTAGA 
GTGAGCCATTGAAGACTTCTAAGCAAGGGATGTTTTATTTATATTTATTTATACGTGTCTCT 
ACGTATTAAAAAGGTAACTAGCAACATTGTGAAGGTTGAGTATCA^ 
GAAGATATGATGTAGAAAAACAATTACTATCACATGTGGGCGAGAGGATATT^ 

4 0 GTGGGAAGATGTTCTTGGCATACTrGXZAGTACATTTGTGAAAGAGAA 

ATCCACCCTTTAAGAAAGAAGACTTTAAAAAATTAGTTAACAGTTATGT^ 
GTACCAGTTATGAATCAACAAGAAAAGATGTTACAGAATAAAATAAATTGTGCTTC 
AGTTGGTTGTCATTGATATACCTCCTGATCTTGATTTATTGTTGAAGGAA^ 
TTTGCTGTGGGAACAATGTATCTGCCTTTGTAGGACAAATGTCT 

4 5 AATGAGGGATGAACTATAAAAACATTTCTTCCITrCATTCAAATGCA^ 

GTCATATCTTAGCAATATATTGAATTTCTTGAATACTTTATTCTTATTAATTTAGGGTCAAACTATGAT^ 
AAATAGAGGCACCCAAAATGGTGCACACGATTTGAATGTTAAGAATTTGGATTTGGC^ 

TATAATCCCAG CACTTTGGX3AGGCCAAGGCAGGCAGATCTC CAAAACC 
CTGCCTCTCCATAAAATACAAAGCAAATTAGTCAGGCATGGTGGTATGTGTCTGTAGTCCCAGCT^ 

5 0 G AATCACGTGAGT C CAGG AAGG CGAGGCTG CAGTG AGCCAT G ATTGTG C CACTG CACTC CAG C GTGGG CG ACAG AG CG AG AC CCTG 

TCTCAAAAAAAAAAAAAAAAAGAAAAAGAAAAAAGAATTTGGCATTCC!AAATTC^ 
TTTTTAAAATTTTATCCAAATGCTACAGAAGATGA^ 

TCCCAAGTTAAAGGATTCCCTTTTTAAGTGTGCTGCTATTCAACTCACTATCTGAT 

TTGCTTGGTAATTT CTGTGTCCATGTTTG ATCCAG AT AGC CACGTGTAGCCGGTGG ATTT ACCCAACCATGATGACCAGTAACTCA 

5 5 AAGTGGGATTTTCTAATTTGTTGGAAACAAAGGTTAGAAAAGAATTGAGATTA 

AAAGAGCTTT AATAAAAAAATTTTAAGTGGCTTAC AAAATTGTTG ACAAGG CTGAAGAAAC AGACTCTAGATCT AG AAAAAG CCAT 

GTTGCATCTAGGCCACCAAGAGAGTTATTGCCTCTTTTAGGATCAGAAA 

CCTGTCTTAGAACCTTACGACCTCTGCCATGATCAGGAOT 

GTAAGCCGTTGTTGTTGCTGGCCCTAGAGCCAAGCCCTGCCTACCACAACCTGTACrGCAGAATG^ 

6 0 TATTCATGAAATCTGTGACTGAATACTAGGACTTCAACAAATACTGCTGTGA^ 

AACAACCCAAACTCAACCTCCCTTCTCTTCTGTCCTTGAAATTTCTAGTGAGTC 

AACTGTAG CCACAAGAG AGGCTGGGAAATGTATTTTTTAACTTT CTCITCTCTGCAGCACAGGAAG ACATACACTGAAGG AGGTTG 

AAATGGATGTGAACAGTAATCTACCATATTTACTGCAGGTTTAGAACCTGGTTCTCCTTAAGG 

AGATAAGTTGAGAAAGTTTATAAATAATACCGTCAAACAATGTTGAGA 

6 5 ATGACTACATTAAAAAAAAAACTAGAATTTGTCCTTCACCTCTATGATGAATTAAGTGG 

TTTAAGATGTGGTTAGTGGATAAAAAGACCAAAAGAGGGAGTATAAGAAAAACCCTCATACAAGGTTACCT 

TAGTGTTGTCAAATAATGAAGAGTGGTCAAATAAGGTGGGACIX1AAAAGCA 

XTGGGTTTTTTTTTTCITTCTTTACTATTGA 

TCCCATCCCCTGGGCTCAAGCCATCCTCACACCTCAGCCTCCCAAGT^ 

7 0 TTTTTGTATTTTTAGTAGAGATGAGATCTCACCATGTTC 

GCCTCCCAAAGTGTTGGGATTACAGG(ZATGACCACTGCGCCCAGCCTGTTAGT 

T AGGAT AAGAGGG CAAGTTG CAGGTTT AGGCTGTG AGTTG AAGGG AGCACTTG AAAGGAGAGATT AT AG AAT AAGTTTT CAAGAAA 
TTTGGCAATGGATTITTTTrrTTTTO 

AAATATCACACACAAGCAAGTACAGTCTTCCAAAGTTCTTC^ 
75 ATATTCTCTTTATGCTTAGAGATATAATAGGCACAATATTACTTTTATTATATC^ 
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TTATAAAGTGCCAATTAATCACTACTATAGT^ 

ATTGATTAGATGAATTAACTTTCAACAAAGTTTATTTGAACTTCAG 

CTTTTATAGGAAAGGATGCTTGGTACATTTTTTCATATGACA 

TCAGGTTCATTGGATTCTTCAAAATAACCTCATTGTCTCTCT 
5 GTTGTTlTCTCAAAACTGCACCrTATGTGAATGAAACAGAAACATGTTCA 

(K3AGCATATTTATTATGGAGCATATTTATTAAGTAGAAAATTATTATGGTCT 

GCGACCTAGTTGGACTTTCGTTTAAAAAAAAGTCTAAATTATGCAGAAG 

AAGGCTGAAAGTAATGAAAATAGAATTGCTAGACAGTGTCCTGGAGGGACAAGA 

ATGAGCTTTGGATGAGAAGAGGAATAACTAGTCITTC 
1 0 TTAGTGACCTGGTTTATTTGTAAGGACTAACCCATTTTAGGCAGTTAAT^ 

GACTTGAGATGGACCTAAATTTGTTTATTCTGTTTAAGTAGTAACTT^ 

ACTATTCCATTGG TTTTCT TATTTrAATTrTATAARTAATGTGTTTACrG 

CTTCTCTAAAATATATTGGTTTAAATGGCCTTTAAAGTTGATAAGCAGAAAGATA 

GAAAGCTGTGATTGAGTGATTTGTCTGAAGTCAT^ 
1 5 GGTAATGCTGTTTCCCTTATGCCAACTGATTGCATAAGGGAACTAGTGTACTGAT^ 

G CTAGAGT AT CAAAGGACTTCAAAACTTCAAAACT AG ACTTTTT AGGTT AT^ C CTTCAACAAATGTTT CTCAGCTGT CT 

ACCGT ATG ACAGT CACTGTTT AAGATCTTG AAG ATCAAC AAG TT CTTGGGC CT CAAGAATGAGGGAAC CAAAAAAT AAACACTGAA 

TTAAAAATGAACAGAAATTT AGAT AAT CAT CAGTG CTAT GAAAG AAAGAAAATGGAGAAAT AGAAT ATGG T AGGAG AACT ATGTG C 

TTT AGAT AGAAT AGTCAAGGAAG ATTTCTCTGAGTT AGTG AT ATTTGAG CTGAG ACTG AATG AT AAGG CACTCG CCATTGAAGAT A 
2 0 TCTAGGAGTGTAGCATTCCAGACAAAAGCCCTAAACTTGAACATGGTGTGGC^ 

GGGG TTGGGGGAATGGAAGGAG AAG AGATCAGAGAGACC^TT AAG C 

TAAGTGAAATGGTGTGGTATTGGAGGAGGAGAGTGAGTGAGTAACCTAAAGAATATC 

TGGATTGTAGGAGGGCCAAAAGTGGAAGCAGGGGCACCAGTTAGGCAACTCTCAT^ 

GACTCGGTGATTCTGGTCAAGGC^UUVGAGAACTGGTTGGATTO 

2 5 TATGGGAGTTTGGGGAAATAGAGGATGGTGAGAGTAAGAGAGAAAAA 

CAGATGGTTTTTCTGGGACTTGTTAAATTTGAAATGCCTACT 

GAGGAGAGGTTAGACCTGGTGTTGCAGATCTGGGAGGCGTTCGCATTAACACCATA^ 

AAGGTGGTTTGGAATTGAGG CATTGCATGTTT AGAGGT AGC C AGGGGG AG AAGTCAG CAGAG AG C CAGTG ACGTAGG AG AAAAGTC 
AGAAATACTGTGTTGTGGAAGCCAAGAGAAGAGAGCATTTTAAAGA^ 

3 0 CCGGGTATGATGAAAACAGAGAAGTGACCG1TGATTTGGCAATGTGGAAGGCATTAGTGTCCTCAGCT 

TGGTGGGGTCAGAAGCTGATTGGAGAGGGTTGAAGAATGAATGGAAGGTAAG 
GTTTTTTTTCCTTTTCTTTTTGTGTGTTTTTTTAAG 

AGGAAAATTAATGGTGATTTGGGAAAGAAAGATTAATTGTGGAGTAAAGTCCTTGAGGTAGCACT 

CAAGT AGAGGG ATT ACC CTTTCAT AGGAGTGGGG ATGCAATT ACGT AAT AT AAAAAAG GG ACCTCAAG CCACTTT ACACACAG C C C 

3 5 CTCCCTTCAGTCTTAACAATCTCCTTTAGACCAGCTTGATCTT^ 1TCTGTG CCACA TTAA 

TATGTTCTTTCAGATIGAAGTCATCCTATATAAGGC^ 

AACAGGTGAAGCAAAAAATTTATTGCCATATCTTGGACCCCACAAAATGAGAGATTGTT^ 
AAGTTTATCGTACCCAAGTTGTGGAAAAATCTTTAAGGTTGGTAGAAAAAATTGGTAAATT 

AAT ATT ATG CATTTTCGAGATT AT AAT AATCAAGAAGACAAG AC AG AAACA CATGTGGCTTTT C ATTG CC TCAAA 

4 0 ATGTATCTTCCTCTTAATTGTTCCTAATTTTATCITTGTTTTCTGGAGAAAA^ 

CGTTTGAACTTTCCTCAAAGTACATCTTTATTTTATTTATCTAAAGAATAAGTTA 
AGTGCATTATCAGGGAATGAATTCTCAGATAGATGTCTCATCTTTGA^ 
ATTCATAAAACTGGAATATACCACCTCTGCACCGTAGCAATGTGAACAT 
AGACAAAAGTOU^TAAGGAGGCCGGGCGTGGTGGCTCACGCTTGTAATC 

4 5 TGAGGTCAGAATTTCGAGACCAGCTTGGGCAACATGGTGAAACCCCX3TCTCT 

GGGTGTCTGTAATCTCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 

ATGCCATTGGACTCTAGCCTGGGCAACAAGAGTGAAACTCCATCT 

GGGGTATTATTGGAATGAGCATGTGGTGGATGAAATATTGAAT^ 

GGGATCTG TT T TTT TT 1 T 1 l'ATGCCTTTAAAAAAATATCTTTCTTTTACATTTGCCTTTGAAGTTTCCCT^ 

5 0 ATTTTTTATAATTTTAAATATCrrATTGAATTCrTCTTAGAAAAGGTATTGT^ 

TTCAAGGAGTAAACTIGAAATCTTTAGTCTTAATTCTCATATTGTTCT 

ATGTGGATGTGATATATAAAGGAAATTCTGCATTTATAATTGTTTAGAAAATAATGCAAAGAGATACAT^ 

CTTTCATATCATGTTTTCATATCATATTTGACTAATAljTrcr'l IT 1 GCATTGATTTCTATGTTGAAATTAATTAGAAGGCTCTAGT 
TACTTTGTGAGACCAACTTTTAAAAGTTTAGTTTGGTTACATTTGAAG 

5 5 CAACTAAAAAGCTGAACCAACTGTTTTCTAGACAAGATCAGATAT 

TTTAGTTATGACATCATAAAGATACTGCATTTC^TAGCGATATTTCTAGAGAATC^ 
GTACACTTGACAAAAGAAACTATC^TTTCAGAGATTCGCGTCT 
CTTAAAATGAACTTATAAAAATAAGCTreGCAACATGAGGATGGGC 
TAAAAGATGAAAATTGGTAAATTAGATTTTATCAAACTTAGAGGCTCCTT^ 

6 0 AGCCACACCTGGGAGAAAGTATTCTGACAAAAGACTTGTATCCAGAATTTATTTA 

CCGATGG C CATTTAAAAT ATGGGGAGAAT ACTT AAACAGAT ACTTCAGAAAAG ATATGGCCAA CACATGAAAAG GT ATTGGAC 
ATTGTTAGTCATCAGGAAATCAATTTAAAACCATTTTTGTCACTATAGATTCTAG 
CTGGTTGCCTGTAAGAATATGGTGGGAGTGATTGCAGAGTCACAAGGGAACTTTCTC 
ATAAGGTAGAGATTACATAGGTGTATACATTTATCATGTTACATTCACTAGAATAG 

6 5 TGGAACAACTGGAGCATTGCTGTT AGGGG TG T AAAACAGT ACAACCACTTTG AAAACAG TTTGACAGTTT CTTG T AAATT ACACAT 

TCACCTGCCTTTTAACTCAGCAGTTGCATTCTCAGTAATCAA^ 
GCTTTACTTTTGATAGCCCAAACTGGAAACAACTCAAATAA 
CAATTAAAACAGTTGAACTACTATGTACACAACAACATCG^^ 
GTACATACTATATGATTCTATTTATGTGAGATTTT 

7 0 TGGGGTAACTGAAGGAGAGGCATAAGAGAACTTTCTGGGGAGATAGC^GTGTCCTTTATT^ 

GTATAAATTTATCAGAAGTCATCAACCTCTATACTTTAAATTGGATTT^ 
AAAATACCrrGTTTTAATTTTTTAAAAATATTTTTTAAAGT^ 
TAAAAAAAAATTTTGAAGTAGCCTITTCACATAAGATTG 
TAAGACZAACMATGGTATGAATGACACTATCCTCACCTCTCAAATGG 
7 5 TTCTATCCACAGGACrGCTGAAAAGCTATTTGTTCAGAACT 
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AGAACTTAGTTGAAATTTAGCATTGA^ 
CTTTATTGAAAGTGATATTATTTGGCCAGGAACA^ 

CAAGGTCAAGAGATTGAGACCATCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAA 

ATGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTG^ 
5 ACGCCACTGGACTCCAGCCTGGCAACAGAGCAAGACTCTGTCTCAA 

TATCGCAAAGAOUAAAAACCAAACACCACATGTTCTCACTCATW 

GAGCATCACACACTGGGGCCTGTTCTGGGGTGGGGGGAGGG 

AATGGGCGCAGCACACCAACATGGCACATGTATACATATGTAACAAAGCTGCA 

AATAAAATATATATATATTAAAAAAAGAAAGTGATATTATTTGAGTATTTATACAATTATTA TTG 
1 0 TATTTTATATTATTrGAAGTAATGAGCTTTCCTTTrCTTTCTAG 

TTT C CAGT ATTTG T CTTTCT ATGTTTATGATAAG AATGTTTT ACAAAGAGATCT C CGT AT AGGT ATGT ACT ATT CATAATTAT CTT 

TAAT CACAATGTT AATG TTT AT ATTCATTCGTTCTCTTAGT AT AGTTTTGAAAATTG CAGAATAAT AT ATG CTCAACATCT CTGAT 

CTTTrTCTGTAGACTACATGTTCATTTGTGA 

TTGCAGCTCCCCrAGTCACTATGTCATATGTTTTTATAATTCCrG 
1 5 TTGGCTGTTTCATATCrrAGGTTCTCTAGTTAAATGTGAAATTTAAAAGGG 

AT CTTG CT CT ATCT C CCAGG CTGG ACTG CAGTGGTGCG AT CTTGGCTCACTG CAAC CTCTACTT CC CAGGTT AAGCGATTT CT C CT 

GTCTCAGCCTCCTGAGTAGCTOtfWATTACAGGCGTGTGCCACCA 

CCATGTTCGCCAGGCTGATCTTAAACTCCTGCAGGA 

CCACGCCTGGCCGGGCAGGGACT C1T1V1T ACGGAGCACOT 
2 0 AATGTTTTT AT AGAAAGATTG CAGTTTTGATC CTTC CAAAGAACAT CTG TTGTTGAGG'lTTTTCT AAT AGAAGT AAT AATT CACTT 

TAATCTATTTATACATTTTCTTCCTGTCTGTATTAGTGACTC 

GGTAATTTATAAAGAATATAGGTTTATTGGCTCACAGTTTTTCAGGCTG 

CATG AGTGAAAGG CAAAGGGGAAGCAAG CACATCTTCACATGG CTGG CAGG AG AGAGGAGAG TTGGG ATG TGCTACACACTTTT AC 
AATAACCAGATCTCATGAGAACTGACTGTCATGAGAGCAG 

2 5 CTCCTCCAACAGTGGGATTACAGTTTGCCATGACATT^ 

CTCTTTTTTTTTGAGACGGAGTGTCACTCTGTCGC^ 
TGGGTTCACGCCATTCTCCTGCCACIAGCCTCCCGAATAGCT 
TTTTTAGTAGAGACAGGGTTTCACCATGTTAGCC^GGATGGTCTCGATCTC 
GTGCTGGGATTACAGGCGTCAGCCACTGCTCCCGGCCAATAAATATATTTCT^ 

3 0 CATGAACrAGAACAGAGCTAACTGGCCACATATGTGTCACATTCCTGATCTTTGC^ 

AGGTCTAATTTACATTTATTCTCTTTCTTTTTAGTATCCT 
CCTACAGCTTAAAAAATTAAGCATTATTTGGCCAGGTGGAGTGG 
AGNNNNNNNNNNNNNNNNNNJWAAAAAAAAAAA 
TTCTGATGTTTCTCTGTGTTTGATATTGAACTCCTTAACAA^ 

3 5 CTCT AT AATCTGGCTTCT ATGTTTTTTTG AT AAAC CTTT CTTGAAATTTG C CAACATCATTTTAATC AAAT AACCTTT AAAACAAT 

ATGTTTTTCTTAATTATAAAACTGTTATATGTTCTTAGTATTrGG 
GTATTTCTTCCATTCTTTATTTCATACATTTGTAT^ 
TTTCCCCATATCATTAATATTTGTTAAAACrATTTGTTACATAATATCCT 
TGGTGAACATTTACATTTCCAAATTTTGGGCATCCTCAGTGGT^ 

4 0 TCTTTTTCCCCAGGATAAATTCCTAGAAGTGAAAATACTGAGTOVAGGGGT^ 

TATTGCCCCTCTGCCCCAC7\GAGTATCATACTAATATGGACGACATAAATGTTGTACT 

G CAT AACCATTTT ACCAC CCCCTGGCCATTCTTGCATATTAT CATTGAAAAAAATTTG CCAATATGATAGGTG AAAAATAACACAT 

TATTATTTTAATGTGTGTTTTCTGATTTTGAAGGAATTTGAGGAATTTT^ 

ATTGATTGCTCATTCCTTTGCCTATTTTTCTTTTGG 

4 5 CTTTTGTCAGCTATATACATACCAAATACTCTGTCTCACAAAAGGCA 

TTCACTCTCTTGATTGCTTATTCX3GTTTC 
CCTCTGATGCCTTCTTATATGCAGACTCTTCTGTTTGT^^ 
GTTTAACATTTACCAAACACTTCTCCGTGGCACACAAAGGTTGGG^ 
GACITCTAGTCTAATACTAATATATTAATACTAATATATTAGGGTCTGTGCTGGAGA 

5 0 GAACAGCTCATTTTATTTGGTAGGGGAATAAGAGA 

CTTTTTTTTTTTTTTTTTC 

TCCACCTCCTGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCAAGTAGCTGGGAT 
TTGTATTTTTAGTAGAGACAGGGTTTCACCATGTCAGCCAGGCTGGT 
CCAAAGTGCTGGGATTACAGGTATGAGCCACCACXSCCCCACO^ 
5 5 GGGAAGAATGAATCATGTTTGTCGTGTACCTGCTGGAGGAGGGTGATA 

CTAGAAACCATTCTTAATTCCAAAAGCTAAGGAACTTGGACTTTATTCrc 
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TATTATATGATATACAGTTATATATAGTTACATACACATATATATATATGCA^ 

TATATATACACACACAGTTATATACACACGCAGATATTATATATTCTAA 

CTGTTGTAGGCCATAAGAAGTTGTTGATTTGGAATAGGGAGCGGTTAGAGGCA^ 

ATGAGTTAGGGCAGTTGCAGTGGAAATACAGATGAGGGAATGTACTTCAGGGGTTTCA 

GAGATGTGAAAAGGAGAAGAGTTTAAAATGGTCTrGTTTGTATGGCTGTCATTAAATTAGA 

TTTTGAGTGAGAAGGGGAGAGGATGACTTTGGTTTCAAACA 

TTG ACAGTTGG AAAT ATAGATCTG AAGCTTAGAGAAAAG AG CT AG AGATG TAG ATTTG AGAGTT ATT AG CATTAAG TG AT AGGTG A 

AGCCTGTGTTGGAATGAACGAAATTACCTGGGAGTGGATAAGGAGGGTTTCAG 

C^GGAAGTAGAAAGAAAGATGGGAGAGATTAATGTCAGAGAAGATAAGGAAACC^ 

CAG AT AAGG ATTGAT AG AGG CT ATTGGT AGTT ATT AATAAC CTTTG AG AG AATG TAT AACTT CGG C CAT CAT ACCTT AAGAAAAAT 
AAAGTTGAGGTTTATGATTTCCAGAAGTTGCCTTTGTT^ 

AGTAAAAATCAGTTGTTTAGGGAGTGGTGTCAGAAAGCTGTAACTGTTGGTGTGATAACT 
GGGATAAAATTTATACATTTAAC^GAAAATTCTTTTTCCTTTTG 

TGG AAG TGG TTG ATGG TTTT ACIT AT ACATTTTT AACATTTTTCT AG CATT AACACACAGG TTTT 
TTrTrTTTTTAATTTTGAGCTATTGGAAGTTG 

G CATTTTCTGG AAATG AAATTTGAGCTAGAATTT AGAGAAT CATCTTTT AAACTTAAGAT CTT AG ATTCAG T ACAAG TTGAG CAT A 
GATAGATTTTAGGG G1 UT T1T ATTTTTAATTACATGA 
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CATTGTACCACK3CAAAGAGAGGAAGCAGT 

CCAGATTCTAACTGTCTTCTAAATCTGGATC^GTC^ 

AGGATACATCTGTAGAACTGACCATGGCAGGTTCAT^ 

GTGTTGGGGCCCAC^TCGGCTCCTGAAAGTCTCCGGGAATGCCATAGGCGTGGGGCTGGTGCT 
5 TCAGCACAGGGAGGACCTGAGAAGACX3CTGATCTTCCATCTGTATCCCAGAATC 

GGGGTGGCACCTTGGCCTCTGGTTGGCCTCATGAATAGTGAAGTATAAGTCCTGCAAG 

AGCT CT TCT T TATACCTCTGTGCTTGAGATGCGGGGCAACTGCTTTTC 

TCAAGAAATTCCCATTCTTGCAGGAAACTCTGGACTTCCTGGCTACTCCATGGTTTA 

CTGGACTG CTTTT 1 1 1' CTGCTTCATTTTCAGGATGTTTCTGGTAGTTTCAGTCTGAAGTAT CACTTTCTCCATTTTTATTCCTGAG 
1 0 T TTTC T T CTGGGATGATTCTCACTGAGTCrGCAGGGCCTC 

CTAGGAGAGTOU3AAGGAGCCTCAGTGTGGAAATGTGTTTGCTTCTCTCT 

AATTTTAAATTATGGCATAGATAATCCCA' lTl ' I ' l X ' T 'l 1 rT TTTGAGACGAAGACTTGCCCTGTTGTCCAGGCTGGAGTGCGTTGGC 

ATGATCTCGGCTCACTGCAACCTCCGCCTCCTGGTTTTAAGCAATTCTGCCT 

CCCCCACACCTGGCTAATTTTTATATTTTTAGTAGAGA 
1 5 CCTTAGCCTCCCAAAGTGCTGGAATTGCAGGCGTGAGCCACCGTGCCCAGCCTAAATTACTTT^ 

TCITC CCAGAGGT AATAATTGTT ATT AATT CGGGG CAT AATTGTTT AT ATT AGGCATCTTT AT ATGCT AGG AACTG TT AG TACAAA 

ATGAGAGAAAAAGAAAAACTTTAAGCATATATTTTTATTTCCAATGAGG 

CCTCCTATTATGTATTTTTTAGTTATTTTGTTATTCAAGAGCTCAAACAA^ 

AAACrTGACTATCTTAGTAGACTCTGGCAGTTCTCCCCGAC^ 
2 0 TACTAGAATGTCAGAGACCTGAATACCAGTCATGACTATGTCACTATCTATCTATGTGATCTTCA^ 

GGTTCAT C CAT AAAAT AAGATTGGGTTT AAAATTG CT AAGGT CTTTTCCAACACTTTG AATTGT AATTT CTG CTGACATTT AG CTT 

TT AAGCAAAATTTACTGTGTATAAATTG CTTATGGCTACACTTTGGTGTCTGTTTG CACAGGGACT ATATCAGTTATCTACTGCTA 

TGTAAGTACTCCAATACTTACTGGCTTAATACAGCAACCGTACATTTACrr 

GCTGAGTGGCT CGGCTGGTCTCGCCTGGGG CTX3 ATGTGCAGTTATCTGGGAGATTGGGGCTGGCTGGTCTGATTCATATC 

2 5 GGTTGGTGCTGACTATTAGCTGGACCTTCCTCACCACATGGTCTCCTC 

TCACATTTGCTGJVTGTCCC^TTGGTAAAAGCAAGTCACA 

CCAGGAGTTATGGAGGCCATTAATATACCAGTTTACCACAAAGACCAAAATATG 

GTGCTGCCATTGTCTCTAAATTGCTGCTGTTGACTGTTTG 

AGTAATATTTGTCACTGTTTTTGTTTGTTTGTTTTTTTGAG 

ACACTGCCACGCTCGGCTAATTTTTTGTATTTTAGTAGAGATGGGGTTTCACCGTGTTGCC 
CAGGCAATCCACCCACCTCAGCCTCCCAAAGTGCTAGGATTACACGCGTGAGCCACCATGCCCGGCCTGTCA 
TCTTTTAGCCATTTCTGCTAGTGCTCAAGTAGGAACAGGACTATCAGGATTTTCCT 
TAAGAATATATTGTATCTAAAAATGGGTCCCAAG^GTAACAC^GTATTTACTGATGGCA 

3 5 G AAAG CT AAAAATG CTTTTTTTTT ATTT AAAG AT CAACAAAGG TGGTTT CTGTG CC CACAGTT C CTT GATTT CCTTTT GTGTTT C C 

TTTCTTTGTCTGATTTTAAGGAACTAATACAGCATCTGGTGTGTATATGTTC 

GACAGTGAGGAAATGAAGTACTAACTTTAAATGATTTGTTGGGACGTTTGTTTCT^^ 

CTCACTGTGTTGCCCAGGCTGGACTTGAACTCCTGG^ 

ATGCACCACTGCACCCAGATGTGTTTGATTTTTTTTGAAGTTGGTAACTTCATAGCTAGTGAAAT 

4 0 ATAGTAATCAACAACTG<5CCACAGTGGCTCATGTCTATAACCC 

AAGTTCAAGACCAGCCTCGGCAACATGGTGAAAACCCATCTCTACAAAAACT 

TTACTTCTGGCTACTTGGGAGGCTGAAATGGGAGGATCACCTGAGCCGAGGAGATCGAGGCTGCAGTG 

ACACTCCATCCTGGGCAGCAGAGCAAGACTCTGTCTCAAAAAAAAAATTAAAAAAAATACATAAATA 

AAACTAAAACAATCTTTGGGACTATTAGAGGCACAAAAAGGGT^ 

4 5 CACATCAGATGTAGAGTTTGAGTAATTC^CCCAAGATTACACAGCTAATAAGTAACA 

TTGTGTTGGAAGGAGCACAAACAGGCATGCCTTGGTTT 

CAGCITTCCTCATCTGTAAGATGGTGATTATACCCTCTACCTCCAAGATTGTTAGGA 

CTCCTACCTGGTAGAGACTCAGTAAATGCTGGCTTTTCTTTCAGTTCAGTGCT 

GTTAATGCCTGCCAGTTTAGTGATTAAACAACAAATCAGAATCGTGA 

5 0 TCCTTCTTTCCCCATCCTCTCTTCT^ 

TGTCTTCTCCT(TTCCCTCTTCCCTGTTTTATTCCCTTGGGCrGATG 

GCTGGACATGTTGTAGGTATCCTGCTACCTATTAACTTCAAATATTTTTCTTAATTTTCCT 

CTAACTTTGTGGAATCATGTACTCrTTCTTCTCTGTTTCTTCAGG^ 

TGACATCTTAAAAAGGAGCTTAAGAATAAGAGCrGGCAATAAAATTACA 

5 5 TAGTCAGAGAAGGGTTTCCCTCTCATAGGACAAGAAACATTCAACTCTTCAGTGA 

TAGGAGGAAGAGGCTGCTTGCTTTAGAATATATTACCTGTGTGACTAAGAGAGGATTCTCT 
AGATGTATTTCCAGTGATCTGAGAGTAGTGAGTCATAAATAGGACAGGTAAATTTCrACTCTAGT 
AATAGAGTTTGGTTAGGGAAGCCTGGAAATGAAATCAGCATTTACAACACAATGCCATTGT 
GTCAAACCAATAAACTCCTAAATAAACTTTCTAGGCAACCCTCT^ 

6 0 TATGTATGTAGATATGTACATCTATATAC!ATATTCATAAATTGTC 

ACCAGAAGTCTACTGATTCGTTTCTGGATGGTAAGTAAACTCTCTTAAAATTTTGTGCCAGATATTAT 
GAGGACATAAAAGATACACAGTTACAAAGATTCTTAAAGGGAGGGTAATCGTTCT 
TAGATATATTTCTGTATGTACATATATATTTTTTGAGTATCAAACTGTTATGCATTGCCTTAACATTT^ 
AATCTGTGGTAATTAGGACTACTTGATAGAATTATGGTAATCATTTGTGGTGTGCAATATTGAA 

6 5 TTAAGCCATTCCTGGTAATATATGGATTTGCCAGGAATTTGATACAATTCATAGATAAAAC^ 

CTCTITAACCTAAATTCTTCCTGCAGTTTAAGGAACRTTTGAGT 

GTATTTAATACTATAAGAATTTCTGTTTTCCCATCIATTTTTATCTC 

GTGTCATTTTCCATGTGATTTGAATGAACTATAAATC 

TATTCAAAAGAAAGTTG^GAAGATACGTATTTGTTATCAGGACCAATTTATTT^ 

7 0 TAGACATATAAAGATGAACTTTCAATAGGCCTCCTTCGATTGTATTTTTTTAA^ 

ACAAACTCTTTAATAAAATACAGCTGTGTACTTTACTATTTTCTTTAAAAT^ 

AGACATGAATGAGAAGCTTATCAA^AACTGTAATGTGGTTACATTCTAA^ 

TAATTCACCCTTTTAAGGTATACATTATATTTGCTTTTCAGTG^ 

TTATAAAAGTTAACAGTAGGGGAATGTGAGTAGAAAATATGGAAATTTTTAGTGTACGGTTCTTC 
7 5 TATTTCAAAATACATCTTTTTTTCTTTTTTCCAGCACTAAT 
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GTGTGGTGGGGGCCTTWVTATATTCACAGAGTTGTGT 
GTTTTCCTGTTCCTTCC1TTTCACTTGATATACAAAAGTTGA 

CAAATGTTGAACTAAAAGACCAAGGTATATCATAAGTACAAGGAAATAAAGTAGAATTAGGGCCT 
GACAAAAGAAAGTGAACTGCTTTTAACTTTATATATCCTTTGTTGA 
5 G CAAAACTGTGT CT ACTTCTTT AAAACTTG AATTG AGG AAGGAAATG CTTTGT AC CAAG ATGG CAAAT ATTTTATT AT AAGAACATT 

TCTGTTTCTTCCTCTCATACTACTATGTTCATTGTACTC 

TTTGTATGTATTGGAGCAACATAATCAGTGTTTGCAATGTTTTGATAAAAGTA^ 
TAATTTTCAAGTCAGTTTTCACTGTTGTTAATGATGTA 

AAATGCAGATACATCTG CTCAAAACTTGTC A ' I ' X ' IXjT ' 1 ' CT 1 AGGGTCAAAAAATTGGGAAGATTAGTTTGCACTACCATTGTAC CTT 
1 0 GCTTTTT AT AATGAG TGTACTCTGGGT AAAAACATGAAT CTG AATAAGT ACT AAAAAAATG CAGTTG CCTGTAAT ACAACTG AGGT 
TTTATCTAATCCCCTGATTCAAAATGACCCAGACAAATGT 

TGTAGAAAAAATGCITTAT AAAAATGGATTTTTATGTCA^ CATTTT AAAAT ATTTT CTAT AAAGT ATT AT AT AAA 

ACAGTG CT AATG AGT CTT AGG ATT ATACGAAG CATGTTTTCTT AT ATTGTTTT ACTT ATTTTCTC CAAAA 

AAGAAGACTTGTGTAATCACAGTGGCAAAGAAACTTGGTTTTC^ 
1 5 AAGG CTT ATGTAAT ACAAGAGATTGT CACAGG AAATGAG T AAAAAATG AACT AATT AAACTGGTTCATG TAAAT AGAG CAT ATT AT 

ACCTTATTTTAAATTCTAGCATCAAATATTATAGACTTTTG^ 

GAATAATGCTCATGTATATGCATATGTGTAGATTTGTGGAGATTA 

ATTATTATTTGTCTTTGAGGGGAAAAGTTTTAAGGGAAAATATGTG 

TGATTTTGTCTGTCTTGATTCTCTGTTTACTCTCTAG 
2 0 CTGTGCACAGCCTTCACTCCCACACCCACCCTTTCCTGTTGTTCACCTCGCT 

TTTTCTTTTTTCCAAAATAAGAGACCTTTTTCAGATTCTG 

TTAGAAAATCTGAGTTGCTCTTAGAAATTATTTTTCTAACTAAAGGGAATCTTTTTGCTCT 

TAAGGATTGTGAAACCATTTCCCGAGGGCATAATTGCAAGCAC<3ATTCTTCCTCTCTAAAAG^ 

CAGAAGTTTAAGACAAGAAAAAAGTGCTAC^CCCACACCACAGATAATTT^ 

2 5 TCTCAAGCATTCTCTTTGA1TAATTATTAAATT 

ACATACTTTTCCTGTTTAACCTACTTTGCCATGTTTTAATATTCT 

GTACAGTATTGTAAATGATCCATGTGAGAAGCAATGTGAAGAAGTGGAAAGTGTTTAAATTAAAAGGC^ 
TTTTTATATGTAGTTACTTAAATGACATTGAATTTCATT^ 

G AG AATTAAGTGAAATATCTGTAAGAATAG TTT AAAAG CTG TTAATG CTAT ACAC^TGT AAAAT ATTGT CTGTTTACCACCAGT CA 

3 0 GTCTGGTGGCTATTTCCCTAATATTCCTTTACCAAAAGGCAGATATC 

GAAAAGGCATGATAGGAAAAGGCTTTGAAAATTTAAACTGAAATTTTATGAAC 
ATTTTATGTAAATTTGCTAAACAATTTAAGAGTTCCAGCTCTCAA 

TACTAAAGATTAATACAGAGACACAAGACTTTATTTGATGGTGTGTGTGTGTATATATATGTG 
GTATATATGCATATGTGTGTATATATATGCATATATATGTATATATATACAC^C^CATT 
35 GATATACATATATATGCATATATGCATGTATATATACACATTTGAGAGATATATATCTCATCT 
CAAC^GTTACCCAGATATAATATAGTCATTTAAAAATTTTGTACAAGTTATTT^ 
TTTGTCTTAAATGATACTTACTTAATAAGATTTGCTCACTATTTAAAACAGATTTTAG 
GACATATAACACTTTTGGGTTAATATGGTACCATTGAAAATGTTGTTGTCCCACTGCT 
GAAAAGATAGAAAATCATCCCATGGTTCATACAAATCCTACTTTGTTAATAAATT^ 

4 0 CAAAAATACCATGAAACATAAGAGGAATTAATAGGGAAATAGGTGAAAGCTAT^ 

ATAATTACTATTAGTAATTATACCIAGTGTATCCTAAGCACAGTACTAAGTACTT 

GGTAAATAGCATCACCTGCATTTTACAAATAAGGCAAATGAAATTCAGAG^ 

TAGTGGAGCCTTGATTCCAAAAGAGATCTGTCTGATTCTAAAACT 

ATTAGTCATTAAAAAAAATAGATTAGGAGTATCACrTAAGTAACATCTGAGTGTAAGGAAGAGAAGCCT 

4 5 GAAAATGGGAGAGGCCATTGGAAAGCTCCTAAAGAGTTTCTTCGAATGTC!ATCATAGGAACTT^ 

AGTTGTCAGGCATTTGCAGAGGAGATGCATCn^AAGTGGAGGCTAGATTCTTAG 

CTGTAAAATGAGTATAGAAAATGAAAAGTTAAAAATTTTTGC^ 

TACACTAAGGTATTTGGGTGG1T1GT1TGTTT1TGGTT 

AAACT AT ATACTG TAT CTTT AAACTGT AGACCT CCCCCCTGC C CTG ACTCAG T ACAG CAAT AGATACTG CAAGTGG CAC ATGGAAA 

5 0 CTG AGACC C ACT AATC ATTTCTTAT ATT CCT ATGTTCACTGTG CTT ATTG AG C AT AAGTT CTCACTGTTTAAAT ATTGTTTTTTTT 

CTTTCCCTTTCATGAAGTGAATAATGTTTTCTCrCAGTCTTTCCCT 

AG AAAAAAACACCATTTTGAGATGC CATTGTATTCTTT1TGT ATG CTTCTCTGTAAATCT 

ACTCCAGTCTCITTTTGAG^CCTAAAAGACGCCACACCAATTTTT 

ACTCAGTTTTCTGCCACAATCTAGAGAGAAAGTTTGGCTAAACTTGGGTCAGATGTCT 

5 5 GAAAATTCACTGTAAGGCIAGTCCACTTTAGGGCTTTGAGTAGGCATGTTCCCT 

ATTGCTACCTCAGAGGGAGAGATGCTTGTTTCCTAGTTCTACCACTAATTAGC^ 
TGAGGATCTTCATCTATGAAATGATAATGTCAGCGCAAAGGGTTCCCTTAGTTGTGAAGTTCTAGGG 
AATTTAGCTTTCCAGTGGAAGGATCCCAGTAGTATTAAGATGGGCTTTAT^ 
TCTTACCTTGAATTGTCTCCCAGAGATAATTTTTTATTATTGCTGATACTGTTA 

6 0 TTTCATCTTCAGTAGTTCAAAATATTGTTAGTAACATTTCATGAGTGTATTTTTCTA 

ATTCTACCTTTTGTAATTTTTCCTTCTGGGCTGAACCATTTTTT^ 

CAGAG AGAG AG CIAGAAATGGTTCTTTTTACCAATGACATTGTGTCAAT AAGT CTAGTGG ACATTTTTCAGTCCTCATCTTACTTGA 
TCTTTG AGCAG CTTTTGATACTGGTG ACCACTCCCTCCATTTTG AAG CATTCTTTTCCTT AGTTTCTGATCCTGTTTATGGTCCTA 
GTTATCATCTATTTCATTTGGTATTCTTTCTCAATATTCTTTGC^ 

6 5 AAATCCTCTTTTGGCTTCAGTGAACTACAC^CGTGCTATTAGTTATTAGGTCCT 

TTCTTTAUT'rCTTAGGC!ACAGTGATAAAAAGCATACTTTACATATGGTTC 
(^GCACTTTGAAAGGTTGAGGCAGACAGATTGCTTGAGC^ 

ATAGGGGGGGGAAAAAGAGTGATTAAACCAAATCTCCCATATGGAGAATTTGCTGCTCTCATTAAT^ 
ACTTGTCCAAGCAGTAAATCCAGATGTGCTATCAACCATGTCCTATATGATGCAA 

7 0 ATATCTTGAATTTCTGGATCATGCCATAAGTTATTGATTGTTTTATAGGTGTT1TT 

AAAGACATTTAAAAAG AAATG CAATGATTAT AG CTAAT AAT AAAT ACATGAG G ATTTTT AAAACATG CCCTATTCAAGACATAACA 
AATGTAAAAACATAATTCATTATTGCAGGTTGATTTTTTTTAAC^ 
GATCATATTAAATACTTCTTTAAGGAGAAAACAACACCTTCATAAAGAA 
TTTTGCTTTTTCATCTGATGGGCGATTGTGTCCA^ 
7 5 GAAGTTGGGATATTTTTCACCTC^TGTACAGGCATACCrTCTT^ 
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TTACAAATTGCAGGTTTGTGACAACCCTCrrCT 
TATGACAGCCTTTTTTAGCAATAACATATTTTTAAATTAAAATAT^ 

CAAT AT AG T AT AAATGT AACTT ATAT ACACT AG AAAAC CAAAAAATTTTTG TG Al T i"! CTGTTT AG AT ATTTGCTTT ATTGAAGTG 

GTCTGGACCCTTAGACAGTATTTCCAAGGTATGCCTGTAAAAACATTT^ 
5 ATGGTTGTTAAAATTTAGTGTTTTGTGATAAAAACAA 

TCAAATACCTCTATATTAGTCCATTTTTGTGTTGCTATAAAGGAACATCTAAG^ 

TGGCTTACA GTl ' C " rT CTGCA)GGCTGTACAGGAAGTATGGTGCTGGCATCn» 

GTGGAAGGTAAACGGGGAGCAGGCATATTACATGATGAGAGAGGGAGCAAGAGACAGAA 

TAGATCTCACATGAACTAACTAAGCAAGACGCACTTATCAC^ 
1 0 TCTGACACCTCCCACTAGGCCCCACCTCCAACACTGGGGATCACATTTCATC^ 

CAACCrCTGTTCATCTTATTTTCCCAAACTGTATGCAGAAATTTAC^ 

CAATTGAAATAATCACAGATATTTTTGTGTGAAATTITTC 

AATAGTACCTGTATTAGCCCATTTTCACGCTGCTC^TAAAGAC^ 

GTTCCACATGACTGGGGAGGCGTCAGAATCATXX3CAGGAGGTC 
1 5 AAGCAAAAGO^AAACCCCTGATAAACCCATCAGATCTTGTGAGACTTATTCACT 

CCATGATTCAATGACCTCCCCCttJGGTTCCTCCCACAAC^ 

G71CAGCCAAACCATATCAGTACGAGAGTAGTGTAAGTTCCTCAGAATGCCCCTTAATATAAGATAAT 

TAT CAGC CTAACAT AGT CAGTG CT AT GGAAG TCAT AT AG AGCAAT ACCTT C CAAGAATTTGGTTT ATTTTTT ATT AAAG ACGT AT A 
AT ACAGTGCAAGGTAT CTG ATCTT CC AACATTTT CTGACAT ACATG AAAAGT AT ACTGTCATTG AG CTTC CTTT AGGAG ATTG AAT 
2 0 AATAAGAATGCAGTTGTAGACCAAATCACATTACATGTTTATTATCAAACATAGTA 
TGAGATGGAGTTCTGCTGGAGTGCAATGGTCTGATCTCAG 
AGCCTCCCGAATAGCTCGGATTACAGGCATCCGCTGCCACACCCGGCTAArm 
TGGTCAG<3CTt^TCTCAAACTGCTGACCTCAGGTGATCTC 
GCGCCCAGCCTCCTTGTTGGATTCTTAATGTTACTTCTTC 

2 5 GTGGCTTGTCTTCTATCAGTTTTGGAAAATTCCAGGCAATTATCITATCT 

CTGCTGTGACTCCAGTTACATCAATAATAGGCCATTTCACCATGTCT 

GTGGATATTTTCTACTGACCTGTCTCCCATTTCACCAGTCCTCTTTTC^ 

GGTTTATTGCTTAGTTCTAACTi'TITTATTTCATT^ 

TTTAAAAAATAGCTATTTTAGGCTGGATGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGG 

3 0 GAGGTCAGAAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCGTGTCTCTACTAAAAATACAAAAATTAGC 

CATGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAC<5AGAATCGCTTGAACCCGGGA 

CGCCACTGCACrCTGGCCTGCGTGACAGAGCTAGACTCTGTCTCAAAGAAAAAAAAAGCA^ 

CTGCGTCTCTTGTAAGTCCACTGCITTCCTCATTTTTTCT 

CTGGAGAATAAAATATAAAATATATTGGAGATAATTTGAGGGCCTGAATGGTATAT^ 

3 5 CAG GAACTGCTGTTT CT AT CTTGTTGG C CAG AACT ACATCAT ATGG CT ATCCCTG ATCTCAAGGG AGGCT ACAGAAT GT CAGTTTG 

CTGGGCTCATAACTGCCCCATCAAAAAACAAAACACACACACACACAC^ 

TCTATTAAATGATTACAGATATAAGTAAAACTGAACTTTTGTGTGAAAAGAGACTAAAAAGTAT^ 

GTTTCTAAAACTGAAAAGCAGTTTTAAATAATTTAAAACTTTATTTT 

AAATTAAATTXK5TAGTATATACTTTTAAAATCTGAAAATTAAATTC 

4 0 AAATCTTCTGAAATTTCACCAACCAAAGGTGTCTTCTATCAACATTT^ 

ATTTTTATAAGTTAACATTATAGTATGTACCCTATTTTGTGATT^ 
ATATAATGAGTTAGTGTTTTAC^GTTTAGAAATTATAATTTCCTTAATTTAACr 
ACAGTATGTGTTTTGTGTCTGCCTGrrCTGTCCTCACACTTTTCTTGACCT 
CTAAGGAATGTTTTATTTACC^TCAGTGGGAGTAATGAACATATTAGGTAGGCTCrGA 

4 5 CTCATT AAAT ATCCTTTTG ACTTGCT AT AATACT ATGATT^ TT GT AAACAAAATGATT AT CTTTTT CT ACTCT ATT G T CAT AG 

TTTTTATTTCTTAGAACTGGAATAGATCCCAGACATTTCCTCAGAGA 

TATGTT AAAATTGT AAAACTG CAAATTTTTGTGCCGTTGG CTTGG CTC C CTTG CT AGC C AAGTAAT ACAT ACATG CTGT CACAACA 

GAAAATATGATTTCAAAATTCTAGTTGAGTAATATTACAAATTACAGATCA^ 

GTAAATACTAAATGCAC^AATAATGAAAGACCTAGTTTACAAATTATATATGATTT^ 

5 0 AAGATTTGCTTAATTTAGTTCCAGCTCCACGTAACTGCTATTAATTTTTCTACCACAGATC 

TCTCAGGCAGACTTCTGTTTTAAGATCCATCAGTATTTTTTC 
GACCTCAGATTCATTCTTGTATAAGTCCCCTTAAAATAGTGAGGGTGTTTCA^ 
TTGTTCTAAAGAGGAACAGTGACTAGATAATGAACTTTCCTACCT 
CCCTGAAATCTCCACATGGCAATGTGAGAATTTGTCTTGTCAG^^ 

5 5 TTGTTAGCG CAGAATTG CTCAT C AC CAG Ct^CAGGTTAGAAAT ATCAAG CTCT AGATTAT ATTGTTC CTAAG CCT^ TTGA 

GAAGTTGATTGTTTGAATTCTTTTGAACTTACACT^ 
TTATTTTCAGTGTCCCATCTTTCCATCAAGACTCTTGGCCATAT^ 
CTTGTAAGCAGTGACTTACATCTTTAACATCTATGGATTCCTACCCACTC^ 
AATCTTTATTGTTTGATCAAAGTAAGTAGTGACAAATTGAGTTTTATTGGCTGTAACITAGTT 

6 0 ATTTCTCTGTCTTTTAATGAGACTGCTTCTGTCCTTAATGCTGGTTTATCTTCTG^ 

TTATCTCAAAATTTCCTTAAATTTAGGTATTGTCTTTC 

TAAAATTCTTAAATTTGAATACCATGTTTGACATAAGTATTTGTTTCATATCTT^ 

TCATTTAAC AAGTACTTAGTGTAGG CAAAGCCATGTGCT AGTGGT CAGTGGATAAATACTGTGGTG ATTTGAAAAGGCAC ATACCT 
TT AAT ATTTTGTCAG T AAAAT AG ACT ACT CAG TTTGT AATCTTTTTGT CATT AT AGGG T AAAGTT CAC CTTGAATT AAAACTG AAT 

6 5 GAACTGATAACGGAGAATGGAACTGTATGCCAGCAGCTTGTTGTACAGTAAGCATTT 

CTCC^TTTTCTATTCrTTCGATTTTAAGCCAACTCCTTC 
TCCTTTTGTGAAGTTGATTTTCAATTTAATTTTCTCATAGTCT 

TTGAATATGGGTTAAATGATATATATGGGCATTAAGGATAAGGGTGTCTAACTCTCTGGCAGTGGAG 
AGAAGGTCCTCTAGAAATTGTCATACCACAGAGCAGAGCTTTAGAGACTTTGGAGCTAGAA 

7 0 TATTGATTCCCACCTACTTTGAATTCCTTTTTACCTCTTCCACCCT 

TTCTGGAAATCrcTTGTAGCCaUVTCCCCTACTCTTTTCCTCCCCT 
CCTTCACTTCTTCTGGGTGTTTTCCTCTATTTTTGCATTA^ 

TCAGTTAGAGGCTTTGTTTTTCTATCATGTCCACATATTGTAAGGTCAGGGAGTAGAG 
GACTGTAACTTATTCTCATCAATTGTGAACCCCAGCTCCTTTGAGGTATTCT 
7 5 ATAATATTTATTTCATTAATTTTTTCTG^ 
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AAGAATAAGGAATATTATCTTAAATAACTCCAGTACAGTTACCAAATTCAG^ 
TATAGTTATTTTCTAATATTTTAATCTGCAGTTATTATATTCCACTTTrGTTAAC 
TCCTTCTAAGAAAGGATCTGCTCCAAGATTATGTTTTATGTtTAGTTGACATGTCT 
TTGCTTlTCrTTGATTGTATTAACkTT^ 
5 TAAAAATAGTTCAAGATCTGTTTGCAGCTTTTTCTCCACAGACTGACTC^ 
ATTAGGTCTTCCACTCCrCCCTCCCCTGCTT 
TTGCATTCAATTTAAGGGTTrrrrTGT 

AATAGAATATTTGCATGATTTCAAAAATCAAAACTATAACAAAAGGCGTTACT 

CACCCAGCTCCAAGCAACCATGATCTGATTTCTGTCCCTACAGTTTTTG 
1 0 ATGTCTTTTTTGTGTCTGCTGTCTTTGACITAGCATAATC 

GTGTTATTTGTTTCTTGCCATAATTTGX3TAGATACTGTTATCCT 

AGCCTAATTTCACACAGGAAGTGTTGGAGGCTGGCAGTTTGATTTTTTTCC 

CAGAAAGTTGTAAGGATAGTACAGAGAGGTCCTGTGTACCCTTCATTCTl'UT^ 

CAATACCAAAACCAGGAAATTTACATTGGCATAATGTATGTATATAGTTATAGATCATITT 
1 5 ACCACCACCAAGtfVrACGAAACTCTTCTGTCACCATAAAGATATCTCT 

AACCCTTGGCAACCTCTAATCTGTCTTTATATCTATAATTTTGTCATTTC^ 

CCTTTTGAGATTGGCTTTTTTTACTCAGATCTATCTGAATGACTGTCT 

TTAAAAAAATCATTTTTGCCAATCTCTGTCTTTTTTT^ 

AAATATGCCATTTTTTGTTTGTTTTCTGTTTGTCTC 
2 0 TTTAGAATTCCATTTTGATTTATTTATATTGTGTTTTTAAAAAATCT 

AGGT ATT ACATT AT ACATG CAT AACT AAT CATAATGTACTGATGTCAACATTTT AC CARTT AG AAT AAAATGTATGT ACTTTAC CT 

TTCTTTAAATCCCTTTACCCTCCCCCATTTGTAATATACTTTGTC^ 

ATGTT AT AATTTTTGCTT CAG C CATCT AACACAG TTGAAAAG AGTT AG AGGAAAAGGAAAGTGT AAG T CTG TTGTCT CT ACC CACA 
TTTTTGCCCTTTCCATGTTATTTTTTTCCTTTCTGAT 

2 5 TCTTTTAGAGTAAGTCTCITGGTGATACATTCT 

TAATTATACTAGATATAGAATTCTGAGTGGACRGTTCTTTTCTTTCC^ 

TTCTGATGTGAAGTTCATTGTCATTCAAATTCTT^ 

TTGCCTTTAGATTTCTTTTCTTTTTTCTT^ 

3 0 GTAGCTGGAATTACAGGCATGGACCACCATGCCCA 

TGCTCTCGAACTCCTGGCCTGAAGTGATCCTTCTGCCTTGGCCTCCCAAAGTCCT 

CTGCCTTTAG ATTT CT AT ATGTTTTAG CTTG ATTGTGATGT ATCCT GG AAT AGATTT ATTTAATTT ATC CTGTTTGGGGTTCGCTC 
AGCTTCTTGACTATGTATATTTATGTCTTTTGCCAGATTTGGGGA^ 

TGCTATCTITCTCTTL"r'rCI ,, rrCAGAACGCCAGTGG CACAAATGTT AGGTOTT TTTTI ACAGTCCCAGAGGTCCC CCTG AGGCTT 

3 5 TGTCCTTTCCrTTTCCCCCCAGTCTATTTTCCCTCrG 

CTTTCCTCTGTCTTCTTCATTCTTCTGTTTAGCTTATCTGTTGAGTT^ 
ATATTTGATTCTTCTTTATAACTTTTTTCCTTTGGTGAGAGTTTCTTTTGCAGAGA 

AT AATTGTTCATTG CATT ATT CACAATAG CAAAGACATGG AAT CAA CTT AAATGG C CAT CAATCG CAGACTGG AT AAAG AAAATG C 
AGTACATACATGCCATGGAATACTAGCCGTAAAAAAGAAAGAGATCATA 

4 0 TCCITAGCAGACTAACGCAGAAACAGAAAATCAAACCAAATACT 

TGGACACATAGAGAAGAACAACAGACXSTTAGGGCCTACC^GAGGGTGGAGAG 

TGGGTGCTAGCCTTAATACCTAAGTGACAAAATAATCTGTACAACWUVCCCTTGTGACACAAGTTTGC 

CATGTACCCCTGAACTTAAGTTAAAAAAAAATTGCTCATTGAATTATT^ 

TAACAATCATGTCATCTTGGTGTTAGTGTCTTGTGTTAGTCCATTCTCACRTTGCT 

AAT CAT AACAAACACCAAG CGGGAG CAGGTGTCCTACATGGC^VGGAGCAGGtfUVCAGGAG AG AGAGTCGGGAGGT ACT 
TTTTTTTT TTTrrm TTTGAGATt^AGTCTCACT 

GTGATTCTCCTGCCTCAGCCTCCGAGTAGCTGGGATTACAGGTGTGCACCACCA 
TGGGGTTTCATCATGTTGGTCAGGCTGGTCTCGAACTCTTGACCTCGTGATGTGCCCACCT 

5 0 AGGCATGAGCCACCAAGCCTGGCCCTCTTTTAGTTATTTTTAAATC 

GTT AAATACGAGATCTT ATCCATTATATCTAACTATATT1TTGTACCCTCACTTCC CGTCCCAG CCCC^ 
TCTCTGTTATCATGAGTTCAATTGTTTGAGTTTTTAGCTCCCACAAGTGAGACCATACGA 
TTCACTTAACGTAATGATCTCCAGTTCCATCCCTGTTATTTTGAATGACAGGACCT 
TTATGTATATATACCACATTTTCTTTATCAGTTCGCCTGTTGATGGACACrTAGATTC 

5 5 GCCACAGTAACCATGAGAGCGCAGATACCTCTTCAATATACTGATTCCCTTTC^ 

ATTTTTAGTTTTTTGAGGAACCTCCAAACTGTTCTCCATAGTGGTTTTATTAATT^ 
CTTTTCTCCATATCCTCACCAATATTTGTTACGCATGTCTTTTGGATAAAAGCCATTTTAACT 
TAGTTTTGATTTACACTTCTCTGATGATCAGTGATGTTGAGCACCTTTTCATAAAT^ 
AAATATTTATTCAGATCTTTTGCTCATTTTTAAATCAGATTATTATATTCT^ 

6 0 TGGTTATTAATCCGTTCTCAGATGGTTAGTTAGCAAATATTCTCCC^ 

GTTGTGTAGAAGCTTTTTAACTTGGTGTGATCCCATTTGTCT 
TATGCAAGAAATCTTTGTCTAGTCCAATGTCCTGGAGAGTTTCCCAATGl'T^ 

T AAG CCTTT AATC CATTTTT ATTTG ATTTTTTT AAT ATGGTG AG AGAT^ CATT CTTCTG CAT ATGGATATCT AT 

TTTTCCCAGCACCATTTATTGAAGAAACTGTCCTTTCCCCAGTGTATGTTCTTG^ 

6 5 CTACGGATTTGTTTCTAGGTTCTCTGTAATGATCTGTTGGTC 

TAGCTCTCTAGTATAATTTGAAATCAGATAATTGTGATTCTTCCAGTTTTG 
TCTTTTGTGGTTTCATATAAATTITGGGATTrTTTTTT^ 

AATCTGTAGATTGCTTTAGGTAGTATGGACATTTTAACAACATTGATTCTTCCAG 
AGTGTCCTCTTCAATTTCCTGCATCAGTGTTTTACAGTTTTC^ 

7 0 GTTTTATTTGTAGCTATTGTGTAAGCAGTATTACTTTCTTGATTTAT^ 

GATTTTTGTTGTGTTAATTTTGAATCCTGCCACTTTACTAAATTTGTTTAT 

TTTTCCAAATAAGATCIATATCATCTGTGAACAAGGATGATTTTACTTC^ CCTTT ATTTCT ATCT CTTG 
TCTGATTGCTCTAGCTAGGACTTGAATAACAGTGGTGAAAGTGGGCATCCTTGTC^ 

TTGTTC CC CATTCAGTATGAT ACTAGCTATGGAT ATG T ACT AT ATGGCTTTT ATTGTGTTGAGGTTTGATC CTTCTC 
7 5 TTTG AGGG TTTT ATT ATT ATG AAGGACTGTTGAATTTTG T CAAAT ACTTTTTGG L^T CAG'i"i\i aaATGATCATATG^ TT CTTGCCC 
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TTCATTCTGTTGATATGATGTGTGaCACT^ 

ATGATGAATGATCTTTTTAACGTGTTGTCGAATTGGGTTTGCTCATATT^ 
TTGGCCTATAGTTTTCTTTI T 1 TGATGTATCTCTCTCTGGTTTGGTATGAGGGTAAC^CTGGCCT 
TTCCCCTCrCTATTTTTTGGACTAGCTGAGTAGGATTGGTATTAGTTCT 
5 CAGGTCCCAGGCTTTT CTTTG CTGCAAGACTTTTT ACTGTGGCTTCAATTTCATT ACT 

TTTTCATGGTTCAATCTTGGTAGGTTGTATATGTCCAGGAA 
GTTTCTTAAGCTTTTGCTCTCCATACTCACTGTCTTGTAGCAG 
CATCCTTATAGATATTCTGAAATTCATGACATTCT 
CTTCTTGCTGGACTTTGGTTTTGGGGGAGTAAT^ 
1 0 TCCCCACTACAACATTTTTTGAACTITAGTGAGACCTCCCTGCCATCCA 
CATCT1TTTCCGTGACCTATTGCTGTAGTCATTTTCTCGCAG 
GAAAAACTCTAATTGAAAATGACCTCrrCCTCTGTGTTCAATCTGCACATG 

CTTT AAATTTATCAT CACAGG C CTCACATGGGCACTT AT AT CTG CC CTGGAATTTAACTGTGCTTCTCTAATACAGCTATAAGATT 
CTCTACCACCACAGCCTGGAAATTGCTTACATCTTCCT 
1 5 GCAGAGGAGGTACTCCTTTTCTTGACATTGAATTCTTCCATCTGAGCT 
TCCTTCAGTTATGCCTTCTTTTTCTTGTTTTCTCAGCTTTACCTrCT 

ATCTCTGTCCTAAAAGGCATGCCTCCCCAGACCTCATCTCTGCCCACCCATACCCCACTCCTTTTTATCATTCCC^ 

TGTGTGTGTTTGCAATTCTCCAGTCAGCACTCTCCAGTAGATTTTTCTGTGATGTGGAAAATGC^ 
2 0 CAGTAGCCACTAG CCATGTGTGGATATTGAAAACTTGAAATACAGCTAATGTGACCAAAGAG CTGAAGTTTAATTTTATTT AATTT 

TAATTAAATGTAAAAAACCACACATGGCTAATGACTACAGCATTAGTGCAGCTTCAGTAAAAGAACCTGGACT 
AGCTTATTTCAGG ACTTGGG CAGGG AAAATATG AGATGAGTCTGGAGCATCTTGGGGTCTCAAAAAGTAAGAAAGTG CTAAAAAAT 
AATGGGGATATGTAAAAAGGtflCATTGAAGCCAATGTAAAGGGACTTC 

TAATAATAGTGGATTAAAACTTATAGAATAAAATAAATATAAGGAAATACTGGTACAAATAAACAA1TAAA 

2 5 T AGGGAG AAAGGACTC ATTTTT CCT AT AGT AGAATTCTAGTT AATGAATGT AGG AG^ CA 

AACATCACAATAATAAGTGGGTGAGGGGACAATTCAAAAAATCATCTATGGATGCTAAGTGAGCATAAGTG 
TGATAGATGCCACCTTAATTGAGTGATGAAAGTTAACATCACCAATAATC 

GGACACCAAGATACATAACCTCAACCTAATCATGAGTAAACATCAGACAAACTCAGATTAAGGA^ 
AATACTGGCAAGATCACCGGGCAAAGAAGGAAAACTAATGAACTG 

3 0 CACTATGGGATCCTGTATTGTGTCCTAGATCAAAAAAGGACATTGGTGAAATTTGAGTAAGGGCTACAGATTTCAGTAAT^ 

GTGGTTCTACAAAACGTTAACATTTGTTGATGAGTATACAGAAACTATTATTTTAACAGC 
AAATAAGGTTTTTAAAATTGCCTGTACCTTGTGCATTCA^ 

AATCACATTGCAAACTTAACACATTTAAAACTGAACTACAGGGAGGCTGAGGCAGGAGAATG^ 
GCGGGGAGCCGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCTC 

3 5 TCTATCCAACTCCTCCCCCAAAGTCTTAGCAAATGTTTAATTCCGCTCAGTATGTTTTTCATCT 

TCTTCTGTGTCTGGATGGATCATATGGAATATAATTATAACITTTAAATGTCTTTGTCTGCT 
GATTCTGTTTCTATTGATTTTTCTTTTCATTATGGATCATAATTTCTTAGTTC 
GACATTGTGAAGTTGCTGGATATTTTAATATTTCTATGAATGTACTTGAGCTTTGTTCGGCACT 
TTTGATCCrTTTGTGGTTTACTTTTAACTTTTGTTAAACAGACAAG 

4 0 GGCAGTACCCATGAATTACCAGGTTITCTACTGTGGCTGATGGTACCTCTTGCTAGCCATGTGTGGG 

TATTCTTTTCAAATGGTTCTTTCCCCAGCCTCAGATGATTrCCTC^ 

ATTCTTGTAGCTCCCTGGAGGTTTTTCTTTCTCTGTGCAACTTTCTTCC 

CCTCCTCCTGCTGACAGCTTTTGCCTCTTCTTCTCAGAGAA^ 

CACTGCTGATTTTGTTCAATTCGTTTTGGGTGGAGAGTAAATCTAGTTCCTGTTACT 

4 5 TTTCTTTTAGTTTAAAAAAAAATCCAAAGAAAAGAATCAGATCAGTCTTAT 

ATTTTAGATACTATATTGTAAATGCTTTATATGTCACATATAAATAAAATATCAAATTCCTTATT^ 
TTTCTGCCCATCCATGATTTATGATATGAAATTGCTTACTTACAGATTC 
ATTTCCAAATATATGTAATGAATAACCTATATTTAATAATATTA 
TAAAAATTAAGAGTACTGATGATTTCTGTAGGGTGCACTTCTGGTTTATGGGC^^ 

5 0 TTC^TAAGTTTTTTATAAGTCAGTACTGCATTTTCCTCTGCT 

AGGTTTCTCATAATTCGTGTTTATGTGAGATAAATGTGGTATTACAAGGACAGTA^ 

GAAGGGCACTGGACTTCAGTAACAAAATTAAAAAGTAGAAGTTGAGTCTTGGCCAGCTTCTTA 

CTTACCTTCTAACTTCCTCAGATAACCTGTGAGCCTTACTTCTTTTA^ 

AGGTGCATTTTGTGAAAGTGCATAGTAAACTGCAAATGCCGTGTCAGATATCATTTTGAGGTTTAGCC 

5 5 AGCAGTATGTACAGTGGGAAAATCTGACTGTACCTCAACCCAACATTAAAACCCAGAATTATCT^ 

GTTTCAAAATTTCTTTCTTTATTTTTTTATGCTCAGCTATAATAGAAATGAGTCGTATTATCATTTCTGTCT^ 

AC CAAT AGCACTTCACT C AC ATCTG TG ACCTGTGGGTTGTGG AG AG AAAT AAGG ATGTACTCAACAG CTTG CATCTTT CAACAAAG 

AAT ACTTTTC AGCAACACT CATG AGTTT CTCTT AGGCATT AG AAAATG C AGAC C CTT AG ACATT AAACT AAACTCAT AAACT CAG T 

AATATTTAGTGAGAAAATTGGAGGAACAAATCLACTAAAGTTAGATTCT^ 

6 0 TAGATTAGATGACCTTGGTTTTTATGGTCTAAAACATTTTCTACAGAATGGTGCTTACTG 

AAAAGGCTGCAGAATTGTGTG AAGCTTG CCCT ATATTTATG AATCAG ATTTTT AAGTGG AAG CTGCAGTTGTTTCCTTG^ 
TTCATGCTTTAGCAGAGGATATGATGCCATAGAGCTGGCTrGGm 
TTTAGAGGCCACAGCAGTTTTGATGTATTATCAAACCGCAAAATGTACCTTGTTGGG 
TACACATCCCTTATCTAGAGTGCTTGAGACTGTTTTGAATTTCT 

6 5 GCCCATTAAGCATCCCAAATCCAAAAAGCCCAAATTTGAAATGCTGCAGTGAGCATTT 

AGTTTTGAATTTTGGAGGACTTTGGATTTTTGAATT^ 

TGTTTAGTCTGATTTTGAAAATACAGCATAAATTTAAAAATGCATACATACATATGT 

GTAGTATATAGCCATACAACTt5CAGACACATGGAAAACA^ 

TATCTnaTTTAACTGGAATCTAATATGTTAAATCTCTTTTATTCTAC 

7 0 GGCCAAAGCTGTGACCCTTATGCAACAGTTTCTCTAGTGGGC 

TTGATGTTTGATTTAAAAATTTAAAAAGTAAGATTCTACGAGCAAATGATAAATAGCT 
GAAAGAAAAATCTTTAGTGAGTTCTA1TATACACTTGTCAATAACAAAGCCCTCTT 

AATCTGC CATGCTCTT AGGCTTTAAGG AACAACTT CAG TTTT AT ACT AGT ACIAATGT AGT ACITT AAAACAT AAACTTGAC^ C 
ATG CATCAGCATT CCT ACTTTTCAAAT ATCCTC AATTTTATG CATGTAAAATC CTGTTTTAATGAATGC CAG TTCAG AAT AACAAG 
7 5 ACAATTT AT ATATTTTT ATCTCGAAGG CAG CT CAGG ACTGAAAATTG CCTTTTCT ACTCATGAAAT CTG TCCTCT ACAGAGAG AAT 
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TCCAACTGTAGAAAAACTGCTGAM3TACTAGAGCATGGCAA 
AAAACCTATAGCACTTTGTAATACGGTGTGTTATATGACTTCATTATGATTTCT 
AACTTGAAGGTTTAGAATCAGGAAGCAAGAAACACAGTATTCC^ 
ATCCCCAGAATCTCTAGTTATTCTTACrTTGTAGAAAATCACTACTAAG 
5 ACCTAGT ACCATG CCGGT ATTT ATT AGG TG CT CAAACAAT AGT AT G CT ATT AG CAG AT ACT CAGTCGT ATG ACATT AAATG CT ATT 
AGAGAGATTTATTTTTCTGTGTTAAAAATGTTCAATCT 
TTGGTAACATACAGTTAGAGACAATTTGGCATATGAATTTGTGTT^ 
CCCAGCACTTTGGGAGGCCGAGGCGGGTGGATTACCTGAGGTCAGGAGTTC^ 

CTACTAAAAATACAGAATTAGCTGGGTGTGGTGG CTCATGCCTGTA^ CCAG CT ACT CGGGAGG CTG AGG CAGGAG AATCG CTTG 
1 0 AACCCTGAGGCGAAGGTT ATGGTGAACCGAGATTGTG CCATTGCACTCCATCCAG CCTGGGCAACG AG CAAAACTC CAT CT CAAAA 

AACTAAAAAATAAAAAGTCACTGTTGGTTTTGCTATTGTACT 
ACAAGATTTCCTTATGCGTTTTTTGTGACCTCCTCTC 

G AAG C CCAAGAAGT AAGCTGAGTT AAGAATGTTTTTCAAGTT CCTTGGG^ CTT AT 
TGTCTTCCTTCTATTTATTCTGAGTTGCTTATAGAATAGAATGTGTT^^ 
1 5 GTAGAGAGAATAATACAATCTTGGCACATAGGGGATGCCT 

TTTCOVAGCAATATTTAAAACTGAATCTTCCCTGAGTCACCTTAA 

TTTTATAATTGTGTACTGTTAAGTTCTACAAGGTAAAACTCT 

AGAAAACAAGCAATCCGCAGTTTAATGAAATCTTTTATT^ 

T ATTTAATGCT AGTTGATTT L T TT 'lT T CAAAC C ACAGTCAT AGT AAT CATTTT ATCAATT AATG CAG AT AAGCATTTCTCT CCTGA 
2 0 AAG CAGT AGAACCATGGTGTGATGCAATGGATGCCCAA^ 

C^GGGCCATACCCACTCCCCAGTCATCCTC^CTAT^ 
ACATAGTGGCCACAGGATTCTTCTGCAGGGTACCCTTTGACAOT 

T C CAGT CAAGTTG ACTGAAAG AGT CACT AAATGTGACT CAAGTTTCTTATCCTGTG AAAG AAT AAT CGG AAGGG AAACTAGCAGCA 
CTACCTGACTCAGAGTTTTTGTAAGACCCAAAGCATGTGAG^ 

2 5 TACTTTTTTTTTrAAGCCTAATTCATTCTGTGGACG 

CAGTTTGGACAAAAGCTTTGGAGTGCTTTTCTTT 
AAAAAATCAAACTCTCAAAAAGATTGTATTCTAATAAAGCA 
TTAGATTTTGTGCCTTTTCTTCTGTGTGTATTTTTC^ 
GCTCTACTTTAATAAAGGAACACAGGCCGAAGAAAATGCCA^ 

3 0 AAAAAACAAAAGAAAAAGTACCAATGTGAGACTTTTCCATTTCCTC 

GGTTGCATCTAAC CTGG AGAAAACAAAGAAGGCAGCAATGTAAAGGGCCGCTG ACTAAAGG CACTCACACTCCCCATTAGCCCACA 

TACCITAAACCTCAGCTGGATGGAATCTACTCTGCTTCAT^ 

CCTGCCTGCTGTAGCAGGAGTACTTGTTAGGCCTTGCAACATTC^ 

CATCTCTGTC CTTTAACTG AAG CAAGGAACACAAAG CAG ACAT ATGTATG CATAT ACACAACAGT AAAATG CAGT AG AT ATT GGGG 

3 5 AGGTCAGATGAACTCTGCTCCATTAAAGGCTTGAAGAAATATTGGAT^ 

CATTTTGAGCAGAGGCACAGAGACAGAAAAGCAGAAAGTTATGTCCAAAG 
AAATAATGGAAACTAAGATTAGAAAGGTAGAGTAAGACCAGCTGATGAAAGGCCTTGAGTGCCT 
ATTTTGAGCCrATAAGTTATGAGTAAAACAGTGTTTGGGAAGOAATTAACTATGATCTGAGGTC 
CAAAGCTGGGGAGACTGTTGTACTTTAGGTGTAAGAAAAATGGGGAGGGGATGGGGTATCrrr 

4 0 GAGGAAGAGGCAGATAAAAGAGAGAATGCAAAGGGCAAGGTAACAAATTGGTTATAGAAAAAAAG 

TGCCCCCAAGGTTCCAAAT CTGG AAAAATAGAAG AGT AGGG AAG AAGGAAAATAAAA 
AGGGCTGACATTTTAAATCCTCCAGTAGGGAGTTTATGACrcAAACTTAATATGA 

TTT AT AT AAACAG C CAAG AAAG C CAAG AGAAT AG AGGT AT ACAAT ACAGG AGAAAAGTGAAG AAACT AATTTCT G CGACT CAAGT A 
TTT AACTTTCAGC ATGT CATTT AATTT CTGT ACTTCAG CTTCCT CATTTTT AAGTGAGGG T AAT AAAT CT AG CC CTGT CCG CCTCT 

4 5 GAGCAGCGTTCTGATGAACAAAGGAGATAATGCATGTGAAAGCACTTC 

AGTGCAAAATAAAACATAAAATTTTGAAGAATAATTTATCTATAGATTCT 
GGTAGTGCCCAGGTAAATTCTATAGAGTGTTTCTGGGAAAGGAGATACAGTTGTATGATCA 
GGTAAGAACTGAGAATTTATCGTCAGATGTGGTGATGAAGAGAGATTTTAGGAGAGT^ 
GTTACTATAGTGAGTGGGTGGTGAGGAAGTACAAGCTGTAAGGATGGATTGCAGTTTTCAG^ 

5 0 GGAAGACCATAATTTAAGGGGAGAATTITTCTTAGACTTGAGTGAGATG 

CCTTATTCTTTTCTGCTACTTCCATAGTTGCTTTTATATTCAGCTA^ 
CCACAATTGAATATTTTGCAGATGATCATTTTTAAGCTGTAGGAAA^ 
TTTACAGGTAACCAGATCCAGTAGTTACACCAGAAAGTCCCAGTTCCA 
ATGTGCCTTGGGGTTTTACAGAATTGCTTTTTTGTAT^ 

5 5 AGAATTCTCTATAGCAAGTTGGTTGATGAAACTTAGAGGAATATGCAACT 

GTTTGCAGAAATAGTCAAAGTGTATTAATTTTATAAACAAAGATGATCTAATTTTCA 
TCCCAAATTGTTCCTTACATTTCTGGATTATAGTGTAATGGATGGAAAGGAAGTTATAT 
TTCTTCTTAACATATCTGCTGTCTAACAGAAATTTCAGTGAGGTAAGAAGATAAT^ 
GTTTCCTGC^TCTTAAATATCTATGCCCTTTGCGAAGGAAAAATCTTATAAATGCATGTG 

6 0 TCTAGATGGTATATTAATCATGTGTTTGGAGTGATCAGGTAATGTATTTTTCTAATTTTTCT 

CTAGAGAGAAACTGTAGTCTTATTGTAGTTTCTTTCT 

GACAG TTT AAG TG CCTTCTTTCAG ACT ATTT AAAATTACT^ CTG CTGT AGT ATTTTG CTTCT CT A 

GTCTC^ATAGTGAGGTATGTATATGGTGTTGCTAACATCTTTTAAGCATGTAAAATTAATGCTG 

AAGCTGTTTTGTGGTGTGAACTCTCTTAAACATTGTT^ C CAGTTTTCAATTTTCAGGAT CG ACTTGTGG AACA 

6 5 ATGGAAACCTAGTCCAAGATGTTTTCCTAGGTGAGATTAAGGTTCCTC 

TAAGGGCCGAGCATTTTAGTGAACTCCATAGTTTAATTCTCTTTAGTA 

TTTTAAATTTGCCATGTAATAAGCTAAACGTCATTAATATTGATATG 

ATGACirGACCTGTGGTACTGAGATAAAAGAACTGAATCATCT 

AAAAATGGGAAACATTTAAAACATTACATAAGAGTTGTTCATGTAAGAAAACGCTTGGT CACAAAT CAGTGTCTGCATTTCAGAAG 
70 TTTTATTAGTGGTCATAGGAACTGTGGGTATTTGAAAGCAAAGGATTTTATT^ 
01 T CT TCCTTCACCACACCCCATTGAAATGTACTGTAGA 
CTGGCACTTTTTAGGGCTCTTGGTTTTCAATAATGACT 
TGTTAAATTCAAATTAATGTTCAGACATTACATTGAGCTA 
ATAGCACTTAAAAGCGTGTGTATCCTTTCAGTATGTTGCCCAGC^^ 

7 5 CATGCCTTTCTCATTTGCCTTTTAAGGAGCACITTT^ 
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GGATATCAGTTTCATCTGTTCCTAAAAGAGCCTGCCTCTTCT 
AGTTATCACTTGTAGACTGATATGGAGACTCTCAAAATTTTTACTTTATTTT^ 
ATTGGTCTTTATGTTGGCATTTTGTGTATTATAGTTTGTAAAATGC^TC 
AAAATTGATGAAGATTTGGGACCTCTCATCCTCCCACTACACCAT^ 
5 AATTTCTGAGCTCAGTTTGTCAAGTTTTTAAAATTATC 

GGTCCCTCAAAATGAACTTGAAATGTGCCACATAATCTGTTTTTTAA^ 
ACTTTGTGCAGCTTCCTTGGCAGTATGTAAGAACATTAACAAACT 

AG T AAC CTG CCAGAAAT ATAAGTG CTGAAATTT AT AT AG CATGAAAT AGGAACAT CTGGT AATTCAT AT AT AATTC CACAGTTGT A 

AGGAAACATAAAAATGTGAATGCCATTTCTTTTAAGTTGG 
1 0 CCCTTGTGGGGTTGTTAGTAACTGCATGAAAACTCAAGAAAAGATTTATTTACCCT 

G ATGTTAAGAGAG T AATCAT CAGT AATTT AATT CTGTTGTTTG AAAAGTCT CAAAAT AG ATTAA ri 1 1' T AAATTTGAAAGT AAGAT 

TCAAAT AAT AGAT AAAATGAATT ACIKXXnTTTT CTTTTCAACAGAAG CACITT AT ACATG ATTTT AG ACT AG ATG AAT ATTTGAT 

ATGATTTAGCTTTCCAGTTTGTTAATTTTGACACCTTTTC 

CAAAGCCTCACTCTGTTGTCCAAGCTGGAGTGCAGTGGCATGATCT 
1 5 TCCTGCCTCAGCCTCCCAAGTAGCTGGGACTTCAGGCACACACCACCATC 

TTTAACTATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTTCCT 

CATTGCGCCC^GCCTTTGGTCTCACATTTTTAACGTATCTGTTTAA 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNIWNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

GCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTCACC^ 

TCGGCCTCCCAAAGTGCTGGGATT ACAGGCGTG AG C CAC CG CACCTGG CCCACG TT CAGTT ATT AATAGAACTGCTTCTTTATTCT 

2 5 CTCATTTACATAG CATT ATG T ACAAAT AATTG CACTTTG GGTTTG TCT ACCTCAATTTT AATT ACACAG ACC CTATGGTTAGAATT 

GTTTAAATGCTGA lTlT TTTCTGTTAAGGGACTGTCAAAACTGCTTACrCIT 
AAAAGTACATGTTTAGAGCAACTGAATGCTACTTTCAGTTAAAAATTTCTC^ 

ATTTCATTCAAAAACTG ACAT AT AT AT AT ATGAGTTTTTTGTTTTG'IT'IT CACTT AG AAATG C CAG AT AGTC CTGAACTATTT AAC 
AAATCTTGAGCCITTTCTCTTTAGCrrCTAATTCATACAGAATTATTTAATT 

3 0 TTTTGTAATAATATTTTAAAATAATCZACATTTATATAGTTATACAAATGACTATTGATAG 

TTATACATCACATATATAAATGTTATATGTGTAACATTTACATAGTGATTACTGTGTGCC^ 

T AATAAcrrrrri' AG GG CTGTTGTAGGGTTTT AATGTGTTAAG ACACT ATTAAT ACTC CTT ATTTG ACAAATGAGG AAAAAG GACA 

GAGAAATTTAGCAACTTGTCTAAGGTCACACAGCTGGGATTT^ 

ACTGTAAGTTTGAACTCTGGCATATTTCTGAATATATTTAAAACTCZAAAAACATTC^ 

3 5 CCTTGCTTATAAAATATTATATAAAAGAGAATCAGAGAGTTCCCTACTCATTTAAAAAATGA 

ACAACATGTTGTGAAGTCTTTTTTTTTTTTTTT 

AAAATCCAAGGAAACAAGTTTAGAAAGTATGGCAGGAGTGAACCCTTCTGGAGTTCTTAAG 
AATGAGTAATTGGTATAGCACTACCCTAATTCCTCTACATA^ 

ATGAGAGTAAAGAGGGATGTCATACCCCTCCCTOVGTTATTTGGAGCTAAAGGAAATAA^ 

4 0 GTAAAAACCAAAACAGAGTTTGTTTTGACTGTAGAGTTCAGAGA 

C^GTTGTAACIXIAGTAAGCAGTAATACAAGTAATGTTAATCT 
CTAGAGTAATAAAAAAGGATTCCAAAGAGGTAGGCATGGTACTGCCTATTCAAGC^ 
AAGAATTACTGTTTCCTTGGCAAGACTGTTGTCTTAGCTGAGAGATTCTTAAGGCT 
CACCACAACGCCITTTGATAATTTGCCTTTCCAAATAGC 
4 5 ATGGTAAATGGTGATAATTGTTGTAACCAAGTGACAAGCACATGGG<3GTGGTGGTAAGAAAG<K3 
TCTTCTCCCTACTTTTGTAAGTGAGCATTTCTATAATAAAATGCATTGG 
ATTTCTTTTTCATAAATTTAATAATTGACCTC 
TCTT TlTTU " l^Al w X " X " lT TTGAGATGCAGTCrCTCTCrrGTCGCCCA 

CACCTCCTGGOTTCAAGCGATTCTCCTGCCTCAGCCTCC CAAGTAGCTGGGACTACAGGCCTGTG CCACCATGCCTGGCTAATrrr 
50 TGTATTTTGGT AG AG ATGGGGTTTCACCATGTTGGCCAGCCTGGT CTCGAACT CCTGACCTCAGGTG ATCCACCCGCCTTGGCCT C 

CCAAAGTGCTGGGATTACAGGC<KSGAGCCACC^CACTTGACCTTCAGGTTCACTT^ 
TTGCAAGGTTGCCX5AACATGCCCTGGATTGTGAAAAATTGAC 
TTCCTGATGTG<3TTTCGCTGTGTCCCTACCCAAATCTCACCTT 
GATAATTGAATCATCGGGTGGTTTCAAATCCCCATACTGTGCTITGTG^ 

CTCCCCAGCCGTGTGGAACTATGAGTCAATTAAACCTCTTTCCTTTATAAATTACCCAGCT 

AAGGACA.GACTAAATACACTTCCCTTAGAGGTACTTCTGAGTCACCTTTG 

T CTTT ACAT CACTATTT CTT AAATC7VGACAGTAGATGTTCTG 

TTGT ATTAACCATCCCTTCATCCnTTAAATTTGCCTTGGTT AATTCG CT AGTTGTCTTTT CTGTCTTTATCTTATCTTTCTCAAAG 
6 0 CTAATAAATGTTCTATTAAAGCCTTCCTCCTTCTATTTTTAAATC 
ATTAAGTATTAAAACTACTATTTCTTTTATAAAAAGCATCT 
ATCATTTTGATGATTAATGGCTCTAAGCCGTGATCATCAGTGGCT 

CTTGGTAGAACT ACACAG CATT ATCAATG AAG CATTCTTG CCAG AACAT CAAATCTGAAT CT AACCAAG CCT CTAG AT CT AACT AC 
CAATATAGAGGAAATACAGAGAGGATGAAGGAATATGGTAAATGACATTGTGGAGGTAGAGATAAGAGTTAACA 

6 5 TGGAAAACTCTATAGGACCAACAGTGAGGAATCTATAAAAGAGACTTAAGAATGCTATTACCAATTCTAATCT 

GGATTCGAACAAACTGTAGAAAAACATTTATGATAAGTTTGGGGAAATTTC 
TATATGAAATTATACCTGGAAATTGCTTrAAAA^ 
TCAGTCTTTTATTAAAAAAAAATCAGTGTATGTATTTTTAAA 
GCTTITTTCTCATGTTAGTTAGAAACTTCTAATTGCTTTCAA^ 

7 0 CTCTAGTCAATATCCATGGGAATGTCTACTACTAAGCCCATGTC 

AACAGCAATGCTTAAAAAAACTCAAACTGCTTAAAAAT 
CIX3GGCTTTGTAATTTTAAACTTTAGAAAGGCCT 

GGCCAGGCATGGTGGCTTACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTA^ 
AGCCTGGCCAACATAGTGAAACCCTGTCTCTACTAAAAATACAAAAACT 
7 5 CTCGGGAGGCTGASGCXSGGAGAATCGCTniAACCCGGGAG^ 
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GTGACAGAGTGAGACTCTGTCTCAAATAATAATAATAATAATAATTCCAAA 
TCTCAAACAGTATAGGGGAAATATTGACAACATTTAAAATGCATTTTAAT^ 
TAAAATGCATTCTTAATGTATTTTACATGTATTCACAAAATATAAGAAAATC 
GAGAAAATCCCTCCCTTTACTGTAGCAAGTGTATACCATAGAGGCAGTGAGTAAGAGAG 
5 AAATGGAAAGACAAACTCTTCCTCAGTATAAGTAACCAGTGCTATAAATTO 

ACAGAGGTTGAGGATTTTCTGCAAATAATAAAAGAAATATCAGTGCAGATTATT 

AGC ATTTT AGCAGGACTTG GGT AACAG AGGACTGAGAG AG AGAATTT CC CAGACC CAAAAAGCAAAT A CT 
CTT AGT AAGG AGGG TTGAAATT AAATTCAAAATTTTAATGTTTGCTGTTT ATTG CTTTCTG AACACAATT ATTTTT AAG TT ACTTA 
TGGC CTATT AAG CTGTAT AGCATTTT C^CTTCACAAGACAGTACAATT ATTTTTCATT AT ATTCATAAAT ACCT 
1 0 TCGAGTCCATTCATTGCATGAATATTATGTACCCGTTAAAAAGAACAAACTGGATCTA 
GAAGTCCTTTCAGGTAAAAAAAACAAGATACAGGAGAGTAGAATATGATAACA 

TTT ATAAATGTGT ATAT ATGCACACATTTT AAAAAAT CT AT AT AAAATT AT AT AAGCATGAAGAAAAT AG AG AAGGG TCCAT AG TA 
GGTAGTTAACAGGTTCCTAAGAGAAGCGGAGGTt3CAGAATtK3TAGTCT^ 
AAAAGAG AAAAT AAAGAAAACTGTTCCTTAAAAAAACAAAAGCACT 
1 5 TTTGTCTATGTGTAGTAAGGA TTTTTT AATATTAAATAA 

CACAGATTCTTTCTTGGTTCTTGAAGTTrGAGCTTCACTACAA 
GTTAATAGCAAATCCAAATTTGTAAATTTAATTTGGAGTTATT^ 

AAAAAAAAT ATGCAG AT AAAT ATTT AT AAATAAATTCCTTT AAT AT ATTCT AATTG TGAT AGTTT CTGCAG CAT AATTT AAAATT A 
TG CT ATTTT CAGTGT CCAAATAAGGATGTT AT CAGTGTT AGTATTG ATCATCTGG CTGCAC AT AG T AACTT AAAAT AATTTTGTTT 
2 0 T AGAATAT ACT AATAAAAG ACAT AT AT AAT ACAGTTTTT CTG CAGTTTGGTTTCTGGTT AT AGTT AAGT AAAG CAAAGATTCACT A 
TTTT AT AT ACAGCACAAG AAAACGC CTTTIT AAAAT AATG AATTATTGAAGAAT AAG AAT AC AG TTAAG TTTTTTT CCTGGATGTT 
TTTAGATATTTTGCTCTAATGGAATAAAGGGGATGAAGTATTTCTAATC^G 
GAACAATTTTTTAACTCCTGGAACTCTTGTGATATCTTG 
GGCTAAAGCCCAGCZAATTCTGAAGTTTTAATGACCAAATTAGTTG^ 

2 5 AAATTGCCTTTCTCTAATCCTGATATAGTCTAGAAAAAAATAGGTAT 

GCATATTGCTGAGTATCCCCCCAAAAAGGAAATATGAATACTTGATTGTCA^ 
TTTTGTTTTGTTTTGTTTTTTAATTTTAGATTCAGGGGTATAGGTA 
AGGCrTATTATCCCTTCACCCAAGTAGCGAACAAAGTACC^ 
GGAACCCCCAGTGTCTATTGTTACCA.TCTTTGTCTTTGTGCATACCCAGTC 

3 0 TTGGTTTTCTGTTTCTGCATTAATTCGCATACXK3TAGTGGCCTCCAGCAG 

TCTGTGACTGCGAACAAGATTAGTCTTCAGTTGATAATT 

TTGTCTTTTTAAATTCCCATGATAAAAAGCITTTTTT I'lri'T'l "I 1'TTAAACTCTTAGATTTCTGGTtTTGGTCCTGCAAATTTCTC 

TTAAGTrCAAATTTTTGGAAAAAGAATTTACTTTAAAACAAGGT^ 

TTGAG ACGAAGTCTTG CTCTGTCAT CCAGGCAAGCCCAG ACTGG AGTACAGTGG 

3 5 GAGTTCAAGTG ATTCT C CTGCCTCAGC CT C CCAAGCAG CTGG ATT ACAGG CACCTG C CACCATG CCGGACTG ATTTTTGTATTTTT 

AGTAAAGACAGGGTTTCACCATGGTGGCCAGGCTGGTCTCGAACTCCT^ 

CTGAGATTACJVGTCATGAGCCACTGCACCCGGACTGTCTTAATITTCAGCTAAATCC 

TATTTTTGATAATACATTC^ATTTTAAATGTTTACCTTTTATTATCA 

CT C CTTTG AGATCTT AATTTCTGT AAT ATCTATGCTTTTTG TTT CCTGGAT AAAAATGTG TTT AT AG CTTTG T AATCCAATGT AT A 

4 0 ACCAATATTAGTTTTCTGAAGATt3AGTATCCTGCTACTTATGA 

TTTTTTATAACTTAACTCTTAACTTGCCTCATTGGCT^ 
TTGCTACAGCCAAGAGACAATGGAAAOVAGTCATCCAAAACTGATGACCTG^ 
CTACGTGCTTCCTTCAGAGTACTATGGTCCTTTGAAAACTTTGCTC 
GATATGATTTAACACGAAACC^CTAAAATAATTCTATAAATGATAAGGAAAAGATTTCAA 

4 5 AAAATTATATACATACATATATATACACACACACATTTCTATTTATCTG 

TCATCTCTGATAAAAAATTTCGAAAATTAAATGTCCTCT 
TTGAACAGTTAATCATGTTTCCTTGATGTlTGTGCrTGTTT^ 

TTTGAGTGAAATATGTCGAGATAAAAATGATGCTGTTTTGCCCCTTGTACGACTGCTGCTGCACO^TG 
CCA.CTGCTGTGGCTGAATTAGACTTGAAGG AT ACAC AGT AAG AGTT AT ATTTT ATT AAATG^ 

5 0 CCTTAGTTTGACA^ATTGATTTTGTAGTCTC 

CAAATAACAAAAGGCAGTTAAGCTATCAAACAGTATTCACCTACCre 
GGAAATTTCATTTGTTACTAATCTGCTTCTTAGTGTAATGGTTCAC 

TTTATTCATTATGTATCACCCTTATATGCATAT6TATGACATTCTAAAAAGCAGTCAAQTGACTTCT 
CACTGTTGTTTTTTTACACACAGAGACATACATATACTCCACATT^ 

5 5 ATATATATATATATATATATATATATATACACACACACACACACATATGT 

CTCGCTTTTTCACCCAC^TTC^AGTGCAGTGGCATGATCTTGGCT 
CTCAGCCTCCT13AGTAGCTGGGATTACAGGCACTCGCCACCACGCCCAGCTAA 
ATGTTGGCCAGGCTAGTCTCACACTCCTGACTTCAAGCGATCCAC^ 
CATCACACCTGGCCATTATTTTAATATATTTTrGGTAAGTTTGAAAAA 

6 0 ACTAAAATTGTAGGTCACTTATGCCTT CTAGTATGTCTTAATCTATTTCAG CT AAATG CTTCTTTCTT AAATTGGTCAAGTATGTT 

ATATTATTGGTTTCAAAACCTTTGTTTACTACATTTGGTTTCATCTATT^ 

TTCCCTGGCTACC CGATGT CTGGATC AGATC ATGAAAATAGTGGGAGGGCACT ACCTG AAAGTAACATTAAAACCT 

AGGTACAG AATATATCTCCAACAATG ATAGTTTGTGG CCTT ATGATAGTTGTTCTTTTATCAAGTGAT CTATTG ATGGGAAGGATT 

TATTCATTCATGTGTTACTGAAAAAATAGACTGAACACTTTAGAAGCA 

6 5 GGTACAGTATAGTATAATCTACATTATTTCTTTGTTATGGGATTTCTATTTTAGCT 

GATATCATtSTTTTAAGAAAATTGTATATGTTATCCCTGCATAGGACATTTrACGCT 

TTT ATT AT CAGAAAAAT AAGTT ATTTT CATTTT GTCAGACT ACTT AG AAAAAAATCATT AACTG AAAAATTT A I T T TT CAGAT ATG 

TGACTCCTCAAAATCCTGTGAAATCGATCCTATTAAATTGAAAGA 

TTATTTATAATTGCATTAAAATTGGTCTTAATGGTGTACCTTTCTCCC^ 

7 0 CTGTTTATTCTTCACATAAGCCTTTATTATTAG 

TCACTTT ATTTTTG TT CTTGATTGT CTCTTTTTGG CTCT AGT AA CTAGTTG CAGCC CC CGTTCT 

AGCCATTCCITTGGGCTCAAACTGATCAGXSCTAAGTGACACAGTGACTTT^ 
TGATTGTGTTAGC^TCACAAATTTTTATTGCAGTGC^TAATAGTGTCACAATC 
TTTCCTTTT(^ATCATTTTATTCCCT 
75 ATTTATTAAATTAAGTCA 



537 



WO 03/008583 



PCT7US01/51291 



HUMAN SEQUENCE - mRNA 

ATCGCGGCGGCGGCGCCTGCTGCrcCGGCGGCTTCTTCCGACM 
GGACAGTCGCGAGGTTCGAGTGTTGCAGAGCCTGCGGGGCAAGATCT 
5 AAATGAGAGATTGTTTCTGTACCATAAATTT 

TTCAGTGAAGAATTTTACTTTGAGATTCCAAGAACTTTCCA 

TCTCCGTATAGGAAAAGTAGCCATCAAAAAAGAAGACTTGTGTAATCACAGTGGCA 

ACTCCAATTCAGAGGTTCAGGGTAAAGTTCACCTTGAATTAAAACTC 

CTTGTTGTACACATCAAGGCATGCCATGGGTTGCCTCTCATAAATGGCCAAAGCT 
1 0 C C CTTCT AGGAATGAC CAAAAGAAGACAAAAGT AAAGAAGAAAACAAG CAAT CCGCAGTTT AATGAAATCTTTT ATTTTG AAGT AA 

CCAGATCCACTAGTTACACCAGAAAGTCCCAGTTCCAGGTAGAAGAG 

AACAATGGAAACCTAGTCCAAGATGTTTTCCTAGGTGAGATTAAGGTTCCTGTGAACGTA 

CTCWTACTTGCTACAGCCAAGAGACAATGGAAACAAGTCATCCAAAACTGATGACCTGGGGTCT 

CAGAAGACTACGTGCITCCTTCAGAGTACTATGGTCCrTTGA 
1 5 T CAG CTGCTT ACATTTTG AG TGAAATATGT CGAGATAAAAATG ATGCTG'ITTTG CC C CTTGT ACGACTGCTG CTG CACCATGAT AA 

ACTTGTTCCITTTGCCACTGCTGTGGCTGAATTAGACTTC 

CTACCCGATGTCTGGATGAGATGATGAAAATAGTGGGAGGGCACTACCTGAAAGTAACATTAAAACCTA 
GACTCCTCAAAATCCTGTGAAATCGATCCTATTAAATTGAAAGAGGGAGATA^ 

TGTAGACAAGTT ATT CAAT ACAATTGTAAAAT CAAG T ATG AGCTGCCCCACTGT AATGTGTG AT AT CTTTT ATT CTCTAAGGCAG A 
2 0 TGGCTACTCAGAGATTTCCTAATGACCCTCATGTTCAGTATTCTGCAGTG 
GTAGTATCACCTCATACITTTCATTTGCGACCTCATCATCC^ 
ACAAACTTTGGGAAGCTGGGGGAGTCTGTCCAAAAGCAAGTCAAGT^ 
AAGAAGAAGGATATATTATAGCAGTTAAAAAGTTCTTGGATGAAATTTCATCT 
CCTGTGCACCTGAAAGAAGGTGAGATGTATAAAAGAGCTCAAGGAAGAACTCGGAT^ 

2 5 CTG CTT AACAAGCAGAG AG CT CAC CT ACCACAAACAGC CAG G C AAAGATGCAAT CT ACACAAT C CCAG T AAAAAACATT CTTG CTG 

TGGAAAAACTGGAAGAGAGCTCTTTCAACAAGAAAAATATGTTCCAAGT 
AACTGTGTAGAAGCTAATGAATGGATAGACGTACTCTGCAGGGTGAGCCGATGCA^ 
TGTGTATCTAAACGGAAATTGGCTCTGCTGTCAGGAGACTGGTGAAAACACTCTCGGCT 
CAGACATCCAAATAGATATTGATGAAGACAGAGAAACAGAAAGAATTTATTC 

3 0 ATGGAAGAGGCTTGTGGAACTATTGCAGTCTATCAAGGACCACAGAAAGA^ 

TGTAACAACCTTTAAG ACAATTCAGCAAATAAAAAGCATAATTGAGAAG CT 

CCAGTAGTGCAAAATATGGGAGCAAG-GAAAATCCAATTGTTGGGAAAGCATCTTAGAGTTT 

AAATATTATTTT 

35 HUMAN SEQUENCE - CODING 

ATGGCG^CGGCGGCGCCTGCTGCTGCGGCGGCITCTTCCGAGGCGCCAG CGGCG AGTGCGACTGCAGAG CCCGAGGCCGGGGACCA 
GGACAGTCGCGAGGTTCGAGTGTTGC!AGAGCCTGCGGGGCAAGATCTGTGAAGCAAAAAATTTA 
AAATGAGAGATTGTTTCTGTACCATAAATTTGGACCAGG 
TTCAGTGAAGAATTTTACTTTGAGATTCCAAGAACTTTCCAGTATTTGTC^ 

4 0 T CT C CGT AT AGGAAAAGT AGC CAT CAAAAAAG AAGACTTGTGT AATCACAG TGG CAAAG AAACTTGG 

ACTCCAATTCAGAGGTTCAGGGTAAAGTTCACCTTGAATTAAAACTGAATGA^ 
CTTGTTGTACACATCAAGGCATGCCATGGGTTGCCTCTCATAAATGGCCA 
CCCOTCTAGGAATGACCAAAAGAAGACAAAAGTAAAGAAGAAAACAAGCAATCCG 
CCAGATCCAGTAGTTACACCAGAAAGTCCCAGTTCCAGGTAGAAGAGGAGGACATTGAAAAGCTA^ 

4 5 AACAATGGAAACCTAGTCCAAGATGTTTTCCTAGGTGAGATTAAGGTTCCT 

CTGGT ACTTGCTACAG CCAAG AGACAATGGAAACAAG TCATC CAAAACTGATG ACCTGGGG T CT CTTCG ATT AAAT AT ATGTTAT A 
C^GAAGACTACGTGCTTCCTTCAGAGTACTATGGTCCTTTGA 

TCAGCTGCTT ACATTTTGAGTGAAATATGTCGAG AT AAAAATG ATGCTGTTTTGCC CCT^ CTG CTG CACCATGATAA 

ACTTGTTC CTTTTG CCACTGCTGTGGCTG AATT AGACTTGAAGG ATACACAAGATG CAAACACAATTTTTAGAGG AAATTCCCTGG 

5 0 CTACCCGATGTCTGGATGAGATGATGAAAATAGIXKK^GGGCACTACCTGAAAGTAACAT^ 

GACTCCTCAAAATCCTGTGAAATCGATCCTATTAAATTGAAAGAGGGAGATAATGTAGAA 
TGTAGACAAGTTATTCAATACAATTGTAAAATCAAGTATGAGCTGCCCCA^ 

TGGCTACTCAG AG ATTTCCTAATGACCCTCATGTTCAGT ATT CTGCAGTG AGCAGCTTTGTATTTCTTCGTTTCTTTGCTGTAG CC 
GTAGTATCACCTCATACTCTTCATTTGCGACCTCATCATCCAGAT^ 

5 5 ACAAACTTTGGGAAGCTGGGGGAGTCTGTCCAAAAGCAAG 

AAGAAGAAGGATATATTATAGCAGTTAAAAAGTTCTTGGATGAAATTTCATCTACTGAAACTAAAGAGTC 
CCTGTGCACCTGAAAGAAGGTGAGATGTATAAAAGAGCTCAAGGAAGAACTCGGATTGGAAAAAAGAATTTTAW 
CTGCTTAACAAGCAGAGAGCTCACCTACCACAAAOIGCCAGG 
TGGAAAAACTGGAAGAGAGCTCTTTO^CAAGAAAAATATGTTCCAAGTAATA 

6 0 AACTGTGTAGAAGCTAATGAATGGATAGAOGTACTCT 

TGTGTATCTAAACGGAAATTGGCTCTGCTGTC!AGGAGACTGGTGAAAACACTCTCGGCT 
CAGAC^TCCAAATAGATATTGATGAAGACAGAGAAACAGAAAGAATTTATTCCCTTTTTACCCT 
ATGGAAGAGGCTTOTGGAACTATTGCAGTCTATCAAGGACCACAGAAAGAGCCTGATGATTA 
TGTAACAACCTTTAAGACAATTCAGGAAATAAAAAGCAT^ 
65 CCAGTAGTGCAAAATATC^GAGCAAGGAAAATCCAATTGTTGGGAAAGCATCTTAG 
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Table 25 

MOUSE NOMENCLATURE 
ICSGNM Gatal 
Celera mCG3 964 

5 

HUMAN NOMENCLATURE 
HGNC GATA1 
Celera hCT10890 

10 MOUSE SEQUENCE - GENOMIC 

TGCCTATAGTTTCAGAGGCTGAGTCCATTATCATTGTGGAAGCAT^ 

GATATCCTGATCCATAAGGCAAAAAGAGACACATCAGCTTTTGAAGCT^ 

AACAAGGCTACTCCTCCTAATCCTITCTCAAATAGTGCCCTACC 

CTTTTTTTTTTTTTTTTTTTT 1 1 " ! T 11 J . AGTTTTCAAGACAGGGTTTCTCTGTATAGCCCTGGCTGTC 
1 5 GTAGACCAGGCTGGCCTGAATTCAGGAATCAGCCT 

CTGGGCCATTGTTATTCAAACCACCGTCTGTGTGTGTGTGTGTGTGTC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNIJNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNWmNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNIJNNNNNNNNIWNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNN1WNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNIJNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNiJNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNIWNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 KNNNNNl^NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT^^ 

NNNNNNNIJNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNI«NNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNTJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNIJNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNION^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNN1INNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNlfNNNNNNNlJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNl^NNNl^NNNNNNNNNNNNNNNNNNNN^ 

NNNNIJNNNNNIYNNNNNNNNNNNNNNN^ 
g0 nnnnnNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNJJNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 nnNNNNNNNNNNNNNNNNNNNNNNNI^ 

NNNNNNNNNNNKNNNNNNNNNNNNN^ 

NNNNinWNNNNNNNNNNNNNNNNNNNNN^ 

NN^n^NNNNNNNNNNNNNNNNNN^ 

NNNITONIJNNNNNNNNN^ 
70 NflNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NN^NNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AAAGX3CTGCACAGTTGAGGTCAGCCTGGTATACAGCCGAAACCCTCGTCT 
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ATTGCTTGTTTCAGGTTGCTCTGCCTGATTAGTTTTCTTGTC 

CCAGGTTTCTCTGTGTAGCCCCAACAATCTTGGAACTCT^ 

TCTGCCTGGAATGAAAAGGTGGGAATTAAAAGCXn^CACCC^ 

GCCTTCAACTCAAAGTAATCTTCTTGTCTCAGCCT 
5 TTCCTTGTGCATACTCATGTGTATTCAGATGCCCGTGAACGTGTCAGTGTGTTC^ 

TCGCCAATCATTTTCCACTTTACTGAGGGAGGTTCTCAGGGAACCTGGA^ 

TCCAGGGATTCTGTCCCCATCCTTCTACTTGCTGGCATGATAGGTAGACAATATGTC 

GTGGAGTCTTATGAAAGTCAAGGTTATGGGGTGTGGTGGTACACCCTGTTAAG 

CCTGCGTTCAAGGGGATCTAGTTCTCACTTCTGGCCTACACACACAGA 
1 0 GACACACTGACACACAGG CACACAAAG AAAAAAGTT AT CTTTTAAAAACCAAAGT CAGTTGAG TGCAGTGG CATGTGCCTTTAGTC 

CCAGTGCTTGGGAGGCAGAGGCAGGAGAATCTCTGTGACTTCAAGGCrAG 

TACACAGAGAGCTCCTGTCTCCAAAATGTTAATTAATTAAAAAAATAAAACTAAAGTA 

TAAAGTTCTGAGACTAATTCCTTTACAACTATGGTTAAGTAAACAACT^ 

ATCTGTTGCTACACATATATACACGCACATACACATCTTGAGGCTAAATTAGGAT^ 
1 5 AAGTTTGTGTGTTAAAAGTGTGTAGCrrTTAATACATTAAGGCAGTCAATAATATTCT 

AAACACATCATTACTAAACCATCTGACATTTCTGCTGrATATTTAGTAAAATTCA 

TOTCCATACTTAATTATTTGAGTATATATA^ 

CATTGTAGACATGAGTACAGGCCCTTGGAAGCCAGATCCCTCAGCTAAACCATCT 
GGTO3TGTTACATAGTACTGGCTGATCCAGAACTTGCTATC 
2 0 TGAACTCCTGGTTCTCCTGCCTCTACCTCCAATATACCATCATGCCCCAGTCAGGTC 

TGGC CTG CAGCAAG ATGTGG CAGCTT ATAT C CATGAGTCAAGCA CACG CAC C C CCAAGTCTGACACG AGAGGATCACCAG AAGTTG 
GAGGCCAGCTTAGGCTGGCCTGTAGTGTATGGTTCTGTCTCAAAAG 
CTrTGGGCTAGGGGTACAGCTCAGTTGATAGATCACTTGCCTAGCATC 
ACATCATGTACCCCAACTCAGGTGTGGCCATCTGTTGTCA^ 

2 5 GCGTGGTGGCGCACGCCTTTAATCCCAGCACTCTGGAGGCAGAGGCAGGCGGATTTCT 

TGAGTTCCAGGACAGCCAGGGCTATACAGAGAAACCCTGTCTCGAAAAAACAAAA 
AAAAGCAAAAAAAAGTGATTTAGTCAATCAGGCCACATTGGGAAGAGGGATAAAGACCTCAA 
TCAATTCTACCAGACCAGTCCCAATTCTACOIGACCAGTCCCAATTCTGAGOT 
AAATACAGTGACCGTAGGGATGGCAGCTACAATATAGTGTGGGGTACCACACCT 

3 0 TGGAAGGGGTGGGACAAGCAACTCCATTTCACTGG 

TGCTTGTGTCAGCTTCCCAGGCAAGTGCTACCATGCCTACTGTGACTTTATTTTCAGTCTGACC^ 

AACAAATATTTTCTGAAGACATTGCTAATTGACCATCTGGTGGTTTGCTTTCCA^ 

GGTTGGTCTGCTTTGAACCGCTTTGGGAGCGTAAATGAGAGATGTCAG^ 



35 




TTAGTGGGCTGGAGAGATGGCTCAGTGGTTAAGAGCACTGACTGCTCTTCTC 
TGGCTCACAACCATCTGTAACAAGATCTGATGCCCTCTTCTGGAGTGTCTGAAGACACCTAGA 
ATAAATCTTAAAAAAAAAAAACAGAGTTAGCAGGTATTTCGCAACCAGTAACTATAACA 
NNNNNNNNNNNNNNNNNNNNGAAACTGAGTCTGGGATGGTGAGACT 
4 0 TACTCTTCTCAGAGGAAGCCAGTAGCTTTGGG^ 

TACTGACCCC CCCCCCAT AGAACAGGGAGGAACTAACGGGG CCCACAGTCTGCAG ACACAAGTG AAGTGGGGGAGGGGATAAAGCC 

GGAAACCACGGGGTTTCTGACAAAGTCAGATGGCAAGATAC 

ACTCCTATGGGAGCTGTCAAGGGGTCCAAGGGATCTTGGTCCTGCATTCTGCCT 

AATAAAAAG AGG AAATGG AGGAGG CCTTG AAGAATGGGAG AAAGAG AAG CAAAGG CTTG CCGG AGG AGGAGAAAGAGGAAGGTTAG 

4 5 AAGGAAAGTAAAGGAGGAAGGAAAGAGGAGACACAGAAAGAGGAAGAGAAATATGG 

GTGCGAGGGAAGGGACCAACAGAGAAAGGAAGGGAATACAAGACAGTGGAAAGAGG^ 

AC CTGG AGAG AC CTGGAG ATGAGG AAGATG GGAGG AGGGAT AGG AG AGG AAGAAGAGG G ACT CAG AGGCAGCAAACTGATGGAGT A 
G T AG AGGT ATG ACAGAC AAG AGG CAACAGG AGGATGGGGAGAG AT CAGAGG AAG ATGGG AG AAAGGGACAAAGG ATGATGGAAGAA 
GACAAGAGACATGACAGCCTAGGGAGGGATGGAAGGGAAGGGGAGAAAAATAATCTGAGAGCC 

5 0 AGTG ACAG ATT CGG AGAAAG AGAAAAAAAAGGAGGG AAG AG CCAGG CGGTGG TGG CGCACG C CTTT AAT CC CAG CACTTGGGAGGT 

GGAGGCAGGCAGATTTCTGAGTTTGAGGCCAGCCTGGTOTACAG 
CTCGAAAAAAAAGAAAGAAAGAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAA 
TCACAACCCTTTCTGTCCTCACAGGTTCAACCCCAGTGT^ 
TGCCCCAGTTTGTGGATTCTGCCCTGGTGTCCTCACCATCAGArrCC^ 

5 5 GCATCTTCTTCCACTTCCCCAAATG CAG CCACGGCCG CAG CATCAGCACTGGCCT ACTACAGAG AAG CTGAG G C CT ACAG ACACT C 

CCCAGGTAACTGTTGGGTGGCTGCCTTGGCACTGGCTGGAAGCTGTCAGGGTTGCCATAG^ 
TGAGGTCATCITGGAAAGAGTTAGAAGCTCTGTGGATAGGTGTACCGCCCCATTTGTA 
TAGTGAGCACACGGAGAAGCCAGGCTGGAAGTGTGTACGAAGCTGTCTTATTTGTCAC^ 
CAAAAAGGGATGGAGGGGATCGTCCTTGCTATAGTGACCAGAGGGAC^ 

6 0 TAGGAAGACCAGAAAACTAGCCACCATGTTGGGACAGACGAACACCATTTC^ 

TTTCCTTTTCCTCTCTACCCCTTCAACAGTCTTTGAGGTGTATCCAC^ 
ATGCTAGCTGGGCCTATGGCAAGACGGCACTCTACCCTGCCTCAACTGTGTGCCCC^ 
GAAGACCAGGAAGGGAAGAGCAACAACACGTTCTTGGACACCITGA^ 
TGCACTG CCTG CATCG CTCCCTGTCACCGGCAGTGCITACGGGGG AGCTGACTTT 
65 CTCTCAGCTCAGCAGCCTATTCTTCCCCCAAGTTTCATGGAAGCCTGCCATTGGCCCCT 
GGAGGTTGAGTTGGGCTAGGTGGCTCAAGACATAGCTAAGCT 
(^TGTGTGAACTGTGGAGCMCGGCTACTCCACTG 

GGATTTCTGAGTTCAAGGCCAGCCTGGTCTACAGAGTGAGCTCCAGGAACAGCCGAG^ 

7 o CCAAAAAAATTTAAAAAAGACTTATTTAAGAACGCCACCTGGGCTGGTGA 

CGAAGGTCCGGAGTTCAAATCCCAGCAACCACATGGTGGCT 
GAAGACAGCTACAGTCTACTTACATATAATAAATAAATAA 

CCCGGCACTATTATACTTGCCCTCTTCCTTATCCTTTCCCCATAGGGACCATCCATCCCT 
CCTAGAACTCCCATCCAAGG CCAGG AATTTTT ACAG CTTCCCCA^ 
7 5 ATT GATAGATTTGGTTTGTTTTTTGTTTGAAA 
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TTTTAATTTGCTATGGTCCTCCTGTCrCTGT 

TCAAAGTCTAAAACTAGAGATAGAAGAAGTCTTGCATAGCT^ 

ATCCCAGAACTGGAGACAGAGGCMGTGGATCTCTTGTGAGCT 

CTACACAGTGAAAGCCTGTCTTGACAGATAAATACACAGATACATAGATACATAGATGAAT 
5 CTTTCCTTAATAGGACTCTCATCCCTTAAGCTCTTTTCTCAGGC 
CAGCACTAATCTCCTGTCCTTTTATTCCTCAGATTC 

AC^CTCTGGCGGAGGAATGCCAGCGGAGATCCGGTATGCAATGCCTGCGGCCT 

ATGTTGCTTCCTGTCCTCCAAGCACACTCCCGGCTCCTCATCCAGCC 

TCCCTCTTTGCTCCTCTTTCTCTTCrCAGAAGTTGTGTA 
1 0 TCCATCTGTCCTCTCTTTCCTCCTTGCCACCTCCCGTGATTTGAGTCTTGGCTACCT 

GAGATCAGGGC CACACACAAGCACAGGTCTCACCTCCCTAG CACACCCTGGGT ATATTACTCT AGCC^TTGAACTAATTCCCCATT 

CTTAACAAAGTGGTGTTAAGAGACTGTCCCTAGCC^ 

GTTTCTACAGGTGAACCGCCCACTGACCATGAGGAAAGATC 

AGCGGGGGTCGAATCTGGCTGGAGCGGGAGCGGCTGAAGGACCAGCTGGTGGCT^ 
1 5 AATTG TGGGGGAGGTGGCCTCAGGCTTGGCACTGGG CACTG CAGGTACTGCCCACCTCTATCAAGGCCTAAGAC CTGTGGTACTGT 

NNNNNNNlWNNNNNNNNNinW^ 
NNNIWNNNNNNNNNNNNNNl^^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
CATGATGCACCAACGGAACTGTGTATCTGAAGCCT^ 

TG ATTACTTGTGGATGAGGTGGCGGGAGATGGCAGGGTGTCATCAAGTTGCAGGTAAAAC^ CTGTG CGTGCAATAA 

2 5 CAGGTGAGGAGACATGGAAGGCTGTAAAGGTGTGCTTGTGCATGACT 

GATTTTTGAAACACTGAGCAAGAACTGAAGTGTAGGATTGTCCTGAAATCAA 
ATAGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGATCTCA^ 

TCCTTTCAGAGACTGTCCAGAATGGCCATGGCCCT ATAAAAGTCCCCAAAGCAGCTCAG CCCT CCACACCTTGCAGACAC 
CCCTT ACCTGAATAG CAAGTCT ACT CAG AATTGCTCTTACACAGGTTCCTGGCCTTACCTTG AAAAAAG AACCC^ 
30 TCATAG AAACAGCTG CCCTGACTGGTGCCGGAAGCACAGGCTAGCCCTGAG ATCCT CTCACTCTTGTA 

CTTTCCTTATCTCCCTGCTCrCACTCT 

AGTTGATTTGTTTTTCTCGATTTTTCTGAGTGGGGGTCTCACTCTGTATGTG^ 
AAATTCTGGCCTTGGACTCATGGCAATCCTCCT 

3 5 TCCAATAATTC^CAGGCAGCCC^TCCTGACCrGGATAGGAGCCGCAGTCCCTACCAG 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNIWNNNNNNNNNNNNNNNNNN^^ 

NNNNNimNNNNNKNNNNNNNNNNNNW 
45 NNNNNlWNmflWNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNtWNNNOT 

NNNNNrWNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNOTW^^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNlWNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNIW^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNIWNNNNNNIWNNNNNNNNI^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNinWNNNNN^ 

JJNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNIWNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNW 
55 nnnnnnnnNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^ 

AAACCAAAAACAAACAAACAAACAAACAAAGGACAGCTGAGGTA 
AATTTGACTCCCAGAACTTTGGTAAAACAGGTCATCTTGCGGGAG 

TGGGATTTCCTGGC7VAACCCACTTAGCCTATTTGAGGAATCCCCAGCCAGTGAGAGAAACT 
7 0 GTGAGATGGTTCAGCGGATAGGAGCACTTGACTGCTCTTCCGAAGGTC 

CCACCCATAATGAGATCTGATGCCCTCTTCTGGCACATCrGAAGACAGCTC 

AAAAAAAAAAGAAAATGTGCTGTGTCTGAGAAACAACATCTGAAGTGGTGCT 

CACAGTAACACACATACAOIAACCCACTCACATGCATGTCAGGAAAAAGAC^ 

TGTCTGGACCCCGCATTAGACCCTTCAGCTTGAATATACAAGATTTCACAGATCTGTCT 
7 5 ACAGAAACACCTAGATTTTTAAAAAATTATTGTTTGATGAATCrrTTGGGATGTTTTTA 
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ATGCCTGCGTTGTGAACTATTTTATGTTGGGCACAAATGTTTTTA 

gcaccaccg atggtccacagagg caaatgtcagtgtgcaatcggatgttttcag aag caccccctgtgtgagtggaggcat aaaca 
actgggaac^^atataaaczagaagcatgcacaaaggtcattctcccaaaagaggaggc^ 

ttggtccataac c ct ctgggct aaagttgt cat atg tgat acctgactt att atataagttt ctggcttgaccaacta ctg aaag c 
5 (^ctcttctggctttaatataatcttatat 

ctcataacgttt<;tttgtttgtttgttagtttg 

AACTCACTCTGTAGACCAGGATGGCCTTGAACTCAGAAATCTG 

CCCCCCCACCCCCCAGAATGTCTCTAATTTACCAGCTAATATAATCATTCTCCTG^ 

TACCTTTGCAGTAAAAGAATTC(nX3TGCC 
1 0 GTCCCATCCCCATGCCTCCATAAAAGAAAACATGTCAGTCACT^ 

ATTATATGCAGAAGTGCCGTCCACCGCTTCTAGGGCC^ 

ACCAACTTCGTAACAAGAACTCCAAGTACCCGGAGGAAAGTAG 

TATTCTGATTGGGGCTTATATGAGAAAGAAGAAATATTTACTTGTGCTGGCTGOT 

ATTACGTCCCCATCTGCAGCACACCCTCTTGTTCrr^ 
1 5 TACAATTGGGCTGCACTTTCACCAATAGGTTCTGTTCAC 

TCATTTGCTATGGAGGCTGCACTTTCAGCAATCGCCTCTCC^^ 

ACG CCTTCAAACCAGT AC CACCTGGGTGACTCTT ACACTGC CAAGTTCAGCTG AT AGCG CAAGGT ACAAACTTG G CTG CCT CTGGA 

TCTCCCTTCL"ri Tl TL T TTTTCTTTT ^ 
2 0 CCTGCTTCTGCCTCCCTATTGCTGGGATCAAAGGCATGGACCACTTCC^ 
AAGTACACrATAGCrGTTTTCAGACACACCAGAAGAGGGCATACGATCT 
TTTGAACTCAGGACCTCTGGJ^GAGCAGTCAGTGCTCTTAACrGCT 

TT AATTTCTCATCT ACAGATGAGCAGAACTGAAT AT CCCAG CAAGACAAAGG ATTCACTTAAGCAGTTGTGCT AG CTTGTT AAAT C 
TTCAGCCCCTACTGACCAGAACCATGGAATATTAATTCAAAACAACAAATC^ 

2 5 TC!ACAAGCCAGGTCTCTATCCTCGGCATTCCTCTCAAAAGTCTC^ 

CCCTCCCCAAAACAACATGGCCAGGGTTGTCACAGCAATGCCCCGCrCCT 
GGATGGAACACCATGACCAAAAGCAACGTGCAGAGGAAAGGGCTCATTTCAC^ 
AGCAGTGAGGACAGGAACTCCTAGGGCAGGAACCTGGAGGCAGGAGCAGATA 
TCCTCATGGCTTGCTCAGCTGCTTTCTGAAA 

3 0 TCCTCAATCACTAATTAAGAAAACGCCTCAGGAGAGGCCTGCAAACCCCAG 

GAGAGGCTCTATTTAGCACCTGATGGGAGCAGATACGGAGACCTTGCAG 
GGGAGGACAAGTTGTAGAGGTCAGAGGGATCAAGGATTCCAGCAGGACACTG 
TCAAAGACACCGAACCAACAACCAGGAGCCTGCATGGAGTTTGCT^ 
TTCTCGACTCCTAACAATG^^AGCAGGGGCTGTCTCTGACT 

TCTTCTGAAGGGAAAATGCAGGAX3AAGTGGATCTGGGAGAGAGGGGATT<KM 

G T AATG T AGGAGAG AAGAAGAAAAAAG ATTGAT ACAAAT AACTG TC CT CT AAGT AC C CAAG AAT AG AAACGT AAC CATTT AG AG AG 
GGG CAAAGAAAGC CTT ACAGGC CTGCCT ACAG C CTG ATCTT ACGGAGG CATTTTCTTTG CTTTTTGT AATTTTG CT AGTT ATGAT A 
GATAATGTAAATATTTTTAGAGAGAGAGGGTTGTCAA^ 

4 0 AGAGAGATAGATGCCTTCAGTGCTTGAGTCCCAGCACCCACATCTAAAGGCT 

CAGTGTCCTTTCTTGCCTCCTTAGGCATACACACAGGCAAAATCCATGAAGTATGCT 
CZACTTGGGAGGCGGAGGCAGAGGCAGAGGCAGAGGCAGAGGC!^ 

GTCTACATAGTTAGTTCCAGTACAGGTGGGACTGCATGGAGAGACCCTGTCTCAACAAA 
GCGGCCGCCTTAGCTGTTCGTGGAGCGGGATCrTGCAGTGGTCTTT^^ 

4 5 CTCCACAGGGACCATGATCATGGCTTTGTAATTTAA 

CGCOVATCGAGTGACTGACAAGCTGACCCCTATTTATGATTACATCTC 

CAGATG CTGTCAGTT ACCAG CTTGGTTT C CACAG TATTG AACTGAACAATTCT C CACT AGT AC ACATGGCTG CCAGG CT CTTTAAG 
GAGATGTGTTACCGGTGCAGAGAAGATCTG^TGGGAGGAATCATCATTGT 
TGTACCCATGAAGGGTATGATGGTAAGACAATCTTTTGCCACCGGAGCCTCCGGAAGCT 
50 TCCATCCTGAAGGCACGACCAAGGACGAATGTCTCCAGTTCACTGCCA^ 
GGAGGAGTTATCCGCTTGGCAACCATTCAAGGATCAGGGGTAG^ 

TGCCACTTTGTCACCTCCCTGAAGCCTAGATGCAAAAGCT AAT AAACAGTGTT AAAG AAAAAA^ 

AAAACAAATGGTTGGCTG^GTGGTGGTGGCGCATGCCTTTAACCCC^ 

GGCCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGATATAC^ 

5 5 AAAATACCAAACAAACAAACAAAAAACCCCAAATGGTTCTTAAGCT 

GGCAAAGG CAGGTGAATTTTTG AGTTCGAGGCCAGCCTGGTTT ATAAAGTX3AGTTC CAGG ACAGCCAGGGATATACAAAG AAACCC 
TGTTTCAAAAAAAAAAAAAAAAA 

MOOSE SEQUENCE - mRNA 

6 0 ACTCGTCATACCACTAAC^TGGCTGAATCCTCTGCATCAACAAGCCCAGGTTCAACCCCAGTGTTCC 

GGGGCCCTGGGGACCTCAGAACCCTTGCCCCAGTTTGTGGATTCTGCCCTGGTGTCCT 
CTCTGGGCCAGAGGGTTTGGATGCAGCATCTTCTTCCACTTCCCCAAATGCAGCCAC^ 
GAGAAGCTGAGGCCTACAGACACTCCCCAGTCTTTCAGGTGTATCCACTGCTCA^ 
TATGCTAGCTGGGCCTATGGCAAGACG^CACTCTACCCTGCCTCAACTGTGTGCCCC^ 
6 5 GGAAGACCAGGAAGGGAAGAGCAACAACACGTTCTTGGACACCTTGAAGAC 
CTGCACTGCCTGCATCGCTCCCTGTCACCGGC^TGCTTACGGGGGAGCT^ 
CCTCTC^CTCAGCAGCCTATTCTTCCCCCAAGTTTCATGGAAGCCTGC 



70 



75 



GTCAGAACCGGCCTCTCATCCGGCCCAAGAAGC^AATGATTGTCAGCA^ 

ACT ACAACACTCTGGCGGAGG AATG CCAG CGG AGAT C CGGT ATG CAATGCCTGCGG CCTCT ATTTCAAG CTC CATCAGGTG AAC C G 

CCCACTGACCATGAGGAAAGATGGAATCCAGACGAGGAACCGCAAGGCATCTGGCAAAGGGA 

CTGGAGCGGGAGCGGCTGAAGGACCAGCTGGTGGCTTCATGGTGGT^ 

TO^CTTGGCACTGGGCACTGCAGGTACTGCCCACCTCTA 

TATCCCTTTTCCTGGACCTCTOCTGGGA 

TCGCCCCACTCAGTTCITGAAGGTACACAGAAT^^ 
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CTGTAGCTATGTAGCTATGAACTATGTAGATAAGGGTGACCCCACATTTCCCATTCTCCTGCCTCTAC 
CAGGCATGTATTGCTATGCCTGACATTTTCCTTCCTTTCAGl'U 

ACT ATGCAG CCTAGGCT<K3CTTTGAATTCCTTATGT AGTC C/^ ACT AAGCTCCAGCTTGAGATCCTrGTGCCTCAGTTCCTGAGTG 
CTGGGATTACAGGCATGTGCTACCGTGCCCGTTTCTCCCTCCTCrTAGAGCCACT 
5 GCACCCTTAAAGATTTTGTAAAATAAAACCACCTGGAGACATGCGCT 
G 

MOUSE SEQUENCE - CODING 

ATGGATTTTCCTGGTCTAGGGGC CCTGGGGACCTCAGAAC C CTTG C C CCAG TTTG TGGATT CTGC CCTGGTGTC CTC AC CATCAGA 
TTCCACAGGTTTCTTTTCCTCTGGGCCAGAGGGTTTGGATGCAGCATCTr 
CAGCACTGGCCTACTACAGAGAAGCTGAGGCCTACAGACACTCCCCAGTCT^ 
ATTCCTGGGGGCTCACCTTATGCTAGCTGGGCCTATGG 
TGCCCCTTCCCAGGCCCTGGAAGACCAGGAAGGGAAGAGCAACA^ 

ACCTCCTGACG CTGGGGACTGCACTGCCTGCATCGCTCCCTGTCAC CGGCAGTGCTTACGGGGG AGCTGACTTTCCCAGTCCTTT C 
TTCTCTCCCACTGGGAGCCCrCrCAGCTCZAGCAGCCTATTCT^ 
CAGAGAGTGTGTGAACTGTGGAGGAACGGCTACTCC^ 
TGTATCACAAGATGAATGGTGAGAACCGGCCTCTCATCCGGCCCAAG 
ACTAACTGTCAAACGACCACTACAACACTCTGGCGGAGGAATGCC^^ 

GCTCCATCAGGTGAACCGCCCACTGACCATGAGGAAAGATGGAATCCAGACGAGGAACCGC^ 
AGCGGGGGTCGAATCTGGCTGGAGCGGGAGCGGCTGAAGGACCAGCT^ 
AATTGTGGGGAGGTGGCCTCAGGCTTGGCACrGGGCACTGCAG^ 
AGGGCCTGTCAGCCATCTTATGCCTTTTCCTGGACCrCTGCTGGGATCGCCT 
CCAGCAGCACCAGCGTGATCGCCCCACTCAGTTCTTGA 

25 HUMAN SEQUENCE - GENOMIC 

AGCAGCrTTCCATCTCTGCTGTGACTGCAGGCCGTATTTGTCACATTCTTCT 

CAGTCATTGCCATGGGGAGCTGGACATGCTGATCTGTTAATCCTATCCATAA 

GTTCCAAAACACACATTTCTGACACAGTAGGGAGCTTGGCT 

ACAAATTTCCTAACACCTATTCACAGTGGACTGAAAACCAGAACCACAGAGCCT 
3 0 TTCCCACACGCAAACCAACATGCCTTGCCAAAAAAATATGGAACGCTTCACGAATTTGCATG 

AATCirCTCTGTATCGTTCCAATTTTAGTATATGTGCTGCCGAAGCAAGCATG 

GATATTTTGAAGATAGGGACTGGATGAGTTCAACAAAAAAGAGGATGTCAGCT 

AACAAGAGAAGGGACAGACAGATGAGTCAGGCAAGTGGGAAGGGAAGTGTGTGGATTCCTC 

CATGCGTGAGAGTCTCCTGAAGTAAAGGAGCCCCTGGTCTTCTGGACTCTGGTCGCTAAATAGTCTTCAGA 

3 5 ACCCTGTGACTGAACCGCCAAGGAGCCCCAAGAGTGCTGCGAAGAGAGGGAGTGTGA 

TCCTAAAACCTGTAGCATTTTCTGGCAGTGGAAAAAAAGAG^ 

CCAGACATTATATGCCTGAAGATGGATGATTTATGGATGAAAGGAAAGAGAAAATGA^ 

TGGGTGGATGGGTGAGTGGGTAGGTAGATGAATGAATGGGTAGATGCTCTGATATTTCCTCCCTCCT 

CJ«3GTGTGCACAGAGAGAGGAATATGAGITTATGAATTGCAGATTTGCTATA^ 

4 0 ATCAACAGATTAGTGGTTAAACAAAATGTGGTATATCCATTCCATGGAA 

AGCATTACTCTGATGAGAAAACTAGACAAAGATATCACAAGAAAAAC^ 

AAAAAT ACAAG T AAG T AGAAT CCAGCAACAT ATAAAAGGATTATACACCACGG CCAAGTGGG ATTTATCCCAAGAATGCAAGGTTT 

GGTCAACTAATGAAAATCAATCAATATAATGCATCATATTAACATAATAAAGAACAAAC 

AAAAAACATTTGACAAAATTCAACATTCCTTCATGATAAAAACACTCA 

4 5 AAGGACATCCACAAAAAACCCACAGCTAAGATCATACTTAATGGTGAAAGT^ 

ATGTCTITTTCAACACTGTATTGGATGCTCTACCTGGGGCATTAGGCAACAAA 
AGTAAAACTATCCCTATATGCAGATGACATGATCTTATACATAGAAAATGT^ 
AGTGCATAACCATTAGAGCAAGGTTGCAGGATATAAGATTTTGTATCA^ 
AAAACTATTCCATTTACAATAGCTTCAAAATGAATAAAATACT^ 

5 0 AAACCATAAAATATTATTGAAAGAACTTAAAGAAGACCT 

GTGTTAAGATGGCAATACTCCCCAAATTGATCTACAGACTTGATGCAATCCCTATCAGA 

AC AAGCTG ATTCACATGGGAGTGCTAGGGG CCCAAAATACAACAAT CTTGAAAAAGAAAAACAAAGTTAGAGG ACTCACACCTTCC 

AGTTTAAAAACTTACTATACAGCTACAGTAATCAAGACCATGTGCTACTGGCATAAAAACATAC^ 

TGAAAGTCCAGAAATAAATTCATACATTATAGGCAACTGATTTTTTTGAGACAGG 

5 5 CTCACTGCAGC CTCAAACTCCTGGGCTAAGCGATACTCCCACAT CAG CCTCCCAAGTAG CTGGGACTACAGGCATGCACCACTGTG 

CTCAGCTAATTTTTAAAAAACTTTTTGTAGAGACAGGGTCTCAATGTGTTGGACAGGCT 
ACGTTGGCCTCCCAAAATGCTGGGATTACAGGTGTGAGCCACTGTG 
TTCAATGGGGAAAGAATGCTCCTTTCAACAAATGGTGCTGAGACAGCT^ 
CTCATACCATATACAAAAAACAACTa^AACAGATCAAAGACCTAAATGTAAGAGCTA 

6 0 CCCAACCTCCAGGCCACAAACCCATCTGTGGTCTCTTAGAAACTGGGCCACACA 

TGAAGCTGAGCCCGCCTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNIJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNAACCTACAGAAGCACAATATTTGAAAATCACATATCT 

7 0 TCAACAATAAAAGGAOUUVTAACCCAATTCAAAATGGGCAAAGGATTTAAATAAA 

AOXn'GGCTCATGCCTGTAATCCCAGCATTTTGGGAGACT^ 

CAACATAGTGAAACCCCATCTCTCCTAATAATACAAAAATTAGCCAGGCATGGTGGC 

ACTGAGGCAGGATAATCttCTTGAACCCAGCAGGra 

AGCAAGACTACATCTCAAAAAAAAAAAAAAAAAAAAAAAGAAGATAGATG^ 
7 5 GTCACAGGGAAATGCAAATCAAAACCATAACAAGATACCACTTCACACCC^ 
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AATAAAAAGTGTTGGAGGCCAGGCATGGTAGCCTGT AATTCCAG CACTTTGCAAGGCCAAGGCAGG CAGATTGCTTGAGCCCAGTA 
GTTTGAGACTAGCGTAGGCAACGTGGCGAAAATCCGATCTCTACGAAAAA^ 
TGTAGTCCTGGCTACTCAGGAGGTTGAGGTGGGAGGATCACTTGAACCCAGGAGGTTGAGGCT 
TGTACTCCAGCCTGGGGAACAGAGTGAGACCCTGCCAAACAAAAGAAAAAAGAAAAAAAAGA 
5 AGAG AGAGAGAG AGAGAAAGAAAGAAAG AAAGAAAG AAAGAAAGAAAGAAAGA AAAAG AAAAG 

AAAAGAAAGAAAGAAAGAAAGAAAGAAAGGAGTGTTGGAGAGAATGT 
AGAACACTTGCAGTTCCTCAGAAAGTTAAACATAGAATTACCATATCA^ 
GAAAAACATATGTCCACATAAAAACITTCACATGAATGTTCATAGC^ 
TGGATAAGCATAATGTGGCATTTCTITTTCCTTTTATCriT^ 
1 0 CAATCATGGCTCACTGCAGCCTCAACTTGCTGATCCTCCCACCTGAGCCC 
CCCGGCTAATTTTTCTTTTTTGTAGACACGTGTTCTTGCT 

CACCTCAACCTCCCAAAGTGCTGAGATTACAGGTGTGAGCCACCGCATCCTAGCTATAATGTAG 
TATTATTCAGTCATGAAAATGAATGAAGTACTGATACATGGTACAATATGAATGAACCT 

G C AGTT ACAAAAGGCCACAT ATTATATGACTG C ATTTATATGAAATG T CCAAAACAGACAAATT CAT AGAGACAAAGAGATTTGTG 
1 5 ATGGCCCGGGGATGGGAGGAGAGGGAAATGGAATGGGAGGGTGTA 

TTCTGGAATTAGGTAGTGGATAGTTACACAACCTCGTGATATACTAATCGCTTT 
AGAGC^AATTTTATGGTATGTGAATTATGTCTCAATAAACAGAAAAAGAAAAAAAAGAATGAACTA 

GAAGAATGTCACAGACATT ATGCTG AGCAAAAGCAG CACAT ACTG T ATGATGTG CTT C CATTT CCATG AAGTGCAAAAACAGGC CA 
AAAAAACCCCAGGTATCATATGGTGTTTATGGGTA 
2 0 AGCTCAAGGCAGT AGAAGGAGAG AGGGAAATGGAAT CAGTACAAG AGCATCTGTAAT ATCG CATATCCTTTAAAAACAATCTGAAG 

TCAATAAGGCAAAATATTAATGTTTGACAAAAGTAGATGCTTGTTACAGCACCCTT^ 
AACAGTTTGTTTCTGAGGGCAAACGCTTGACTC 

TCAGGCTCTGTCCCCACCCTGATGGGCTCCACAGTACAGAAGAGGCACAA^ 
GCTCTAGAACCCCAGAAGATGCCAGGAGGGAGTGAGCGAGTCAGGGAAGGCTTC 

2 5 ACTTAGGCCTGACGGAGAAGACGCGCGGCCAGGAOVCCCCACCCCCGCCCTCGTCTCCCCCAAAGCCTGATCT 

CCCTTATCTGCCCACTCCCAGCTCCCTCCTTGCTGGCTGAACT 

GGGAGGGAATCGGGTGAGGCCTGGCCTCACAGCCTCGGGGTTTCCAGCTCTTGCTGGAG^ 
ACCCTTGGCTTCTCTTCCTGAGCGCTCTGTGCTCT 

CTTAGACATGCCTCGAGCCTGCCATTCCCTGTGAGGACAGATTTCCCTATGTTGCGACCGCT^ 
30 GATAATTCCCATTTACAGAGCACACCATTTATGGTGTGCCAGCAGGCCCTGTGCTGAGTGGTTCC^ 
CTTTACCCGTTTTCCAGATGAAGAAACTGAGGCTCAGAGGGCGTCTGGCCCAGGAAT 
TTGAGACAAGAACCCAGGGCCCTGGAGGAACATCTTCATTTCCACACACCT 
ACACTTGCACCACAAGGAGCTGGTGAGAATGCGATAGAATTATGGGTGGAGAGCGCTTA^ 

T AATG CCGCTGCTGTTATTATTAGTCTATTAATGCT ATTATT^ CG AGACTTCAT 

3 5 CTCTATTGAAAAAAAATTTITTGCTGGGTGTGGTGGCATGCACCTGAA 

TGAGCCCAGGAGGTGGAGGCTGCAGTGAGCCAAGATTGTCACTGCACTCCAGCCTGAGTGACATAGTC 
ATATTTAGTACGTATCATTATCACTA1TGTTGTTAGAATTAGAATAGTTTTAG 
TAGATTAAAATTAGTATTAGCATTAGATGAGTATTACTATGAAAATTAGTATGTCCTC^ 
CCAGCAGATGGGAGGGTAGGAGGCAGGGAGGGATGGGGAGAGGCTGTCAATGGGT^ 

4 0 TCTGGTGTTCTAATGCATAGTACAATCACTAGAGTCAACAATAAATGTATATTTC 

CTCATCACAAAGAAAAGATAAATGTTTAAGGTGATGAATATGCTAATTACCTTGATTTGAT 
AAACATCATGTTTTACCCCATATATATGTACAACTATTATGTGTCAATCATAAATTTAAAGAAATAA 

ATCAAT ATT ACAAT ATTGGT ATT ACT CTT AG TG TCAG CAT ATT ATT AGCATT AGT ACT AT AAG TAG T AG T AGGGTT AG AAAGGG CT 
ATGTACTCTCCCTTGTATGCAGTAGGTACTCAATAAATAAATAG<XX3TACTTAATATTTATTG 

4 5 AGATCATGTAACATGGGTTAAAACTCATTTAGTCTCCCCAGTCA 

TACAGATGAGGAGCTAAATTCCAGAGGGGATGGTAACTCTCCCAAGGTCACCTGATCCATTAGTG^ 
GGATGAGCTGGCTCTAGAGACCATCTCCTGAAACCACTGCCACTTAAGTCAGAGGTGTCTTTGTA^ 
GAGGGTTCAGCATGGGGCGTGGCACACCAGCAGGAGCTCAATAAATAGGAAT^ 
CATTGCATATGAGGACACTGAGGCTCAGAGGAGTGAGATAAGAATGGTAA 

5 0 CACrGACCTGGGGCTCCACCCTTGGCCTCTGCCCTTTGTAGGTCT 

TGTCCCACGTGTGCATGTGTGTCAGATGCTTTCATATAGAAAAAGTGCTCAGGTTAGGTACAAGTC 
CTGTGGTTGTTGTCATTAATATTTGGAGGAGGTGGGGCAGGGCIAAGAGTGTGGGGACTGGGGTGC 
TTGTCCCCTGCTGAGGGCTGAGGGCAGAGCCACAGGCTACATCAATCCACATCCTCCCATCC^ 
AGGGTGTGATCCAGTCTGCCTGGACCTTCTTCTGAGTCTGTCTCTGAATATCCCTCTGTCTCT 

TGTCTTTAATGGTCTCTGGTTCTCCCCGGGTGTCTCGCCCTCTTTGTCT 

CGTCTGTCTCCTTCTCrATTCATCTCTGTCACTCTCTTTCTC CCCATTTCC^TCTCTCTC CGCCT ATCTCTGTTTTTGTTTGTCTC 
TCTGCCTCTGTCTCTCCCACTCCACCCCTTTCCTTrCCTACCCrrATCCCACTCCT 

TCCCGCCTCCAAGGCAGCATGGCGGGCAAGAAGTTGAGGCCACTGTCCCTGGGTGTTCCTACCCCCACAC 
60 CCTGTTACTGCGG CGCCAACAGCCACGGTCGCCTACATCTGATAAGACCT CTGGCGTAAG 

CCCCAGTGGGGCGCTAAGTGAGTGTGCCCCTGCCTCCCGCCAGCACTGGCCTGGCCTGCAGGCTTAGCCTGG^ 
CCACAGGCTCTAGTTCAAATCCAGCAGAACCTCTCTGAGCCTCACrrCTTCTCACCT 
CCCTGCAGCCAGGAAGACGCACATACACAGGAGTCTAGCCCACACCGGCCCCGCACAA^ 
AGTGAAGTCATGAAACCATATCIGCTATTTrCATTTATCTTGG 

6 5 CCTAGGTCGAGCGAGGTCCAAGAATCCCCAGGGTGGGCAGGGAGGGTGGAAGAGGGCCTCCAGTGCCCAAG 

ATGGGACCCGCCCCCTCCCCTGGACTGCCCCACCCACTGGGGCACCAGCCACrCC 

AGGGAGGGAGGGAGGAAGGGAGCCTCAAAGGCCAAGGCCAGCCAGGACACCCCCTGG 

GCGGCCGAACCCTCCGCAACCACCAGCCCAGGTCAiSTCT 

TACCCTTATGCTTGCTCCAGGCTCCCCAACCCACAGCTGGTCCCCAAAGTGTCAGC 
70 CCCACCTCTCTTTCCAACTCCCATTTCTACCCCAAATCTACCCCCTAAACCC 

GATATCTATATCCTCTGGATCCCTTCACTTTGCTCGCTATACCCCTAGGTCTGTCT 
ATGACTTTATCTCCCCAAATCCTTTCCCTGTCCCATATCIGATTT^ 
CAAACCT^CTTTTCTGCTCCCAAACCCCTTATCTC 
C^CT«aTCTGCCAAACCCTAGCTCCATCCTCTTAC<ACCATCT 

7 5 ATCTCCGC^AACCATTTATCTCTCCCAACTCCCTGCCTC 
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AATCTGTrCTGCCGCATGAACCCTCATGTCTGCCTTC 
TCTCCAGCTCCAAGCCTTTCTCTACTCCAATTCT 
GATCTTTGACTCCTAAACTCCATCTCTCCTCCA^ 
ACACTACTTCrCAACTTTATTTCTGACTCTCAAACTTCGCC 

AAACCTCTCTGACTTAACCCGATATCTGACCTTCAAATTCCACCTCTATCT 

ATCTGCTCCCTTCAGCCCGGTGAGACCCTCCCTTGAGGGCTGGGATT^ 

GTGGAGGGTGCCAGCTGCTGCCTGCTGCCGCCGTTGGGGCAACACTTACTCT 

CAG C CTCG AG AT AAACTTT ATCTCTGT CCGGAG AG TGATAACTGGGGGTG AGG AGG CTGGG CCCCTGC CATGGGAGGGGTGGG CAG 

1 0 CCCTGGACTCACCAACGTGTGAGGTGCAGGGGTTGGGGGCAGTGGGGCTGG 

ATGAGGGAATAGTGGTGAAAGAGTGAGCAGAAGACACACAATGAGACAGGGAAGAGAGGGAGAA^ 
AGACAGAGACACAGAGACAAAGAGCAAGAGACAAAAGAGACATGAAGAGAAAGGAAAAACAG^ 
AGACAGAGAC^GACAGATGGCCGGAGAAAAAGAAGGAAAGACCCGGGAGCAGAAA 
AGCACTGTGAAGAGGCAGAGACTAACACAGAATTAGAGACAGAGAGGAAAGAGATGGAGAC 

1 5 TTCAGAGAGCAACAGCAGGCACCCTGGGGCTGAGAAGGTGTTGGCGGTGGGCACACT^ 

CCAAGCCTGGGCCTGCCCCTTAGCCTGTCTATACCATGGGTCATGTAAGCAGGAGTGAGG^ 

TAGCCCTGTCACTCCACCCTACAGACACAGAAACAGAGCAGGCCAGCACAATAGGGCCAGGGCC^ 

GAGCTGCATGTGGCTGGGTACGGTCAGGGGTATGGGACTAGTAAGAGGCACGTGGACATAAGGTC^ 

ACAG AGGG AACAGACAAACAAG CAGG AAGAGTGGG AGAGATG CAG AGATGTTCAG AGAG AT AGG AG AGG AG AGGG AG AAAG CAAG A 
2 0 GTAACAGAAAGAGAGAGCCACAGAGAGAAACATGGGAAGACAGA 

GAGGTCGGGCXSCAGTGGCTC^CGCCTATAACCTCAGCACTGTGGGAGGCCGAGGTGGG 
CCAGCCTGGCCAACATGGCAAAACCTCGTCTTTACTAAAAATACAAAAATTAGCTG 
TACTAGGGAGGCTGAGGGACGAGAATCACITGAACCTGGGA 
TGGGTGACAGAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAGAAAAAGAAAA^ 

2 5 AACACAG AAAG CTGGGGAAGCTACAGAGTCAGACTTGAGAGGGAACAGAGAGTCAGGGGAAAGAG CC CCAGGGGATAGAGAAGTGG 

AGAGCTAGATGACAGACACACACAAGGCAAAACAGGCAAAG<5GCCATCCCACAGGTCT 
CAATAC^GCAGATGGGGAAACTCAGGCCTGGGAACTGAAAGAGC 

CCGGTAGAGGTACCTCCCACCTGCCACCCTCTTCAGAGGAAGTCAGTGGCCTTIKSGGTGAT^ 
AGATAAGCAJGTGGGGGGTACTGACCCCCCCCCAACCAAGAAGTAGGGAGGAACTAAGGGGGCCT^ 

3 0 GTGGAGGGAGGAGGGAGTCAAGCCCGGAAACCATGGGGTTTCT 

GGGAGCAGAGGATGGTGAACCCCAAAGTCCTGGGTGAAGTGACC7VAGAGGCAAGGGACTCT 
CCCAGACTCTTCTAGAGGGGGAGGGAAGAAAGGACGGGGGGACGGGGAGAAT^^ 
GGAAAGGGAGAAAAGGATGAACAGAAAAGGGAGGAGAAGGATGAAGAAATTGGAGGG^ 
GTGGAGAGTAGATGTAGACAGGGGAAGAGAGGCCGGAAGTGGTGGCTCACACCTGTAATCCCAGTACTTT^ 

3 5 AGGATCACTTGAGGCCAGGAGTTCAAGACCAGCCTGGGCAACAGAGTCATACCCCATTTCT 

GTCCCTGGTGGCACACACCTGTAGTCCCAGCTACTCAGGAGGCTGA 

CGAGATCGTGCCACTGCACTCCAGCCTG<5CAACAGAGGAGACCCTGCCTCAAAAAAAAGTGTGTC 

AGGTGTGTGAGAAAGGTAGAAAAGGAGAATCAAGACAAAAAGAGAAGGAGGACAAGGAAGAGGG 

AGGTCAAGGAGGAAAAGAAGAAGGTGGAGGAGAAAAAGGAAGCAGGAGAGAATGAGAAAAGAGTGGA 

4 0 GAGG<tGGGCAAAGAAGAGATGGGAAGAAGAGA<3ACGGGAGAAAGACAGGAGGA 

GG AAGAAGGG AGAGG T AG AAG CAGATGAG AGTGGAGG AAGAGCG AG AGGAGGGAGG AG AGGAAAGACAAAC GGCAAG AGGGGG AG A 
ACGAGGAGGAAGATGAAAGGAGGGGCAGACAGGAGTGGAACGGGGAGATGCAGGAGGGAAAAGAGAGGAGAGGGAGGA^ 
AGGGGAAGGAGAGGGGAGACTAAGGTGAGGAGGAGGGGGGAATGGGGAGGTGGGAAGGAGAAATATGGAGACTGAGG 
GGGAGG AGGGGG AAAGGAGGGAAGAGGAGCAGGTGAAAGGAGGTGGGGGGGAAGGATTTCTGTGTCTGAG^ 

4 5 GCAGGTTAATCCCCAGAGGCTCCATGGAGTTCCCT^ 

GCTCrGGTGTCCTCCACACCAGAATCAGGGGriT-rCTTCCCCTCTGGGCCTGAGGGCTTGGAT 

GAGCACAGCCACCGCTGCAGCTGCGGCACTGGCCrACTACAGGGACGCTGAGGCCTAC^ 

GGCTCTCTTGGCATTGGCTGAGTGCTGTTGGGGTTGCCATGGAGATCCTTGGCTA 

AATGG CTGGAAG CTTCTCAAATG G ATGTG C CG ACCACTTT C CCTAGTT AAG TG CAG AC CTGGG AATT CCAATG CTC CT C AACCT GC 
50 CATATTGGGGCGGCCACACTGAGAGG CAATACTCG AAGT ATGTGGTGGTTG CCCTAGTTGTCG AGTGATCCGTGGAGCTCCAAATC 

CCAACAGTCATCCTCAAAAACCCACTTGGAAATGGTCAGAGGTTATTGGAGAGGC 

AGCATC CAATGGCCAG CAGCTGTTCTGGTAGCCTGTGGAAAAGCTGGG AACTTGGC CACCATCTTGGGGGTGCTGGG AACCACTGC 

ACCCTGACGTGCGCTGACCCTAGACTGATTTTGCCTCTTCTTTCCTCCATCCCTACCT 

TTGCTC^IACTGTATGGAGGGGATCCCAGGGGGCTCACGATATGCCGGCT 

5 5 TGTGTGTCCCACCCGCGAGGACTCTCCTCCCCAGGCCGTGGAA^ 

AGACAGAGCGGCTGAGCCCAGACCTCCTGACCCTGGGACCTGC^ 
CCTGACTTTTCCAGTACCTTCTTTTCTCCCACCGGGAGCCCCCT 

CCCCCTGCCTCCCTGTGGTGAGAAATTGAAAAAAGG ACAG GGAAGTTG AGG TGGGAGGGGTGGC 
CTAGCTCCCTCTTCTCCrCTCCACCCCACCAGAGGCCAGGGAGTGTGTGAACTC 

6 0 ACAGGACAGGCCACTACCTATGCAACGCCTGCGGCCTCTATCACAAGATGAATGGGCAGA^ 

CGCCTGGTAAGACCACAGACCTGCTTCACCTTATACACAGGAACCCCTGTCCT 
ATAGGAACGCTGTTCTCATGATTACAGGAAGCTACTGC^GG 
TGCCTTCATCCTACACAGGAAAATCTCCTCATCCCATATGGAAATCCT^ 
TACAGGAACCCAGATCTATAGAAGCTCCTATCCTCAACCTACCCTTATAAGCT^ 
6 5 CAGTCTTTTCGTCCTTGGCTTCACACTGGAATCACCC 

TGGGCATCAGGATATTTTAGCTCCCCAAGTGATTCCGATACGCAGCCA 

GAACAAAAGAAACTTATACCAGATAGGAACCCCTGTACTGGCCCC^TACAGAATCCTCAGGCA 
CTTCAACCTGACCCTCACITCTTGGGTCCTCCTGACATCC 
TCACATCCTGTCGTCCCCTAGATTCTCAGTAAACGGGCAGGTACT 
70 GAGAAATGCCAGTGGGCATCCCGTGTGCAAT 

CCCTCTGCTTTCCCTGTCTTCATGCCACACTGCCCCCCACT 

T^TCCAACCCCCTGCCCTAGACCT^^ 
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GGTCTGGAGTTGGGGACACCCGCAGCCTCCTTTTTGGCAGGTGAACCGGCCACTGACCATC 

ACCGCAAGGCATCTGGAAAAGGGAAAAAGAAACGGGGCTCCAGTCTGGGA 

ATGGTGGTGGCIXMGGGCAGO^TAGCGGGAATTGTGGGGAGG 

CTAC CAAGGC CTGGGC C CTGTGGTG CTGTCAGGG CCTGTT AG C CAC CT CATGCCTTT CC CTGGACCC CT ACTGGG CTCACC CACGG 
5 GCTCCTTCCCCACAGGCCCCATGCCCCCCACCACC^ 

CCAGAGGAGGGGTGGTGTCCTTCTCCTCTTGTAGCCAC^ 

ACCTTCAAAGCTTTTGTAAAATAAAACCACCAAAGTCCTGAAATTGTGGGTGTGTCT^ 
AGTGTGTAACAATGCCTGTGGCTCTCTGATGACAGTGCATGGCTGCAATCT 

ACACTTTG TGACCAAGGGTT ATGTTGTGT GATTGG AACAATCT ATGTG AAACT^ CA 
1 0 TGGT AAGAGGCCACGCAGGG CTTG CTGACGGTGGGGGTX3GTGCCGGGTG ACTAT AAG CACACTGGAAG TG TGTGTGCTGGAGCAGG 

TGATAAGGAGGTGTGCGGGGCTGTGGTCrTGTGGCTGGGCATGACGCTGGGGGACCT^ 

TAGACAGCATGACAGCATACGTGGTGGGCACAGAATCTGGTAAAGATTCTTGGA^ 

TTGTGGGTCTGGCGATGGCAATATTTCAGTGCAAGTCCCTGTGCATGGGCATGGGTGTGC^ 

GCCATGTCACAGTGTCTCTGAGTGGCTGACAACATGAGCAGCTCT 
1 5 AGGGTATAAGTTAGGGACTCTCTTCCAAGGGACAAAAGACACATTCAGGAAAG^ 

TGGCAAGGTCAGAGGCCAGTGCTGGGGCGCAGGGTTGTCCGTGGGTCTTCCTGTCGGACT 

GCCTGCAGCCCTGTGGGAAATGGTAGAGAATGAGAGGGGGAGATTAGGGGGACAAAG 

AGGTGGGGCGCTCCACACAGTGCCAACCCCTTCCCCAACATTCCCACTCACCT 

CCTCAATTTCACAAACACA CAGGAGCAT AATCTGAGTCATG CCTT AAG C AT CCT AAGG CAG ACTG CTTAAGTGGCTCAATGGAGG C 
2 0 CACACAGCCAGTGGCAAGGGGCCCAGCGGGGCAGTCTGGCCTGGTGTCCATGTGCTCAGCCA 
TCCTAAGGGCTCACTGTGGGTCAGGCGGCAGGCTGGGAGGCTCCAGTTGCTITAG 
AAAATATACCCATACTCATTTCACTAATGGAAGAGTGAGGCCCAGAACTGATGAGTAACTTACCCT 
GGATTTTGGATGGGGTGACACACCCAGCCCAGCAAATAAGTGCTGCTCACACTTA 
CGGATATGGGGGTTGCAGGGACCCACAGGTCCTGACCATATTATAC^TAAG 

2 5 GCGGGCATTCCAGCTCCACTAAGTAGCCTGCTGCAGAATCACAGGAGTCTATTATGCTC^ 

GATCCCATAAAAGACACGTGTCTGGCCTCCCCTCAGAAGCCCAGCCCTCACACCCTGCAGGAGGCCCCATCCTCTA^ 

TCAG AACCTGTGCTGGACACATG CCTGGCC CCACCTTGTTCAGGAGCCCCGGG CCTCACCCCCATAAGT ATAGCTCCCCCAGCCTG 

GTACAGAAGACTAGCTCTGAAGCTCTAGGGGAGCCCTTCATCTAATTATCGACAGGAAAC^ 

AGCGCTTACCCTGAATAGGAGGAACCTCTGCCTAACCCCAGCTAGGGACATCTGGACTAAGCTGTAAGGAAGCCCCT 

3 0 GGACAGGAACCCTTGTCCTGACCATCCACACAAGCCTCAGTCT 

TACTCACACnGCTGCAGTCCTGTCCrCCAGGTAGCATTTTAAAAACACCTCACTC^ 
GCTCTGTCTTTACAGAAATCCCTGTCCTCAACCTCTTCAAGGACCCTGACTOT 
ATCCCTGATGTTGCCCTGCACAACAATCCATGACCTC^CACCCTATCCCATAGAGTCCACAGGAA 
TGAACCT(^GTCTTTAACTTGTATAGAAAACCACTTCCTGACCTTATGGCA 

3 5 ACCTCAG CCTATACAGGCACCTATGGCCTCACCTGATACAGGACCCCCCCTTCTCA 

GAGGCTCCrrGGGCCTTACCTTGTATTAGCTCCACTACCCTGACCCTCTACAGCAGGGGTTCCCAACCCCCA^ 
CTGGTCCATGGCCTATTAGGAATTGGTCCACACAGCAGGAGGTGGGCAGCAGGCAAGCA^ 

GTCAG AT CAG CAG TGG CATT AG ATTCT CAT AGG AG CACG AAC CTT ATT G TG AACTGTGCATGCAAGGAAT CT AGGTTG CGTG CTC C 
TTATGAGAATCTAACTACTGCCTGATGAGCTGAGGTGGAACAGTTTCATCCCCAAACCATCCCCCACC^ 

4 0 TTCCACGAAACCXX3TCCCTGGTGCCAAAAAGGTTGGGGACCGCTGCT 

GACTGGTCTCTGCIAGAAAGCCTTTGTCCTGAACCTCAACAGAAAT 
TTTACAjGGAATCTCTGATCTTTATAGGAACTCCTAGCTTTACTCT 
ATTCTCTATAACAGCCTTTTCCCTGGCCTTGTCCCAGAAGCCCCACCTGCACCCT 

AAACACCGAC CTGAATCT CTG AAGG GG CAG TG T CCT AACT ACACAGAGG AT CTTTGTC CTGAACCCG CACAGG AAC CTCT ATT AC C 

4 5 CCCAGCIAGGAACCTGTGCCCTGAGCTAGCACAGAAAATCCTGCCCTTACACT 

TC CAAAG AATGAAC CAG T AT AAAAAAC CT CT ACACTGT ACAAG AAC CTCTG TTCT GT ACTTGTT AAACAAC CCAT AATCAG A^ CA 
CCATGAATACCTGCCTGGAGACTGTACAGGAACCCCTGACCTAACCCCCAATGGAATCACCTGATCTGTCTGTCTCT 
GCrCTGATCCTGGAa^GAAATACCTGGCCTGAGTACGCTGTGATCCT 
AAAAGGAACACCTAACCTTAATCTGTATGGGAAAAAATGCCTTCACAGTTTTCCCT 

5 0 TTCCCTGCAAACCIX3CCTTACCAGGTAACAGAATCATTCT 

CCCCTACACAGTAAGAGTCCCrGTCTCAATGCTTTAAAGAGAAATTTC 

GGGTAATTTT ATATTTT CAATGT AG C CACATTAAAAAGAGAAAAAAAAGATAAAATTATTTTCAATAACAT CAATAT ACTT AACTT 

TACCCAATATATTCAAATATTATTTTAACATGTAATCAATATAAAAATTACTG 

ATCTC^TGTACATTTCACATTTATAGCACATCATCTCAAGTTAGACTGGCC^ 

CTTCCTGAGCTCACTCTACCTATTCCAGCCTCCTCTGCAGAT^ 

AAAACCAACTAACTCCAAGGACCTAGAGGAGGACAGAGTC!ACAAAAT^ 

TCAG ATTGG CTTTTG AATGAG CAAG AAAT AAACATTT ATTCG TTAAG C CC CTG AAATTCTGGGTT ACAG CAGCT AGG AT AATTT AG 

6 0 CCTAATTAATAATAATCTCTCCCCCTCTCTCT 

TGCAGTGGTGCAATCTTGGTTCACTGCAACCTCCGCCTCCCAGATTCT 

ATTAGCCCGGCTAATTTTTOTAGTTTTGGTAAAGATGGGATTTCA 

ATCTGCCGACCTAC^CCTCCCAAAGTGCTGAGATTACAGG 

G ACAGGAT CTCACTCTATT ACCCAGG ATGGAGTACAGTGGCACAAT CTCGGCTCACTGCAACCTCTGCCTCCCCTCCCAGGCTCAA 

6 5 GCGATCCTCACACCTCAGCCTCCCGAGTAGCTGGGACTACAGATGCACACCACCACACCCAGCTAATTrrr 

ATGGGGTTTTGTAATGTTGCCCAGCTGGTCrCAAACTCCTGAGCTCAAGCAA 
ACAAGTGTGAGCCATTCTGCCTGGCCAATAATAATCTCTTGACCGAAATCTAGACACCT 
TCTGAACCTATACAGAAACACCTGTGACCCAGTACTGGCCCACCTGATCTCTTTGTAC^AAG 
CCCCATCCTGGCCACGAATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAAGCT 

7 0 TTTGAGACCAGCTTGGGCAACATGGCGAGACCCCATCTCTATAAAG 

CCCAGCTACTTGGGAGGCTGAGGTAGGAAGACAGOTGAGCCCA^ 

TTGCACAGGAACAGG AAC CCAT AGTCTGGAG CT CT CT AGAAACCCCTG CTTGATT CAGT AAAGGT ACAATATAG AAACCATAGG^ 
TCCCTGTCCTCATTCTGTCCAGGAACATTTCGAATGACCCAC 
7 5 TCCTCACACACCTCACACAAAAGGAATACCTCTCCTTAGGAAAAAAGGCCCCTACCCrC^ 
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AGAGACCAAGGCCTGACACTACAGCACAGATCG^ 
AAGGGTTGTTTTTTAAAAAGAGAAAAAAAGAAAAATGCCACTGAG 
CCCTTTATAAAGTTTGCTGACCCCTGCTCTTCAGAAACCTCTCTT 
CCCTACTCTATCAG<5CTGCAGGTATCAGAACTCTAGAATTTTTTTGT 
5 GGGGTTTAAAGGTCTTAGGTCTCCAACTCTTTTGCGCTGAGA 

AGGACATCTTCAAGAGG ATGAGAGGGCAAAGCT ACTG^ T CTG ATGCAG AAC CC CCGGACTG C 

GAGCTGACATTTAGACATGXJAGftGGGAGTTGCGGG 
ACTGCTTGGGAGCAGTGGTTACCCGTTGGAGGGGGCAGATAG 
10 GGCCCTGG<3CACTATTATTATCACAACTTT 

TAAGTCCCGTAACCTCAGTTTCCTTATCTGCAAAGCAGCCTTAATAATGG^ 

TGTCACATAATACTTGCTATCATTACCATTCCCTGCCCCCACTAAGACTCCTTCC^ 

TAATATTCGGCCAGTCTTTCCTGTGTACCACCATAGTGAATATTAACTCGGAAAATCCT 

TTTGGCCG CCACCACCAACGTCC AGTGGCAACCTCGCTCTCGG CCCAGCCACAACCAGCCCGCCCGCGCT ACGTCATCACX3GCGCG 
15 TCGCGCGGGAAGTCGCGGGGAAAAGGTAAGATGCCCCTTCCTAATTCCCTCGCTTCCTTGC^ 
AAGGGCTCCTAAGTGACTGGACAGGGACGGAGGGGCGGCGGCTATGGCAT 
GCTTGCACCGCAGCCCTGTCGGGAACCGCCGGCGCCTGCCTTTTACGTAGG 
CCCGTCGCCTGTGAGCTCGCGGTGTGGCTCAGCCCAGCTGGC^ 

2 0 ATGGAAAAGAAAAGCCTGAGCGCTCGCGTCCCCACAGGTCGCCAGAAACTTG^ 

GCTGCGTCCAATGAGTGGAGCGGTGAGTACAGCGCGTCX^CGGCGGAGGCGGGAAG 
ACTGGCTGAAGAGGGTTAGCGAAACGTTAGCGGTCGGGTCT 

GAGCGGGGCTGGTTGAAACGCTAGGGGCGGGATCTGGCGGAGTGGAAGGTACCGGTCCGG 
TCGTTAAGGGTGGAGTCGAAACCGGGGTCGGGGCCGGGGCCGGCTGAGTGAAAGGGTGGGTGATTATCCCG 

2 5 GGGCAGGTTCGTGAAGGAATGGGGTCTGGGCIGGGCTTAGA 

TTGGAAAGGGCAGGCCCTAAGACGGACTGTGTGAGCCCGGGGGCTCATTGCTCCCT 
GTAAAAGGGGTGGAGTCTAGCGGGCCGGGCGGGGCCGGGTAGAATGGCCACCTGA 

TTGGAAGGCTGTGGAGAATCAG ATCG CGTAAGGAATGGAATCTAATAGAGGGGGCGGAG CCTGGAAAAGGGCAGAGAGGTGGGGTC 
CTCAGGGCACACTAGCCCCCTCACATACCCACGCTCCTCCCCCCACCCCCAGAACrc 

3 0 CCTCAACCGGCCAGGATTCCACCACAACOIGGCAGCGAAGAAGTAGG 

TCGGTGAGGCCCTAGACCCGCCCTGATGAGGGGGAGAGGGAG 

AGAAGCGAAATATTAAAAAGGGAGCCGTTCCCCGCTCTATCCCCAATCTAGCGGAGGTAAAGAAGAAAGGCA^ 
GGCCAAGCAATGGAAGAAGACCTAATOSTGGGACTGCAAGGGATGGTG^ 

AAAACCCAAAGGTTCCCCACCCTGGACAGTTCCCTCTGACTCAGGGCCTAAAATGGATCTGTGTCT 

3 5 GGATCrGAACCTTGAGGCTGAAGCACTGGCTGGCACTGGCTTGGTGTTGGATGAG 

ACAGGTGAGGCTGGGTCCTCCAGGCTGTCTCCAAACCACAAAAATACACACACT 
ACCCCAGCCTGGATACTGGAAGCAGATGATTTGGGGAGGTAGGGCTCACACTTG 
ATTGAGTGCCTATTGTGTGCCAAGAGTTGGTGATACAAATGTACCAAACAACAG^ 
GTAGGAGGGCAGATCATTAAACAGCCAACAGCAGTATGAGGTGTTTAATGAAGACAGACTTAA 

4 0 AATAGAGGATCCTAAACTAGTATC03GGCTTCCCAGAAGCCTTTATAGAGGAAGA 

TC CCGTTTG CTTT ACTTTT CAAAAAG AATCAG CAAAT CCTTTTGG C TAT ATG TTCAAAAT AT ATT CACCATCTGAC CTCTC CTCAC 
CACCTCCACTGCCACCATC^TCATCACTCACCT AG ACTTGCAGAGGCCTC CTC CTACTTCCAACCTCACCTCTATTA 
T CTGTTCATG CAGCAAC CT AAAAAAAT ATGT ATG AG TTG ATG TCTTTG TT AAAAT ACTTTAAGGAGG C C AGG CATGGTGG CT CACG 
C CT AT AATCC CAGCACTTTGGGGGGGCCAAGGCAG G TGG AT C ACTTG AGG T CAGGAGTT CG AG ACCGGT CTG AC CAACGCGATG AA 
4 5 ACACTGTCTCTACTAAAAATACAAAAATTAGTTGGGCGTGGTGGC 

GAATCGCTTGATCTGGGAGGCGGAGGTTGCAGTGAGCOSAGATTGTGCCAT^ 

TCT C AAAAAGT AAAAAT AAAAAAAAAAAAAGT ACTTT AAGGG CAAACTGTTG C CCTCAAGGTAAAGTCCAAATTTTTCAT C CTGG C 
CTCAAAGGTTCCAC^TGATCTGGCTCCACATCACTTTGTGACCTCA 



50 HUMAN SEQUENCE - mRNA 

GCAAAGGCCAAGGCCAGCCAGGACACCCCCTGGGATCACACTGAGCTTG C(I^^ CGCAACCACC 

AG CCCAGGTTAATCCCCAG AGGCTCCATGG AGTTC CCnX5GCCTGGGGTCCCTGGGGACCTCAGAGCCCCTCCCC(^GTTTGTGGAT 

CCTGCTCTGGTGTCCTCCACACCAGAATCAGGGGTTTTCTTCCCCT 

CCCGAGC!ACAGCCACCGCTGCAGCrGCGGCACTGGCCTACTACAGGGACGCTGAG^ 
5 5 ACCCATTGCTCAACTGTATGGAGGGGATCCCAGGGGGCTCACCATATGC 

TCAACTGTGTGTCCCACCCGCGAGGACTCTCCTCCCCAGGCCGTGGAAGATCTGGATC 

TTTGAAGAC^AGAGCGGCTGAGCCCAGACCTCCTGACCCTGGGACCTGC^ 

GGGGCCCTGACTTTTCCAGTACCTTCTT^ 



60 CCACTACCTATGCAACGCCTGCGGCCTCTATCACAAGATGAATGGGCAGAAGIAGGCCC 
TGAGTAAACGGGCAGGTACTCAGTGCACCAACTGCCAGACGACCACCACGACACT 

TGCAATGCCTGCGGCCTCTACTACAAGCTACACCAGGTGAACCGGCCACTGACCATGCGGAAGGATGGTATT 
CAAGGCATC!TGGAAAAGGGAAAAAGAAACGGGGCTCCAGTCTGGGAGGCAC^ 

TGGTGG CTGGGGG CAG CGG T AG CG GG AATTG TGGGG AGGTGG CTTCAGG CCTGACACTGGGC C C CC CAGGTACTG CCCAT CTCT AC 
65 CAAGGCCTGGGCCCreTGGTCCTGTCAGGGCCTOTTAGCCACCTCATGC 
CTTCCCCACAGGCCCC^TGCCCCCCACCACCAG 
AGGASC^TGGTGTCCTTCTCCT 
TCAAAGCTTTTGTAAAATAAAACCACCAAAGTCCTG 



70 



75 



HUMAN SEQUENCE - CODING 

ATGGAGTTCCCTGGCCTGGGGTCCCTGGGGACCTCAGAGCCC CTCCCCCAGTTTGTGGATC CTGCTCT 

ATGAGGGGTTTTCTTCCCCTCTGGGCCTG3UK3GCTTGGA 

CGGCACI^CCTACTACAGGGACGCTGAGGCCTACAGACACT 

ATCCCftGGCCCCTCftCCATATGCC66CTGGGCCTAjCG6CAAjGACGGG6CT 

CTCTCCTCCCCAGGCCXJTGGAAGATCTGGATGGAAAAGGCAGCACCA^ 
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ACCTCCTGACCCTGGGACCTG CACTGCCTTCATCACTCC CTGTCCCCAAT AGTGCTT ATGGGGGCCCTGACTTTTCCAGT ACCTTC 

TTTTCTCCCACCGGGAGCCCCCTCAATTCAGCAGCCTATTCCTCTCCCAA^ 

CAGGGAGTGTGTGAACTGCGGAGCAACAGCCACrCCACTGTGGCGGAGGGACAGGACAGGCCACT 

TCTATCACAAGATGAATGGGCAGAACAGGCCCCTCATCCGGC^ 

ACCAACTGCCAGACGACCACCACGACACTGTGGCGGAGAAATGCCAGTGGGGATCCCGTGTGCAATGCCTC 

G CT ACACCAGGTGAACCG G CCACTG ACCATGCGGAAGGATGGT ATT CAGACT CGAAACCG C AAGG CATCTGG AAAAGGGAAAAAGA 

AACGGGGCTCCAGTCTGGGAGGCACAGGAGCAGCCX3AAGGAC 

AATTGTGGGGAGGTGGCTTCAGGCCTGACACTGGGCCCC 

AGGGCCTGTTAGCCACCTCATGCCTTTCCCTGGAC^ 

CCACCAGCACTACTGTGGTGGCTCCGCTCAGCTCATGA 
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Table 26 

MOUSE NOMENCLATURE 
ICSGNM FkbpS 
Celera mCG18519 

5 

HUMAN NOMENCLATURE 
HGNC FKBPS 
Celera hCG17659 

10 MOUSE SEQUENCE - GENOMIC 

ATAGTGAAACTGT CTCAAAAACIAAAAC CAGGGCTGG AGAGAT AGCTCAG TGG TT AAG AG TATTGACTGCTCTTCCAGAGGTCCTGA 
GTTCAATTCCCAGCAACCACATGGTGACTCACAACCATCTGTAATGGGATCT 

GT ACT CACAT ACATG AAAT AAAT AAAT AAATCCAATTTAAAACAAAACIAAAACAAAC C CAAACAACAAT ATGG CCT AAAGTT AT AG 

ATTCATTC CTTTGTCTCTG C CCTT ACCAAGTGTATGTTTTATG AAG ACACATTTTCT AG AACIAATG C CTG AAG CAGCACACTTG TT 
1 5 T ACTGGATTCCCTT C CAGGGACTCAAACT AGGGC CTCAT AAAT AAAT AAAT AAAT AAAT AAATAAAT AAATG^ 

TATGTGAGTGGATATTTTTCCTGCATGTAAACCAGGGCATCCCTGAGACCCCAG^ 

CTGCAGTTCCAGATGGGTGCITGGGATTGGATCCGGGTCCTCTGGAA 

CCCTCAAGGAAGCTTGAGTTAGGJUVGGAGAGCACTG 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 
2 0 NNNNNNNNNNNNNNNNNNNNNNNN^^ CTGAGGCTGAGTT AGGCT AG CAAG TAAGGCTC CGCAGGTGGAGGG AGAC 

CAGACTCCTGGCAACTGTCCTGATCACCACATCTGTGCTATA 

GTTTAGGTTGGGTTTGTTTTTGAGACAjGGGTTTCT 

TAGTGGGGCTCTGCCTGCTGAGGATCTTTGCAGACATTTTAGAATACAT^ 
TCATGTCACAAATTGATCTACATGTGC(^TTAT 

2 5 CAGGCAGATCTCTGGTGAATTTGAGGCCGGCCTATTCTACATAGTGAGTTCCAGGACATCCA© 

GTATTAGTGTATGGGGTGGGGACGGTTTGGCAGCTTTCTAGTTTTTGTT^ 
AATGGGTAAAGTTTCACTTGTTCCAGGCAGATGGGGTTTGTAGCTC 
TGACATTTAGGTGTCAGCCTGTGCAGTTGTAATCAGGTATAAACTGTGT 
TCTCATCCGTAACn'CrTC^GTGTCTAGGGGGCCAAGTCATCACTCATGCCATAGGGACA 

3 0 CACTTTAATGAAAGCAACAGTGTTCTTTTTGAATAAACA 

TAGAGGTAAAATAATCTTCAATTGGACATAGATTTAGCTCATCTTTTTCAC^ 
TTTGCACTTTCATGATTATTCACITGAGGAATTGTTTTTTT^ 
TAGCTGTCTTCAGACACTCCAGAAGAG<3GAGTCAGATCTCGTGATCGGATGGTT 
TCCTGACCTTCGGAAGAGCAGTCGGGTCCTCTTACCCACTGAGCCATCT 

3 5 AGATTGCTCAGTAATTGAGAGCACTGACTGCTCTTTCAAAGGATCCAG^ 

TCTTAGAGGAACTCTCTGGCCTTTGTAGGCACCAGGTATGCA 
ATAAAATCATTTTTTAAAATAAGAGCATATTTTTAA 
GCCTGTGGATATTTGATTCACAGXK3AGACCTGTACGCCT 
AAGTGAAGCATGACTGGTTCTTTGTACACTTAAACCTTTCCTTCCCTC 

4 0 AAACGTCAGACTTTCCACCTCTCACCTTGGCATCrCTATCGACCCCTCTCC^ 

TCTCTCTCCCAATCTTTACTGTTGCCCAGTGCCTCTGAAAGGCACTAGTGA 
AAACAACCTTTAGCCAAATACTCATTCTTCAGAAATGTCCT 

ACAAAATTACAGACAAAGTACTOVGGGAAGTGAGCAATCACATAATAAAAGATGGTGA 

TGTGACACACT AAAATTTtSTCCATAGCTGCTGACCAAAT AATG AC CT AGAAATT CCAT CATCAAAAATCTCT C C CAACGT AG AGTG 

4 5 AAAGGACAGTGATG&AAAGGAGAGAAGCGGTGGCTCCAG 

GCCTGGATGCTGACTCATCCTGGCCCTCTCGTTGGAGTC^ 

CAGTGTTAAAAAATGACTAAGAAACCTGAACGGTAAGACAGGTCTGAC^^ 

CTCCAGTGTCTGTGCGGGGAGACCTGTAAGGATGGGAGACCaiT 

TACTGTAATTCTGAGAAGCAAAAATACTTTTGT ACTTTC CAT AGGGAAAAT ACAGG AGCCACACTAGGAGCAAG AGAAG CCTGACT 

5 0 GGTCTATGGGACCCAACATAGGTCCTGCTCTT<K5CCCCTGAGTTCTCT 

ACTC CATGGT CAGGG CACCAGGGAAG TGGTTTGCTTCT CTGTTGGAT CCATTGAT CTCGTCTGCTTT CT ATT CC C CGACTTCTCAC 
AATCAGTGCTGCATGCCTACCTTTTGTTGCTGTTTTCTGGA 

CTTCAGTG CAGAAAACAG CTTGTGTAGTATGACCTTG AGAAG CAAAAGCTTCAAC CTGCTTG AAGCC CAGGAGGGT AAAAT AGTAA 
GAGGTAAAATACTTTATTCAGATTTTTCTTTATTGCATAATGTC 

5 5 TGTTGTTTTGTTTTGCTAGCAATACAACCCTGTGCCTCCTGCTTACT 

CTACTGCTCAGGAATCAAGCAGGAGGAT<ZTCTGTGAGTTCT 

ATTCACCTGG CTAAAC CGCCCCAGCAAAG CAAAGCTGCACTGTGAG AATGTGTTGGGGGTTCTAGGCAAGGAGCGTCTACTTT ATC 

TCTTCCCAGGAGAACAGTCCCGTGTTGCTCAACTCTGCTGTGGTGGACATGG 

AGCCAGCTCAGCAGGTCAACCCTCACAACAACGGAGTGACAGACAGAGCCACATT 

6 0 GAGTCCTCC^AAAGCATTGTGTATATTGAAGACTAGACTGAGTAG 

CCTTACTGGACCTCTCAGAGCCCTTGGCTGAGAGGCTGATGGAATCACTCCC^ 
AACTGAGGAAGGGAGGAAGACCTCAGAGGGACTTGGGACTCAGGACT 

CTGG C CGCTCCG AT ACAT CTGTCTG ATCAAG CTGTTT CTG TTG ATTTTT CACTTCTT AT AAAAGT AT ACATG CAT AC G T GT AT GAT 
AAAATATACTGCTTTAGACATTTAAATGTTCCTAAGTGGCATTAAGTACACTCC 

6 5 CTTTTCATCACCCCAAACAGAAACTCTGTACCTTACA^ 

ATGTOTTGTATACATGTGTATGTGTTGGCATGTGTCTGCACTTGTGTACAGGTCT 
CGTATTTTATTTATGATTGKjTGTGTATGCATGTATATGCCACAGCATTTGTGTAGAGGCCA 

TTCCTTCCATGTGG AATC CAGGCATCAAACTCCAGATTTCAGGCATGTGTAGCAAGCG CTTTTACTTTCT AAGCCATCTCTCCAAT 
CCTGCACCAACTCTAGACTGTCTGGCCAGCAAGCCCCTGAGATCCTTCTGTGTCAGCC 

7 0 TGTACTGGGCTACAAACTCGGGTCCTCACATGTTCAGCTTGTGCTCT 

TCTTATTTTCCCATGATATCCATTCTCATGGGTAG 

GTGAAACATCITGTCATTTTCTGCATGAATGTGTATGTGTATATATATATATTT^ 

CCCACCACCTCAGTAT AGAAG AAAGACA1 T TCT 1' KjCGAAAAGTAG CTGAATAATTGT AGTGTGT AATTTCACGGAGCTGTGTAGT 
TGTTTCCTCTACCTGTGTAGTTAGCTCl TlT 'l T T' r ' l 'l'l I rCT TT TTTAAAGTGTATTTCTTGTCTGCCAATAAATAACTATTACTA 
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TCTCTAGCTCTAGGTTGGAATGTCTGAGAGAGGTCAGAGGTT 

TCAAAATTGAGGTCCACTTAGCATGCCCTGTGTTCTGAGGTTGAGGTATC 

AAGCTATAGACAACAAGGCCAGGAGTTGGCGGGGCTTAGAAA^ 

TGTCCGATGGGACACCGTGGAGGAAG AGAAG CACCGGTCACATG CACAGACT CTG CAAAGCTCAGGGT AGCTT CCCACG TG AGAGA 
5 CCGTAGTTACCTGAAGTTGTCAGTACCTGTTGATGAGACACTGTCTTCTCAT 
ACTTTGGAGGGTGAGTGCCTATCTGCCATGTCCATAACAGGAG 
CCCTTAGCTGGTGCAGTACAGTACGCITTATCATGGAT^ 
GAAAAATACTGATATTTCCCTAG C1TTTC1 M 1A AAGCT 

TT ATTTTG TTTG TTT ACTT AT AAAAATTT ATACTTCT AG TG ACCTACATG AATTT AAGGGGAAATGTTT AGAAAAT AAAAGGATTT 
1 0 GGGAAGGGAGTTGCTTTGGTTTTTGTTTGTTTGTTTGTTT 

TG C CAACTGAGTG CTGGGATTAAAGTTGT ACAGGATCACCG AC CAACTT AGTTTATTT CITT ATTCCTT AAGATTT AAATG GCTTT 
GAACCTGGAAAAAAAAAAGGTTGTTTTCTGGTTTTCCAGACT 
CAGTCTCAGATTTGCACGCTCGCATCCTTAAGTTGATGAAGTTCC^ 
1 5 AGGGAGTTAAGACTTACATGAATAGCAAGCCAAGTTGTTCTTTTCT^ 
TATATGAATACTGATTTTACATCATTTCrrCCTTTAATTCCTC 

AGCCTGTTGAAGTCTTACAGTlTTGCrTGTATGGGTTTAGA 
GCCTCCACCTCCTGAGTGCTCGTATTGCAGGCAGCATATGCTGCTGTCACCAG 
2 0 TCTTTTTTAAAAATTAATTTTGTGTTGTCTGCATGGATGTCT^ 

CTAGAACTGTGTAACGGACAGTCGTGAGCCTCCATGTGGGTGCTGGGTATCAAACAT 

TCATCTCTCCAGCTTTCCCATTAAGTATTTTTGGAAGAAATAGCACATAGATAAAGTCCG 

TTTGTCTAATTGCTGCTGTCACAGGTTTGGTTATGGCG 

GGAGTTTTTGAGACAGGGTATTCCrGTGACTAGGCTGGCCTCAAACTAAGAA^ 

2 5 TAAGGCATGCGCCAACACACCCAGCCCTAGATCCAGCTTCTTAAATGTTCCTACTC 

GCAATCCCAGACATGTGGGCATATAAAATAGACATACAAGGAGAACATTTATTGATAAT^ 
TTGCTGTGCAGATATTTTTACTATTTATTCGAGATAAGCAAATATGCA 
GAGTCCAAAGATAGACTGTGTTTTGATATTACCTGCCGAGAGGTGGCAGTAG 
ATGGTGTTTCTATAAGAGCAGAAACCGTTGTGCGTGCGTGCAGCAGAAACACTG 

3 0 GGGGCTGTGAGCAGCACCGTTTAGGATTGCATGATC 

GCATTGCTGCTGCTCAGCTGGTCTCCCCGGGTGTGGGCTCACCACTTCCTCT^ 

TATTGCTTTGTTGGAAAGGCCCTCTCACCTTTTCTGGTGGGTAA 

CTAGTGCCTTTAATGTGCrrCAGTGCTTCTGTCCn'GTTGTGTTGCTAGAGGC 

3 5 AGGATAAGTCGATGAGCAGCTCCCTCCAGATAGTTAAAGACCATTCCCTGCTGTTTAGGCGCTTCCTATA 

ATGTGTGACQUVGTCCTAGAAGCTGGGGGTCGGCCCCCCG 

AC1TAACTTTGGTTGTTTGTTTGAGAACAGCTTGTTCTAGGCTTTCCCCTACCCTGAGCCAAACATGTGGA 

TAAACTGTG CTAGGAAATCCACACAGGGGCTGGGCCCTGGGATTGTG CTGCTCTCTTCGAAAATCTTTCAGCACAACAAACAGCTC 
AACCTGAAGAAAACTTAGCTTGTTTGGAAATGAGTTAAAACACTCATATTTTAGT^ 

4 0 GTTCTrrACAATACTGGGAGCTGTTTTGTGTTTGGTTTGACTGTAAGCAGAGGGCCCAC 

GTCCTTACCCTAGTGTGGAATAGCTTTAGAAGTGGCCCACAGTGAGCCGGTTTTGGTAA 

CTGAGGAGAGAGAAATGTCCAGTTTGAAGCCAGCCTGGGCTACATAAGAACCTC 

TTCATTTCCCTGCTCAGTCAAGGAGCAGTAGAAGATAATCCCGGCAGTAGACAAGT^ 

TG CAG ATTGG CTTTG CAG GAG AAAAGG T AACAG CCCT CAGGAGG CAG ACAGGTGGAT CTCTGTTAGTT CAAGGC C AGGCCAG TCG A 

4 5 TACATTTGAGTTCAGGACAGTGGGTCTCTGTGAAGCAGAG 

AAG T AGG AGCG AAAGGCAG C CCTCCAG AGTTCCCTG CTGAAGACTGT AATATG CAAAG CACACATCC CGTGTTCT ACAAG ACCC^ 

TATCTACCAGGAGTGCTTCATAAATATCCACATCCAGCTGAGAAGAGCTAAGGAAGGGTAAAATGGCAA 

TTGTGGATCTGGCTCATGAGAGAAAGAAGAGAGAGTAATCAAATGGGTGTGGCT 

GTGTGTGTGAAGGAGAGTGGCAGAACACCAGAGGCCTCTGTTCTCTCTCAACAATATGGCTGTAGGAGAA 

5 0 CTGAGAGGGCAGCTTCTCCCTGTGTTGTCCTGTTAAGGAATA 

TGGCTAGACGGGAAATTGAAGAAGTGGGTGATGTTTTATTTCCTTGTCTGATGGCATCAA 
TATTGTTTCTGTGTTTTAACTCTATTAGTCTTATGTAAGCTTAGTTCT 
TTGAAGGAAATGCATTTCTCTTGGCAGGTCTTCTACTTACAAAGGACAATG 
CCCAGC^GCCACCATGACTGAGCAGGGTGAAGATATCACTACGAAGAAAGACAGAGGA 

5 5 AAGCAGGGAGGTCTCTGTGTGGGAGCTTGTGGGTGCTTTATGCAGCCTGTCTTGTGGACGGGATA 

AGCrCACTTCCCATCTAGCTTCATAGCTGGGCGTAATCTCAGATTATAAATGGTGCAG 
CATCTTAGGCCCTGCAGAACAAGTTCCAGTTATGCGTTCCCTCA 

TTTAATATGC CATCTTAGTGTGCTAAAG CTATG CAGTG ACTG CAAAAGAGTCTGTAGAGAAAAAGGAGAGGACTTAGTTTTTATAT 
ACAAGTACTGTTTCTATGTGTCATGATGTAAGGACACAGAATATATATTTGGATCATCT 

6 0 CATTTGTGTACACATATGATGTATACTTGCTCCTCTGTCAGGATATCTTCCTCAGTCACTTTTCT 

GAGTCTGTCACCAAACCTAGAAGGAAGCCTGTTTGAGTAAGACTGGGTAGCC^ 

CAGC^CTGGGGTTGGAGACACACTCTTTCATGATTATGGTTTTACATGGGTCTAAGGAATCTA 

CTCCATAGACATTTTTTTAATTTGCGTCTTAGCAACAGTGTTATGAGATAGTACCT 

TAGGGTCTCACT AAGTTGCCAGGCTGGCCTTGAATT ACAGTGTAACCCAGGCAGGT CTTGAACCTGCCATCCTCCTCCCTT ACT CT 

6 5 CCTGAGTAGTTGGGATTACAGGATTGTGTCCCCAGGATGGGCCAGGTTCTCGTCTGTGGTTCT 

CTTACCTTGCTCTGTGCTGTCAGTCCATTGAGATCTCTTTTrCCTGC^ 
TCCTGGAAGCTTCTCAAGACAGTGGAGATTCTTCAGGCTGTCCGCAGCGTCTCCTACCT 
GAACTGGTCTCCCAGGTCCTCAGACCCTGTGTATTCrrGTAACCAAGTACTGAG 
AGTGCTGTAGTCTGAGAAGTCCAGGAACAGAGCATTCCTGCATCACCTTGCTT 

7 0 GGCAGGCAGGCAGGCAGGCAGGAAGATCTGAGAGGCACAAGGACACGTGG 

TCCACAGGGAGCTCACAGAGG CAAGACCTTG ATTCCACCT AACTTCCCTCCTAAAGG CCTCGTATCC CAGCACTGTCCTGGTAGCA 
GTTGATCCAGTGTAAGCTTTTGAGOK3ACAGGTCATATTCAACCCT 
TCAGAGCTCTTGGATAGGCAGTATGCTATAGTCTCCTTTGTCCTAG 
CTGTGTGGTTrGAAGCTCTTTAGGTTATGACAC 
7 5 TATGTTTCACAGCTGTGGCCCGGAGATTTGTCAGCCTCCCrcGGTTAAGGCTTCT 
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ATAAAAGTTGGTTTTACATTTTTCATATAAAT^ 

TTGCTGGGAAGATAGACACACACATAAGAGGGCACTATCACGTGTTAGCTGCTGTCTTCTGCACCA 

CAGATGATTGACAGACCGAATTGCAC CACCACACGC CTGGGTT ACA'iT 1" i' CI T T T CCAAG AATTGAC CAACTGCATT AGT ATCTTG 
CTCTGCCAAGAATAAAAGAAGTAGTCTGGATTTTACAGTGGATAAATGTT^ 
5 TTTCCTTTTTCTTTACTAGAAAACAAGCTACTTGGTATCGTCT 

TGTTACATCAGAGCATACAGCAGAGCGTTTGGACCCTATCTTTAGCAAAGGAGGGA 

TTAGGAAGTCC^CAGTCCTTTGGCTAGTTCAGTTTGTGTACCTT^ 

TGGCTTTCAACCAACQUVGTGAGGTTTACTGG 

CTGGAACTT ACTGTGT AG AC CAGG CTGTCTC CGCAGTGATC CTTTTG C CTTG CTCTGGCTTTCTATTTTGAG AGACTAAAT ATTGT 
1 0 GGTGAGCTGTAATAATAGTCATAACACAAAGGGTrAGTGAAAGGCAGTTTTCT 
GGTGAATCTGCCCATCTCTCTGGGTGCTGGGGACTAGTCCTTC 
TTCTTATGTTTGTCTTTCAGATTGTCAAAAGAGTGGGGACT 

AAAGGGATGTTGTCAGATGGAAAGAAGTTTGATTCCAGTCATGACAGAAAGAAGCC^ 
GACAATATTTTCAGGGTTTGTGTAGGTTrGATTCTC 
1 5 TATTTATAAATACCTAATAAAAATGAAAAAAAAAAAGAAAGTTCTAGAGATAGATGTGCAAAG 
CGACTCTGGAAATAAGAGCTAGGTTGTAGCTGACAGTAACCCATTA^ 

TGGG AAATTTCATG TTACAT AT AT CTT ATT ATT ACCTT AAAAAATGGG CAGATGT AAAC CTAATT CTGT C CACAT ATACTTGG AAA 
AAAAAAGATATTGTAAATAAGAGTAAAAAAAAAAAAGAGTTATTTATTTATTATATC 
CCAGAAGAGGGTCTCAGATCTCATTATAGGTGGTTGTGAGCCACCATGTGGTTGCrrGG^ 
2 0 ^TCAATGCTCTTAACCACTGAGCCATGTCACCAGCCCCATGATTCT 
TGGATCCCTAGGTGTCTCTGTGGTCCTITTGGGACCATAGTGTAACriT 
GCTCACGCTCACACTGGCAACCTAAAATCrrcCAG 

CCCATTG TG TTGTCACTG C CT CTCCTGTTGG CTGTCATCTCAGTG CTTC CAAAGT CG C CCTCAGGAAGCCTGCT AC ACTTT CAGTT 
GGGAACTGAGCTAGCTCAGTGTTGCCGTGAACACCATTCTTCCITGCAAA^ 

2 5 CTGAA.GGAGAATGAGATAAACCTGTCAC!TTCAAGGAAAAC^ 

TTTAAAATATTTGTGGCTGTCCCATGCTTAACAGCTTCCrGAGCCGTAAG 
ACTTGACITCTGGACAGTGAGGTTCGTCIACT 
ACATACACCGTCAACTGTGGACAAGCAGTCCATTCAGGCT 
ATGACCrGCTGAACCTCTTTTGATAGTCATCrAGGAATTCTGTGACGCTGCCCT 

3 0 CnTTTCTACATCTTCAGTCAGAAGCAAGCAAATGACTTCCTCITG 

GTTTCCGACTGTAATGCCAGCACITGGCAGATTGGAAGCAGT 

AAC^TAAGCCAGCATAATCGCACACACCTATCTTCTCAGCAGTTC 

CGATGACCCAAGGCAAGTTGGGGAAGAAAAGGTTTATTTGGCTTACACTTCCA 

3 5 GGAACTCCAACAGGGCAGGAAGCTGCAGGOVGAAGCTGATGCAGAGGTCA^ 

TTG CT CAG CC AG CTTTCTT AT AG AACCCAGG AT CACT AG CCCAGGG ATGGCAG CAT C CACAATG ATT CTC CC CAATT AAC CACT AA 

TTAAGAAAATGCCTACAGCTAGATTTACCXIATGCATTTTCCTGTTGAGTGCTCT 

GGCAAAGGAGAGGGCCAAGTTCAAAGCTGTGTTGTATTTCACAGCA^ 

CATTT AG ATTTT ATATT AAAT ATT AT AAC AG TTG CTG CTTCATCAAAAT AAAAAAAAAAAAAAGG AAG AAGT ATAAATACTTGT CT 

4 0 GTGAGCTCTCTAGTCCCTTATAGTATATGTCTCCCAAAGGAGTATATAGAAATATTCGAAAGAGACGTTCCAGG 

TTCTAGTATGACTTCAGTGTTTTGTTTTGTTTTAAGATATAGTCTTCC 
CTCGAACTCAAAAGAGGTCTACCTACATCTCCTGAGTGCTGGGGCTAAAGATATGCAA 

TAAATAGGACTATTATGAGCCTGACTATAAAAACTTTATTGAAAAAACACATATACAATAGAGCACTGCCCCTCCT 
GCCCACTGCCTTCCTGTACTGACTGAGCTCATCCCCTACACTGTAAATGACTGTTCTTACTACCAACCGCT 

4 5 AAGATAGCACACACCACTATGTACATCAGCCCAGCATAGTGCAGACCTGGGT 

TTTAAAGTTTTTACTGCAGTCAGAGTAAAAAACC^TTGTAACTTCTT 

TTGAAACAATGCTTAGCCAGT Cl ' T 'rT' l 1 T 'l' T ' l 1GTCTGAGGAAGTTGGGAAAGAGAAACAGAATCAGTCTAAGACCTGGTTGTTAG 
OGCAGAGGACTGTGTCCXZAGGATAGAAACAGGATAC 

TGATTTTTTTTCTCTGAAGCACCCTTCCTTGATTGTGAAGTACTAAATGATAGAAAACTTCAA 

5 0 ATGTCTCTTATCTCCCTTGCTGAAAATTACTGAGTATACACTTTTAAATATATGCTTACATT^ 

TGACAGCTTGAAAGTGTTTK5TTTCTACCTGAGGAGGCAGAGGTACA 
GCCAGGACTCACACCCCAGGTTGTGTTCCTGAAGT^ 
CAGCCTGCTCACACCAGCTGCTTTCCTTTTTGCIAGTGATGCATTC 
TGATGATATTGTGAAGTTAACGTGTACITTAAAAGAAGTGTCATACTTTCT 

5 5 GTTAATAGACTTATTAAAAGTACTGAAACTTGGTGATGGTGGCACACGCCT 

ACATTTTGTTTCAGTCAGGTGCTGAAGGAGACFr^ 

ATAAACATGAGACCCTCTACACAGTTCTTTTTGATCACGATCCAGAAGTGG^ 
CCTTTTTCTCTAACTTTTATTTATTTATTTAT^ 

CTGGGGATTAAAACTGGGGTCCTGAGGGAAGCAGCCTGTGCTCTTAACTGTTGAGCCGTCTCTCGA 

6 0 TTTCTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACTCACTCTGTAGACCAGGCT 

CTCCTGCTTCTGCCTCCCGAGTATTGGGATTAAAGGCATGCTCCATCACATGGA 
TCTGGCACATGCTATATCAGTGATGAGCCCCAATCCTGAACCCGAAAGACATCCTACTAAATGAAA 
AGCTGTGTGCCTCCACTTGTATTACATGTCTGTAGAAAAAAGTCTTTGCTGGTGGCTGAGAGGAG^ 
ATCAGGTATAAAGTTTCAGACGAGTAAAACCAGAACGTTCTGGAGATAGGTCGCACAGCGTATACATGTAGTCAC^ 

6 5 ACTGAAAAATGCTTAAGATGATAAACTTTGTTTAACCCCAGCGAAACATGTTTGAAGTCAGAGGAGTTAC^ 

TAGTCAGTTATTAATAAAGTAGTTTTTATTATCAAGGGA 

TTTTTAAC CAGAAAATGT AT AT ATCACTACAGAAGAAAGGAAAAAAT AT AG TCCATTGTTTTT AT C CTGGG CT AGTGTACATTTT A 
CTTACATATTTATTTGTGTGTGTTGTG CAG ATGTCTGTCATGTTTOTACGCATGTGTGTGX^GTTCAG CCATGGGAGCTGTCCATG 
CTGCTTTTGAGGAAGGAGTTCTCTTTGGACCTAGGACTTCC^ 

1 0 GACCGCTGAGCTGTTTGCCAACATTGCCTTTAAACTCA 

GATACTTTGTTACAAAAAGAATAGAACTAAAATGGAAACATTGTGTTTGAG^ 
AGGACTCAATTTTTAAAAAATTTATTTTTGGAAT ATT CTGT 
TGCTI^TC^CCACTGGTTCTCCTTCTC^GGGACTAC^CCT 

7 5 CTCTAGTTTTGTTTGCAGATAGCCTGAGTGATACAACTAGTTCCCATGAGCTC^GCG^ 
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CTTGGCATTGGCCCTCrCCAGCCACAGACCGCCAGGAGCACACTACAGTTGAAC^ 

ATACAGTGGGC^TACAGGGTTGGAGAATGCATGTCAGAAAGGATC 

ATGTTGGGGTGCATCTCAGAAAGGATGGTAGGTACGTGTCAGGAT 

ATGTTGGGGTGCATGTCAGAAAGGATAGTGGGTACATGTCAGGATGGTGG 
5 CTCT CGTGTGGT CTGGG AGAGTT CTG AAGAAG CAGAAGTTTG CT CTGGAGT AGAAAGG AT ATTG AGT AATCCTGTGCTGG AG AATG 

AAGACrTGTGCCTAGAGCAGAGTGAGACGATATTAATCTATGGTGAGGTGCAACAGTTC 

TTGATATTTTGTAGGGTGTACAGTGACCTTATCTTTTATCCAAGCTTAGATGA 

TGATGGTTATGCAGTGACCTTGTCTGCTGTTGGTGTTTTATA 

GCCCAGGCTCX3CCCCAGACAACGGAGCCTGCAGCGAGTGTCAG 
1 0 GAGTCTTTAACCTAACTACCCATGTATACTTTTTCCAAGTATT^ 

GGAGGAGGAGCTGTCTGAGAGACTTGTAGGATACTCATACAGACTTGTTACG 

CTTAGAGGTGGATAAACAGTTAAAGTAAGTCGCCTTCAGAGCCCGTCACCAGGAGAG 

ATCACGTTAAAGGAGTTTTTCTGTAACCAGGATTGGCCATACTC^ 

CTGGCCATATTTTAAATTITAAAATGAAGTATATGTTGTTTT 
1 5 CTGGTCCTATCAGTTACAGGCTTTCACTATACAAAG 

CAGCGACCTGGACTAGJVGCTGGAATACTGTGGTTAATAAAGGGCACGTC 

ACT ATGTTGAG AAG CTACGGAG C CAAAT AATGAATCATCT CAGTTCT AAGAAT AGG CGTGTAAAGGTTT AAAAT AT^ 
AATT ATGTGT AT AATTT AAAGG AAATT AACCAGGAAATTG ACAATTT CCTT GG CAG AAGTTTTCAAATGGTGTT C 
TCTATGTTTCAATGTTTTCTTGGTGTTGCITTTCAGAAATTT 
2 0 TAGAGGGGGAGACAGGGGAGAGCAAAGGAGAAGGAGGTCCAGGGTTAGCCTTGGTGA 
CATAAGATTCTCTCAATAATGACAGCAGTAACAATGCCAGGCCCTT 
AAGAGAAGGGTGTACTTTTATATTACAGTATCATTTCCTACCAGT^ 
TCCATTTTAAAAGAAGAGTCAGATAGATGGGGCTGGTAAGGTGGCTGGGTC 
GAAGGAGAAACCAACTCTACCAAGTTGTCTTCTGGTGACCTCC 

2 5 CATTAATCACACAGGAGTGAGACAGAGGAGCGCTGGCTCATITT^ 

AGATGTACTAGAAGAGGGCATCAGATCTCATTACGGGTGGTTGTGAGCCACCATGTG^ 

GAAG AACAGTCAGTGCTCTTACCCACTGAGCCATCTCACAGCCC CGG CTCATTTCTT AAGATGAGGTAGAAGATTACGAGTTG AGA 

CTAGCTGGACTGTGTAATAAGACCTTATCTCAAAAGGCCAAAAGAGAGGGAGTTATATC 

TTCTTTATTTTACTAAATAATTTATTCTTTACTAAATACCAGACTCTTTG 

3 0 TGrTATTATGTTGTCTGGGTCAGGAAACAGATTAAACTTGCACAGGTTTGCAAAGCAAACCT 

CCCCTACAGTGCATAATCCTCCCTTTAGTAATCAAGAAAGAAAGTGAAGCTATTAAATAAAAAG 
ACGCTGCCATATATTGAGGTGTTGGTTTGTATTGTTTTGTTTAGGT^ 

GGCACTCTGTATGGACCAGGCTGGCCTTGATCTACCCACCTCTGCCTCCCAAGTGCTGGGATCA 
GACAGCTGTGAACTGCCAGGTGGTTGCTGGGAACTGAACCTGGGTCCTTGGGJ^GAGTGGC 

3 5 TCTCCAGCCCCTAAATGAGACATAGTTCATGCTGCACATGATAAGAGTAT^ 

GGAGTCCTAGACCCAGACACCATCTGCCAGAATTCAGAGTTCTAGGCCACCITGTGCroCATTAT 
AACAAAGACTIX3TAAATATTGACACTGAACAGGTTGCCTGCTACAGGTCCATAA 
AAGAAAGGAAACTTCCCAAATGTTACAGTTCITATGATTTGGC^ 
TTTGTTTTTGTTTTCTTGTTTTCrCTTAGGCCAGGTTATC 

4 0 G C CATTT ATT ATGT AAACCAG AAT ATGCTT ATGG CTCGGCTGGC CAC CT CCAAAAAATTC CATCAAATGCAACTCT C1 M ITTl"!' GAG 

GCAAGTATGTGTAGTGCTGTGCATGTGCGGCCATGACACGGAGCTGCGCTGGCCGCT 

CCAAACAAACAAAAGT CACAGATGTGGAAGTAAAAGTTCTGGTCAGT AG AGG AAGTAAGT CATATGCATCCTGACAGGCTGGAAAT 
GGTGGTGGGGAGTGTAAAGTTGTGAAGGTTTCCGACTGTCTCCTCATATCCATOVGGAGCTGTC 

4 5 CGCTGAATTAATTACACATAATTGGCCAAACTGGATCT 

CTTAAATCTTAAGGTGGCAAGGTCACAAGTCTGGGGTGGGAAGCGGTTGTCCTC^ 
TTGTCCrCCTGGCCTTTTGCCCACAAGGCAGTTGTCCTACTATTTCCTGTCT 
AGACGGTGTCATTACACGGAGTTCCTTCCAAGAACTGCAGAGTACATCGCTCTCATGTCCCAC 
GATAAAGATGTAGCTTAAAGTTTTATAAAAATAACTGTTAAACTGGCTTTCATCTTTG 

5 0 AGGTGAGGATTTATTTGAAGATTCAGGCXiTTATCCGTA 

CGGTAAAAGGTGAGTAGTGAGTACTGCCGGGAGCCAGCCTTCCCTGAGGAAGGAGACATGCTT 

GTCTATAAGAAGAAGAAAATCTATCTAAAACAACTAGAATGATTTTCTTGATTTTAGCATGTATAT 

ATTACACAAGTCTAGTTTGGGTTTAGTTATTTTCrTCT 

AGCTCrCAACCAGGAACAGGAAATTGGGATCTAAGGCTGGCTCTGCACCTCTGGGCCTCT 

5 5 GAGGCTGTGCTGGGCTTCTTGGTCTCAAAGGCTCTGCGAGCCCCAAAAACCC^ 

CT AGAG ACAG CTGTG ACACAG CCACTT AAAAGCTT ACTCAT AACATC TTTTCAT CGTTT ATT ATG AAAAATT CAAATCTAT ACACC 
AGGGTTTTTCCTGTGCTGTGGGAACTCCTCCAGAGTCAGTGATTC^TAACAACTTGTTTT 

CATTTGTTGAG AC CAGG AAG TTAAT AGGG CCAT CC C CTGTCCC ACTCT CGTT AC CAT ACT AACTT CTTTT CT AGT CTGG AT ACAAA 
TTCAAAACCTGGAGCTGCATTTAGCCCTCATGGCTITATTCT 
g 0 CACTTTTTTGTTTTTGTTTGTTTGTTT 

GCCTCAAACTCAGAAATCTGCTGCCTCTGCCTCCCGAGTGCTGGGATTAAAGGCGTGTGCCACCATCG 

TTTGAAGACAACTTTCCATTTGCTAG AGAATCC CATAGTTGTTGTCTTATGCCCACAGGG ACTAGATTGAGCTATTAGAAAATG CC 

CTGTGCGTGGCGCTATGCCCTCTTTAGCTCATCATTATCATCATCACCACCATCATCACCA 

TTGGCACTGGTGACTCAGGGCrAGGGTAGAGCCrCCCATGCrTCTCT^ 

6 5 AGAAACGG AATGCTGGGAAGCCCCCAG CCTCTCCACAGCGTCCACAATG C ACTGCGTCCACAATGCACTAGTGAGGTCTCTGGACA 

GCAGTTATCACTGTGCTGTGTGATTGGTGGTAACTT^ 
CTCTTCCCTGTGTTGATTCAGCCTTTTACnTTGTCACCA 

GCTCAGATTGCCCCAGTGTTGACTC^TGGGAACACCTTCCTTCCTGGGGTTCACCACATTGCTGCCT 
GAGAACATCCTTAACTGTTTTTGCTTTGTCTTCCTCACTTTATGGTC 

CATCTTAGCAGACAGCTCGGTAGGACACTTCCAAAGTATAGAC^GCCT 
ATTTAAAGATTTATTTATTATTATATGTGAGTACACTGTAGCTGTCTTCAGACACTTTTT^ 
TTGATTGATTGATTGATTGATTGATTGATTATGTAAGTACACTGTAGCT 
AGGTGGTTGTAAGCC^CCATCTGGTTGCTCGGATTTGJA 

7 5 CTCTCCAGCCCCACATTTGTATTTAAAACATGAGTACTAGGCATAGCAGGTAA 
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CAGTTCTCAGTGTGTACATGGCAGCCCACAATT^ 

AACAGG CACACATG TGGT C CACAAACAT ACACGCAAG CAAATG ACC CATG CACGT AACAACTT CTT AAAAG CTGTG AGT ACACACG 
GTGCCGTACAGCACCACACGTTTGTTCTCCTCCCCTCATCCAGAACrTCTCTTT 
TCCTAGACTGCATCAGGCCACACATTCTGGTAAACACCTAAGAGTC 
5 CTTTGGAGGTCTTAGTCTCAACTATTGAATCTTGCAATTACTC 
AGrriTClTGTGTGTAAATGGGGCTGATTTTTCCTGTTTGAG 
GCTTTCCCTTCTAACCCTCTCTGCTGTGTTTACATAACT 
CATTTCCTACACCTAATCCTTGGTTAATTTCTCTCTAGTGAC^ 
GTTCrTGAGTGAGTGGGCTACATGGAAGGCCTGCTACATCATGGTCTTCT 
1 0 AAAACAGTGTG CT ATTT ACTCTTATCAG AATT CT CACT CTTACAAATCAGCCATGGGTT CT CCCGTG CT1TTT CTTGC CAGAT AAA 

ATGACTGTCTTGTCGTTTGTATTTC^GTGTTGTCCTTTTTCC^ 
TATGATGACTGACACCTTAGTCGTCACACTTGAGCTTAGGTGCTAGCCCT 

CAGTCG T ATT ACACTGAAGTAGGGCT CTG AGT AG C C CCCATCCGTCTGG T ACAAACTTGCATTTTGGGTTTCAGCTTGTTAGTGTT 
GGTGATGGGAACTGACTGTGGGAGCGCAGGCTTGTGCGCTCCTT^ 
1 5 TGjGTGCAGCTCCAACTGTAATCTCCAGGTTCCTATTCTGGGAATT^ 
TCGAGAAGGGAGAGATTTACTCTGTAAAAAGAGAAGGTCCTGGTG^ 
AGGACCTCTGTGAGTTCAAGGCCAGCCTGGTCTACTATCGAGTTCCAGGGTAG 
ATAAAAAGAAGAGAGAGACAGAGAGAGATGGATTAGAATTGTGAACTTATGAAATATATA^ 

20 * AACTCCAAGCTCACAGTCAGCTGGTGCAAGGGAGCCACAGCAGGCTCT 

AGCCAGCCGCACCAGGAAGCCTCTGCTGCAAGAAATCTCAGTCTTGG 

TAAATATATCTCAGGGGAGAAACTGGCAGAGGCAAATAGTTTTTAAACTCCCCAGTGACTCTAGTG 

GGACTGCCTGAGGGTCTATCTCACTGTGATCTCTGCTCAAAAGCTAAAGCCTAGTTCT 

CAGGGGTCAGAGGTAGACTGCAGAGCAGAGGAGTGTGCAGTGTCTTGGAA 

2 5 AATAGTAATGTTGTGAAGGGTTCTTTCTCrAAAAGGAATCTTTT 

TATAGCTGCCTAATAATACATAAGGAATCTGCACCTATATCCTTCACAATCACCCTGTTGGGAATG 
AGATATCAGTGACAGGTAGCTGCAGTCATTTGAACATCTTTAGGTGTATGTGTGCAG 
CGGTGTAAGGGAGATGAACAGAAAACTCAGAAGTCGTTTAAGAGACAATTCAGAACTAAC^ 
GAAAATGGGGAGGATAAGGAAGTCTGGAAATTGATGTTCAGTAACATTGACCTCCAGAGTTAGAGGAGAGGCGTG 

3 0 CCTCTAGTCCAGACTGGTGATCTGAGCAGTGGGCAAGGGAGCAAGGCAAGAAGTGGGGGTGGGGAAGA^ 

AAGGGTTCATCTCATC^GTCACAGACAGAAAGTCTGTTAAGACTTAGAAAG^ 
TTaVTTCAGTTATACTCTTGATGAAATTGCTAAATTCTTACAA 
AACACTTGAGGGCTGGAGCCCAATGCCTGACACC^TGTTGGGTGACACACAA 
TAGGCCCCTAAGGCATCTGTACATGTGTGCATATACACATACACTGACTGC^ 

3 5 TGTGGTAGAACAGGGGAGAGAGGCCCTTAGAGTTCATTGGC^ 

CCTGTTTTTAAAATACTAAGGTAGAGCCTAGCATAGTGGCATCCGTCTTTAATGCCA.G 
CTGTGAGTTTGAGGCCAGCCTGAGCTACACAGAGAAACCCTGTCTCAA^ 

CAACCACT GG CCTTCAC ATGACATGCATG CATGG ATG AATGACAG CTG CCAAAGG CAC CAGG ACT CAG AATGTCG ACATGG TCAGG 
CT ATTT ATTCCTTGTACTrGAAGTAAGAAAAACATATCTAAAATT AAA C ATGCTAATGTC 

4 0 TGAGTGAGCTGTCTGTTGTTCACATGCCTTTCAAACATTGAGTGATG^ 

TTGCTTTTTATCrTAGGTTTTGTTTTGTTTTAAGACAAGGTTCCTCT^ 
GCTGTCCTCAAACTCACAGAGATGCCTCCGCCTCTGCCTCCACCTCC^^ 
GCCCAGCAAAATCACAC^TTTTTAAAACTATTTCATTGTGTCT 
TAGCTCCTTTTTCTAGAAAGTATCTCCAGGTCTTCAAGTGGAGTGAATCA 

4 5 CAAACAAAAATCCAAGCCACTAATGAAGTGTTCATTTTAC^ATTTCGTACA^ 

CTCCTGG AG CTTGTT AAAATCAG ACTACTGGACAGACAAAACAAAAGC TG TG GTGGTAGT G CAC ACATTTCG GC C CAG CG CT C 
GGAAGCAGAGGCTGCTGGATCACAAATTTGAGGCTA 

CCTGTCTCAAAAAACCAAAAATAAGTAAATAAATAAGGATGTTGACTGATAGAT^ 
ATTGCCACTGTTTTCTCTGCTTTTCAGATTAGTCATTAAGTAAATACTTAT^ 

5 0 TACAAAGGCAGTGTC^AGCTCTGGGCTGGCCTGCTGGGACT 

AGGGATCCTAAAGTGTTCATTGGAGGGCACCTCCCTACTAATGTGAGGAATAACCCT 
CCCACACGTAATGCACTGCTTGCCT ATGAACAGATG AATG AATG A^ 
TCTTTTTTCTCTTTTACAATTAAAAAAAAAAATCAGTTGTATGCT^ 

ATGACTGCTGCCTGAGGAGGGCAGAAGAGC^CATCCTATCCTCTGGAACTC^AGTGATAGGTAG 

5 5 AGGAACTGAACCCAGGCCCTCTCTACAAGAGCAACAAGTGCTAACAGCCAAG 

AAGGTCTCCTGTCACTAAGGCTGGCCTTGAAATACTGGTCCTCTTGCCT 

TGCCCCTGCCACTATGCCCACTCTAAACTCTTAGTTATTTACTTTC 

AGTAAGCACTCTACTCTCAGTCCTATCACCMGCTCCT 

AGAACAGGTCTCACCAGGTAGCTCTCAACCTCCCAGCTCTTTATGAAGCCCTAG 

6 0 CTGCTTGCCAGACGCCACTACACTTAGCTTTGGAATGCTTCGCTTCTGCTAGCAG 

GAGGAGAGACCACAGACAGAGTTGTTAGGCGGCAGAATGACA 

GGAAGAGCAGTTAAGATCATAAAAGCCATTCCAGTAGTCAGGACCCGCGCCTGCCCTGGGCT 

TCCCCTGAGTTAGAGCAGGAAGGGAGGCTCTGTGGGGGGAGTCTGAGCACCATGCCTGACAGGAATGA 

TTAATGTACCTCGCTGCTTTAO^AGAGTCCaSGCTTGCC^ 

6 5 GTCATTGACTGGTCCCACCCTGGGTTCTGCTAGCCAGATGACCAGTCT 

GCCCATCACCCAGTCTTTGGGGACGAGTGCAGAGCTATCAGACACCAGCTACTATA^ 

GCTCACATCTTAGGCTGGGGCATTTACAGCACTTTTTTTAA 

GTACAGTTTTGGGGTTCTCACATTTCAGTTTTTGT^ 

AGCACATCGAGTTCATGTGCCAGCCACATTCAGAACAGGGTCTTCTGTGCTCT^ 

7 0 CAATAAGTCAGAGCTCAGTTTAAGTGATGT CTG CAG TGACCAGACT CACT 

GGCTAGTTTGGTTTGCTTTGACTTTGTGGGGTTT G 1 
TTGCCAGTTTTAAACITGTITGTTTCTC 
TTCACTTGGAGACCATTCTAjGATCATTTCCT^CCTCT 
«&GCTCCCGAGTGACTCATGGTGGCTTGTATTAAATATTTATC 
75 CAGGTCTTCCAGACTCMAGCAGGGGAACrn3GGATTACT 



553 



WO 03/008583 



PCT7US01/51291 



GTGCAGAGGCCTGCCAGGGCCCTGGTGCTGCGAG^ 

AATC CAGCCTGTGGTGAC CTTTCT CC ATGT ACATGGAG ACAAT CATGGGCTG TGTTT AC CAG CTCTGACC CTT CA r rxi'T CCTG AC 
TTAACATTCCATCATGTCCTGAAGTGATTGGTGCATGCATGtATGTGTGAGGAATC 
TTATTATGTATGTAAGTGTGCACGTAGAGTGAGAAGAAATG^ 
5 XGAGGTGCAG^CAACCTTGGGCTAGCTGCTGATT^ 

GCTCTGATGCCCTCTTCTGACCTCTGTGGGCACCTGTACATC 

TT 'ITT T T '1 '1 CCTTTTT AGATTTTT ATGTGT AT AT C T TlTlTr GTCTGAATGT ATCCATG CAG TGC CCATGG AAG CCAGAAG AGGGC 

ATT AG ACCCCAGGAGCTGG AGG CACAGACAGTT ATTACAGTT ACTGG CTT C CAT ACAGATG CT AGG AATGG AACCCCAGCT CTGTG 

GGRCAAGCAGCAGGTTCTCTTCCCCACTGTCTCCAGCCCC^ 
10 TTACTTTTrGCrTTCTTCTGACATTCATTC^ 

TACAGACTCTGGGAATCTCACTTAGGTCTTTGGTTTT G l^ ^ 

ACCAGAAGAGGGTATGGGATCCCATTACAGATGGTTGTGAACC^ 

CAGTCAGTCAGTCTGCTGAGCCCTCTCTCCJ«3CCCCCrG 

GTGAGTAGTTTAGAACAAGGAATGAGTCAGCACTCAGGTGATTTCCTC 
1 5 AGCTGATGATTGGGCAGATAAAGGGAAGGTGGAGAGGAAGGTGAGAAGAGAAGAAGGAG 

GAGGAGAAAG AAGAAAAG CGGAGT AG AAG GACATGGT CTGGAG AAACCGCAAGTT CTTTTTTTTTTTCCACTTTTT ATT AGGT ATC 

TAGCTCATTTACGTTTCCAATGCTATACCAAAAGTCCCCCATACGCACCCACCCGAAACC^ 

GAGAAGATGGCAGTGTAGCGGTAGATCTGCCCAAACTAGGCACACATCA 

AG CAT ATTTGGGTT AGAG AAATTT AC CGCAACAG CCCCCCTCCCCACACA.CTTAAGATTTTTAATTACAC1TTTATATATTATGTA 
2 0 QATAAGCTTGTACATAGAGGTAAGACATTCCACTCTGCAA^ 

TTTTGAAAACACATTCAGATTTGATCAACOVAAAGCCACAACAA^ 
GGGAGGAAGGTGATGGTGTGACAGTAGACCTCAGTTCTCTTCCCTCGA 
CMTACCAAAGAGCTGCTTCCGTGGCTCC^CAGGAGTTCCTGC^ 
GACAAATTTAAAAACAGTAGTTACATGAACTTATAGTTTT^ 

2 5 ACGGAGCTCAGGCTCTGTACCTGTTAGGC^GGTCCCTAGCACC 

TAGGTAGGCTATGGCTGTCTGAAGACCTGGGCCTACAGGCCC^^ 
CO^CCTCTTCTCCACAGAGAACCCCATAAGCCTCAGGGTGTGCCCAAGCCTCT 
GTTTAAACTAGAAGATGCTGTTGACTGCATCTGCTGCCACCCAGCTGGGT 
TTTTGTTTTTCCTGTTGTATGTGCACTAAAAGTGGGGGGGGGGGGGCTGTA 

3 0 AT ATAATATAT AG AAG T CATCAT AT AAATGAGTATGG CT AT AAAACAAG AG ACAGT AGG CG ACAT CTG AG AG CTAGG CTGCTG T AG 

CCTTTGTCTT<3TAGGAACrGACTTAGGATITGACTTGACAGC^ 

TCTG AGGT CCTG ATG T ATAG CTT ACGG CTG TTT ATCTTC CAAATTTT CTCT C ATTT CTCAGGCCT AG ACT AATGGTG AT C CG CTCA 

GATGATTAATTAATTAATGTGGAATTAGCTTTGAAGCTAGTTTCCAGGACGA 

GTTGAGATCAGATAGTAAAAGTAGAGTAAGTTGTAATTGGATTGCTTGG 

3 5 G TTTCTCAAG ACT AAGG C AAATT ACTGGC C CAAAAT AAG TT AAG CGG C CTTG T CAAG AAAT ATTTT AGG AAATT ACT AT ATTTGAG 

GTAGAAACTACTCTTCTTGTGAAAGAGAGATCATGGTGCCTATCGAACTATCT 
GCCrAATTGAAAAGGCACATAGCATGAGTATTACAAGTAGTTGAGCTGGGGGGTCACTTA 
AAGTCCCTGGAGTCTCCATGACCACACAAGAGAACTGGATACATGCCrrATAGTACrrAATGCGCTTGGA^ 
GAAGAGTTTCAGGCCAGCCTGAGCACGTGACCTTCATCTAACATGTATGC^ 

4 0 GTGTACAGTATTGGAGAGCTAGCATGGAGTCCTAGGCTTTATTTCTT 

TGTCIACAGATGGTTAAGCCATTGTATGTATTCAATACTAGATATATGGCATTCT 
TTGACTCTGTGTCACAGATAAGAAAATAAGTATACATTTAACTTAAAATGGAAAA 

TGCTGAGGGGTCGCTTCTGACTGTCCTGTGTCCTCACCCTCAGTCCACCTC 

4 5 AGATGTGGTGTTCGTTGTTGGGGAAGGAGAAGACCACGAACATTCOSATTGGGATCGACAAAGCCCT 

GAACAGTGTATTCTATATCTTGGACCACGGTAAGACGTGAGCTGCCCGTTTCAGTGC^ 

CTGCTTGCTCACTCTCATTCTCACTCTCTC^GTGCCTTGTTGGCG 

AACTTGGGGCCTCACACCACCATAGAAGTTTGCGAGTGAGTTGGCTCAG 

TTG ATTTTGATTTGTTT AG CTGCCAGGATGCCACATCAGCAAAGATTGACTTGATCCTGTGGCTCTC CTG CTCTGCTTTTCTCTG A 

5 0 ATT CAG CG AGGG CTTTGGG AACTG CAGT AGTG CATG ATTT ACT ACACTTGAAT C CATG CCACG TTCTTTCT ATGG AGTCCCTGAAGG 

AAGAAAGAGCCAGCTTGTTCTrCCrGGACCTAAAGGCAGCGGAGATGA 

CATTAGCACTGCCAAGCITAGAACCATCCGGCATGACAGTCTCCTG 

CCATAAATTCATCTCTTCTATCTTGAGAAATGGGCCCCTGAGGATGCAGG^ 

ACTGCTAAGACTGAG ACTCTGTGGGCTGCCCTCACCAAAGGG AAC^ CTTG CCTTGG AGGGGTTTTGC 

5 5 TATAATTCACTCTCCAATAAAGCTATTTCATTTTTATTGAAAGAGCTTTCCCC 

ACTAATCTGTTTTTTGTTTGTTTATGAGATGTATTCTTTTATGTTGTAGCCCTGACTC 
GGCCTTGAACTCACAGAGATCCACCTCCTTCTGGCTCCCAAGTGCTGGGATTAAAGT^ 
AAATT ATAGTAGCTCTACACTCAGGTATCTCACGTATGGTCTTACAGTTC^ 
CAGAAACACATAGCCCAGATCTAAGTTTTAACCTTGCGGTAGATGCTTTGGAGAGCAGACT 

60 CAAACTGCCTCTGTTTATTAGTGCAGCTAGTTCTCT 

GCACCTGGGCAAGTCACCTAACTCCCCAAGCCTTGCAAGGTTGCAGAG 

GCCCCACCCTGCCCATTAGCTGTTACTGTCTCAGCGATGGCTTTCATTCATTTT 

CAACAAATGGTATTTTTCAAATCTTTTCAGGCATA^ 

TTGGGGGTCGGGGGTGTGAGTGCATATGTTAGTACATCTCCAGG 
65 ATG CTCTCCATACTGTCTCCATGTGTGTGTGTCACACTGCCCATGT AGGT AGAGC!ATCCCT AAG CA 

AGTCCACCGAGTCTGGGTTGCTTATAAAATGCAGATGGAATTCGTACTCA^ 

TCACCCCCACCTTGAGATCTGTAGGCrCCTTGGCAGGACGTTTTCATAAGAA^ 

ACACTATCTTTTCTTGGTAGTGTTCTTTAGAATATTGTGGCAAGGGTC 

TCAATTTTTAAAAATTTGCTGTTCTCCTCAAATAGATAATTAAATATTGTGACTGAT^ 
7 0 TTTTTCCTTCCTTTTTCTTTATACCTGTTTTGTAACAG 

TAGTTATCTTAGGACAGGAAGGTCTGAATGTGGTTCTGCCTCTATATCACCT 

CJVCCTGTAAGAAGTAGCTGGCGAGGGCATTCACAATGCTGTGTTCT 

CTCCAAGGCAGCTGCCGTCCAGAGGC(nTGACTC^ 

ACTCAGGACCTCTGAAAGAGGTC^TGCTCrTAAC^CTGGCCCAT 
7 5 CCTCAGTACTATGGATCTTTGTGGTGGATCCCAGTTGTGGCAGAAGAGGGTTG 
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GACACCTACCTGCCATCCAATAGCAACTCTAGG 

GACATGGTTCCAAAGTGATGCCAGGTCATCTGTCATATTCTTTCCTCCCT 

AG ACTTGAGAAGTACCCATCTGTATG CCITCTTTGCAGCGAAGAGAGGGACAGACCTT ACCACAGG CAACACACACACATTCATAC 

TCCAGGGAAAAAACCCTGGCTACGTTTCCAACATTGCCACTGGCCATTGCT 
5 GCTGCCACCCAGTTCTTCCCCAGCra 

TCGAG ACCGGGTTT CTCTGT AT AG C CCTGGCTGTCCTGGAACTCACAACTCAC^ CTGGC CTTG AACT CACAGAG 

ATCCATCCATCTGCCTCTGTCCCCCAAGTGCTGGGATCGAAGATGTGTGCCACCACCT^ 

GCCACGAGACTTTAGGATCATGAGAGCTGTTGTCTCTGCT 

GAAGTTGACCCTTCCTTAAAGTGAAAAGCTTGTTTTGAGTACACATG^ 
1 0 AGGGAGCTGGGGTAAAATGAGAAATTCTATATACAAGTACGAAACTGTAACAGAACAGTAAAAA 

ATGTTAAAAGTATTCAAACAGAATTATAGTGATTTITACTACTATAATAATTATA 

AAGCITAAAGGAAAAAAGTCCTTGTAACCAAACAGAAAAGAATAAAGTCTAAAACCAGG 

CACTGTCCTAGGGAGTACAGGAACTGTGCGCCTGCGCTAAGAGAGCATGCGCT 

ACAGCAGGGATTCCACACAGAAAACCAAAGAAGTATACCAGTCCT 
1 5 GGAGAAGCCGGG AAGCCTAAGTTTGG CATTGACCCCAATGCTGAG CTTATGT ACGAGGTCACCCTT AAGAGCTTCGAGAAGGTGAG 

CAAACCCAACTT CGCTT AAAG CTGTGGTGTGGGGT CTTCT CATC CTGTGCCTGT AACTG TGCGGT AG ATGACTTGG CT AAAT AGTG T 

TTGTTTCTGTGTTTGCTTTTCTTCATCCATG^ 

ATAC^GGAGAGGAGCTGGTGTTCTTATTTTTTACTGTGTGTTAGCCTL I 1 1'ITTTCI 1"! 1 AACTTTTGGGTAGAATCCAAG ACTTG 
TTCCTGTATTTGTTCCTAJ3GCTTG TTCTTT T TG TA CTTITTTTT 
2 0 CCTTAGGGGAGTTGGGGTGCAGCCTTTCAGTGCTGAAGGAATTGGACA 

AGGTTACAATCCCAGTGTCTCAGCCTCGGGACCAGCAGCTGCAACCAAGGAGAAGA 
TGTCCAAGAACAGGAGAGGGAACTTGCCGTGGCCTTCAC^CTGGTGTAC 
GGCTCCCACCTGTACTGCACGGTGTCCCCATAAGCrGAGTAAGCTGTCATCCT 
ATTACGATTTTATTCCTCTGAAATGACCCCATGCTCTG 

2 5 GGAACCTCTACCCTTCTCCGTGCTTTGTGACAACATGGCATGCCTGCCATCT 

CAGCrTACGTAGTAAATGGAGGCTTGCCTGAGATACATGCAGTGCTGTCT 
TGCAAACTGATCCTGTTAGAGGCCATACTGCATCTGGTTCTGT^ 
CATGCACGTGCAGCTTGGCCTGGACCATAAGTAGCAATCTCACT 
TGACAGCAGCCCATAGTGAGGGAGCTTTGGGGCTCCAGGTTACGATCCCAGTGTCrCT 

3 0 AAAATGCAGTCTGTGAGAGTGCTCCGTGCTTGTCTTCCTTGGACAACACAGGTCAGACC^ 

TGGAACACCGGTGCATTGGCCTGCTGGTATCCCAGCAGCGGCCCACAGTTGCn'GGGGTCTCA 
CATGAACTGCTGCGTTTCCrCCTCTCCTCAGCTAACTGACGCCACAAATCTCTACACAGGCCA 
AAAGAAAAGCTGACGCAG^CTGC<^TCGTGAAAGAGAAGGGAACTGTCT 
GGGGATACCTACCCMTATC^TCAGCACCCTGTGCTGAGACGCACCCT 

3 5 AGTGAGCAGAGTGAGTTCGTGTGCAGGAGGTAATGCAGAGGCCTCTC 

CGAGGTGGACCAGACCTGGCTCGCTCGAGTGTGTGTGTCCCAGGTCAGAGTTCACAGGGGCGGCT 

GGTGAGACAGGAAAGGCCTTGCAGATCCCCCAGTTCACTTTGGATGATCT 

CTAC^TGAAGGCAAGTCTCCrCCTGATTGTTrrAATTATAATTTTCA^ 

TT AAG AG CATTGG CTGCTTTTCCAAAGGTTCTGAGTTAAATTCCCAGCACCT^ 

4 0 GATGCTCTCTTCTGACATCCACGTGCGCATGTGCAGAGTACT 

ATGTTTTAAAACATTTTCGCrrcnTTAAATTAAAACAGCAGC 
TTATCAAGAAATATATCATTACTTATCAGTACCACT^ 
Cn^CGAGATGGTTCATCGGGTAAGAGCACTGACTGCTCTTCCAGAGGT 
AACCACCC^TAATGAGATCTGATGCCCTCTTCTGGTGCGTCTGAAGTCAGCT 

4 5 AAAAAAAAAAAAAATGATACTGGCTGGGTACCAAAAAGGCCCTGCAGACCGGCATGGATG 

GATGAGGAGAAAACACTGAGCCTTGTGGCCACAGTGATAAGATGTCACTTAC^ 
CTGTCTCCTCAGGACCC^GGAACCAGTGCTTTATCCCCCAGGGCTG^ 

CATTGCTTGCCAGCCCACTTCCAAATTCCTGCTGTCTCCACATC7A 
CTGCTGGGCCAAGTGATGCTGTGCAAGTGGCTGACTCCTGTCTCTGCC^ 

5 0 GGGACACCTTTAAGCACAGGGCTCTGGAGGCAAGTGTATG 

ACTAGGGCAACACAGTGAAGCCCTGCCTCAAAAAAAAAAAAAAAAGGACTTG^ 
TTTACCAGCAGTCTTTAAACAGAATTATTGGGGTITTTGTTTGT^ 
AGTACACTGTAGCTATCTTCAGACACTCCAGAAGAGGGAGTCAGATCT 
ATTTGAACTTCAGACCrTTTGAACTCTGGACCTTCGGAAGAGCAGTCGGGTG 
55 GGTTTTGTTTTGTTTTGT TrTGr i TlV r i T mA TAGCACTATAAAGC^ • 

GAAATGCTGTCACATCTACTACTGACTTACATTAGGGGGTTGGAACAAGAAC^ 

AGT CAC CT (ZAAAAGAAACACTGGCAGAGACAGTGTGT ATTTGT AACAAAGTT AT AAACT AACGTATG ACT CATGGGCAG CCAAG CA 

GATGGCGATAGGAGCCCTGAGCAGGGATCTGGAAATGGAGCTGTCCCTT 

CCTTTCAGGGGTGATAAGTATATTTCAACATGTTTCAAATCTTCCT 

6 0 CATACGGAACACATTATTTTTAACATGTAATCTGATAGACAGGATCTTTCAAAACATCIAGTCT 

AAGGTTCAAAATCACTGACCTAAGACITACCCCAGAATGTCT 

CATTCTAGGTGAGGAGC^U^GCACCGCCCACCGCCGCCCC^^ 

GACTGCCTTCTGTCTGCTTGTCCCTTCTCCCTrCAAAGTTTAAGGAGCCT 

CCCAGAGTATCTGACGTGATGTGTGCCATGTGCAGAGTGTGGGTGACTGTAGGGATGGGCTGGACTACAGTTCTA 

6 5 ATCTGCATGATCTAT^CACCTGCCAGGCCAGTTTCAGCTGTGGGCAGAGAGGGACT 

TCCITTCAACCTCTGAACACTCATGC^ 
CTGTCACTGCACAAAGGAAGGCTAAGGACGCTCTGCTGTG^ 
CCGCGGCTGGCTGATTGTGCTTGAGAGCTGGAGTGTTGTAGGAAG^ 
GTCTGCCATCCCTTGTACAGTCGTACAGGAAGGGTCCTCTGTATTCAAGCTTGTAATTA 

U TTTGTTCTTTAGGGCATGTGCTATGT 

TGGTTACTCTGAAACAGCGCCTCACAGGGGTCACATAT CAGAT ATCAGGTGTCTACGTTCTGATT CAT AACAGTAGCAAAATTACA 

GTTATGAAGTAGCAACAGAAAACGTTTCATGGTTGGGAGTTTTATG 

GCACTGCCTGGGAGCTGAGGCAGGAAGACTGAGTTTGGCACTAGACTTGTC^ 

7 5 ATCAGCCTAGAAATAGAGCAGGGATGGTGCTTGCTTAGTATGCATGAGGCCT 
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AGCAGGACAGTCTCTCACACGGTAAGCTTGCTCTTTAGGGTTATGTACACTGTGTGCTATCAATG 

TCTGGCCGAGCCACTCTTAACAATCACATGGTGCCGGGGTAGAGCCCCG 

CTACTGAGAGCCGAGCGTTTGCACAATCTGCTTTAGACAGGCCTGCT^ 

TCAGTGCTGAATGCCAAGAGAAAAGAG<3AAAGTCACCTCCTCr^ 

ATGTTGGTTCCAAACAGTGTAACXTCCTGCTGCCAGGCTGAAAGAA 

CACCTCACAGGCCAG CAG AG CTTCCG CAG AACT AG^AGGGACAGGAGAT AG AG TGGGGCTGAC ATG 
CACTCGGCTAGGGCAGGCCACGCTCACACTTCACTGCATCCTTTCCA 

GGCAAGTACACGCAGGGCGTGATTCAGTACAGGAAGATAGTGTCCTGGCTGGAGATGGAATAC^ 
GCTGCGACAAGGTGGGTGAGCCGTACTTCACTCTCCGTTTATCTGTGT^ 

GCATGAGCTGGGACGCCAT AAAGGATGGGGCTTTTTGTG CACT CATTTTACAT AAAG C CATT AACAG AATTGAT 
AAGAGTCTTCATCTGTGGTCTTCCTTACCCTGGTCCTTTGTGGGAGAGGAGACA 

TTTGCAAGG CTGGGGATTGAACCCAGGG CACTGTG CATG CT AGGCAAACACTTT ACCACTGAG C CAT ACAG C CT ACC C CCAG TG AC 
ACACTTCCTCCAACAAG CCACACTTC CT AATTT CT AATC CTTCT AAT C CTTTG AAACAATTC CATTC CCTGG TG ATG ATGT ATT CA 
AAT ATATAAGCCTTGGGGGGGGGGGC^GG AGACATTCrTTATTCAG AG CATTTACCACCATGGTTAATG CCTATGGTCTCAGCATTTG 
AGAGGCAAAACAGAAGAATTGAGAGTTCAAGGCTGGCCTGGGCTA 

T CAGCTGG T AAAGG CACrTTGCTGT CAAG CCTGATGACAGG AGTTTG ATT CCT AGGAC CTTGTGGGAAAGAAAG AACAACTTTGTCA 
T ATGT CCTCTGG C CT CTTC CT CATTCAT AGTATT ACATGTG C^TACACAAAGGT AA 

ACITGAACTC^GAATCCCrGGAAGAGCAGTCAGTGTTCTTAACTG 

AACAAAAAAT AG AT ATAAAGTGG CAAAAAAGACATTTT AT AT AAGAGTTT AG TTGGCTTGTTTTG C CTTG AGAG AGG AAT ATT ATT 
G TT AATT AT ACTTTTT AAAGTTTG TG CTGTTT' 'ITT 1' I T T AAAT AT AT AGCTG TT ACAAAGTTGAGAATTT AAGCACACAAG AAAG C 
CCAGCTATCAAAAAAAAAAAAAAAAAAA^ JTTT 
T T TT T TT TTTTTTTTTTTTT GQGGTTTTTG1 

CTGTCCTCAAACTCAAAAATCTGCCTGCCTTTGCCTCCGAGGGCTGGGAT^ 

AAAAACAAAAACAAAGAAAAAGAAAAAAAG CCCAGCTATAAAATTCCCATCAACTTTCTCT CC CAAAGCCACTCATTT AAAG ATTT 

ACTTTATCTTTAAAATATTTGTACATGTACAGATAATATTTGTGTTATCT 

TGTGTGTGTGTGTGTGTGTGTATGCGCAGGTGCTCCCTGGGTCCAGCAGAGGGAGTCT 

TGTGATAGGGTTGCTGGGCTGCTGGGAACTGAACTCGAGGCTTCTGCA^ 

CTTTAGACATTTACrATGGTAGGAGAGGTTCTCTGCATGTGCGTGTC 

TTCCCCCATGCTGAAGGACCMGCTCACTGAAGCTCATCA^ 

TTTCTOT AT AGAGTCCTAGTTTTTC CAAATGAAG AATAACTG CTTT ATTTTTTTCAGT CAAA CTCT CATA TCCCAGTTT 

GACTTGCTTTGTAGCTGAACTTCTGGTCCTCCTGCCTCCACCTCCrc 

GGTGGTGGTTTTTTGAGAGAGGGTTTCTCTGTGTAGCCCT 

GAAATTCACCTGCCT CCACCTCCCGAGTG CTGGGATTAAAGGCGTGTGCCACCACCGCCCGGCTGTTTTGTITTTAAGATAGGG AT 

AAAATTCAGTATTTTGTATATACTAGG(ZAAGCTAAGATAAACCCCAGACTTTGCATATGTATTCT 

TGGAAAGGTGGTCAGAGGCTGTAAGAGTACTCACTGTGCAAGCATATAGGGCCCGAGTTNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NinWNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNTATATATATATATATA^^ 

GTTATTATGTGAAAGCAGCAGTCTTTCCIACGTCAGATGAGTGCATTCTA 

CCTCTTTAGATCCATTCTGTGAACCATGGGGTTTCACTTAACTG^ 

GTCAT AAT AAT AACCATAATAATAATAAAGGCAGTGGTGGTG CAAG C CTTTAATC C CAGCACTTGGG AGGCAG AGGCAGGCGGATT 

TCTGAGTTCGAGGGCAGCCTGGTCTACAGAGTGAGGTCCAGTCAAAGCTCT 

TGGAGACATTTTATACTAAAATACTGACAAGTGGGTGGACCGTGGTTGAT^^ 

CGTGTGATTGC^CTGCCTCAGCACTGAGGTGACAGGAGCGGCAGTTGTCCCT 

CTTAGGCCCTTGGACTGGACAGTGCaVATGAGAA 

GCCAAGGGCGACTTCGAGAAGGTGTTGGCAGTCAATCCTCAGAACA^ 

GAAGGAGCACAACGAGCG<K3ACCGCAGGGTGTACGCCAACATGTTCAAGAAGT^ 

AGCCCCAGGCTTGTAAGGACGCTAGCCTAGGGACAAGGTTGGGCAGCCT 

GGGGAGAGACACAGGTGGGAGGCCCTGGCGTGTCCCTGAGAGCTGAGGGCrGACTCCT 

TGTTTTGTTCTTAAAATGGTTTCTCTTTGTGG ACCCAGCTAGCTCATCT CTCTGG CGATGAGCGTTAGGAT 

TATAGGCATCTGCCACCAAACCAAGTTCTCTTGTTATTTTTAAAAAATAA^ 

TGAAAGAATGACCATCCAGAGACTGCCCCACCTGGGGATCCATCCCAT^ 

agcttgctgacaggagcctgatatagctgtctcctgagaggctctgc 

ccattagacggagcagagggcccccaatgaaggagctagagaaagtacccaaggagct 

caacaatatgaactaaccagtacccccagagctccctgggactaaaccaccaa 

AGCTGCATATGTAGCAGAGGATGGCCTAGTCAGTCATCAATGGGAGGAGAGACCCTTGGTCCTGTC 

cggggactgccjvgggccaggaataggagttaggtgggttggggagc^^ 

ACTAGGAAAGGAGATAAAATTTGAAATGTAAATAAAGAAAATATCTAATAATGAATTAGTTAAAAATA 
GTGGTGGCGCACAGCTTTGATCCCAGTGCCTGAGAAAC^ 

AAACCCTGTCTGGAAAAAATGACAACAGCCCTAAGCCCTTCAGTGTTGTTTCTCT 

TTGGAAACGGGTGTGCCrGTGGTGGCCTTTACTGTATTCTCTGCrGTCATOSGGAG 

GGTCAACTTCCTGCCCCATGGCCACCCTGTGGTCCCAGGTCTGTCCTCCATCT^ 

CAGCMCTGCCTGAGGTAGGCAGCTCCTC^TTGCTTGCCACCCCTACTTA 

CTGCAC3ACCTGATGGACCATCTCGGGAAGGCAGTGGTTGAGTGAGCTCCTCTTCT 

GGGAGCAAG AAGG CTGTAGAAGGAG CCGCTGG CAAACAACACG AG AGTCAGGCCATGGAAGAAGGAAAG^ 

AQ3CTGCGCCACGGAGGGAAGAGAGTCCTAATGAACTCGGCCCTCCTCGCTGGGCT 

GTGTAAGGTTTGTTAC^GTCTCTGTGATTCTG^ 

GTOKWGTGGGGGACATGCO^^ 

GTTTGTCACCTTTCAGTGTCCAGCACAGCATCCCTGTGAACCTAGGG CCCA^ 

^j^^n^Qp^j^^J^ 

GCACT^CTGACTGGCCTGCTAAGAAC^ 
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TTCCAAGTCCTTTCGCAGACTCTTGAGTGTGX3CITTCTGTC 
CATTAGTGACAGCTTTCTCTCTGAGTTTCTGTGGTC 

AGGACGATGTTAGCGCTCAGCCATCCCTGCCCAGCACAGCTGCrCAGCCTCATGCTCCCACCC^ 
AGGCTTGCTCCCGTCCCAGACTCGAGTGGACGGACATCCATGTCCACCCAGGATTGACGAAGG 
5 ATGCAAAAATAGTTrTCAATTTTTACTTT'l i'AGCTTGGGTTTTGAAAGGCAAACTCTAGGAGGCTTAAAACGGGCAGTTACTCTTA 

TCAAGCCAAAGTCTCCAAAACTTGGGCATTGAATTAGGGCAAATCTTAAA 
ATTGAACGTGGCTGGCTTCTCCACAGCCTAAGGAACCCCCATGGGGACACACCCACCACCC^ 
CAGCTCACTTCCAGCCCTAAGGACCACTGTGCTCAAGATGTGTGGCTGTAGTAAGACAGTCT 
1 0 TTCCTTCACACCTGCGGGACAGCGCTTCTGAGGCTCCAGTTCTGCTCT 

ATGTGACTGGGACCCTCGCATCTCACACTCGGCACTCTGCCTCTTTCCATCCATGTTTGAGAGCCCTG^ 

GAGCAGGAGCCGACTCTGTGTGTAATGCTCTAGTGTTTGTAGTTCT 

TTGAGCCTAACATGTTAGGTTTTTACGGGAAAGCTCITTTTCCAGCTGT 

GGGAAGGGGGGATGGTCAAGGTTCCAAGCCAGAAATGGAAGGTGCCCAGCACCCCTGGATTTTAC^ 
1 5 TCAGACGATTT ACACGG AAATCAG AAGATGTCTTCAAAT ACC CTTAGACTGG AT AGAG TG ATGG^ CAGATGATGCATTGAAA 

TTTTGGTTTrTAAAAAATCACTTAGATGCTGGGAGTAATGG 

ACAAAACAAAAAATCATTT AG ATGCATTTTCTGTTC CTT 

TTTTGTTGTTTGTTTGATTTTTGAAGTTTACTTT^ 

GACTTTGAATCTCCTTGAAATGAAGATACAGTTGTTAGCCAGA 
2 0 AGTG CT CTT AACTGCTGGGCTG CCC CTCTCCCAC C C CT C CATTGTTTGGG TTTG ACTT AAACACITGATTT^ 

TAAACAGATTTATTTATTGTATGTATGTGAATACGCTGTTGTTGTCTTCA 

TGGTTGTGAGCCACCATGTGGGTGCTGGGAATTGAACTCAGGCCCTCTGGAAGATCAGTC^ 

TCCAGCCCCTTTTTATGATTTTATTTTATGTGTATGCACACTTTTTTC 

CCTACCTCAGGTAGGGGAGAAAGAAGGTTAGAGGGGGGAAGGGGGCATAGATCTTT^ 

2 5 GAGTTCCTTGGGTCAACTTCAAACTTCATTGT CAGAAT ATCCAATCT ACT CGT CTCTTTG CT CACAG C CTTGTGACAAG C CACAGC 

AGCAACAGACACAGTAGCCTCCTCTGCAGAGCGCACTCCCACCTGATAGCC^ 
TTCCCAGAATCCCAGAGTTCACAGAAACTGTCTGCAGCTGGCAAAACCACG 

GCTGTG AAGCAGT CAC CT AT CTCACAACATAGTCACATGACATAC CTGGGTTTTTT AGG AAACCG AAATT CTCACATGTTT ACCTG 
CATGCAT AT ACATACATCATGTGCATGTAGTGCCCAGGG AAG G T AG AAG AAGGCATCAG AGT CCATGG AACTGG AG TT ACAAG ATG 

3 0 GTTGTAAGCTGCCGTATGGTTCITGGAACTGAACCCACATCCTATGCAAGA 

AGCCCCTGATTAAATAGTATAAAAAGGAAAAAAAGGTCTACAGCCATCATGCCCTAAACCCACTGAA 

CT AAACAG ACTGG C CTT AGT CT ATTGCTG TG AT AACACACCATGAC CAAGGT AAT AAT AT AAG GAAAAG ATT ATTTGG AG CTTG CT 

GTTTCAAAGTTAAGAGAACCCATGTAGGGGGCTGGAGAGATGGCTCAGAGGTGGAGAGCA 

TTCAATTCCCAGCAGCCACATC^TGACTCACAACCATCTGTAATGGAATCTAATGCCC 

3 5 CAGGCAGACTATTGTGTACATAATTAATAAATAATCTCTCTTTTCTTAAAGAGTACATG^ 

GAGAGAGCTATCTGGGAATAACATGAGTTTGTCAAACCTTGAAGC 
TTTCCCAAGAAGTTCCACCAACTAGAGACCAAGTATTGAAGCCCATAAGCCTT^ 
CAGAAAAGCTGTTGGGTTGAAGAACTCTAGTAGAATTCAATTGAAGAATGTGG 
AAGAGCACTAGCrGCrCTTCCAGAGGTCCTGAGTTCAATTCCCGGAAAAC^ 

4 0 TGCCCTCTTCTGCTGTCTCTGAAGAGAGCTACAATGTACTCATATACAATAAATAAATAAACAAA 

TCTCCAACTGAACCAAAGTGCATCCATTTTGCTGGGCTGTCTGGCCAATAAGGT^ 
CCCCAGTTCCAGATGTGGTGCCCTCTGTACCCAGCTGTGTGTGCCACrrATC 

TGTAAAGTAGGAGCTTTCCCAGCTGAGCCATCT CCCCTGTTCCTTAAAG CCTTTTTGACAG ACAGTCTCC ACATTTGTCTGAGTTT 
CCACTCTCCAGACTTTGCCTCCTGTTTGCCAAACCACCT 

4 5 ATGAAACCTATCTACCCrrCAACAAGCATACCAGTTAAGCAT^ 

TGTTG GG AACTGCTGG AAAT AG AAGGGT ACTGAAC CAC AAAT GCCC CAT CT C CTTGTGTG CACAAAG AGCAAACATTTGG CTTCAT 
AAATTGATTTTGGTTTAGTTTGGTTTTGCGTTTGCACTTGA 
GTGTGCCTGGTACC<3AGGTGCATTTGCACAGGTGTGCTTGGGAG 
CGGTTAAGGGCACTAGCTGTTTGCTCTTCCAGAGGTCCTGAGTTCAATT 

5 0 GGGATCCGATGCCTTCTTCTGGTGTGTCTGAAGAGAGCAACAGTGTACTTACAAAC^ 

GGAGGGAAGGAAAAGAAAGAAGGAATGTCTGCCTTTTTCTTTCTTCCCAGCTTATAC^ 
AGAACTTGTCAAGTGAGCTCAAACTAGCCATCITGCrrCGGAGGTCCCCrrGTCCCC^ 
ACCCTTGCACTTTCTTCAGTATTTTCGTTTTTAGTTCCAATTGGACAT^ 
CCTGGTGTAAAAGGCACTTCCTGTGCCCTTC^GGCACAGCTTTCTTCCCT 

5 5 GAGAACGCTCGCACACATGATGACACATGCACACGAGTGAGTtS 

TGAACCATTTTCCTCTGGGGCCGTGGTCCGCGAGAGGAAGTCTGGTAAGGTCC^ 
ACAGTAGGAGCTTCACCTGCTCCCAACATCAGGCCCCTGTC^ 

AGGCGCAGCACCCCCCAAACCCCTCCACATGTAGATGAGGCATCCCTAACATCTCAGACCAAGCCAATAG^ 
60 GACCCCACCCCCACCCCAAAACGGTGCATAATATCCTATCCAGAGGAATAAAGGTGTGTC 

AT CTGTCTTAGAAG AGCTGCTGTTACACTGCCTGGGAAGAAC AGCAG CAGCCAAGGCAGCAGCAJGCAGCAG CAGCAGCGCATCGGT 

TACAGACTTATCGX3CCCTAACTTCCCn7CrGGCrrGCCA 

TGATGTATTTAAAAAAAGATAGAGAGAGAGGGGAATATGGTCTATC^AGATGTGATGAGGTATAGGGGAGGGGG 

6 5 GCCCRTGTCAAGGCATCCCTTCCAACTGAGTGACCAGCCACATGATGGTATAGTATAGAATAGAGTTTATTCAG 

GGAGTTAAAAGGGTAGTAGAGGCAGAGAAAGGCAGAGAGAGGGAGAAGAGTAATAGGGAGAAGTAGAAT 
ACATC^ACAAATGGGGAAGGGACTAGGGAAGGGAGGGGAGGAGGGAAACGCCTACGAGGGCA 
GGGGCAAGCAGCTCCTTTTATAGTGAGCCAGCCCTACCTGGCTATTGCC^ 
ACCTACGCCTGTGCACCACGTCTGTGGACCACATGTGTACCCAOTGCCTGTGGAGGGCAG 

7 0 AGTCACAGGCAGTTGTGAAATGCCACATGGGTGCCAGGAATTGAACCCGCGTCC^ 

AGCXKKK3TCTCCAGGCTGCATTTTTTGTCTTGATC 

CTTGCTAAAGAGTCGGCTTCCTTCCCACATATTTGTATCCTTTTCCAACT^ 
CACTGTGGTGAGAGTGCACACCATGTACATTCTTTGTTTGACTT 

GTCACCCCCACCCTCCCCCCGGAGAATCCGTCCTGCTTTTAAGAAGATCTCATACATGr 
7 5 AGTTTCACATTTTATATTGCTTGCG^GATATACIATGTGTGGGTGTGCC^ 
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ATCCACCCCCTGGCCCAGGGTC^CATTAATCGC^ 
TTGTGGACGTATGGTATTTGCAGCCTCTGTCCTGACTGATGAACT 

ATTTTGGGGCTTGGTGTGCACACGTGTTTCTAATAAT X T CT IT TTCTTTTCTTTTTTTTTTAA 

AGATCTATTTATTATTATACATAAGTACACTGTAGCTGTCTTCAGACA 
5 TGAGCCACCCTCTGCTTGCrGGGATTTGAACTCAGGAC 

GCTGCCATCTGAACTGCTCCCCCGACTCCAGTTTGCACGTGTTCTGTCTCACT 
TTAGTTTGTGTTACATAAATATTTTCCAGTGTGTCATTCCAATTTCCT^ 

T ATCTTAT AT ATGT AAGT ACACTCT AG CTGTCTTCAGACACAC CAGAAG AGAGC AT CAGAT CC CATT ACAGATGG CTGTGAGC CAC 
1 0 CATGTGATTGCTGGGAATTGAACTCAGGGCCTCTGAAAA 

T CAT AATTTAAT AGG AT ATAATTTGAG CCCAT ACCGATGTAATTTCAAT AGCCAAG CATGGGGGGCAATG ATGGTG CACG CT CTTA 

ATCCCIACTGCAGAGTCAGAAGTAGGCAGATCrCTGAATTCCAG^ 

GAGAAACCCTGTCTGGAAGAAAACAAACCAAACAAACAATCACACAC^ 

AGTTCCAGTCCTTCAGATAAGAAAGCAGGAGGGTTAGAATTTCAAGACCAAC 
1 5 AAATCTATTTCTCTATCACTCCTTTCTCCCTATCGCCCCCTCnSACAC 

TGGGTACATATX3CTGCTAGCTCCTAGCTAATGJVGCCATCTCTCTATTC 

TTGGTGTTCCTGCCCCCCTTCTACCCTTCCCCCCAGCTATGGGGTTTGGG 

ATTTAATATGAATACACTGTAGCTGTCTTCAGACACACCA^ 

GCTTGCTGGGAATTGAACTCAGTACCTCTGGAAGAGCAGTTAGTGCT 
2 0 TTTGTTTTTAACTG AATTTCAACAG TG ATCAAATCATTTGCTCCTTTTT ATGGAGTTT AG AT C CTTG CAAAAGAT CTGGGGGCAGG 

AGGGTTTTATTACCAT AAGG CCTTGCTGGGGGACACCAGGAAAGAGCTTTGTTTTCTCT 

TGCCCATGTACATGGGTTGCTTACAGCAGCTCTCCTCTGCTCCAGGATTC 

ATAGACATGGGGTGAGCTTCACCAAGTTAATGCAGGAA 

TCGCTCCCCTCCACCCCCCAGGGTGATTTCAGCTGCCAGCAGTGAGCCTCT 

2 5 CTCAGCCACTGTCCACCTGGACGGGAAGAGGGTCCCCACCTCTCTCT 

CGCAGCGGCA.GGCGCGCGCCCACCAGTAGCCIACCACX3TGAGGCCCGGGCT 
CTGCCCTCCCCTGCGTGCCTGGCAGCTTCCTGTAATTCCAGGTAAGCACCCTCCACG 
TCTGGTGACTGGCTAGGACGTGGGTGGAAGTCTGGAAGGAACCTGGGCCCTTTGGTT 
GCTAGGGGTGGGGACIATOVGAGAAGATGCAGAGTCACCTCTGCCAC 

3 0 GGGACAGGGCAGAGATGCACCAGAACCTGGCrTTAGGAAAGAAAACAAA^ 

NNNNNNNNNNIJNNNNNNNNN^^ 

GTGGCGCACGCCTTTAATCCCAGCACTTGGGTAGGCAGAGACAGGCGGATTTC^ 

TT CCAGG ACAG C CAAG ACT AT ACAG AGAAAC C CTG T CTCG AAAAACC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT CAA. 
GCTGTATAGTGCTGGTGAGATGGCTCAGCGGTTAAGAGCACCGACTGCTCTTCCAGAGGTCCTG^ 

3 5 TGGTGCCTCACAGCCATCTATAATGGGGTCTGACTCCCTCTTCTGGTGTGTCT 

AAACAAAC AAAC^GACAAGTAAATAAATATCAGCCTCCAGGGATGCCAGCT CTGGAGGAAAACCTGACACAGGG CAGAGGCTGCAG 
CTCAGTTCTCAGTGTGCGACCAC^AAGCACAAAGCCCTGACTTCCACCCCTAT^ 
ATCTTCAACTACATTGCITCITAGAGCCTGCTCGCCTTGAGACCCC^ 
TGGTGAGATGGCTCAGTCXX^TAAGAGCACCGACTGCTCTTCTAAAGGTTCAA^ 

4 0 GTAACAAGTATCTGACTCCCTCTTCTGGAGTGTCTGAAGACAGCT^ 

AAAAAAAAGAAAGAAAGAAAAGAAAAGCCAGCCGGGTGTGGTGGCACLATGCCTTTAATCCC^ 

AG ACCAG AATGG TCT ACAG AGTG AGTTCCAGG^ AAGAAGAAG AAGAAG AAG AAGAAGAAG AAG AAG AAG AGGAGG AGGAGGAGG AG 
GAGGAG^NNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
4 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAG CTGAAGATTGAGCGG 



AGTTGGGGAACAGTTGGGGTGGGGGGGATAGTAGGTAGAGAAGCTAACTGAGCCCTGGAGGATTTGCTGTTGATCCTG^ 

CCTTTCCCCCAGGCTGAGCTCTGCCCTGCAGCCTCTTGCCCCAGTGACCTC 

AGTG AGGGAGG CC CAG CTTGG AGG CCATG CCTG 1 



50 



MOUSE SEQUENCE - mRNA 

GTCCGGCGCGCGGCTGAGCGACCGAGCGTGCC^ACGGAGCGGCGGCCTGCTGG^ 

CGCX»GAGCGAGGGACX3CGGCGGCG<3CGGACGCGGCGACAGGTCTTCTACr^ 

ACAATC^AGAGAACCCAGCAGCC!ACCATGACTGAGCAGGG 

5 5 AAAAGAGTGGGGACTAGTGACGAGGCCCCAATGTTTGGTGACAAAGTT^ 

GTTTGATTCC^GTCATC^CAGAAAGAAGCCATTTGCCTTTAGCCT^ 

CTACTATGAAGAAAGGCGAGATCTGCCATTTATTATGTAAACCAGAATATGCTTATGGCTCGGCTGGC 

TCAAATGCAACTCTCTTTTTTGAGATTGAGCTCCTTGATTTCAAAG 

CJVAACGXaAAAGGCGAGGGATACTCAAACCCAAACGAAGGAGCIAACXa 

6 0 TTGATTGCCGAGATGTGGTGTTCGTTGTTGGGGAAGGAGAAGACCACGACATTCCGAT^ 

CAGAGAGAAGIAACAGTGTATTCTATATCTTGX3ACCACX3CT 

TGCTGAGCTTATGTACC^GGTCACCCTTAAGAGCTTCXSAGAAGGCCAAAGAATC^ 

AGGCTGCCATCGTGAAAGAGAAGGGAACTGTGTACTTCAAGGGAGGCAAGTACACGCAGG 

TCCTGGCTGGAGATGGAATACGGCCTGTCAGAGAAGGAGTCCAAAGCCTCAGAGTCGTTCCTCCT 

6 5 CATG TGCT AC CTG AAG CTC C GAG AGT AC AACAAAG C CGTGG AG TG CTG CGACAAGG CC CTTGG ACTGG ACAGTG CCAATG AG AAAG 

GCTTGTACAGAAGGGGCGAGGCCCAGCTGCTCATGAATGACTTTGA 

CCTCAGAACAGGGCCGCTCGCCTGCAGATCTCCATGTGCCAGAGGAAGGCGAAGGAGCACAACG 

CAACATGTrCAAGAAGTTCGCAGAGCGGGACGCAAAGGAGGAAGCCAGCAAAGCTGG^ 

GCAAACAACACGAGAGTCAGGCCATGGAAGAAGGAAAGGCCAAAG^ 

7 0 TGAACTCGGCCCTCCTCGCTGGGCTCGCCTCCAACTCAGGACr^ 

GGAAGCAAATGGCATACCAGTAGCTTCCCAAATGACCAC(nt3CTGCT 
GAGAAGGCCGCTGGTGTGAAGAGACOIGGCCAGCAGCTCAGTCCAGCCCATTTCAG 

CCCTG TGAAC CT AGGG CCCAG CTG CTGTGG G TTCT ACATCGG C^CT AGGGTCACACTG CAG AAACCGTTGAT AAA^ CAGT 
GLVTCTCTGCTTTCCTATTGGTGGGCATG^ 
7 5 GCCCACAGCCAGGGGCTCCCTGGTCCACAGCTGGGTCT(^GGCCCCTTACCTGCCT 
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TGGTGTGGAGAGTGGGTAGAAGTAGGTTTAT 
CAGCACAGCTGCTCAGCCTCATGCTCCCACCCGCACCCTC 
GGACATCCATGTCCACCCAGGATTGACGAAGGAAGGAACTTCACACT 
5 AGCTTGGGTTTTGAAAGGCAAACTCTJVGGAGGCnTAAAA 

GCCTCTGTATTCTGGTGGGAGCTCTAAGGGTTGCGGGCTTCOSGTGTCC^ 
ATTAGGGCAAATCrTAAATAGTrTTGGGGTTGGATTTTAGTCCTACA 

MOUSE SEQUENCE - CODING 
1 0 ATGACTACTGATGAGGGCACCAGTAACAATGGAGAGAACCCAGCAGCCACCA 

AGACAGAGGAGTATTAAAGATTGTC^AAAGAGTGGGGACTAGTGACGAGGCCCCA^ 
AAGGGATGTTGTCAGATGGAAAGAAGTTTGATTCCAGTCATGACAGAAAGAAGCC^ 

ATCAAAG CCTGGGACATTGGGGTG TCT ACT ATG AAGAAAGGCG AG ATCTGC C ATTT ATT ATGT AAAC CAGAAT ATG CTT ATGG CT C 

GGCTGGCCACCTCCAAAAAATTCCATCAAATGCAACTCTCTT^ 
1 5 AAGATTCAGGCGTTATCCGTAGAATCAAACGGAAAGGCGAGGGATA 

GAAGG CTGCTGTGG TGG AAGG ACATTTGATTG CCGAGATG TGG TGTTCGTTGTTGGGG AAGGAGAAGACCACGACATT CCGATTGG 

GATCGACAAAGCCCTGGTGAAG ATGCAGAGAG AAGAACAGTGT ATT CT ATATCTTGGAC C ACG CT ATGGTTTTGGAGAAG CCGGG A 

AGCCTAAGTTTGGCATTGACCCCAATGCTGAGCTTATGTACGAGGTCACCC^ 

ATGGAC^CCAAAGAAAAGCTGACGCAGGCTGCCATCGTGAAAGAGAAGGGAACTGTGTACT^ 
2 0 CGTGATTCIAGTACAGGAAGATAGTGTCCTGGCTGGAGATGGAATACGGCCT 

TCCTCGCAGCCTTCCrGAACCTGGCCATGTGCTACCTGAAGCTCCGAGAGTACAA 

GGACTGG ACAGTGCCAATGAGAAAGG CTTGT ACAGAAG GGGCG AGG CC CAG CTG CT CATG AATGACTTTGAGTCGG C CAAGGGCG A 
CTTCGAGAAGGTGTTGGCAGTCAATCCTCAGAACAGGGCCGCTC^ 

2 5 AAGAAGGCTG T AG AAGG AG CCGCTGG CAAACAACACG AG AG T CAGGC CATGGAAGAAGGAAAGG C CAAAGG C CATGT ATG A 

HUMAN SEQUENCE - GENOMIC 

C!AAAAACCAG CAAGTTTTT ATT AGGG ATTTT CAAAAGGGGAGGGAGTGTGT GAATAGTTGTGGGTGACAGACATCAAGTACTTAAC 
AGGGTAATAGAATATCACAAAGCAAGTGGAGGCAGGGCGAGATCACAGGACCACAGG^ 

3 0 GTTTCGGGCACCATTGTCATTGATAACATCTTATCAGGAGACAGGGTTTTC 

AATTTCCTCTTCCTAATAAGCCTGGG AGCG CTATGGG AG ACTGGAGTTT ATTT CACCTCTGCAATCTCGACCATAAGAGACAGGT A 

TGCCCCGGGGGGGCCAGTTCAGAGACCTACCCCTAGGTGCGCATTCTCTTTTCT 

GTAATATTTCTCCCATTTGCTTTTGAAAGAAGAGAAATATGGCTC^ 

TCTCTTATTCCCTG AACAATTGCTGTTATCCTGTTC T < r i '' r r r CAGGGTG CCCACATTTCATATTGCTCAAACACACATGCTGTACA 

3 5 ATTTGTGTACTTAACGCAATTATTAC!AGGGTCCTGAGACGATATACATCCTTCTCGGCTGACA 

CAGGCATAGGAAATCACAAGGGTATTGATTGGGGAAGTGATAAGTGTCCATGAAATCT^ 
AAGACAGGCATAAGAAATTACAAAAATATTAATTTGGGGAA 
TGGCTTCAGCCGGTTCCTCCGTTTGGGGTCCCTGACTTCCTGCAACACCT^ 
TCTGGGTTGCAAAAGAGGATTTATTTTTTGCACCTAACCATAAGTCTTT 

4 0 TTTTTAATAATTTTGTGCATTAAATAAAATTTGTGTACCTTC 

GGATCATGTTAGAATTCAAAAAGTTTCAGAGTTTCX3AGCATCTTGGATTT^ 

AAGTAAGACTTGGATATTAAAAGTTTGAAAGCCACGGTTTTATTTGCTTAGTAGTATATAAC^ 

TATTGTGTAGAATCTGATGACTCCCCAACATATAACTGTCACCC7VCA 

TGGACAT CG CACAAG CACCC CTTG CTCAACAT AC C C CAG CCAGAG TT CTCTTG CTCT C CAT AAAAAGCAAT CTG AAATGTTCG AAT 

4 5 TAAATTGAAGAG AAAGAATAAAAAGTCCTGATGAGTGGAACCATGAAATGTTTCAT CTTTTTTCC CTAAGTTT AAAAAGATCATG A 

AAG T AAT ACATGTTT AATGTT AAACT ATTT AAATAAT AT AAAAAT AAAATCT CCTTCTT ATCTT CCCAAATC 

TC ATT CT C CAGAGGT AACACTGTTTG AG CAGTTTGG TAAGTT ATCTTTC CAAACTT CTTCT AT AT ATAAGCAAAT AT ACAAACACA 
TATATACTTTTTTCCTCTTGGAAATAGGATCATGTTACAAAT 

CATCTGTCTTTTCATGTGAGTATTAGAAGATTGTTTATGTGGAATGGGTAAGACGTCAC 

5 0 GATT AG CTG TCTT AAAGGG AAAACAG TG AAAAAGACATTTTGG CATG T AATG AACAAT ATTTG CAT AATTGT AATCCAGTG CTCT A 

AAATGATCATACCCACCCACCCACO^AAATCTACAAACAACTGGG 
CTGTTTGTGCCAGGTGCC^TGGCTCATGTCTGTGATCCCAGCACTTTGGGAGGCTGAGG 
TCGAGACCAGCATGGCTAACATGGCGAAACCCCATCTCTACTAAAA^ 
TCCCAGCTACTCAGGAGGCTGAGGCAGGAGTATCATTTGAACCTAGGAGGCGGAGATTC 

5 5 TCCAGCCTGGGCAACAGAGCAAGAGTCTGTCTCAAAAAATTTAAA 

GCCCATTGTGGCATCATCAACTC^CTTTTTTTTTTTTCAAT 

ATACACTTCATTTACTACAATGCTATTCTTTrT^CC r r'lTTTT i rrixriTTGGACAATATCTTGCTC TGTTGC C CAGGC TGCAGT 

GCAGTGGTGCAACCATGGCGTGATACTCCTGCCTCAGCCTCTCCAGTAGCTGGGACCAC^ 

GATCGGGTCTCACTGTGTTGCCCAAGCTGGTCTCAAACTCCTGGGCTCAAGCCTTGACATCCCAAAGTACT 

6 0 AGCCACTGCGCCTG^CATTCTTTTATAATGGATGAACATTTACAAAATATTTTCCCAA 

CTCTTAGCTAAATGATTCTCTACTGTATAGATTTATCTCATGCTTTTCAAAAACTG 
TTAGTCTAACATTCCCATGACCATCCTTTTAAGGAAAAACTTTGAAATA 
TAAGTGATCTGTGTTTAGTTGATGAACTC^TCCATGCTATTCTTGCATCATGACTTTT^ 
TGCATTTAAGCCTGTCTGGTTCTTTTCTCAAGCCTTTGCTCTCT 

6 5 CCCTAGTATAAGTATCAAACTTCTCACCTGTGACTTTTGCCTCTCTCTACATCTCT 

AGCTACCITTCTCTTATTTTOUnTCTTACCTTCT 
TACACAAAATGCCACCTCCCAGAAGGTCGAGGCATTTTTTT^ 

AAATGCTCCATTTTATTCTCAGTAGTATCTTCAGAAAGATTTCAGAGGTAATT GCATCTAC^ 

ATGAGCAATCCGATAAGAATATGTAAAATAAAAAACATTATTACTAAATTCri'Trrri'l 1"! r TTTTTTTTTAGACAGAGTCTTGCT 

7 0 CTGTCGCCO^CTGGAGTGCAGTCGTGCGATCTTGGCTCACTGCAAGCT 

CTCCCAAGTAGCTGGAACTACAGGCGCTCGCCACCACGCCCGGCrrAATATTTTGTATTTTTAGTA^ 
AGCCAGGATGGTCTCGATCTCCTGACTTTGTGATCCGTCTGCCTTGGCCTC^ 
CCTC^CCTTTTTTTATTTTTATTTTTTGAGATGGA^ 
TACAACCTCC^TCTCCAGGGTTCAAGCGATTCT 
7 5 GCCAGTTTTTGTATTITCAGTAGAGA 
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CTCTGGCCTCCCAAATTGCTAGGATTACAGWCTTGAGCCACCTCACCT 

AATGATATATTGAATATAGCTGATGACCAAATGGTGTTGTAAAAATTCCATCAGGCCAGGTGTGGTGGCT 
AGCACTTTGAGAGGACGAAGTGGGAGGATCACTTGAGCCCAG^3ATCACTTCAA 
CTTCTGAAAAAAATAATAAAAAAAAAACATTAGGTGTGGTGGCACATCTCTATAG 
5 TTGCTTG AG CC CAGGAATTCAAGGCTGCAGTGAG CTTATGACGGTACCACTGCACTCCAGCCTGGGCAACAG AG CAAGACCCTGTCT 

CPTAAGAGACAATAAGAAATAAATAAAAATTCCATCAAAGAATTTCTCT^ 
GAAATGTCCAGTGGTTCTCAACCATGTCTGACTGCACATTAGGATCATCT 
CCACAGACCAATTAAATCCAGACTCTCTGGGCTGGAGTC 

G CCCAGGTTT AG AAAAAT ACTTAAG AAAC CTGAAGAAT AATAC CAAAATGTCCGAAGTCT ATCT AATAAT ACTT CT AGAAG T AGAG 
1 0 GACAGAGACATCAAAGGAAAAGAAATAATGCTAGAAAATTTTTCTGGAGAT^ 

AATATCAAGGCCCCTGCAGGAATTAAGGAAAGAAGACAAGTACCTTGACATATCCT 

TTATTATGTCTTCCAGAGAAGAAAAGATGGGAGCCACAGTGTGCACTCTGGAAGGAAM 

CACAGTTTCTGCTTTTTCCCCCAGATGCTGCTCTCCAGTGCATTCCCCTAGCT 

GAGCTTCTCCAGGCTCCCTGGGAAGAGCTGCTCTTAACAGTTGGTGTTTGGG^ 
1 5 CTGACCTCATCTGCTGTGTAGTGTTCAGAAATTCCCC 

TTGCAGATTTT AAAAACTTTTTCTTGTCAT C CCAGCAT CTTC CTGGGGAG AGGAAGTTGATG ACTT AG T C CTG T CGTTCAA CCTTT 

TTGAGGCCAAATAGGGTAAAATCATATTAAGGGGAATAATCCTGTATTCAGGTTCATCCTT^ 

AGTCTAATATTTTCTGGGGTTTTTTTGTTTGTTTTAATATAATCCTT^ 

AGCACTCAGAGTCAGGGGAAGTGTGGTCTCTTAGCTTTGTTTTCT^ 
2 0 AG AAAT ATCTTTCTG AAAACT CTC CTGTT AGTAG CATAATCTG AGG CAAGACACTG AT ATTTT AACAAAGAGAACAAT ACAGTAAT 

TTACATAATATACATAGGGAATTGGTAGATATGGCTTTAAAATGACTTCTCATTTG 

AAATTTTTAGTACACTGTTGATAAAATATTAAGTGTATTCTCATTTTTCTGTTA 

GAGT GT ATG CTT ATCTGC CATGTCCAT AACAGAACGAACTGTG TTCCCTTTCTGATG CATTGGGT AGCAATTTT CCGCCTTTGG AC 
AGTTCAGCACAGTAAGCTTTATCATAGATTAATATCCATAGCTACATAGTGATTTTA 

2 5 GAAGAAAAGTCTCTCTTATAAAAATGGTGAAACTGTCTCACTGTGTCATTACA^ 

TTATATAACATTCAATCAAG^AAAAAGTTTTATTTGTTTTGTTAGCTTACAAAAAC 

GGGG AAAT ACTTGG AAAATTC AGTTT ATGTCTTT ATTCTTC C AG CTTTATCTG CTTTTGAAAG TT CAAAAG TTTG TTTT CTTGTTG 

TTTTTATCTTTGTGTTTTAATTATTCATAGTACTTCTCT 

AAGATGACTTGCATCCCTTGAAGTGTTGGAGCCAAAACATGAAGAGC^ 

3 0 CAGCTTATACAGGTTGCCTATTTACTCGAATTTA 

TTTTTTTCCTCTCTCTTTTTTTTGAGAAAGAGTCTTGCTCTGTCACC 

CTTCTGCCTCCCAGGTTCAAGTGATTCTTGTGCCTCAGCCAC^ 

TTTTTGTATTTTTAGTAGAGACGGGGTTTCATCATGTTGGCCAGGCTGGTCTCAA^ 

AGC CT CC CAAAGTGTTGAGATT ACAGG CGT AAG C CACTG TT C CCAG C CTGAAAACATTTCTTCATTG AC AGT ATTTTC ATTT AAAA 

3 5 ATTTAGCTGAACTCCAAATGATGAACA1TCAAAAGAAAGCATTGTC1TT 

TCACCCCTAAATACTACAGCATGTGTCTCCTATGAACAAGGATATTACTCn3CATAA 

G CTG ACAT ATAGTAGTAATAAG CT AAT ATGCAGTCTCCTGTTCC^AGTTCTTATATTTTGTCCC CTAATCCAGG ATC^ 
AT(^CTG ATTAC^CITGGTTGT CATAT CTCT ATAGT CTCCTTT AATCTGG AACAG ACCTCCCGTCTCTCCCTCTCTCTCTTTCTCT 
CTCTTTTTTT ATCTTTTAAGAG ATGGGCTCTCA^ CACAGTTCACTGCAGTCTC 

4 0 AAACTCCTGAGCTCAATCAATAGTCCTTCCATCTCAGCC 

ATGTTGTTAACATTTTTTCAGGAGTCTAGGCCAATGGTCr^ 
TTOVGGTTAAACATTTTTGATAGGGATAGCACATAC^ 
ATAGCACTTTTTTTCATTATTGGTGATGCAGAGTTTGCTTATGATC 
TTC^AAGGTCTTTTTCCTTTTTTAATAATAAGTAATCTGTGGGG 

4 5 AGTGCTTTACACCCAGAGATCTATTGATGGATGTCAGCATCTCTCTG 

GGCATAC^GTCOIGGACCCTCCAAATATTTAAGAGATCTTTTCCCTCTGTTAT 

GAAG CAT ATTAT AAGCAGTGGGTTTCTAGAAG CTGGGAACCAATCCCGTTCITCTGATTTTCCTG ATTATCTAGCTCACCTCATGT 

AGAGCTAGGTACATAATTTTTGTTGTTTGTTTGAGAACGATTGTTCrc 

ATGTTTGGTATAAAGCCTACTAGAAACTAGAGCAGAGAGACTTAGCCCCTGGT^ 

5 0 AACAATTGGATTTGTGTCTCTGTTGAACTGAATAGTGCT 

ATTATAGTTGTATCAATAAGACATTTTCTGCAAGTGCAGAAAGACTCT^ 
GGGGGCATTAAGTTTGTT'ATTAGCCTAATGTGTAATAGCTGTAGGAGTGAT^ 

TGTGGAATTCATCATGGCAGTGGTTAGGTGTGTTTTGCTTAG CCTATGATTAGCCTGTTGG CCTTGCATGAAA 
AAATGTCTGTCAGGTGACTACCAATAACAATAAGAGCTAACATTTGAATAGTATACCTTAA 

5 5 GATTATCTGTGAAAACAGATATTTTTATTCTAATCACATAGGTAAGGAAAAAATCTTAGAG 

GCTTGTAAGGAGCAGAGGAAGAACACCCCCAAATTCCATTACAAATTCCTTGCACT 
ATTCTAGGGAATCTAAATATGAACAAAGCATGGACCTTGCTCTGAAGGATCTTTTAAACCT 

CTT ACAG ACAACAAATCGAG AG AAGAG CTAAGGAAG T ACGT AAAAGGG CAAAATGG CCTT ACAAAT AG AGGGT CAAGGTTGGT AT G A 
CAGAAGTAGGATTTGAGAGAGGTGGATGTTTTGGAAACAAGAAAGGGAGTGACCAGAAGAATGACTATGOTA 

6 0 TTGTAGAGGTGGAATTGGCCTAGTTTGGCTGGAGCATGAGCTGCATGAAAGAAAGT^ 

GATAGATTAGGGCCAGATTGTGAAGGGCCTTGAATGCCACAT^ 

CCCTACAGAAGGCCCAAAAATGATGATAAATGGTTAAATATGTTTAAGTTACCAATCT 
ACTGTAACTATGAATAGTGTACTGTGTCCATTTTGCAGATGAAAAA 

6 5 ATTAAGAACTGATAAGGTCTTAATCCTCTGTCAAGAATATGGCTCTAGA^ 

CACCATGTTATGTTAAGGAATGTAATCACTACCCAACTTGACTTCTACTGTTCTCT 

AAGAATTGGATGATGTTTTATTTACATGCCTGATGACATCAATACAT<SCTCA 

AGCTTATTC^CCCTAAGTAAATCTTAGGTTAGGTAGAGCTCAGTTCCCAGGGGCATTCAAGA 

CAG CTAAAATATTTTTCTT CTT ACCAGGTTCTCT ACTT AAAAGACAATGACT ACT CC 

7 0 CCACAGCCACTGTTGCTGAGCAGGGAGAGGATACTACCTCCAAAAAAGACAGGGGAGTAT^ 

ACAT ACAAC^GAAACCTGCTGTCTGGAAAG ATTAT AG CATCTCT 
AAGTGCCAAACACTAGAGCTAGATTCTCATTTTGTCTTCATAACrTCA^ 

GGCTGG TCTTGAGCTCCT GGGTT CAAGTGATCTTCC CATCT CAG CTTCCTG AG T AGCTGGGATTTT C"l"l" 1 TT AAATTAATT AATT A 
AATTTTTGAGAGAGGGCATCTCACTATGTTGCTCAGGCAACTCTTG 
75 AGTAG CTGGGATTATAGGCACGAGCCACTGTGCCCAGCTAGAGATrrTAATT^ 
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TGGGCT AGG TAAATT AACTTG C CT AAT ATCAT ATATCTT AGT AATG CAG ATTT AC CTGGGGTTCAGG T C CAAGTTAT AT CTCACTC 
TAAAGGGCTATATTTCCCTATTCTCAGAGTATACAAATAGGTT^^ 
ACAATTTTGAAGACTTTTTAAGGAAAGAAAAGCTAACT^ 
TTTATTTTATTI^ATTTTGAGACAGAGTCTCCCr 
5 GCCTCCTGGATTCAAGCAATTCTGCCTCAGCCTCCCGAGTAGCT 

GAGACAGGGTTTCACCATGTrGGCCAGGCTGGTCnTGAACTCCTGACCT 
GATTACAGGCATGAGCCACCGTGCCCGGCCCAAAGGGGTTTATTTTTAT^ 

TTG CGATTTC CTCTT AG AATAT AT CTT ACTTTTG AATT AG CACTAT ACAGT CATTCAC CAT ATAATG A 

CATTTCAGTCAGTGACAGACTGCACCTT CAG CAGT<K5TCCCATAAGATTATAATG<^CCGAAAAGTTCCTATGG CCTAGTGGTGC 
1 0 CATAGCTGTTGTAGCATGGGTAGTACAGTCTATTATGTCTTTGTGG 

TTATGTATAGTACATAATACITGATAATAAATGTTTATATTACTGGTTTATGTG 

TGCTTCTTATTTTTTAAAAAGTTAACTGTAAAACACCCT^ 

CCAAAGCAAGAGGATCACTTGAGGCCAGGAGTTCAAGACC^ 

AAAAATTAGCCAGGCATAGTGGCATGTACCTGTAGTTTCTACTA 
1 5 CCAAGCTGCAGTGAACTATGATCATACCACTGCATTCTAGCCTGGGTC 

AAAACAGCCTTGX3GCAGGTCCCT CAGGAGGTATTCAAGAAGAAGG CATTGTTATTATAGGAGACAGCAGCTCCATG CATGTT ACTG 

TCCTGAAGACCTTCCAGTGGGACAAGATGTGTTGGTGGAAGAGAGTC^ 

TATGrTGTTTGTGTCITAGTTTTAAACAAAAAAAAGTTT 

ATAAGGATATAAAGAAAGAAAATATTTTTGTACAGCTGTACAGTGTTTGTGTTCT 
2 0 AAAGTITT ACACAGT AAAAATGTT AT AGT AAG CT AAGATTGATTTATT ATT AAAGAAAG AAGAAAAATTTT AAAAT AAATG T AGT 

AGGTGTCCAGTCTTTATAAAGTCTACGGTAGTGTATGGTAATATCCTAGGCCTTCATG 
CCAGAACAACTTCCAGTCCrcCAAGTTCCATTOVTGGTAAGTC 

ACTGTAC CTTTTTCTATGTTG AGATATGCTT AGATACACAAAT ACCACTGTT^ CAGTATTCAGTGCAGTAACAC 
AATGTACAGGTTTGTAGCCTTGGAGCAGTAGGCCAAACCACCTAACG^ 

2 5 AAGTACACrCTCCGATtSTTCZACATAAGGATGAAATCATCT 

ACTGTGTGTG 1TT CT T CACTCTATAGTTAATCATTTTCACGTC 
CAGAGCZATCTCTTGATTCTTCCATCTTATAAGTGAATAAACTGAATCTGAG 
TTTCrCCTACACCATTCTTAACACATOTAAGTTTTGTATAGTTAAGT 
ATTACCAGGATGCTGAGTTTTAATATCTCTTGTGCCAGCAGTA^ 

3 0 GCCTTCTGTGCCTGGATCTCTITCTGAATATATGTGAATTGCTATTATGTGCT 

AGGGCTTTATGCTATAGATTTACATCCTATTGTCTTTTITTAGAA 
CZAGGAAGCTTCTAGAAACAGTGGAGACTTTGCAGGCTCTCCTCAGCAT 
CAGAGTGCCCATGCCCAGAGAACATGATCCCCAACTTGCCTCAGACCCTGTTCT 
TAGGCTTATTAAGAAACATTTGAGATTTGAAGCCCTITAAATTAT 

3 5 CTTTTGGTTAAGATTrAAATAGGTCTAGTAACTGTGCAAGCT 

AAATTAGAATAATAATGCCATCAACCTCATAGAGTTATTAG<3ATTAGGTGAAGCACTTAGCATAGT 
TCTTATTAACTATTACGTATTTTTAAAAAAAGAACCAGTGCTGGTTTGGAAATTCAGT 
TTATATTTTAAAAATGAATAGAACCTAGATTGTGTTCTTAAGGCAACTCTA 
GAAGTGATACC^TAGTACCACTTGCTGGTGTGGATGGCTCTCTGACTCTCTGGTAGCC^ 

4 0 AT AGTCCT AG AAATG CAGATT C CTTCCACGTG AGGCTT ACAGTTG AG CAT AG CCT C CCAAAT ACCACATGCAGTTCGTG AACATGA 

GCACTTGTAGAATATTGTGTAATATCTTTTAGATTA^ 

TT CTITTGCCTITAAAG71AT ATTTGGGGCTGGGTGCG ATAG CTTATACC^GTAAT CTCAGCACTTTAGG AGGCTGAGGCGGG AGAAT 

CATTGGAACTCAGGAGTTCCAAACCAGCCTGGGCAATGTAGCAAGATCTCATCTCT 

TGGCATGTGCCTCTAGTCTCTGCTACTCGKKIftGGCTC 

4 5 ATTCTACCACC^CATGCACTCTAGCCTGGGTAAAAGAGCAAGAC^ 

TAAGAAAGCTGTGGAATTTAAGTTC^CAGTGTTTATGTGATATTTATGTTAGGTGGA 
GCAAATAATCGAATTATTTAAATAAAACTCAAAGCAAACTCCTTCCCCAAAAAGATG 
TATCTACCAAAAATGATTAATTTGGCATAGATTACAAATTAACAAATTACACC^ 
AACTGACCAAATACATTAGCAGCTTGCTCTGCCAAGAACAAAATAAATACAC^ 

5 0 ACAAGTTATTCAGTATCATATCAAGCAGTGGAATGATACTCATG 

CCTCTTTTACCAGGAAATTGGGCTTTCTGAAAATAAACA^ 
AAATTACACCTTTGAAGCTTTGCTTAAGCTTCTTTTTC 

TGGACTAGCTCACAAGCCTGTGTTAGAGAGTTTAGTGTCGGGCGCGGTGGCTCATGCT^ 
GCGGGTGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCT 

5 5 GAGCACX5GAGGCGCACGCCTCTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATTC 

TGAGCCGAGATCGTGCCATTGC^CTCCAGCCTGGGCGACAGGGCAAGACTCCG 
TACTGTCAAAATATAGGGCATTAAACCAATGAACAGGATTTATTGGTTAATTCT^ 

AGTAGATATAACATAAAAGACAAGTTGGTAAAGGACAGTTTTCTAACAATGAGTTATTGTGAGGAATGAAGGGAGGC^ 
AATGGATAATAGTATATTTGCATATCTGCCTGGATGTCTTGGGATGGACTGTGTGACCITTTTAT^ 

6 0 TTTGACCTCCTTTTTCTTAATGTTTGTCTTTTCTAGATO 

AGTTTATGTCCATTACAAAGCAAAATTGTCAAATGGAAAGAAGTTTGATTCC^ 
TTGGCAAAGGTAAGAGGGTATTTTTTTGAGTTGTGTTTTACT^ 

CCTTTCTAAAAGTAAAGAAATTTTAGAGTAGTAGTACTCAAAGAATGGTCTGAGAATCCCTAGGGCTCCCT 
GGTCCCAAATGTGATTTATTTTCATAACAATACTAAGATATTATTTGC 

6 5 AAAAG CTG CAG TTGT AAAAAAACTG CTGGTTCCTTAACACAAATCAAGG CAGTGGT AAC CAGCTG TT CCAGT AGT CATGGT A 

CCACTCCCACTCCrTCACTATTTACATAAGGGAAAAAGCT 
TTATTGAAACCCAATCTTTGAGTTTCACATCACTCT^ 
AAGTATGATGGTTGTCTCAAGGAAAAGTACTTGTGCTCTTGTTTT 
ATTTTTACTTGAAAGGGCAAATCTGATTATGCAGACTTGATTC^ 

7 0 GTCATTTCAAGGAAAAACAACTGACAATGTTITTTGCCATTA^ 

ACTGTGGGCTT ATTreCCACTGTGGGCTTGACAG CTTCCG AAG ACTTTT CTGATGAG ATTGGTCGGT ATT AATGAATGTG<3TTTTA 
AATATACAATGAAATGTGTCCACATTTGGAAGATCTGTATAACTAAGAATGAATATTTTCT 
GATCATGTGTGAATAAACAATTCATTCAAAGTTAGACCAACAGATTTTAA 
TTTCCACATTGTAATTGACTTTTATGAATCTTACAAGTTC 
7 5 AAGGTTG TT AAAGGGCTC CT ACCCTTTT CAACTAT CT ATCTTTGTGGGTTTGGATTTTCTT CT CI" ATGT ACi " TCAACCAAAATTGC 
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WICAGATGAATGCAGAAATAGGAGAATCTGGTTGT 

GACACTCGTCTCACTAA T1"! 1 T ' l 1 1 1 1 ATTTTGGAAAATACAAAATATACAAATACTTTGTACAAAATTAAATTACTTATATTAAC 
ATGGGGTTATTCTAAAATGAATTATTAAATAACTACTTTTTtACTAAATCTCAGTAGAT 
CCTCAATTTTTAAGATGTTAAGGGGTCCTGAGGAAAAAAGGTTGAGAACTACTA 
5 TATTTTCAAATGCCTTTTTTITTTGAGACAGGGTCTCGA 

GTGATCCTCCCAACTCAGCCCCGCAAATAGCTGGGAATACAGGTGCGTGCCACCATC 
TGTAGTAGCACCATCTCGGCCCACTGCAGCCTCCACTTCCGAGGTTCA^ 

ACTACAGG CGT ACACCACCACACCTGGCTAATTTTTG'ltiTT T TTI^GTAG AGATGGGG^'i~i"it5CCATGTTGGCCAGGCTAGCCATGA 

ACTCCTCACCTCAAGTGATCCGCCCACCTCAGCCCACCCAAACACC^ 
1 0 TTTTTTTTTTTTTTTTTGTAGAGACAG 

TCAGCCTCCCAAAGTGCTAGGATTATAGGCATGAGCCACCACACCTGGCCTCAAATGCT 

TCTTATTTTTTAATCTTGTAGGAATAATAGCCXATACATCAAA 

AGTTACATCTITTTATTCTTGTCTGCCACAGACAAGCAACTGTAG 

ACCATGGCTATGTGTGGACTTCATTCCCTTGTCCTTCAGCTCTGTATCTAGTTGAA 
1 5 ATATAGTAAGCAGAAACTGAGGAGTTACTTTCTGTCTCTTGCCACTCAG 

ATTCTGCTTCCCGTCCAGCTGTGCTCTTTCCAAGCTCCAGTTCTGGCCACGC 

T AATGG CT CATG AGAT ATT AATTTTTTGTTTGAAT AAG ATAG CAGT AAAAAT CAAAT ACAGT CTTGTGG CAGAT AATT ATG ACAGT 

AGGTGACCTAAGAGTGACTGACCTTGTCAGACCCCCTTGTTCCTGCCACCTGCTAAAT 
20 TAAAGAAGCTGAGAGTTAAGATAACAGGTCCTTTTAGCAGTTATCTTGGTGAGCAG^ 
ATCTTCTCTACTGAGATrTTCTCAGTAAAACACTATGGTGAATGAGTTTCCT 

TATTITTTT AAAAT ACAAGTTGGGAGAAAAAAAGAGG CAACTTT CAGTTG ACTTTGTAC CTCTG ACACAC CTGTCT AGAGT CAG TG 

CGGTCTCATGGAAAAAGCTTCCAGCCGGGCATTAGGAAGCCAGGGTTCCTGTCTTCCCTGT^ 

AGCCACTTAATCTTACCTGCTTCrGTAAAAAGAAGTCTGTGGAGAGCTTAT 

2 5 GGAACAAAAACCATAAAAGACTAATACCAACACAAAGAGACCCTTAGAGGCCTCCTT 

GCAATATACCTCTGGTAAAACATCTATAACATAACGTATTTAGTT^ 

TGTGG<^ T T TTTTTTTT C CTTTTTTTT T^ 1111 ITGAGATGGAATCTCACTCTGTAGCCCAAGCTGAAGTGCAGTGGCGTGATCT 

TGGCTCACTGCAACGTCTGCCTCTGGGGCTCAAGTGATTCTCGTGCCTCAGCCTC 

CACGCTTGGCTAATTTTTTGTGTTTTAGTAGAGACAGGGTTTCACCATGTTGCCCAGGGT 

3 0 TCCGCCTGCCnTATCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGCCCAGACTATAGrTTTATA 

CATTAGGTAGTTTATCAAAATGAAGAAAATAATATAAATGGACAGTAGTATAAATATTATTCAT 
TTGTACCTCTCCCAAGGGAATAAATAGAAATATTTCAATGGAAAGTC^TCCTAATTTGGG 
CATAGAACTATAAGAGCCTATCTGTAAAAATCCCATTGAAAAGATAAACATATCATAAAATACACAAATTCAA 
TTGCTTTGTATATGTC^TTTGCITTGT^ 

3 5 TGAGACGGAGTTTTACTCTTGTTGCCCAAGCTGGAGTGCAATGGCATGATCTCAGCTCACTGTGACC 

CAATTCTCCTGCCT CAGCCT CCC AAGTAGCTGGGATTACAGGCATG CGCCACCACACCTGG CT AATTTTGT ATTTTT AGT AG AGAC 
GGACTTTCACCATGTTGGCCAGGCTGGTTTTGAACTCCTGACCTCAGGTGATCCAC 
CAGGCATGAGCTACCGCGCCTGGCTAGCCGTACCATCTTTTAAACTCTATTCTTTAC 
TATACTACATATATTATATATGATTATATAAGTATATATTATTATGTTGAGGATTA 

4 0 CCTAACAAATGGTAAGCACTCAGATGTTGATTATTATTGTGGCAG 

CTGTGTTATAACATCTTGGGTCCTGACTTCATCTGTTGACTAT^ 

AGCCCCAGGAAGTTGGGAAAGACAGAATCAATCTAAGGGAACTGATTATTACTACTGTGGAATATGTTACAG 

AGGTGATGATTTCTCTTACTAGGTTTTCTTAACTAAATATTAGAGGTTGTGAAT^^ 

AGCTGAATTAAGAACAAGGATTTTCATACATACTTCTATAATAGGGGTTCTGTTCTTTTACCCTA 

4 5 TCACTAAAATGGAAGAGGGCAATAGATTTGGAGGATGAAAGAATTATTTCTACCCTT^ 

CTTGCTGTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCCGCTCATTGCAACCTCCGCCT 
CTCAGCCTCCTGAGTAGCTGGGACTACAGGTGCATACTACCATGCCTGGCTAATTTITGT^ 
ATGTTGGCCAGGATGGTCTTGATCTCCTGACCTCGTGATCCACCCACCT 
CCACACCCAGCCTCAATCTTACTATTTTTTGGATTCTTTGTACAATTAGTATAAT^ 

5 0 AAGCCACTTTGGTTTTTTTATTGAGGTATTATTTACGT 

TATTTCTGTAACTACCACCACAATCAAAATATAAAACAATTTCTTTACCCTAAAAAGTATTTT 
CTTCCTGGACCCTGAGCCCCAGGCAGTCACTAACTTGGTTTCTGTTACT 
TCTTGTATTATCTTTGGTGTTGACTTCTTTCACTTAACATAATGACTTTGGGATTCAT 
CAA1~TTGGGAATCCCACAGTTTGTTCACCCATTCAAAAATAGGACATTTG^ 
55 CCTTATTAACACGTAGTATTGTCGATCTTTTTAACTTTAGCCATTCTAGTA^ 

TCAGATCTTCTGCCCATTTTAAAACATCAGTTGTTTGTCTTACTG^ 

CCCAGTCTATCTrGC Cl ' l ' ri ' T A Tlll ' Cl ' TA ATTGGTCTTCGGAAAAGTAAAAGTTTT^ 

T Tr TCTTTTATGGTTTGGTTACCrACTGTTATTAACCATTTT^ 

6 0 CAGAGAAGCTCGGTAAATTGCCCAAGGCTATACAGTAAGTGGCAGAGCCAGGATTCAAATC 

AGATCTTAAGGTTTAAAACATAGCCCAAACCAGGTACAGAAATGAAGTAGGTCCTGGTTAGCA 

CAGTAGAAATTTCTCTTTTTTTTTGAGACGGAGTCTCGCTCTCACTCAGGCTGGAGTGCAGTGGCACGATC 

CTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTC 

TTTTATATTTTTAGTAGAGACGGGGTTTC ACCATGTTGG CGAGG ATGGTCTCAATCTCCAGACCT CGTGATCCGCCTGCCTTGGCC 

6 5 CCCAAAGTGCTGGTATTACAGGGGTGAGCCACTGCACCCAGCCACAGTAGACATTTCTTACCAGTAATGAT 

TTCTCTCCAGAGACTTGCTCTGTTATTGGGGCTGGAATCCAATGG 

GTG ATCCTCCTGCCTCAG CCTCCCAGAGTGCTGGGATTACAAG CATG AGTCACTGCACTCAGCCAATATATTCTTAGTATCTGTTT 
ACTTTTTTTT ACATGCAG TTT AGGAG CATT CAGG AGATTTTT AAGGAAAG C1TTTAGGCT AAAAT ATTTAATTTTTTTAATTGT CC 

7 0 CCAAAAGTGTGAGCCAGAAAGACAGCAAACTTTAAGAAAGT^ 

TTCCAATTTCCACTGCTGAGTGGATATCAGGGAGGAGCTGATTTAGGCT 
CGATATTACATCATGCCTTTGTTCTCTTTTTGAGTACAGTGTTGT^ 
CTACCTTCAAGTATGTAACAATTTAGTCTGGCAGAGTTGATCCTTAAGG 
TTTCCCTCTTTGTTTGTTTGTTTGTTTTAATTATAAGCCATGAAGGTGGGGA 
7 5 GAGAAATGAOUUVAATACAATGTAACATAAGATATGAA 
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GGAGGCCAAGGCAGGAC^ATCAGTTGAACCCAAGGGTT 

CAAAAATTAGCCAGGCGTGGTGGTGCACGCCTATAGTCCCAGCTTTCTCAGGAGGCT 
GTCCIAGGCTGCAGTGAGCTATGATGACATGCCTTCACT 

ATT AATTTGTTT AATTCATGATT AG TT ACT ATTGGT AGGG AGTAGGGATGATT ATT AG AG CCCTTTT AT AG CT ATTTT AAAAT AAA 
5 ACTAGGCCAGGCGTGGTGGCTCATGCCTGTAATCC^ 

GACCAGCCTGGCCAACATAGTGAAACCCTGTCTCTACTACAAATACAAA 

CTACTGGGGAGGCTGAGGCAGGAGAATCACTTGAGCCTGGGAGTGGAGG 

TGGGCGACAGAGCAAGATTCCATCTCAAAAAAACAAAAACAAAAAAACCTAA 

TTGCTTTATTTAGTATAAAAAGAAGAAAGGATTTCTGTATTATCTGTT 
1 0 G AAT ATG CAAGT AGTT AAAAT AG AG ATCAG AAATTGGTACTGAGTCCT AC<5 AA CAG 

ATGCATTTTTATTACCTGAAGCTTATTAGTGAT^ 

TATTTACCTTTCTTGTTCTTTTCTACTCCCATATGAAA^ 

TGTTCAAGATGCTCATATTCAAATGTAAATGTACATTTAA 

AAAGAACAGTCTTTTCAACAAATGGTGTTGGAAAAGCTAGATATCCACATGTAA 
J. 5 CTGCAAAAATTAACTCAAAATGGATCAAAGACCTAAATGCA^ 

AAGGCCGGGCAGAGTGGCTCACACCTGTAATGCCAGCACTITT 

CCAGCCCTGACCAATATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAAAAAAAAAAAA 

ATAATCCCAGCTACTTGGGAGGTGGAGGCAGGAGAATCGCTTGAACCCGGGAGGC^ 

GCACTCCAGCCTAGGCAACAAGAACGAAACTATGTCTCAAAAAAAAGAAAAAAGA 
20 GACATTGGATTTTGCAGTGGTTTCTTGGCATGACATCAAAAGTACAGGCA^ 

T CATAAATTAAACAT AAATTAGAAT ACAT CAAAATT AAAAACTTT^ CAG C CT ATGG AA 

TGGGAGAAAAATGTTTG CAAATCTT AT AT CTGAT AAGGGATTAAT AC C CAGAAT ACACAAAGAACTCCTATAACT CAACAAAAACA 

ACAAAATAGCCCAATTTAAAAATGGGTAAAGGACCAGTTGCAGTGGCCATACCT^ 

AGGGTCACTTAGACGCCAGGAGTTGAAGACCAGTCrGGGCAACATAGC^ 
25 AGGGCTTGAAATTTCTCCAAAGAAGATATAAAJ^TGGACAACAAGCATAT 

GCAAGTCAAAACCACAGTGAAGTATTACCCTCATACCGGTTAGAATGGG 

GG ATG TAGAGAAATTGGAACCCTT ATG CGCTGTTGGTGGGAATGT AAAAT AGTG CAG CTGCTGTGGAAA^ 
CAAAAAATTAAAAATAGAGTTACCATGTTGATCTAGCAGTTCTACTTGTGG^ 
AGAGATATTTGTACACTCATGTTTGTAGCAGCATTATTAACAGCAACCAAAAG 
3 0 ATAGATAAACAAAATGTGATATAAACGTACAGTGGAATATTTACGTTTTAAAGAAAAATTCTGGCAGATGCTA 

CCTTGAGGG CATTATACT AAGTGAAGTAATCCAGTGACAAAAAG AAAAGTACTGT CTG ATTCCACTTATATGAGTTATCTATCCAG 

AGTAGTCAAATTCATAAAAACAAAGTAGAATAGAAGTTGCCAAAGATTAGGGAAGGAGGAAAT 

TTTTGCAAGATGAAAAAGTTCTGGAGATTGGTTGCACAACAATGTGAATA 

T AAGATGGT AAATG CTTTTTTTT ATC C CCC CACAATT AAG CGTTTTTTTT AAAACT 

3 5 TCTGAAGTGGTCACTATAGTCTTTACTATAAATGAAACTGTTAAGTGATTAAAATGTTGTATCTTAAAATATGT^ 

TTAAACCAGAAAAAAGTGTAACATTACAGAAGGAAGAAAAAGTATAATGCATTGCITTTATAC 
TTTTCACCTGCCTATAGTTACATACGTATAAAAAGCATTTAAATTTAGTAATAG 

TGTTTTATATTTTATTACAAATGG AAT AAGACT CCTG AAATTG AT AAATTGTGTATAAG CT AGTAAAACTG AAGTCCAGATCAAGT 
GTAAGAATAAATTGACGTGGCOVGGCIACGGrrGG CTCACGCCTGTAATCCCAG CACTTTGGG AGGCTGAGGCAGGCX3GATCACGAGG 

4 0 T CAGG AG AT CGAG ACC AT CCTGGCT AACACG ATG AAACT CCAT CT CT ACTAAAAAT ACAAAAAATT AGCCAGGAGTGGTGG CT AG C 

G CCTGTAGTCCCAGCTACTCGGGAGK3CTGAGGCAGG AG AATGGCGTG AACCCGGGAGGCG CTTGCGGTGAGCAGAGAT AGCG C 
CACTGCACCCCAGCCTC^GCGACTIGAGCGAGACTCCATCTCAAAAATAAATA 
TTGGTGTGTTTITTTTTGAGACGGAGTCTTGCTCTGTCGCCCAGGCrrGGAG 
CCTCCCGGGTTCACGCCATTCTCCTGCCTCAGC 

4 5 GTATTTTTAGTAGAGACGGGATTTCACCGTGTTAGCCAGGATGGTCTCAATCCrCTGACCTCG 

AAAGTGCTGAGATTACAGGCGTGAGCCACTGCACCGGCCTATTCTTC 
TCTTTAAAATGTTTATGCTTTG<^GGTTGGGCAT 

ACTTCrAGC^CAGGAGTTTGAGACCAGCCTGACCAACGTGGTGAAACCCTGTCTCT 
TGGCACGTGCCTGTAATCCCAGCTACTTC^GAGTCTGAGGCACGAGAATTGCTTGAACCCAG 

5 0 GATCGTGCCACTGCACTCCAGCCTGGGCAACAAAGTGACACCGTCT 

TCTACATCTAGGAATTTAACCTAAGG AAATGTGT AGAAGCTTGCITAACCCTCTC 
ATATATACTGAGTGATACAACTC^TTTCTTGTTCTCTGGTGTGGTCTCACACATTGCTCCC 
CCTGCAGCCACAGACCACTAGGAACACACCrATAGTTGAACAC^ 
GTGGAGTATCTCAGTAAGAGTGTTAGGAGGAATATTTTATAGGACTTGTGCTTCT 

5 5 GAGCTTTCTCGATTGGATGCTGTTAGGATGTGGGGTGATTGATGTTTGGGTATCT 

GAGTAGAGTGAGACCAAAGCTAGAACTGGTAGAGAAGCAGCAGTCACTCA 
AAAATGCTCAGTGACCTTGTTTTTTTCCTCTGGGCTTAGACAAAAA 
TGGCGTGATCTCAGCTCACTGCAACCTCCAGCTCCCAGGTTCA^ 
GCGTGAGCCACCGTGCCCGGCCAGACAAAATTTTGAAGTATTTTTGT^ 

6 0 CATGTrGATATTTTGCGAGATTGTTTATGTCCCACAGGAGAATAATATGGCCTCCCT 

TGAGGTCCAGCTGTAAGTGTTCAAGGAGTTTCCAGACGTTGGGGAGGCTTTT^ 
CCIAGGTGCCTGTAGTCTCAGCCACTCAGAAGCTGAGGTGGGAGGATCACTTGAG 

CAAGACCCTGTCTCTTAAAAAGGAAAAAAACTGGG CTG ATGCAATGGCTCACACCTGTAATCCCAGCACT CTGG AAAGCTGAAGCA 
GAAGTATCACTTGAC<5CTAGGAGTTCAAGACCGACCTGGGCAATGTAGCAAGACTCCATATCTA 
6 5 CCTGTAGTCCTAGCrACTAGGGAGGCAGAGGCAGGAGATTCTCTTGAGCCCAG 



CTAAGGTGGGAGGACG3CTTGAGCCTGGGAGGTTGAGGCTGCAG 
GCAAGACCCTGTCTCTTAAAAAAAAAAAAAGTGATGTAGCCATTTCGTAAA 
TATCACCTTTGCACTTCCAGGAATTTACATTCCAAATATACTTATG 
7 0 TTACTGTTGGTAATATATGCATCAATAGAAGATTGTAGTATTATACACCCATTAAGAATGAG^ 
ATGATGATGTCTTTCCTTTATCAAATACATATT^ 

GTATTTTTAOTGGAGATGGGGTTTCACCGTGTGA 
7 5 A^GTGCTGGGATTACAGGCGTGAGCCACCACGCCCXXKZCTTATCT 
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TTATTATTTACATTTTGCAGGTGAGGAGACTAAG 

TGGTATTCAAATCCAGGTATTCTGAAGTTAGAGTATGCTGCATTTTTGACCT 
AAGAGGATGAAGATATATAATAAATGAAAAAGTAGTTTAGAACTGAATGTCTAGTT^ 
AAAGTGACAGCACACAGTAGGATACCAAAGAAACGTAACAGTTACTTATAAGGAGCAGA 
5 ATCTCTTTCATGGGTGTTTTAATAATAGXSCATC^ 

GTCTGGAGTGCAGTGACATAATCATTGCTCACTGTAGCCTCTT^ 

G CTGGAGCTACAAG CATATGCCAC CATG CCG CACT AATTTTTT AAATTTTTGT AGAG ATGGAG TTTCACT ATGTTGCCCTGGC CGG 
TCTTAAAACTCTTGGGCTCAAGTGGATGCTCCTGCTTTGGCC^ 

ACTTACAT AT ATTTTT AACTGAAAT AAATGT AGCTGACATTTTG A CT AAAG TCATG AA 
1 0 TAAAAAATGGTTTATtTTATCCCTGCTTACATACTGTTATGTAGGGTTTCCCTTC 
TCACTTCCTCTGGGGTrCATCTCTTATCTGATCCTACTTTrGAATA 
TCCACTCTGCTGCCCAGGCTX5GAGTGCAGTGGCATGATCTCAGCTCACTC 
CTCAGCCTGCTGAGTAGCTGGCAAGCACCACCACGCCTGGCT 

TTG CCTCGG CTGGTCTCAAACTCCT AAG CT CAAGGGATCTG C C CG CCTTGG C CT CCCAAAGTG CTGGG ATT ACAG CTGGGATT ACA 
1 5 GGCGCGAGCCAACGTGCCCAGCCCCCAGATACTTAATATTTCTGATTGACA^ 
TTTTTTGGAGACAGAGTCTTGCTGTGTCGCCCAGGCT 

TCAAGCAATTCTCCTGCCTCAGCCTC CTAAGT AGCTGGGATTACAGGCACGCACTACCATGC CTGGCTAATTTTTGT ATTTTT AGT 
AGAGACGGGGTTTCACCATGTTGGCCAGGCITGAATTCCTGGCCTCA 
AQVGGCGTGAGCCACCATGCCCCACCAAAATGTAATTTAGCCTTTTAAATGG 
2 0 ACTGCTTACAGTAAGATAATGATGCACATCATAGGAACTATAAACATTGAGAT^ 

G AGCAGT AAATTT AACTT AGTTTC CTGG AAGT AGGCAAAAAAT AAAAT ATTTT AAGTT CTGT AGTTG TGG TTTGACAAAAAGACAC 

ATATGTTTTTGTTTAAAGGAACATTTTTCTGATTCTAA^ 

GTTCTGGTTACTTCTTGTTTTGTTGTTTTTTT^ 

TATTTTTGTTCrrATCTCTGCCTAGAAAATTTGCGCTT'ATTATAGTTG 

2 5 TCATGGATAATGTTACCACCAGTTTTCAAAGACTTAATTTTTGAGTAGTTTAGGA 

AATC Cl ' TTri 'GAGACAGGGTCTCACTCTGTCACCCAGGCTAGAGTGCAGTGGCATAATCATC 

GG CTCC CAGG CTT AAG CTCT CCT CCCACCACAGCCTC CGG AGT AGCTGGG ACT ACAGGTG TG CACCAT ACC CAG CTGTGTT CTTTG 

GTTCTTTGGGGTGTlTlT^fTriTTTTTTAGAGATGAGGTCTTGCTGTGTTGCCCA 

TCCTCCCACCTCAGCCTCCCAAAGTGCTGGGCCGATCAACAGGGAATCTTTTOT 

3 0 GTATGTAGCACTCTGTCATCTTGTAGCCCATACTTTCTAGCTGAGTATGTAGTATGA 

GAAAGTTAAATCTTTTTAAATAGGGCTTATAGCTTTCAACT 

GGGGTGGCTACCATGAAGAAAGGAGAGATATCCCATTTACTGTGCAAACCAGAATATGCATAT 

AATTCCCTCGAATGC^CTCTCrTTTTTGAGGCAAGTATG 

TTGAAGCAACAGGCIAGTGGGAAAAGGAGCrrAAGCITGCTATAGAGAAT 

3 5 ATTGTTTCCC^CAAACCCAAGGGTGCGCGTGCCAGTGCITTCTTAGAAACG 

AAAAGACGCTACACTTAGAAATAAAGATTCTGGTCCATGGTGTGAGTGTGACATACTCAAAAT^ 

GTTGGCGATAAATGTTGAAATATGCCTATGTCACGTTGCGGGGTATTTACATAT^ 

TCTAGAAGCCGTTGGTGGTGATAAAACATTATTCATCCTATGTTG 

CCAGCTAGGTAAAATACACCTTTAATCAGTCGTAGGTCACATCCATGAGATTTATTCTTACATT^ 

4 0 CCAGGATGGGAAAGTTGTGGCCTCAAACAAAATAGTTCTACACC^ 

CTAGTATTTCCTATCITTGCACTGCAGTCCCrrTAGGATATAGTTCr 
AGTATAGTGGGTTTTTTTTTTTTTGCTACTTTC 

GTAAAATGAAATAATTGGTTGCCCTTGCCTTCACATATGTTCAGTTGGCITTA 
TTCTAGATTGAGCTCCTTGATTTCAAAGGAGAGGATTTATTC 

4 5 ATATTCAAATCCAAACGAAGGAGCAACAGTAGAAACTAAGTATTAAAGT 

TGGACGAAATCGAGACITAAAAAAAAATGTTTGTAGAAATGAATATATCGG CTGTGCGCAGTGA 
TTTGGGAGGTCGAGGCGGGTGGATTGCCTGAGCTCAGGAGTTTGAGACCAGCCTGGCCAGC^ 
AATACAAAAAACTAGCCAGGTGTGGTGGTGCACGCCTGTATTCCCAGCTA 
GGAGGCAGGGGTTGCAGYGAGTCGAGATGGTGCTACTGCACTCCAGCCTG^ 

5 0 TTGAATATATCTCAAAGATAGTCTTAGCTGGGCACAGTGGCTCATGCCT 

CTCTTGAG CTTAGG AATTCGAGACCAGCCAGG ACAGCATAGTGAGGCCCCATCTCT ACTAAAAAG CAT CAAGAG AATTAGTCAAGT 
GTGGTGTGACGTCCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGAAGAATC^ 
GAGCCGTGATTGTGCC!ATTGTGCTTCAGCCTGGACAACAGCAAGATTCTGTCCA 
TAATAArrTTATTTAGTAAATATTAATTCAACATCTACGTGTCAC^CATGCTACCAG 

5 5 TAATTCCATCCACAAGAAATTCTCAATCTGTGGACTCTCTGGTGTGCCAACA^ 

TTTCTCACAACCTTAAGAGGTAAAATTTCTTGTCCCCAG^^ 

AGAGTTACrAAACAGAGCAGTAGATTATAATCCCTTTTAGGACTCAAATACATCTCATCAGCTGTCATCAAGATTA 

CATCATTTAGATCAATATTGGATAATAATGTTCATTCTATTCCATATCAGGAGACCTA 

AGTAACCTGAAGCTATTCCGTTTCTTCATTTGCTAAGTGGGAGTAGGGGCAGCTCTGA 

6 0 CCCAGAGCAAATTATATGTACCT^CTGTGTATACTACTTGTCAC 

TGAGTGCTTGAGGACCAGAATCTTTCTTCATCCTTTTOTG 
GGCTTTCTGCATGGATAAGTGCTTGAGAAGTGTCATTAGTGAGCTC 
TTCCATTAAGAAOIAAAATCAAATCATTCAACAAGGATGGGAACATC 
AAAAGCCAGGTGTAATGGTGTGCACCTGTAGTCCCAGCTACTTGGGAGGCT 

6 5 G CTATAGCGTGCT ATGGTT ACACCAGTG AATAGCCAGTG CACTCTAG CCT AGGTACCATAGTGAGACCTTGTCrCCTTAAAAAAAA 

AAAGTAAAGAGTTCATTTAAACAGAAAACAAGAATAGCTGA 

AAGTTTTTGAATACATGAAGAGCrrTTAAATGTAGAATACCAGGGAAATAGCTCCAG 
CAGCCCTGGCCTCAGGCTGGCTTCAACTAGACTTCACCT 

7 0 TGATTTTTAAGGTCCTACTAGCCCTAAAACTCCATCATAGAATTCTAAATATrrA 

TCACTATATTAGTAAGCACTTAAGCTTGTCTATAATGTGTATTTTAAATTTTTCT 
AGAGGTTTACTCCTATGGATACAGGCIACACCCTAGATTCAGTAATTTATAATA 
GAGGAAAAACTTTTCCCAATTATAGACAATATCACAAAATTACCGCTGTGGAGATGA 
TTGCTTTTGGTAATACTTGCGATCGGAGTCAATTTTTT^ 
7 5 AACTAGTATAGCGCGTTCCCTTATATGCTTTACACAGCTTCCTCTAAATGTTAA 
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TCAGGAAGTTAACATTGATGTAGTACCACTAACTG 

TGGTCTAAGATCAAACTCAGGATCCAGCATTGTATTTACrTl'l 11"! 1 1TIT1 IT'n'TTl'l 1 1 1 1 GAGATGGAGTCTTGCTCTGTCA 
TCCAGGCTAGAGTGCAGTGATGCTATCGCAGCTCACTGCAACC?TCCACCTCCT 
AGTAGCTGGGACTATAGGCGCACX5CCACCATGCCCGACT 
5 GCTGGTCCTCAAGCTCCTGACCTTGGGTGATACACCCGCCTCAGCCTCCCAAAATC 
AGC CT ACT CTACT ATTTTGAAT ACIAACTGAC CG TTTTGT AG AAT AT^ 
ATGCATTTTTTGAAAATACCACTGAAGTGATGTGCCITTCATCATTAT 
GTTACCTTCGAACACTGGGTTAAGGTGGGAAACCATGTTTCACTACT 
GGAGTGAAGTACTATCTTGTTTTGTATC^CAACTTTACACA 
1 0 TGGCATTTGACrAATGGTAAATTTTCTATTGCTGTTATTCTCTCT 

ATATCATTATGGATCTGTGTATATTTATTCCATAAGTTATTATTCATTACT 

TTTGACCATTGGAAACTCCTTTAAGTTTGTTTTT 111 AAGTGCTTCCTTATTTTCTGG 

TACCACAAG ACACTCATGGCTCATCTTAAAATTTTCCTGCC CT AGACCTGG AACCAACCATTTCTCCAAAG AG CCCTG ATTCCTTT 
TATTGGAGAAAGGCATTTAGAAACCAAGATCTGAGTGCTAGGTTTGCTCCTGCTATGGGAT^ 
1 5 AGACAGAGAGATAGAGAATATATGGACCTCAACATACATACGTATACATCTATATGTATT^ 

GTACAGTGGTGCAATCTCAGCTCGCTGCAACTTCCGCCTCCCAGGTTCAAG 
TTTACAGGTGCCTGCCACCACGCCCAGCTATTTTTTGTATT^ 
ACTCCTGACCTCGTGATCTGCCTGCCTTGGCCTCCCAAAGTGCTGGAATTAC^ 
2 0 TATATGTTAAAAACCAGGGATACTCACTGGCAGTTG 

TGAAACTCCATTCTCCAGAAGTGAGAAAATGCTCTCATTCTCCACAGCATACT 
TTTTAGAATTGCTAACCTAGACCCCTGTGACAAACTGACTTTAGGAAATATTTGTTTATAGTA 
GTATATTCAAAACACTTGCTITCCAACTGAGGTAACITCTT^ 
AATTATTCATTTGTTACCCTCTATATTCCA 

2 5 GTGAAACATTACCATGATTTT AAACT AT AT AAAAAACG AT AT ACT CAAAAGTGTCATTTCCC CCTCATCCTT ATT AAC CTGTT C C C 

ACTTCCCCATTATTTCCACCCTATTCC!AGTCTCATTAGTTATATCATTAGATC 
TGTCATTTTATTTTTTTCTGTTT CTGTT AACAAAGGGAGCCAG 

AGG CAGGAGGATGG CTTGACCT CAGG AATTTG AG AC CAT C CTG AG CAACAT AGAG ACCTTGT CT CTGCAAAAAGTTT AACATTTAG 
CCACATGTGATGGGACATGCCTCTAGTCCCAGCTACCCAGAAGACT 

3 0 AG AG AGTTGTGAT CATGCCACTGT ACT CCAG C CTGGG CAAC AG AACAAG G CTCTGT CTCAGTTTGAAAAAAAG AGAG AAAGAACAA 

AAGAAAGGGCAATGACTAATCTTTTGTCTTTC^TCTTGTTAGAATGTA 

TTTTTCACTCAAAAGACCGTCTTGGTTTTCTGGAACAAATCTATTTC^^ 

AAAATGTCCTGGTTTTTCCTCCITTAAAGAAGTTAGCTATCTC 

ACCAAG ATCACCTTGTTCTTTTTGAAATCTTCTCAGTCT CTTTATAG CATCACCAAAT ACACTTAG ATG 

3 5 TGAAACCIAGATATAAAACCTGGAACTAGGCCAGGTGCGGTGGCTCACGCCTGTJ^ 

ATCACGTGAGGTCAGGAATTTGAGACCAGTCTGGCCAACATGKSCAAAACCCCATCTCT 
TGGTTGTGGGCACCTGTAATCCCAGCTACTTGGGAGGGTGAGGCAGGAGAATTGCTTGAT 

AGAGATT ACGC CACTACACTCCAG CCTGGG TG ACAGAGG G AAACT C CGT CT CAAAAAAAT AAAT AAAT AAAAT AAAT AAAAC CTGG 
AACTAAAGCCGGCTATAGAGTGAACAAGAACAATAGACACTGATCACCCATTACAATATGTT^ 

4 0 TTTCTGTCATTTCTGTGTAAAGAGACAATGAAGTGGTATGGAGGAGGGGACAGATAGALTriTGT 

ATCTTGGAATTGCTACCTGATACTTTTATGACTTTA 

GGGAATAGTTTTCACTCCCAGAGGTTGTGTGAGAGTGATAACGGTTTGGCTGTGTGCCT 

CCCACAATCCCCACGTGTCGTGGG AAGGACCCACTGGGAGGT AACTG AATCGGGGGGG CTCCCATGCTGTTCTCATG ATA 

GAGTGAGTTGTCACGSAGATCTGATGGTTTTATAAGGGTCTGGCAGGGAAACCCTGCTGGCT 

4 5 GAAGGGTGTGTTTGCTTCCCCTTCTGCCATGATTGTAAGTTTCCTGAGGCT 

TCTTTTCTTTATAAATTACCCAGTCTCAGGCAG^ 

TCAGACATTTAGTGGACACTT'AATTTCTTCTACCCTATATATTGTGTTrCACT 

AAAAAGATCCTTGACTCCTCTTTTTCTTIT^ CAGTGGG CCAGGCTGG AGTGCAATGGCGCATCTT A 

GCTCACTGCAACTGCTGCCTT CCG<K3TTCAGGTGATTCTCCCACCTCATCCTCCCAAGTAGCTAGGACTACAGGCACGTG CCACC A 

5 0 CACCCAGCTAATTTTTGTATTTTTAGC^AAGACGGG 

CACCTACCTTGGCCTCCCTAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCTGGCT 
TCATCCATCAGCAAATCCTCTTGGCTCTACCTACCATGGTTATTAGAGACCTAAAATTOT 
ATATATCTTAATAGTAATCTTATCTAAATCACATTTCTCAAGGTTCCGGGGAAAGAA 
TGAATTGTAAACTTTTACTTCCTTTTCATAATTTGTCTCT 

5 5 TCAAGGAAATTTTGATTTTTACTCTTTATATGAGGATCCTAATGG 

TTGTGTCAGTTAAGATGACCATCTT'GTAATTTTTATTCAAAGTTTT 

TTGATATGATTATGATGACTGATACTTCAGTCTTTAAGTTTAATTGCAGGTCTTGGCTACATTC 
TAGAAAATCAAATCTGACTTACATAAGGTTCTAAATAGCTACCAGTTACTCTTCCCCCATCC 

TGTCAACTTT AG TGGTTGGAAG AGG CC CATC AAAATG CTGGCTGT ATGTGGTTCTCT CG C CT CT CAG T ACTTT CAG CCACATTTG A 
g 0 ATGCAAAGAATGCAGACAGTCAGCAGTATTTTTAAATGTGAAATTTTCAACAGACT 
CCTTTCCCTCTCCAAATCTCACTGAAATGATAGATGGAGGGAACAAAGATTTTCAA 

GAAAGGAGAGGAATGTAAGAGTGGATCAGAAATTATGGAGAATTTCTGGAATATGTAAAGTTGATGAAGTTGAAC 
AGTCTCAGCAAATCCAACCCGAGACAGCTGTAAAGAGGACTTAAG 

TCCAAGCTCATGGTCATCTGGTGTGAGGGAGCCCAAGCAGGCTATGCAGCAATCATCAGGGT^ 

6 5 GGGGCCAGTAAACCTCCAGTGCAGTAGTTCTCAACTTTTCCTGAATTTAGA^ 

GACCAGTTAAATAAGAATCTCACATCTCAGAAGATGGAACTCC^ 

GCAGGCAAGGACTGAAAACTACTAGACTACGGTCTGACTGTGACTTITTCTCCCAA 

AAAATAGAAAAATAATAAAATAAATTTAAAAATAAATAATAAAACAAAATGTGCCTCAAAAGACT 

AGGGAAAATGAATGTATTTTITGAAACTr i T l TTrrr r rri"r'l l'l' 1 T T TTTGAGACGGAGT CTTGCTCTGTCACTCAGGCTATAGT 

7 0 GCAGTGGCGCGCTCITGGCTCACTGCAAGCTCCGCCTCTCGGGT^ 

ACAGGCACCTGCCACCACGCCCAGCTGATTTTTTrrCTATTTTTAGT 

TCTCCTGACCTCGTGATCCACCTGCCTCAGCCTCCCAAAGTGCTGAGATTATAGGTGTGAACCACCA 
TTTTAAAACAAATTTAAAGGGGAAAAAAGTGTAAAGACAGCTTTTC^ 
TTTCCTGTGCCCATCATGCCCAGCTCATAGTAAATGCTTAAT 
75 ATCCTTTTGCAGAAAGGCCTGCCATGGAAACATATCCAACTGCX 
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GAATCCTTCCTTTATAATACGGCCAGACAGCAAAGAATC^TCAGG 
GAAGTAGAAGAGAACCTTAAAAATACTAATTATTGTCTTC^ 

TCCCTGAAAAAGGAAAGGGAGGACAAGAAAGTTTCTGGAAGTT AGAGCATG ATTAAT CAAAACTGAAACTCAG AGGAAGGGTTG AA 
GAG CAAAATGGACAGT C C CTGAAATCAGGTG AT CTGAAAGAT AG CCAAGGATTTCTCAG AACACAAGAAAAAA^ 
5 TGAAAAAGAAATTAGAAGATATGGATAATAAATCCAGGAGAGGTAATATGCATCCAATCAAAC^ 
GGAATAAAAGAAAAAGATTCTTAAGATCGAAAGGGCCAGTTGATAGTGGTACCCAGGAGGAAG 

TCCTGGTGAAATTTCTGAATT CTGCAGCTTACAAG AAAATT CTG AACT CTTC CAGGG AAGAACAAG CT ATGT ACAAAGG AAT AAG A 
ATCTTATTCATTACACTAAAGACAATGCTTTTGA 
ACTGTCA.TTTATGTATGAGAACAAAATTAACTTTTTGGA^ 
1 0 AAAACAAAAAAATAAGT CAAGAAGCAACAACTCAAGG CAT AAG AAACAGT AG CGAGAAAAAAAAAAT AATTT CCATATTGGAT ACA 

TTTATCCTGAAAGATCTCTGAAGCAGTAGAGCTGTAATGTTAGAAGTA^ 
CCACACCTGTAATCCCrTACACTTTGGGAGGGCGTCACTTGAGGCCAGGAGTTCAA^ 

CT CT AGT AAAAATACAAG ATTT AG C CAGG CATG ATGG CACATG CCTAT AATCC CAG CT ACT CAAGAGG CTGAGG CAGGAGAATCAC 
TTGAACCCAGGAGACAGAGGTGGCAGTGAGCCAAGATTGTACCACTGCCTAGGTGACAG 
1 5 AATAATAATAAAACAATAATAACAGTCTGGATTCTGG^ 
AAATTGTCTAAGATTATTTTCTTTTTTAAAACATTT^ 
GTTGTt3GCTCACTGCAGCCTCAAArrCCTGCCCTCAAGCC^ 
CCCAGCTAACTTTTTTTATCCTTTTAGAGACGGGGTC^ 

2 0 ATGCTGATTAATTCITTTATTTTAAGATTCAGGGGG 

GGGCTTCTATTGAACCCATGATCCAGATAGTGAACATAGAATATAGCCCTTGTACCCC 
AGTGTCTCTTGTCCTCATCTTTATGTCTCTGTATACT 

TGTTTCTG TGTT AATTCACTT AGG TTACTGG C CTCCAGCTG CATCTG TG TTG CTGT AGGGGACAT CCTTT CATT CfTTTTT ATGG C 
TGCATAGTATTTCATGGTATATATGTACCACATTTTCTTTTTTTT^ 

2 5 GAGGGCAGGGCAGAGTGCTCACTCCTrTGGCAGCGGCn^ 

TGCAT CCCCACCCTTGTCTTGATGAACTTG AGGG CCCATTTGTCCTTG<*AGACC1TCAGT AACTCCATGGTGCACTOCTCGTATGG 
GGTGAAGCCACACAGCTCTCAGATCATGTCCGGCACGAACTTGGTATATTTCGTC^ 
TCCTCATGTTCTCGGTCACCrTGTGGCCCTTGTTGAGGCCCATGGCCATAGGGCAGCACAGAGCCATGG 
TGGCCGTGGCAGAAGGAAGCCCACATTTTTATCCAATCCACCATTGATGGACACCTAGGTTGACTCCG 

3 0 TATAACTGTGCTTAAGAGTAAGTGCTGTGTGTGGCTC^TGCCTGTAATCCCAGCACriT 

AGCCCAGGAGTTTAAGACCAAACTGGGCAACATAGTAAGACCCCATTTCTACAGAAAAAAAA^ 
GCCTGTAGTGCCGGCTACTCAGGAGGCTGTAGTGGGAGGATCATTTGAGTCACATTTATA 
AGCGAGACCCTGTCTCAAAACAAAAGAGGCTGGGCGTGGTGGGACATGCCTGTAATCCT 
AT(IATTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCTGCCrCT 
35 GGTGGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCrGAGGCAGAATTGCTTGAACCTGG 

ATT CTG CCACTGCAGTCCAGCCTGGG CAACAG AGTGAGACT CTGT CT CAAAAAAC AAAC AAAC ACACAAACAAT GCTGCAGT AAAC 

ATGTCAGTGTAAATGCCAGCTATCTTTCTGATAGAACGATTTATTTTCCTTTGGTTGTATACCCAGTAATGG^ 

ATGGTACTTTGATTTTTAGTTCTTCGAGAAATTTCCAAAACTC 

TTTAAGCATTCCCTTTTCTCCAC!AACCrTGCCAACAT 

4 0 TTTGAGACGGAGTCTCATTCTGTTGCCCAGG CTGGAGTGCAGTGG CGCG ATCTTGG CTC ACTGCAACTCCCGCCTCCCAGGTTCAC 

GCCATTCTC CTGCCTCAGCCTCCCGAGT AGCTGGGACTACAGGCG CCCACCACCACGCCCGGCT AATTTTTT AAAATATTTTTAGT 
GGAGACGGGGTTTC71CCGTGTTAGACAGGATGGTCTCGATCTCCTGACCT 

ATTAAAGGCATGAGT CACAGCGCC CGGCCTTGACTTTTT ATTAATAGTCATTCTGACTC1AGTTGTG AG ATGGTATCTTC^ 
TTTGATTTGCATTTCTCTGATGATTAGTGATGTTCGGCATTTTTTCATGTTCGTTGGCTGCTTCTTC 

4 5 T AATGGGGCT ATTTG CTTTTTTTCTTGTTGATTTGTTTAAGTTCT CATAATTTG CAAAT 

GTTTTCTCCCATTCTGTAGTTTGTCCATCTGCTTCGTTGATAG CTTATTTTGCTGTGCTGAAG CTCTTTAATTAGGTCCCAACTGT 
CAGTTTTTCTTTTTGTTGCATTTGATTTTGGGGACTTAATCATAAATTATT 
TTTCTTCTAGGATTTATATAGTTTCAGGCCTTACATTTAAGCCTTTAATCCA 
ACGTCTXX3TTTCCTATTTCGATGCCTTTTATTTCTTTCTCTTGCCTG 

5 0 GCTCAGTACCATGCTGAATAGGAGTTGTTAAGAGTC^ACACCCTGTTTTTTTCCAGTT 

CGTTCAGTATGATATTGACGGCTGGGTGCIAGTGGTTCACACCTGTAATCCC^ 
AGCCTAGGAGTTCAAGACCAGCCTGGGCAGCATAGTGAAACCTTGTCTCTACT^ 
TGCACCTGTAGTCCGAGCTTACTTGGGAGGCTGATGGAGGA 
GCACCACTGCACTCCAGCCTGGGCAATGGAATAAGACCTTGTCTCAA 

5 5 AGGTGGCTCTTATTATTTTGAAGTATGTTCCTTTGATGCTTAGTTTATTGAGGG'r 

GAATGCTTTATGCTGATTGATTCTGAAOITTGATAAAGAGGTATATATATATATCT 

GTACTAGTAAGAATAGCATGTAGAATTTCCAAATCACCCCATGTGGAGTGGGGAAATAGAGGAACAAGGAAAACTTCAT 

G CAAAAATTGTTTTAAAAAAT AAG AAAACACGGGCCGGGCACGGTGGCTCACG CCTGTAAT CCCAGC ACTTTGGG AGGCCGAGGCA 

CGTGGATCACGAGGTCAGGAATCGAGACCATCCTGGCTAACATGGTGAAACCCCATCTCTACTAAAAA 

6 0 CATGGTGGCGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCAGGAGAATGGCGT 

ATCACACCACTGCACTCCAGCCTC^GCAACAGAGCGAGACTCCATCTO^AAAA 
AAGAATAAAACCACTCATATCAGTGATTACAAAAAATATGAATGGGATAAAGTCAC^ 
GAACAACAACAGTATCTAGCTATGTGTTCTTTATAAATGAAACACT 
TTACGTCTGCCGATATTGAGATCTGATGGAGCTGTGAGACTTTACATTTTTGTTTTT 

6 5 CTGGGATATAAGTCTAGGAATAGGAGCAAATTCTTGAAACCATTTTATTATATGTTTTCACAAAATAATGTAA 

CTAAAGTGGCCAGTGTTTATTCTTTCAAATATTTTGGATTTATATAATTTCTTTCT^ 
GAATAAAATCATAGTAGAAAATAAGTCITTCTGGCCAGCCATGGTGGCTCACGCTTG^ 

GGAGGGTTGCTTG AGG CCAGG AGTACAAGACCAGCCTGGCAACATGGTG A CATTCTTAACAACAAGAACAACAAAATAGCC 
AGGCGTGGTGGTGTGTGCCTGTAGTCCCAGCTACTAGGGAGGCTGAGGAAGGAGAGTAGCTT 

7 0 TGAGCCATGATCACACCACTGCACTCCAGGCT 

CGGTGGCTCACACCTGTAATCCCAGCACTCTGGGAGGCCAAGGCTC^ 
AACATGGCGAAACCCCATCTCTACTAAAAACACAAAAATTAGCTGGGCA 

CTGAGGCACAAGAAGTCCTTG AACT CGGGAGGCAGATGG TTG CAGTAAGTGAAGGTTGTAGTGA^ 
CCAGCCTGGGCAACAGAACGAGACTCTGTCrCAAAAAAAGAAAGA 
7 5 AAAAAAAAAAAAAATTCCAGCCAATTGGCCATTTATTTTAC^ 
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TAGCTTC CT AGAATTT ATTAAAAT AGG AAT ACT AATAG ATT CAAAAAGG AAAATTTGT AT AAATT CTGT C 

CTTAATGCCCTATTTTAAGATTAGTCATTAAATAAATATTTACTTGCTC 

GAAGTACAAGATACTGCCTTTGCCTACTAAGAATTTTATTCATACTTTGAC^ 

ACAG AGCTCTAAAACATG TTT AAT CTGGT AT AATTT ACCACTAGG CACAGGAAAAT AATT AT AGCT AACATTCCTTT AGTTTG TCT 
5 CTGTATATCCACACTTAACAACAGCATTTTCACATGATAG^ 
ATTAAAAGCTCTAATTCAGTATCTAATTTAATCCTCTC^ 

AACTGAG CTTCAG AT CACTTT AATT AC CTTGCC CAAAGT CAT AATTGA^ CAGAATGGG ACTGG AAT CCAGGTCT AACTCAAG 
TGCTATAGTGGTGTGAAGGGAAGACTGGTAAGGGTTGGTGTTCATGGAGAATTTGTTGCT 
CTGTTTTGAAAACATTCAAACTTGATCAACCAAAAGAAATCATTTGC^ 
1 0 GTTAGCATCTGTCCGTCAGTTCrGTAAGCAATCTTACAGTGGTG 

GCGGACAGCTTAGGG^AAATCTGACCACCATGCGTGTTGGAAGTATTTCTCTTGT^ 

CTTTTTCAGGGGGTTATTAAGATCCTACTCTCTTGGAT^ 

CATGAATTAGGCAA.CAGAGAAAACTCTTGCTCAGTGGGAGTAACCTG 

CGAGTAGTAATGCCATTACAACATTCCCATTC^GATCCTCATGACCCAGTAATTGTTTTGAG 
1 5 CAACTACTATAATTAGAAAAGGGAAAAATAGAGACTTACCAGAGCTAATGTCTTT 

GTTTTTTAAGTTATGGAGCCTCTTTCTCAGTTTTGCTTCCTAA 

TTACTGCTGTGCACT CTCIT l'C CTriTlT CCCCCCT 

CCACATTCAGAACAGGGTGTTCTGTCCTCTTCAAAA 

AGTGATGTTTTGCAGCTCGCTTGATGTGGTTACACTCCGATTC 
2 0 GATTTGGAGGACCTCCCTCCCCCTTTAAGTTACATAGTCTTTAAGTCAAGAAAAAAA 

TTCTCTACCAAAAAAGCCCTAAGTAGAAGCATCCTGTATGTACrrTTATTCT 

AATCATGCTGGTTGGGATTAGCAACAGACCTCCTGAGCCCTACAGGCTX3 

TATTAAGTTAGTATTTGCACTAAAAGATGCCAGGTCAAAAAGCAACCACAG 

TGTTTTGTTTTGGTAGGAATTTTTAAGATAACATTAAAAAGAATAAGTGGAGCTATAGG^ 

2 5 CAATCAGGGCCCCTACTGTGGCAATCAGCAGTCTTTCCGTGGCT 

TTCACATCAGTGTTCCAkGTTTAGAAACGAGTTAACCCA 
TCACCATGAGGCTGTGATACCrCCTTTTACTCTGACATTCCGAATGTT 

AACTGGGGTGTATATGTCTGGATGTATATCCGTTTAAGATTTTATAGTACTGCAGGGTCCAATT 
AAAAAAAAGATTAGTCCACATGTAAAGTAGAAAATATTTATAAGAGCTATCACAAATGTTTACTGT 

3 0 TAGTTCTGTCATTTATTGTCTCTCTGAGAAGACTAGCTACAATGAATGT 

TTTTTTTGGCATGGTTACTTCATTTATCTTTCCCAACATGT^ 
ATCTAAAGAGAACTTCTGAAATTACTCATTTAATTTTAATATC^ 
ACAGGCAGGAATCCTTGCAGAGTCTCTGGATTTCCTC 
GATAATTTCCTCCCCCACTAAATGTACTCTCCTTAAGGGAGGAAGATGAT^ 

3 5 TAACTAAG TTGTG AGCT ACT AGGACAATAAAT CTCT AT ACTTGAGACATTC CACAATACATG AGAAATTT ACTTTG CT CAG CAAAT 

CrTTTTTTTTTTTTATGAAAAGTGTTATTAGTCTAAATTTC 

CTTAACCTCCTTAG AAATATCAATTTTTACT CCAGT^ CTAG 
ATTACAGTGCCGCAAT AACAACTCACTGCAGCCTGGAACTCCTAGGTTCAAG CAATCCTCCCACCTCAGCCTCT CGAGTAGCTGGG 
ACTTATAGGTGTGCGTCACCACCCTCAGCGAATTCAGTAAAAATATCTTAAGCCTCAGACTCCCTGTA^ 

4 0 TGTACAGTCATATG CTGCAT AACATTTTGTTT AACAG CAG ATTGCACAT AGG ACAGTGGTCCCCTG AG ATTAT AATTCCAT ATCTT 

TACCTTCTCAATGTTTAGATACACAAGTACTTACCATTGTGTTACAGTTGCCTACAA 
TTATAGCCTAGGAGCAATATGCTATACCACTTAGCCGAGGTTATAGT 
TGTTCCCACAGTGATGAGAATACCTC^TGACAGATCTCTCAGAATGTATCC^ 
GTTTAAATTTGAATTTATTTTTTAAATGTTTTCCAATTCAGAATGA^ 

4 5 TAAAAGTAAGTCTAACAGAAGGAATTATCAGCATTTATGAGTGTGGGACATAAACTA 

CTCCAACTCTAGCCCTTGTTTGGAGGAGCTG ACTCAAG AGT^ CCCTTTCTAGTTTGG CCCAAGTTTCTTAG ACCCCATTCTGT 
GTCTGCAAGGGGAACACATCAAGATGGTATTAATACTGGTGAGTTAAAAAAGAGTAGTGAGTACT 
AAAATTGGAACAATACAGAGAAGATTTAGCAAATCGAAAAAAGAGAAGAAAA 
ACATTTCGGCCCCTAGTGTCTACACCATTTCTGTTCATTCTTCA.GTTTTTCT 

5 0 AAATT ATTAACTGCAGGAATTAGATTTGAAGCTGG TTT CTC^ACTATACTGATATTGAGAGTGATA^ 

AGAATTTTAG AG CTTCATTG AGATCATGT AGTTCAAAAGTAG AATAAG TTAGATTTGAATTTTT ATTTGGAAGGTAAT AGTTGTTT 
TATTGGGCATGGAATGAGTATAACAGAACGAGTGATTTTTAGTCTCTCTACAAGTTGTTC 
AAAAGAAGCATTTTAGGAAATCTCTAGATTTGAATATAGGCTATTATGTTACAGAAGTAGGAATAAAGG^ 
TTTTTATTTATTTATTTTTTTATTTATTTTGAGATGGAGTCT 

5 5 TCAGTTCACTGCAACCTCCGCCTCCC^GGCrCAAGTGATTCTTGTGCCT 

CACCTGGCTATTTITTGTATTTTrAGTAAAGATGGGGTITTG 
CACCCGCCTCGGCCTCCCAAAGTGTGGGATTACAGGCGTGAGCACCGCGCCCAG 
ATTGCTTGGTGACTGTTATGCTGTCATACCAGCCATCCATTTTTCTTATTACATCAAAAACTTAAAA 
AAAGTAGTTGTAAAGITTCAGAGAATTGTTTTTCCTAGAAATAAAGA 

6 0 CGCTTTGGGAGGCTG AGGCGGGTGG AT CACCTGAGGTCAGGAGTTCGAG ACCAGCCTGGG CAACATGGTGAAACCCTGTCTCTACT 

AAAAAT AACAAAATT AG CTGGG TGTGG TGGCAC ACACCTGTAAT CC CAG CT ACTTCGG AGG CTG AG ACAGG AG AATTGTTTGAAC C 

CAGGAAGTCGAGGCTGCAGTGAGCAGATATCACGCCACTGCACTCCAGCCTGGGTGAGA 

CTAACTCATCAAGAAAAAATAGATATCTGTATGCATTTTTTCTATTTTGTCTTA 

AAATT ACTCTGGGAAAG AG CTTGGTAAATTGGGGTGGAACC^GGG AATT AGGCCTCCCAAAAT^ 

6 5 GTTATTAAGTAATrTGATTTGTGAAGTTATAGTAGACTATTGATTCTTTCACACT 

CTAAGATAGTTATTGATGTTGTTGTCTTTGGATTTACCAT^ 
AAGGAGCATCTGTATTCATTTAATCTTTGTATATCATTC^ 

AGATGCAGTTACAGATGTTTATATATTTGATATTAAATTTGTGGCATTTGATAACTTTAT 
GTATTATAGGTGAGGAAACTTCTGTCACATAGTAGGCTTAC^TAAACCTGTGTTTCTGGTTTGTTACC 

7 0 GATACAAATATCAGTAATTACAAGTTTAATTTAGAAATATGGTTGCTC 

CACACTGTGTGTGTCTCTTC^TGAAATGAGAAAGGGGAAATGGTATTTGCTACATTGTC 
TAGCCAGTCCTAAGAAGC^CTCTGTTGACTCTCCTCTCTCCTTGCC 
TGACTGCAGAGATGTGGCATTCACTGTGGGCGAAGGAGAAGACCACGAC^ 
AGCC^GAAGAACAATGTATTTTATATCTTGGACCAAGGTAAGCAGCATACC^ 
7 5 GATATGCTTGC1TTTTCATTCAGTGTTTTGGCATTTT 
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TCATCACGTAAATTAATGGGCCTTACAATAT 

CAGCCACTTTTTGCTTTTGTGATGATTTGTTTTGCCTG 

CTTACCCCCTCTTCCCTTTTCTCTAAGTTAGGCAGGAGXSCAGGAAAACAT^ 

CTACCACTTGAACTTGTGCCACnTGTTCTTATACAGAGTCCCTGAAAAAA 
5 TGACT CATTTTTTGTCT AAGTTTTTCCCATCATAAGGAT AT CCATT AGCA.CTTT CAAG CTT AG AAC (^TCAGACACTG ACAG TCTC 

TCTTKJTG AAAACATGTTT AGT AGT ATTTTT AGATCTAA CTT CTGT CTTGAGAAACAG ACC CCTG A 

GAGAACCCGGGGTGTTTrGATTTTGTCTGCATGAGAGTC 

CAGTGGGATTCTGAGTTACTCAGAATCTTGATGACCTG^ 

CTCTGTAGTAATCCAATGGCAGTTTACCTTTGC^ 
1 0 TCTGCCACCCAGGCTAGAGTGCAGTtBGCATGA 

CTCCCX3AGTAGCTGGCATTACAGGCACCCGCCACCATGCCTGTCTAATT^ 

GCCAGGCTGGTCTCAAACGCCTGACITCAAGTGATCCTCCCGCCTCA 

GCCCAG C C CATAATCAT ATTTTTG AGGTGATCT AGCCATGCAACT ACCACTTGAAAAT 

TCATAGCCGTC^CTGAGGCACCCTTTGTATAATGGCTTCTTTTTCT 
1 5 AATGTGTCTTATTTATATTAATATTTACrrCrATATTTGTAGTTTAl 

AAAATTC^GATAGAGCATAGAAAAAGCT AT CTCTTCCAT ATCCAAA TT T T T T TTGGT AACAGG ATCTC 

ACTCTCTCGCCCAGGCTAGAGTCCAGTGGCACAGTCACGCTCACrrGTAGCCT 

GTCTCTAGAGTAGCTGGGACTACAGGTGTGTCCCACCACACCC 

TGCCCAGGCTGGTCTCCAACTCCTGGACTCAAGTGATCCGCCTGTCTCACCCT 
2 0 CACTGCGCCAGGCCATCTCTGATTACTTTTAATCAGCATT^ 

ATGTATATTTTCAAAATTTAAGCrrCCCTrAGGATAAATTCTGTATGTATTATGACAACAT 

ATTACATTTTAGCCTGGCTGACAAGGCAATTGTGGGCACAG 

TTTTTT AAOGCAAACftG TTCTrTT CAGATTTAA^ 

CACTTAACCCTCCTAAGCTTCAGTTTCCCCTCTGTAAATGGGGATGACAGTAT 

2 5 TCCTGCACATATAGGGCACACATGAGCCCGCAGTGTGTGAGCITT 

TCGGACCACAAGAAAAAGGAAATCCTGTTTAACATGAGATCAATCTTTCAGATC^ 

ATAAATTTCTAGGTATAAAAGTTTATGAGTTTTATGGAAATAATTTAGCATATTTAGAAA 

GGTATGTTGGAGAATTATGTGAAGTGCATTCTCTGTAATATGAGACT 

AAACCCCTACGTGCTATATATGTGTGTGCGTATGCATACTGTGTGTATAGAAAACCCCCrACATGCTATATATGTGTC 

3 0 OITGCTGTGTGTGTGTATAGAAAACCCCTACATCCTAAGCATGTTTTCAGTGOIGGCAGTG 

CACTTTCATAAAACATTGCATCTAAATGTTGCTATAAGTATGGCTGTTGCAGAAAATATCrc 
ATTTCATTCTCCAGAAAATTTCCAAGGCAGACCrrCTCTGTGACTCTGA 
GTCCCTCCCCCACCCCCCGAGACAGGGTCTCACTTTCACCTAGGCTGGAGTAGGGTG^ 
TCAAGGGATCCTCCCACCTCAAGCTCCTGGGTAGCTGGGACTACAGGTGTGCTCCAT 

3 5 GAAACAGGGTCTTACTATGTTCTCCAGGCTGGTCTCAAACT 

GG ATT ACAGGTG TGAG CCACCACAC CTGG C CT AG AATGT CTT CAT AAT AAG AAAGTCAT AAAACATTT CCAG ACAAT CCATTTTTC 

TTTCTCCTCATTGGGACAATATCTTCCCTTTGTATTCnTTTAA 

ATAGTGTTAACTTTTTTTTCCGTCCTTTCCTTATGTTT^ 

GGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCG<K5AGGATCACT 

4 0 ACATGGTGAAACCCCGTCTCTACTAAAAATATAAAAATTAGCTGGGCGTGGTGGTGCATGCCTGTAATTCra 

TGAGGCAGGAGAATCGCTTGAACCOWSGAGGCGGAGGTTGCAGTGAGCTGAGATC^ 
TG AAACTCTTGT CT CAAAAATAAAAATAAAAAAACAG ACTACCAAG AGCC2VT AATGTCT 
ACCTGATTTTCTGGGGACACACAGGGTCTGAATTTAAACCTGGCT 
TCTTTATCTGCAAAAAGGGAGCAATTGGTAAGAGCTTGTAGTCCAA^ 

4 5 AGTCATTATTTGTATCrCTATTCTCrGGAGGAAACTCTTAATTCAAGG^ 

TTATCTTTTGGTTAGGTCTTGGGATTCCTCTTGTGGTGTTTTGTT^ 

CAGG CTGAC AACITT CTGTGT AAT ATGGAAGT CAGGGT AGG TG CTG C C AG AAT ATCTTGG CGTGGG CTTCC ACCT CTG CAG GAT CA 
TGTGCGCCACACGTAT CTTGACTGAAT CGCTTCATTTTAACCTGGGAG AGATTGT AG AAATAAT ATTTTTCTCT 
TCCCATCAATAATAGGACACTTGGTGCCTCCCTTGATACAGCCCCACATTCCCCACAGGCAGATTTT 

5 0 ACACTTTGGGCTTGCTAATC CTTGGTCATAGGGGGCTGTC!ACGTGCATTGTAG AATATCCAGCAGT ATTCCTGGCCTCTTC CCACC 

AGATACCAGTAG(^CCCCCATAGTTGTGACAACCCATAATATTTCTAG 
ATCAACCTGGTTGAGATCTGCTGGTCTAAGGCAAATGCCAGATCA.TTTG 

AACATGCTCTCAGAAAATATGTGCTTAGCAAGGACTCACCTACCTACCTGGGTAGTTAAGGAATTT 
CAGAAACAAGG T AG T AGGTAT AT ACAT CT AAGAG CTCAAAGT AG AAAATT AGG CAT CACTT CTT AAAATTC4G 

5 5 GTCTTTGTGGAAGATACCCTTTGCCCCGCTCCrCATTCCTA 

TTCTGTGTAACAGTGGAGGTGAGTGAGTAAGCATTTTTATTCCrGGAAAATCC^ 
AAATTTGAGTGTTCCCACTCTAGTCCCTTTGCCCCAGAGAAAACTGGAGGTATTAGATT 
TACCTTGAAAGCTTGAGTTTACTAAGTTTTTTGTAATACCATTTCATATATTAAAGAAC^ 
AATATATTTCAGTGAACACTTGACTAATATCCrcGG 

6 0 TACAG CrCC(2ATTCTAGTAAATATCrGCTTCCCGAATACATTTAATTTG CTGGAG ATCTT ACATGATCCCAAG AAAACAAATGTAT 

ATG T C CT CAGTT ATTTTTTTC AG T AAGTTTTGGCT AAAAGGT AT AG AAT CCT ACACT CTG ATAG CT AT AAGCCAT AAGAAT ATTT C 
TCTAGAGGAGGTGTCAGAACTTGCTAGGGCAGCTATTTTTGAGTATATCATTATGTTATTTGTG 
AACTTAAATAGTGTGAAAACTTTTCTCTCCTGATGTGTGAGCCTTTCCCCTTCACT 
GACAGCACTTTAAAGTTAAAAGCTTATTAAAAC^TACCTCCCGTGAGTACCACTCTAACAAT^ 
65 CTTTTTGCTGATAAAACrrAGGGGAAAATTCCTGGCtAG 

AGTAGAGAAAAAGACAGTGACATTTAAAATCCCATGAATCAAGCTGCCCCAAGG 

GGAGAAAAG AAAACT AAACCT AG C CTGGTTT CTAAG AAAACT GT AG CAGTGAAGATACTT CACACTGT ATCTT CTC^ 
C^TTTGTCTTCAC^GGTCTCTGACACATGT AG CTTGCAAATG AT AG CTCT 
TACACCAATCCTTTTAAAGTTTTATITTTTACTCTCCATTTCAGA 

7 0 ACCTAATGCTGAGCTTATATATGAAGTTACACTTAAGAGCTTCGAAAAGGTGAGAAAATACCAACTTCCT^ 

TTCTTCTCCATCCACTTCCTATAAATCAGTATCTTATCTTAG 

TCTTGCCTCCAACACTGCTGCTGAAGAATAGAACAACTGACATAGCACAGCTGATATACAACG 
GTTCCTATGTGTCAGCTTTTTTTTTTTTTTTTAACT^ 
CTTTTTGTACTTTTTTTTTCCTGGGrTGAGGACAGAAAGGTTCACGT^ 
7 5 TCAGAGCTGAACXKUITTGTACCCTTCACAGGWSTC^^ 
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CGAGGGCCACTAGGCTCTGTGACCAGAGATACAAGAAAGAAAGCAG 
AGAAGAGAGAGAATTTGTCAGTTGCCTTGGATTCCAGAAACAG^ 
GCTCCCAGCAGCAGCCGACCTGGACACACACAGCATATACCT 
CCAAAGCCCCACCCTTTGTGGAGCTCAAGGACTGCATTTGTGTTCCr 
5 TTCAR CTT T TTT TT T 1 TTGAGATAGGGTCTCACTCTrGTCACCCAGGCTGGAATACAGTGAGCG CGATCATAGCTCACTCCAGCCT 

CAACTTCCTGGGCTCAAGCAGTCCTCTCACCTCAGCCTTCCAAGTAGCT^ 
TGTTTATTTTTTGTAGACATGGGGTCTCACTGTGTTGCCCCAGCTGG^ 
CTrCCAAAGTGCTGGGATTGTAGGGGTGAGCCACTGTGCCTGA 
ATTTTGTTAGATTCAGAAAAGAAATCATTGTCCTGTGGGCCTGTACCTTTGGG 
1 0 CACCCCCAGTAGTGACATTCCTAATCATACCTCTTCTACTTCATGTTCCCCCAAACA^ 
TTGTCATGTAACCTCTTCTGAAGGCCTGCTAGCTCAGCACTGATCCCCCAA 

T AAAG AGAGAGAAGAT AATCTTTAGCATGTGGTT ATTTAGACCT ATGG CTTT ACTATTT C CATTTTTGTAAAGOVGCCTCTTGCCC 
CTTCCITCTCX5TGGCCCMTGGACATTGCTTATTTAGGAGCACT 
TCCAAGAGTATATTCATGCACTGCTGCITGGTATATTTGAAATCACATGAA^ 
1 5 CCTCTCCCTTGTT AAGAATAGACT AGGCCAGG CTGGGCG CTGTGGCTCACGCCTGTAATCCTAG CACTTTGGGATGCCG AGGTAGG 

TGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAAACCCATCTATGCT 
CGTGGTGGCAGGTGCCTGTAATCTCAGCTATTCGGGAAGCTGAGACAGGTGAATCGCTTGA^ 
GCCGAGATCATGCCACTGCACTCCAGCCTGAGAGACAGAGCAAGACTCCGTCT 

T AT AAAAAT AT AT ATAT AT ATAAAAAT AT AT AT AT AAAT AT AT ACIACACACACACAT ACACACACAT ACIT AT ATT CCAGG CATGG 
2 0 TGGCTCACACCTATAAATCCCAGCACTrTGGGAGGCCAAGG CAGGATTACTTGAGCCGAGGAGTTCAAAACCAG CTrGGGCAACAT 

AGTGAGACCCCCCTCTCTGCAAAAAAATTTAAAATTTTCCTGGGTGTGOT 
AGATAGGAGGATGGCTTGAGCCCAAGAGTTTGAGACCAGC 

AAATTAGCTGGACATGGTGGCACATGCCTGTGGTCCCAACTACTCGGGAGGCTGAGGCAGGAG 
AGGCTACAGTGAGCCGCGATCGTGCCACTGCACTCCAGCCTGGACGACAGAGCAAGATCCTGTCTCAAAAAAAGAAA 

2 5 AGTATCCTGTCTCAAAAAAAGAAAAGACAACAGAGCAAGATCCTGTCTTAGAAAAAGA^ 

GAAGCCAACCTAAGAAATACAAAGAGACTTATTAACTAGGTGTTCTTTGTTTTTGTTTT 
ATAACTAGGTTTTTGAGTAATTTGAGCTACATTATTAGGAA 
TTCACTTTTAAAAAGTAGAGGATAGTCACTCGGCGCGGTAGCTCACGCCTGTAA 
GGGAGTTTGAGACCAGCCTGACCAATATGGAGAAACCCCGTCTCTACTAAAAATACAAA 

3 0 CTGTAATCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATCACT^ 

TTGCACTCCAGCCTGGACAACAAGAGCGAAACTCCGTCTCAAAAAAAAAATAATAATAATA^ 

GGGGCTGTGACACCAGGAACACACACTCGTTATCTTTTGAATAGAGAGAATT^ 

AGTAAATAGCACTCACCACACCCACTGTCACTTTTCCT 

TGTAAATGGCAAGTGCAGTTATTTCTGGATTrcTATCTGTAATGGTGGAAGTAATAA 

3 5 TGAAGTCATTTATTTATGGCTCTAAAGTGATATTTTTGCATGAGCTTTACTCC^ 

ACATCTGCAAGTTGATACAATTTAGCAGTATTATTAGTATATCTGGTTGTTTTAGAGTGGCA^ 
AAATTCTGTATGAATATACAAACTACAACTTGACTACAGTCATGTTCCCA 
TAGTGGAGAGGTTTGCAGAGGGAGATAAGGCATGGCCTTAACTTTGTGTGGGTTCACC^ 
AGTGCTTACCAGAGTACGTTCCrrACTCTGTTTTTGCGGCTCCTCT 

4 0 GGAAAGTGAGTTCTTCAAACAGTATGAAGAAGCACTGGCCTAGCAGTAAATTGG 

TTCAGTTTGGAGTATCTTCTAGAATTAGCTGCTCTTTGTTCCAATAGGCCTCA^ 

GTCACAAATTTACAC ACAGGCCAAAG AATC CTGGG AGATGG AT ACCAAAGAAAAATTGGAGC AGGC^ CCATTGTCAAAGAGAAGG 
GAACCGTATACTTCAAGGTGTGTGAGTCCCTATGCAATCCTTTCCATCTCAAAGAATTTATTCAAGTG 

ATT CCCT ACT CTGLAATTTTGTGAACACTGCTTGGT AAG AAT AAG CTG CAATTTTT AT AAATCTGTCA 1 ri i" CAATATGAAAAAT GG 

4 5 TGAGAAAACAAT CATGTGG AAAAAAAAAAAAGT ACAGATT CCTCTTTT ACACATT CTG CT CT AG ATTGT AACCTTC CATACAGG CA 

GACCAGGTCTGGCATCCCCACCCACCACAGTATCACTCACTCCTAAAG 

TTTGGAATGAAAAAAGAATGAATAGAATAATTGTTAGCAAAGAATGGAATTCATAAG 

CTTTTTTTTTTTTTGTATGAGAGACAGGGTCT 

TCAGCTTCCCAAAGTGCTGAGATTACAGGCGTGAGCCACTGCACCGGGCCCATCTTCCTTTAG 

5 0 GCATATGATAAAAGGAGAGAGGACATGCACAAATTTCAAAACAAACTGACAG 

TCCTGCACACTTG CTTTT ATG AGGAATCTC CGG AT ATTCCTG AATGATATTCATCAT AAATTCCTT CT 
CAACTAGGAGTGTGATATTTTCTCACTTAGATCTTAAACCTCTAGATCT 
AGTGTTCTACCCATTGCTTCCCCACCCCAAATATATGCCTTCTCCATGTCA^ 
AATGGGAGTTCTOTCAAGGTTTCTTTCGTTTTCTGCCACT 

5 5 ATAAGTGTACTTTAACAAGTCTTTATGGGGCATCTCCTATAAGACATGCTA 

GCAATGTT AAT CCAAATG AAAGAACT AGAGAATT ATGT AAGAAT AT AT AAT CAACC CAAAATT AACACAG C CAG AGGG CAC C ATGT 
GTTTTCAGAATTTGAGCTGAGACAATATTGGGTCATGAAATCAGTTTTGTGGA^ 
GTTGCATAAAACTTTGCTTGTGTATGAGTAAGTACTGGATTGCATATTTC^ 
GGGTTTAGATAAGTATAAATATATAAATATGTTATTTATCATTATTTACTATGTATACTAAA 

6 0 GGGGGTATGATAGAATAAGCTCAGAATTGGGAAAAATGGGCTTTATTGCT 

TATATTTTCAAGTGAAAAATAGGGGAGTTAGATGTTCCCCAAGTTTCTT^ 
AACAAGTGTTATTTTTTTGGTTTTGTTTTGTTTTTAAT^ 
AAGTGGGTTATTTGTATATCCAAATACATAATACACTAGAGGGAAAA^ 
AGTATTTTCTATATTTCCGAAGTCCATAAAAATCCTTAACTATATTTTATATATTTCCTC^ 

6 5 TTTGGGAAGATTTGAGGCTTTTGGCCTTTGACTA 

T CGG TG CT ATG AT ACAGAATTTTG G AGAACATT ATTGGT AATGAAAT AATGTT ATGAAG CCATCTG T CAT ATAG CTG TGTTCTGAA 

TATATACAGAGAGATGGCCAGAATTAGTGATCAGTAGCAGAAAGCAATGAGTGTCTGTACAGTT 

CATCACAACTGCAAGAGTGATACTATTATGAACCTACTTA 

GGAAAAAATTATTTTTAACATGGAATTTCATAGGCAAA^ 

7 0 AACCACTGACCTGAGACTTTTCTCCCAGAATGTCTCTGAACATTTTGTACCACT 

AGGTAAAAGATCCCAGCAGTAGTCCTAGCGTTAGCTGTAAAACTTCAAAATGGTTAGTAGATAGTTTC 
CAACTCAGGGCAGGTTTCAATCTGCCATACTGCCCTCCCGGGCCTTC 
ACAGAGTCCAGATGTGTAGGTGAGAGCAGAGATGGGCTT3AACTAAAGATCT 
CCATTGATCTGTTGGTTTGGTTTATACCTAAATGGGTAAAGCCTCTCTCCTGAGTAGGAT^ 
7 5 AATGATTTTCCCATTGAAGATGTAAAGTCCTCATTTCAA 
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GAAAATAATTATTAAGTACAGGTAACAGCATAATATTAAGCCAGGCATTGTGCAAGCGAG 
TAGCTGACAGTGTTCTGCAGGACAGGAGCATTAACTCACACCTGGTCAGTGr 

G AAGG CT AAGAACACTGAAGTGATGTGGGATTCTTG CGCC CCCTTGTG CTT CT CCTGGGAAGAAAGCATTT CTAAAACAAACT AGT 
TGAGGAGAATTGTTCAACCTCTAAACTCTTCAGATAAATCGGGCTTTGCT 
5 AATGTGATTTAGAAACAAGCCGTTTATGACTCTAGGATTTGATGTTTTAAAAA 
CTCTTTCCTTTAGTGCAGAATTTTTCAACCTCAGTACTATCAACATTT^ 
GCATTGTAGAATATTTAGCAGCATTCCTGGCCTCTGCCCATTAGATGCCAGTAGCATCCT 
TCCAGACCTGCCAGGTGTTCCCTGGGGGGCAAAATCACCCCCCAGTTGAGAACTGCTT^ 

CAG ACTTTT CG CT AATTGGAAGTTGAAGTCT AAATAAAGTT ACATG AT AT AT ATG CAAGAGCTTTCAAAAAATTAAAATGCCCTAT 
1 0 GT G AAGGAACT ATT AAGAAGTCT AAT ACCTG 1 1 TTri GTG 'm ' TG TT'l I XSTTTTACCATGGCCTTCTACCATTCTGATTATTCAGA 

ACTT ATTGAAATGTT AAG AT CT AGGTTTTTGT C CTCCCIX^ CTCTTCTTTT CTTT CC CT AT ACAATATATT AGGAG AT AAT CAGAA 
AAGTTAAGTTAATCAATGCATTAGAGAGGGAGGAGGAGAGAGGATC 

TCTCATCTCAGTGAAACAACACTTTT CATTT CCACCCACACA1T"1TTTGCTCC C^^TGAT AGCAAAAGG I'll 1\>I TG TTGG TATTG 
TTTTTAGAGAGAGAGGGTCTTACTCTGTCACCCAGGCTGGAGTG CAGTGGCACAAGCATAGCT AACTCCAGCTTCGAACTC CTGGA 
1 5 crTAAGAGATCCTCCTGCCTCAGCTTCCCAAAGTGCnKKSA 

CATATTTACATTCCTTTGATTTCAATATTTACTCAACAAACC^ 

GGAGAGGACAATGCTCCCTGTCCTCAAAGACCAGCAAGAGAA 

CTTTATTCTTTTTTTTTTGTAGCATCCAAAAAAGGA^ 

TG C CACAGT AAAATACCAT ACACTGGGTGG CTCCAACAAC^ CTGGAGACTGT AAGTC C CAG ATCAAGGTGTT AGC 
TGAGTGGAGAGAGGGCAGAGGGCAGCAAGCACCCGAGTTTCT^ 



GGGGACACAATTCAGTCCATAGCAAATCTAATTTCCTCAAGAATTTC^ 
AATGACCTGTATCACACACTTCTGAGGCAGAGAGGGTGACAG^ 

2 5 TGGK1AAGAAGGATGTTCGAGGGATGTAGCCTGGCCCTCATTCACAC CCAGG CCCTCCAGCTGG AACAGGTCTGCCCCATCTTCATC 

ACCAGAAAATCCACACCAGGGCTTCCfTTTCTACAGGCAGAGCTCGTTTCT 
TCCTCITAGACCTGCTCTATGGTTTTTCTCTTACTCA 

GTGAGATGTACGCGCTGCTTGGX3CTGTGGTAAATCCTAGTAACATGGTCTTAACACT 
ATAGGTCTGGCTCCTCCTCTGCCATCTGGCAGGATGTA^ 

3 0 TGTACTCCTGTTTCTATTTGCGTATTTACCAGCGATTCCCCAGTGCTTTCATTTACCT 

CCCTGGAGGGGTTTTGTTTTGTTTTGTTTTGTTTTGTTTTTGAGGAGTCT 
CTTGGCTCACrcCAACCTCTGCCTCTTAGGTTCAAGCAGTTCT 
ACCATGCCTGGCTCATTTTTGTATTTTTAGTAGAGACGGCCAGGCTGX3TCT 
CCTCCAAAGTGCTGGGATTACAGGTATGAGCCACTGTGCTCAGCCCCCAGCGT^ 
35 TATATATATTATATATGTTTATATTTATATATATTATATATAACTGACAGACCTTTCTGATGTATAT^ 
ATGTTGTATGCTACTATCCAACTATTCATATTCCCATTGCCTATAGATCAGGGGTCAGCAAACTA 

TCCCTG CCTGTTTTTGTAAGTACAGTTTAACTGAAACACAGCTG CACCC ATTG ATTT ACAT ACTGTCT AG AGCAG CTTTTGCACTA 
CAAATGATAAGAATTTAGTAGTTGCAACAGAGACTGGTTCAAATAGGA 

ATTTGCTGTG AAGTGCTG AAAAG AAAAGT AACTTCCTTTTCATT^ TGG CAT AGTCT AAAACTG CTCACAAT CCTTTTGAA 

4 0 AGACTAAAGAATAGGCCAGXSTGTGGTGGCTCATGCCT^ 

AGTT CAAGAC CAG CCTGG CGAACAT AGTG AAACG CTGTCTCT ACT AAAAAT ACAAAAAAAAAT AG TGGCGG CCACC 

TGTAGTCCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTG^ 
TTGCACTCCAGCCCTGGAGACAGTGCGAGACTGTGTCTCAAAAAAAAAAAAAAA 
ACCAAAGAACATAGCCCAGAAGCCTTGCTGTACTACACCTGCTGAAGGGATGGATGGAAA 

4 5 TCAGGCACTGCCCCAAACCAGCCCACAGGCCATCGAGGCCTCCAA 

AGAAAGAAACCAGCTGTACCAGGGTTTGCTGGJ^GGGTTGCTTCA 

AGTACAAGTAATGACGCTTTTTTATCCTAGGGAGGCAAATAC^ 

GATGGAATATGGTTTATCMAAAAGGAATCGAAAGCTTCTGAA 

TGAAGCTTAGAGAATACACCAAAGCTGTTGAATGCTGTGACAAGGTAACATCGTGTCT 

5 0 ATCTTTTCTCCATCTCTGTCTTTTGCrrCCTCTCAGCCAAAGG 

TTGGGATTATTTGTTTCTCAGTATGAATTGGGAAATTCAGCAAGATTTTTATTACAGT 

T CAT AATG CATTCTG AAGAAAAGTTTTGATTAAAATT AAAT AATTTTT ATC CACG ACC^ C CTTAT AACTGT AGT ATTT AT ATCAG 
AGTAGACGTATAACTCCTCAGGTTCAGTTTAATAAGAGAAGATTTAAOT 

T AT AAAGCAT ATAT AT AGGTGAATTTTTATTATAATCATTT ATTCACTG AT ATG CA^ C CACAG AG CATCA 

5 5 GCAGCCTTTTTATTTTTACATTTGTCTTATTAATACTATCTTTGC^ 

ATCAT CTG T AT CGTT ATG TATCGTTCTGAT AT ATTT AT ACAC CAG C CGAT ACT ATTTT AGG T ATCTGG TCTCAG AGTG AACAGTAT 

ATATTATACTGTGGGTAAAAGTAAAAACTAATCATTGGATGTATCTTAA^ 
TTTTGTACATTCTTTCTGAGATTATTTTTGAATTTTG 

6 0 TGTTTTAGAGACAAGTTGTTGCTATTTTGCCCAGGCTC^ 

AACTCTGGGGCCO^AGCTATCCTCCCACCTCAGCCTCCTGAGTAGCTGGGACTA 

TTAAATTTACTTTTGTAGAGATGGGGGTCTTCCAAAGAGCTGG 

ATTATTTTTCTGTCTTCTATTGACAGAAGCAGTCATATAGGATCACTATTAACCC 

ATTTTCCTCT ^TTTTTTTTTTTT ' 1 ' G GAG ATGG AGTCTTTCT CTGTTACCCAGTCTGGAGTGCAATGGCACAAT CTTGGCTCACTCT 

6 5 AACCTCCACCTCCCGAGTTCAAGCGATTCrCCTCCCTCAGCCTCCCA 

AATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATG^ 

AGCTTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACAACCCAGCA^ 

AAAGAAGGTTTTAATGTTTATGTAGTCAGATTTGTCGTCTCCTTT 

ACTG AAGCCAGGCACAGTGGTTTACACCTGTGACCTCTAACCCCAGG^ CCTGAG CCCAG 

7 0 GAGATAGAGGCTGCAGTGAGCTGTGATTGAGCCACTGCCCTCCAGCCTGGGCGA 

TTATATTTTACATATTTTAAAATAAAAAATAATTGAAACATTT^ 

CTTITGTGCTT'ACAATATGAGGTAAGAA L " rri '' rri '' l '' l ' Tl I IITT T X 1 1 TGGAGACAAGAGTCTCACTCTGTCACCCAGCCTGGAGT 
GCAGTGGCACGATCTCAGCCCACTCCAACCTCCACCT^ 
ACAGGCG CCTG CCACCATG CC CAG CTAATTTTTGTATTTTCAGTA^ 
7 5 CCTCACCTCAGGTGATCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTA 
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TCTATGTGGATAGCTGCCATCCTGGCCCCGTTTATTTTA^ 

TGTTTTTATATGCAGACACATGACTGTTTCTGAATGATTCGTTCTC 

TCTTAATTTCAGTAGTTTTAAAGTTTGTCITTTTCT 

GCCATTCTTGGGCACTTATTTGTTCCAATGAACTTTAAAATGAACTT 

GAAATTGCATTAAATGC^GAAAATAATTTGGGTAGTAAATTTTC 

CACTTTTTCCATCTCCTTTGGCAAGAGAACTTTGGTAAGCT 

TCTATTCCTAGGCCCTTGGACTGGACAGTGCCAATGAGAAAGGCT 

G AGTCAGCCAAGGGTG ACTTTGAGAAAG TGCTGGAAGTAAAC C C CCAG AAT AAGG CTG C AAGACTG CAG AT CTCCATG TGCCAGAA 
AAAGGCCAAGGAGCACAACGAGCGGGACCGCAGGATATACGCCAACA^ 
CAGAGCCTGATGACTGCAGGTGGATCAAGAGCCTGACCATGCCCCTTTGAT^ 
CTCAAGAGCAGGGAAAACGATTTCCAATCTAATCTGAGGCTGCTTTGGAT 

CACACGTCATG AGAGATGAGAAAACT AAATTCC CTTCAG ATGT ATTAG AT^ TT CAAAAG T AATTGTGGTTTTT ACGATT ACTTT 

CAATGGCAAAAACAGCAATTTTGCACCAACTCCCCCCGCCCGCCCCCCGGGCCCCTGCC 

GACTGTAATGCAGTCGTGCTAACATGGCTCACTGCAGCCTTGACTTCTCTGGATCAA 

GCTAGGAGAAGCACACTACCACACTGGGCTAATTTTTTAAAAT^ 

CGAACCCCTGGCTTCAAGGGTTCCTCCTGCTTTGGTCTCCCA^ 

GTCATTrTTGAAAAATTGTAAACCCATCAGGATTTTTTTT^ 

CCCCATCATCACACCTGTTATCTTGAGGGTCGGGAG 

C CCTACAG CATAGTTGGCATTCTGGT CCTGTCCC CAGTCT C CAGGAATG CC CAGACTTCATTATCCCACAGCTGTCCCATGTGT CA 

ACCAAAGAAAGGTGAAGCTCCCACTCTAACTATTGGTAGCACAGCCCTGACCTACCCAG 

ATACCTCGTCTTTGGGACCTGATACACAGTCTGGAGGAGCCAGGAATTATTT 

TTT CTT CCTT AGG AAGAGG CCAAT AAAG CAAT GGGCAAGAAGACTTCAGAAGGGGTCACT AATGAAAAAGGAACAG ACAG TCAAG C 
AATGGAAGAAGAGAAACCTGAGGGCCACCTATGACGCCACGCCAAGG 

CTTT C C C C CAACTCAGGACAG AACAGTGTTT AATGT AAAGTTTGTT AT AGTCT ATGTG AT^ CTGG AAGCAAATGGCAAAACCAG T A 

GCTTCCCAAAAACAGCCCCCCTGCTGCTGCCCGGAGGGTTCACTGAGGGGTGGCACGGG^ 

GACTGTGGTGTGGAGGGAGTGAGCCAGCAGCTrTAAGTCCAGCTCATTTCAGTT^ 

CTGGAGATCATCCTAACAATGTGGGGCTGTTAGGTTTTACCTTTGAACTTTC^ 

TGATGAATTTCTCCTTTCCTACAGTTGGGTAGGGTAGGGG^ 

GTAGTCTGTTGCATTTTTAACCAACAGAACCCACAGTAGAGGGGTCT 

AGCCAGAACCTCAGAGGCCACTTGCTTGCTGACTTAGCCTCCTCCCAAAGTCCCCCTCCTCA 

TTCTACCACACACAGCCTGTC CCTGGGGGAGTAATTCTGTCATTCCTAAAACAC CCTTCAGCAATGATAATG AGCA GATGAG AGTT 

TCTGGATT AGCTTTT C CT ATTTT CG ATG AAGTT CTGAGAT ACTG AAATGTGAAAAGAGCAAT CAGAATTGTGCTTTTTCTCC CCTC 

CTCTATTCCTTTTAGGGAATAATATTCAATACACAGTACTr^ 

TTGCTATTAAGAATTTAAGACTTGTGCTTACAATATTTTT^^ 

^ T YXTTGTCTTTTTAAGATTATTGGCTCATAAGCATATGTATACTGGTTTATG^ 

AAATTTTGACTTTTTAGTACTAAGCTTAATTTTTAAAAACAAAATCTGTAGGGTTGACAAAT 

GGTTTCACCTGCAGGTTTGACACGCAGTTGCTCGCTTTTCCTGC^ 

GAAGACAGCTTCCCATGCCGGTACACAGCCAGTAGCCTAAATCTCCAGTACTTGAGCTGACCAT^ 

GTGTACATGTAGTTGAATTTGAGTCCTTACGGGTAAACAGATTGAGCATGGCTCTCTATTCCCTCAGCCT 

AATGCATTTGGCAACCCAAGGAACCATTTGCTTAAACCTGGAACATCTCACCTTTTTA 

TTAAATTAGTTTTTATTTTTATGATGGTTTTATCAAAAGACTTTTATTATTA 

ATTTCCTTTTATAATACTTTrCTTCCCCATGGAACAAATGGGATCAAT^ 

TAATTTCTCATTTATGCTTTTGTCTTTTTTATGAAATATTTCTTTTAAAAGCC 

CTGATTTTGCTTACn'GCTAAACTGTTGGGAAAGCTCTGTAGAGCATGG 

GGCCACCTAGCTTTTTGCAGATAACAAACAAGAAAGCTATTCCIAAGACTCAGATGATGCCAGCT 



ACCACTCAAGAAGACACTTCACGTATTCTTGTATCAAATTCAATAATCTTAAACAATTTGTGTAGAAGTC 

CCACCTTTTAGGCTGCATATGGATTGCCAAGTCAGCATATGAGGAATTAAAGACAT^ 

CACTTTTTTGTGTGTTCTTTAAATAAATCCAAAAAAAATGTGACTTCCCATTGTTTTT 

GCTATATTGGTTAAAAAACCATGGTGGGGGCCAACTTGAAGAAATTAGACATCAGAGAGGTCAGGCGTAGTGGCTCATGCCT 
TCCCAGCACTTTGGGAGGCCAAGGCAGGTGCATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAA 

TCTACT AAAAATACAAAAATT AGCC AGGTGTGGTGGTGCAC^CCTGTAATCCCAGCT^ CTGAGGC AGGAGAAT CACT 

TGAACCTGGGAAGTGGAAGTTGCAGTGAGCCAAGATTGTGCCACTGCACTCTGGCCTGGGTGACAGTGAGACTTGGTCTC 

AAAAAAAAAAAAAAAAAATCAGAAAATCGGCAATGGACAGTATCATCCTACACTTTGCTCTAAGGTGAACCTG 

GGTGAAAGTGGCAGAACAATGCATCATTGCTAAAAAGAACCAAGTAATGGGAAAAATAAAAAGGT^ 

GGATATTTCACCTTTTTAAGCrrGATTTTAAGGTAGGCTCGACTrGCAAAATAAAATTT 

CATAAGTAAGGCTATCTAAGGTGGTGGTCCATATTGAGCATGCATTTTCTTTCTTAAGTAG 

CAGACT CTCAAGG CCTAAGGAGATGTCACATATACTG CAAATCCTCATTTTCACACTGC AAAAGAAGGTGGCACAGTG CTGG AAGG 

TACTTAGTAGGACCTCTGAGGTCTACAGACAGGTTGATGCAGGTTCTAGGCACCCTTTATTCCCT 

TGGATTTGAATTATATTTCTCCCTCCrCCCTGTTAAATTATTTGGTAATGAAATAAATTCCT 

ATAACCCCAGCACTTTGGGAGGCCGAGGTGGGCAGGTTGCn'TGAGCCCAGGAGTTCGAGAC 

TGTCTCTACAAAAAGTAC^AAAAATTAGACAGACATGATGGTATGCACCTTTAGGC^ 

GAGCCCTGGGAGGTTGAGGCTGCAGTGAGCCATGATCATGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTA 
ATAATAAATTCCCTTTATTTGCGTGCTTTTTTTTTGTAAGAAAATAATGATCACC 

TCCTGG CTGTAAATGTGTGATCTCAGAAATTCAGAGGCAGCTTGCACAACCTCCAAAACG AGGACTGCC^ CTTTGTCT 

ATACTTACTTGGCCTTAGCAGCACACTGAACTCCACTTTT 

TCCAGTATTGGTGACAAAACGCATTGAATTTGAGCGAGTTAGTCAAAATCCT^ 

TTTCCATCAGCTGGCTACTTATGCATCTGAATGACAGTGACTGACG 

TTGGTAGACCTrCRGTACTTCCCTTCAGATTTTTCTCTACTACATGTCT 

GTAATTGCCCAGAATCTCCACGTGTGACCTTAATAATACACGGCAAGAAAGAGACAGCGTTAAATAC^ 

AGCATCTTGTTTGGGT AACTGCAG CCTAT AGTT AGAAAAATT AAAG CACAGTTT ATAAGGCAGGGATTCTTTGTCTTCTAAAGATG 
AGGGAACCTTGGAGAATGAAGAATATGGGGAGGACCCAG AATGGTG AT^ CCCTT 
GCACCATTTTAAAATAGGTCTTTTTCTAGTATGATGAGTGATGCT 
CCACATTCGGTGGAACTGAAATGGCAGCIAGAGCCATGGGCCTCATA^ 

GTG TT AAG CAG AGGCAT CTTTTT AAAAT ACACTTT AGGGG C CAGGG A'lXKiTGGCTCACACt- 1 G i AA i CCCAGCA^ x j. j. GAGAGGv. i 
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GAGGCACKSAGGATTGCTTGAGGCCAGGA^ 
AAATTAATTAGCTACACATGGTGGTGC^TGCCrATAGTCCCAG 
CGTAGGCTTCAGTGAGCTATGATCACGCCACTGCACTCCAGCCTGGGCA^ 
ATGTGGCTGCTACATTGTTTCCACCrCTCAAA 
5 TGGGACATGTGTTTCC^VACTTGGCTTAGTTGACACATTC 
GAGGGAGTTTTGTTCTTGTTGCCCAGGCTGGAGTG 

TT CT CCTG CCT CAG CCTCCTGAGT AGCTGGGATT ACAGGGT C CG CCAC CACACC CAG CT AATTTTTTGTATTTTTAGT AGAG ATGG 
GGTTTCACCATGTTGGCCAGGCTGGTCTCGAGCTCCTGACCTCAGGTGATCAAC^ 
GGCGTGAGCCACCGGGTCCGGTCTGTATCTAAGCCn M rCTTGAGGTATTAAATATTTCCAGCCTGGG 
1 0 TTTTCATAAGATAACTCCTTATTGTAGGCATTGTT 

AGAATAAACAGTTGATGAGTATTTGGGGATGCAATGACTAATTCTAGTTACTCTAGTTCT 

GTATCTGCrrCACAACCGTGTTTCCACACTAAGGACTCCTGAATTTTTTTGTCTT 

CACCTGTAATCCCAGCACTTTG<K^GGCCGAGGAGGGCGGATCACGAGGTC^ 

CCCCATCTCTACTGAAAATATGAAAAATTCGCCAGGCTTGGTGGTGGGCGCCCATAATCCCAGCrrACT 
1 5 GAATGACGCGAACCCAGGAGGCGGAGCTTGCAGTGAGCTGAGATCG 

TCTCAAAAAATAAAAATAAAAAACTTAAACCCATGTGTATCCTATTGCCCTCTGTGGCTACCAGAGTC^ 

CCTTTACCTGTAGTTAACAAAATGTGATTCCCCTTCTCCCACTGAGGGATCAGGAACACTGCCTTACT 

CCAX3TCCATCATCCCCAAACTAGAACATAGTAGATACTC<MTAAGCTGGCTTTGTCCCT 

CCQVGTGGCCCTTCACTCCTGTTACCTGAGATGTGGAGGCCAAAGCCC^ 
2 0 CTTGAATTCTTITCCAAGTCGTGAACCAGCAGCA^ 

ACATTGAT CTT AT AT CTATTTTTAGTCACAATTG AT AACTCAG AGTTCACCAAATACG AAGAT CAAAT AT AAAAT ATTTTC CTTAA 

AAGTATTCTAAGACCACCCCGCCCAGTCTGGTGCCCTCGTCAGCCAGCCCCCAAA 

CTGTTCCACTGAGGTGAAGGATGGGGCCCCCTGGGCTGAGCGACTCGCAGACAAGTGAAAAATGAGGCATG 
TC^CAGAGC^TCTGCCATCTCCTACTGACATCTACAAAATAGG 

2 5 CATTCAGAGCCACACTGGAGACCX3AGT CAAGCCAGGGXTCCACCCCACTG CTGACATCCAGCCCAGTGCTACTGGGCAAGCCCTTCC 

TCAGCACTGCCCCCAGCTGG CCACTGTTCCACCCCTGTGCTTCTCTCCAAG CATCC 
AGCTATACC^GCCCTCTTGCCCTXKKSCCCAGCCAGGATGGTTTCTCCTC 

AGTCTAGGAGCAGAGAGCTATGTCCTTAGGTCTCTTTTTCAGTTACCTGCCTACTGGAATACAG 
ATTTACGAGACAACTGTACAGATACACCATGCCTAACTTTGCTTACTGCATG^ 

3 0 TTTTTGCTACATTITAATTAGCACTACC 

GGGAGGCCAAGGCAGGAGGATCACTTGAAGCCAGGAGTTTGAGACCAGCCTAGGAAACAAAGCAAG^ 

TTTTAAAATCAGCCGGGCGTGATGGCGCATGCCTGTAGTCCCTGCTACTCAGATGGATGAGGTGGGACGACTC 

GGTCGAAGCTGCAGTGAGCTGTGATCATGCCACTGCACTCCAGCCrGGG 

ATAGGCTGGGCTCGGTGGCTCTCACCTGTAATCCCAACACTTTGGGAGGCCGATGGCGGCGGATCATGAGG 

3 5 CATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGTGGCTGCCTGTAGTCCCAGA 

CACTCGGGAGGCTGAGGCAGGAGAATGGCATGAACCTGGGA^ 

TGGGCGATAGAGTGAGACTCCGTCTQUVAAAATAATAATAACTAAGTAAATAAACTTCCTGACT 

TGTCACCAACACACTTTCCCTGCAGAGAAAGCTACACTGCTTGAGTGAGTTGGTAAATGCTCCTCT 

GCAAGGGCACGCAAGGCTTTCTATATGGGCTTCTAAGCTTrCTCTGAACCCAGCT 

4 0 ACAATACAGTGAGGTTTGCACCCTTCTACCCTTCAGCCGTTACCAAATGTC 

CAAGTCGGGGCCTCAGAACCAAGTTATAACCTGAGATAACCCCAGTTGGCTGGATGATTGCCGAGACCAG 
GCTGCCCACTTCTGCGGTCATAACTGCAAAACAGGGCCCCTCATAATGTCCTCATCAGGACCT 

CCTCCAATGTTTGG CAAGAGCACTGGTTTGGGTCCAG CTCTGTGTTGGG AT CTCTTGTTTCCTCAC CTGTGAAATA AC AATGG CTA 
ATATTTATACAACTGTTTTAATTATAGTAGAAAACACATACCATGAAATTTATCATCT^ 

CAAGTAATTCTTGTTCCTGAGCCTCCCAAGTAACT^ 
TTCACCATGTTGGCCAGGCTAGTGTTAAATACAGTAAGAAGTTCT^ 
TCCCTCCTTTCAAGGCCAGACTTCCTTACTCTCTCTGTCCCCCTGCCCTGGT 
CCACATTCCACATCTGCTACCCACTCTGTGAATTACCCCTCCCGTCACAACGACCCTCCCCGCAG 

5 0 AC CAGTGTAACCGCATTCTTGCACTTrTCAAGTTAGCCAACCGGG 

CACAGTAGCGGGCACAAGCTGTGTTAGGAATAAAAACCCCTTCCCTCCTTTCTTTGGGGTGCT 
AAGCACCCTTCTGCAGAAGCAAATTTGCCTTGCTGAGAGATCCTTTGTCTCAGTGCT 
TTCCAACAAATTTGGTGGCCCATATGGGGACCTCATTCTCCTCCGGGAAAGGGTCTCTGATCACTTCT 
GCTAACTGATTGCAGTGGCCTCAAGGGTAAAGAATCGGGACCCACCCAGTGCAACCAATAAACCCA 

5 5 GAAAAGGCCTACAGTTACTGTGGTCAGCAGGTAACGCTATGCACAGACCAAGGTAAGAAAAACTGTGGGGGCG^ 

TTGGTGGTAGGGACAGTTTG<3AGGTTAAAAGTGTGTAAATAAGATGCACAATTAAGTGCAAAGCGAGTGTGGAGTAAA 

GTG CAAG AAAT CT CCAGTG<3GGG AGCTG AGCC TC CAGGG AAAGGGAGCAAAAAAAAAACAATCT TAAAAG AGGTTG AG CC AG 

GAGGCCGAGGCGGGTGAATCACGAGGTCAGGAGATCGAGACCGTCCTGGCTAACAC^ 

AAAAAAAAAAAAAATTAGCCGGGCGTGATGGCGGGCGCCTGTAGTCC^GCTACr 

6 0 CT GGGAGGTGGAGCTTGCAGTGAGCC^GATCGTGCGACTGCIACT 

AAAAAAAAAAGAGGTTGAGCCCCACACACACTCAGGTTGAGCCTTC^ 

G CCCCCATTAACTTCCAGGATGGGAAATAC CCCTTAGTAAAACAAG AACTACAAAGG ACCAAACATAAG AAAAAGCAACAG ATGATA 
AAATATTGTTGTTTTTTTTGAACCAAAGAAACTATTCTTAAACCT 

TTGTCAACTTTTAATAAAGTATGTAAATAACAAAAGTCCTGTCTCCCAGGAAGAAATAGACT 

6 5 CTCTCCTCTACCCCCTAAAAACTGCAGGAAAAAAAGCCAGAAA 

CACTAGCACATGGGATTCAAAATTTCCCTTTCCCCTCCTCCT 

CTAGGAGGAGGGGAAATTGGCTTTGTAAATGCCCCTTAACTAGTTCAGAGGTCAGAAACCTAAG^ 

GATGAT CTTTTT AGGGTTGTGG ATOVAATT AACCGATTTTTAGGACC^ CTAATGTC CATCTTAAG 

TATTCTCTTTTCGGGAGAGGAAAGAACC^TGATCCGCAGGGCT 

7 0 TCCCTGCAGCAGAACAAAAATCXX:CGGCCCAAAACCCCCAATGGGATAAT^ 

AGGAAAATGAT AG TT AAAXK3GATTCGGGAATCAGTGCCTC AAAC CCAAAAT ATTTC C CG AG CATTTAAT ATG CAGCAAGGAAAAAA 

CAAAGGGCCCACGGAGTTTTTAAACAGACTCAAGGAACAGG 

GGATGTTAAAGCTCCATTTTGTTACTAGTAGCCAAATATCACAAAG 

GGGAATGG AGGGAG TCAGGGAG AGAGAG AAAGAGAGGCAAAGAGAGAAG ATGGAG AGG CAAGGAG AG ACAGA 
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HUMAN SEQUENCE - mRNA 

GGGCCGGCTCGCGGGCGCTGCCAGTCTCGGGCGGCGGTGTCCGGCGCGCGGG03G 
ACGGGCAAGGCGGAGAGTGACGGAGTCGGCGAGCCCCCX3CGGCGACA 
CCAAGAACAATGAAGAAAGCCCCACAGCCACTGTTGCTGAGCAGGGAGAG^ 
5 ATTGTCAAAAGAGTGGGGAATGGTGAGGAAACGCCGATGATTGGAGACAAAG 
AAAGAAGTTTGATTCCAGTCATGATAGAAATGAACCATTTGTCTTTA^ 
GGGTGGCTACCATGAAGAAAGGAGAGATATGCCATTTACTGTC 
ATTCCCTCGAATGCAACTCTCTTTTTTGAGATTGAGCT 
GAGAACCAAACGGAAAGGAGAGGGATATTCAAATCCAAACGA 
1 0 GG ATGTTTGACTG CAGAGATGTGGCATTCACTGTGGGCX3AAGGAGAAGACCACGACATTC 

. AAAATGCAGOSGGAAGAACAATGTATTTTATATCTTGGACCA^ 

ACCTAATGCTGAGCTTATATATGAAGTTACACTTAAGAGCTTCGAAAAGGCCAAAGAA^ 

TGGAGCAGGCTGC CATTGTCAAAG AG AAGGG AAC CGTAT ACTT CAAGGGAGG CAAAT ACATG CAGGCGG TGATTC AGT ATGGG AAG 
ATAGTGTCCTGGTTAGAGATGGAATATGGTTTATCAGAAAAGGAATCGAAAGCTTCTGAAT 
1 5 CCTGGCCATGTGCTACCTGAAGCTTAGAGAATACACCAAAGCT 

• AGAAAGGCTTGTATAGGAGGGGTGAAGCCCAG CTG CTCATGAACGAGTTTG AGTCAGCCAAGGGTG ACTTTG AGAAAGTGCTGG AA 
GTAAACCCCCAGAATAAGGCTGCAAGACTGCAGATCTCCATGTGCCAGAAAAAGGCCAAGGAGC^ 

AT ACGCCAACATG TTCAAGAAGTTTG CAGAGCAGG ATGC CAAGG AAG AGG CCAAT AAAGCAATGGG CAAG AAGACTTCAG AAGGGG 

2 0 GTCCCAGTGAACTCGGCCCCTCCTCAATGGGCTTTCCCCCAACTCAG^ 

GTGATTCTGGAAGCAAATGGCAAAACCAGTAGCTTCCCAAAAACAGCCCCCCTGCTC 

HUMAN SEQUENCE - CODING 

ATGACTACTGATGAAGGTGCCAAGAAC!AATGAAGAAAGCCCCACAGCCACTGTTGCT 

2 5 AGACAGGGGAGTATTAAAGATTGTCAAAAGAGTGGGGAATGGTGAGGAAACGCCGATGATTGG^ 

AAGGAAAATTGTCAAATGGAAAGAAGTTTGATTCCAGTCATGATAGAAAT^ 
AT CAAGG CATGGGACATTGGGGTGGCTACCATGAAGAAAGGAGAG ATATG CC^ 

GG CTGGCAGTCTCCCTAAAATTCCCTCG AATGCAACTCTCTTTTTTG AGATTG AG CTCCTTG ATTTCAAAGGAGAGGATTT ATTTG 
AAGATCGAGGCATTATCCGGAGAACCAAACGGAAAGGAGAGGGATATTCAAATCCAAAC^ 

3 0 GAAGGCCGCTGTGGTGGAAGGATGTTTGACTGCAGAGATGTGGCATTCACTGTGGGCGAAGG^ 

AATTG ACAAAGCTCTGG AGAAAATG CAG CXXKSAAGAACAATGTATTTTATATCTTGGACGAAGATATGG 
AGCCTAAATTTGGCATTGAACCTAATGCTGAGCTTATATATGAAGTTACACTT 
AT<X1ATACCAAAGAAAAATTGGAGCAGGCTGCCATTGTCAAAGAGAAGG 
GGTGATTCAGTATGGGAAGATAGTGTCCTGGTTAGAGATGGAATATGGTTTATCAGA 

GG ACTGGACAG TG C C AATG AG AAAG G CTTGT AT AGG AGGGGTG AAG C C CAGCT G CTCATGAACGAGTTTG AG TCAG C CAAGGGTG A 
CTTTGAGAAAGTGCTGGAAGTAAACCCCC7VGAATAAGGCTGCAAGACTGCAGATCTC 
ACGAGCGGGACCGCAGGATATACGCCAACATGTTCAAG^GTTTGCAGA 
AAGAAGACTTCAGAAGGGGTCACTAATGAAAAAGGAACAGACAGTCAAGCA 
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Table 27 

MOUSE NOMENCLATURE 
ICSGNM Rel 
Celera mCG8770 

5 

HUMAN NOMENCLATURE 
HGNC REL 
Celera hCGl5154 

10 MOUSE SEQUENCE - GENOMIC 

GTCCAAAAAAAGACTCCTITTTCATATTAATAAACCATAGACAATAA 
ATTGTTCTGGTTTAAGCATTCAAGAGATAGATGGAAAGGTGGAAG 

AATGGGCTGGAGCG CCAGAAGAAGACATTTGAGCCACIAGTCTCTAG AAACTGTT CTCAGGAT^ 
AGCAGTGCTGACTGAGGTGTATATACCAAAGTGTCATACGCTATTTGACAGAAGGAAG 
1 5 AAGAGACCATGCTCCACTGCC1TATGCCCATTCAGGTCTAACCACTTTAACT 
ACACTAATGCTGGGACATAATCTCTCTCCATACTTAGAACAATGTT^ 
ACATTTATAGCTTGACTTTTATTTATATATTCATAATCCCATGACAACTTACAGA^ 

TTTATTCAGT AAAT CT AGAG ATAT AAAAGT ACACTTTT ATCIAATTGT ACTATT AGTTCAAG CAAGCACCAAAT AACACTGAAGGTG 
T ACCT AAACACAACATGGGTATTC CC CATACCT AAGAGG AGG CAGG AGGATTG AG ACTTTG AGGT CAGC CTGAG AT AT ACAATGAG 
2 0 ACATTGG CT CAAATAAATAAATAAATACATAAATAAAT AT ATTT ACTATG A 

ATTCAGTGTTAGATAAAAGTTATACTTGTCTTCAA^ 
AAAGCCCAGATrCCAGTTCATATTTTTCCATTCTCCCAATCTTCTAACT 
GCACTCTCAGAATGTTCCCAGTTTCCTAGAACAGCTGCTTTCCA^ 

AATGGCAAAATGCAAAACAAAAAAATATGTGCATCCTACATCCGCCATTTAGCGAGACACCCCCTTC 
2 5 TGAGGTTGTCAGGAAGTGTAAACATTTCCCTAGAAACTGGAGGGAAAAAGAAAAAGTCGA.CT 
TAGAGTGTTACAGAAGAGACAGATTATCTTGGGCTATTTTT^ 

CAAAGTT AATTCCAG CACT CAGG AGG CAGAGG CAGG CAGAT C AG AG TTCAAGG C CAGCCTGGT CT ACAG AATG CGTTCTGGG ACAG 
CCAAGGTTACACAGAGAAACCCTGACTGGATAAACAAATGTATATATAATTCATTTTCCAC 
TAACTGTCTAAAACTTTAAAATGCAGTTTTAACAGACTTCTAATCTAAGAAGACCCT 
30 CTCTGTAACCAACAGACCAGATACAGATGAAGTACCTAAGGGTGTTCCGTGTCCTCACTIT 
GTCAGGAGTTCAGX5TATGGCGGCACATCCTGCCCCACAAGTTCAACATTGGTT 

AAGTCTCTGTCCAGGGAGGTCTG ACAG ATGGATCCCACGTG AC CTCAGATGTCATTCCTGTTACAGAAATGAAGTCAC CCTGTGTC 

ACACTT , ACAAGCX5CTGCTATATGCACAAAACTGGCTTCGTCCTCTAGGCCTTAC 

TTTCTGGATGGTTTACGACCTGCCTTTATCAGTTCTCATGTACTCAAATTTA 
35 TTCCTTTTAGCCTAGCTGAGTGGGGCCGCTTTCTCAAAGCTGTTTCTCACCTTGG 

CAAAGGGTGTGCACG CCCTCCTTGTTGG ACAGG AGAGGTT ACAG ACAAAGCTGATGGCAGGGTTCAGGTG ATG CCACTGTCTG AAT 

TATCACTAATTTTTGAAGTAGTCATACATATTTCTTGCTGTACCCA 

AGTTCTCTAAATCC^TTCAGTCAGGAAGGTGGACAGCTTCTCCTCGATCCTTTCCT 

CATATGCTGGCTGGCCGAGGCTTATTTGCTTCACCACAGTGACTTCTACA^ 
40 CGCAGCCTCTGTTACAGCGCTCCCTC^n-TTAAGAGCTTGTCACATTTCTCT 

ATAACCGAAGGTATGGGTTGGGCCAACATAAATTTGGCAACAGTCAAAAGTT^ 

TTAATTCAAATTAAACCTCATAAAGACTCTGCTATGTTTCCGAGGAACTCACCAGCTT 

TG T CTGG AAC ACACATGTG CT AG CCACAAACACATG CCATCACAC AACG C CAGGC CAGT AT CTG CAG C CGAAACGGTTCG T ACTTT 
ACCTGTGCTTTGATTGTGTGGATGAGTTTTCTGTCCTTATAGAACATTATAGCACCTAATAGCATTCC 

4 5 GTTAGTCTGTCTGTCTTTCTTTCTTT C lTrCTTTCTTTCre 

TrCCTTCCTTCCTTCCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTCTTCT^ 
TGAGCTGCTAATTTGAGTTTCATTTAAAGATCATTAAAGTAATTTTGATCTTGTTTAG 
AGACCCACACACGCTTCTGGTATTTTGCATC^CATATTTATAAGGCAGCATTTT 

TCAGGC C AGAAGAGGGCATCAAATCCCATTG CAGAGGGTTGCGAGCCACCATGTGGTTG CTGGG AATTG AACTCAGGACCTCTGGA 

5 0 AGAGCAGTCAGTACTCTTAACCACTGAGCCATCTCTCCAGCCCAAGA 

TTGCTATTAATAAATGGTTTGTAAACCTCTTTGTGTGTC 

TAATAAATACATTACTCAAAAAAAGAGCACCTCCCTCGTGAAACAAAATACTC(^TTGTCAATAG 

TTTCCTAATCTAATATATACCCATATTAGTCAATATTAGCTACTTTACCAAACATAGTAGTTCAAT 

CTCTACCTTTTTTTTTTAACTGTCTGCTCTACA 

5 5 CAGCAAAGGGGATGGAATTACAATATAGGATCAACAGCTTGTA 

TACAAGCCTCTGGCTGAGGGCAGGTGAAACATATCTCTGATGAAGAAACGCTGAATGOT 

TGAAGGCAGCrCTCCCAGAGGGAGGCTTAGTTGTTGACCCTGTGACCTCTCCAGATATGGGAAA 

TGAGGAACTGTATTCAAACTCTCCTCTGTTAAATAAAATTAAAAGAAAAGGAAAAGGTGA^ 

G TT ACAAGT AG ACATTTTGGTG AG AAG AGGTATT AAGATGAGGT CT CACT AT AT AG CCCAGG CTGGC CTCAAATTC ACTGTGGAAC 

6 0 CCCATCTGACTTCACCTTATGGCAGTCCTGCCTTAGCCTCTCAAGCAACAACATCTC^ 

GAATTATAACCAAGTTTCAGAATAAACTCCAGTATAGACCTTCATAGGCTCAGAACT 
GTGTGTGTGTGGGGGGGGGGGTCACAGCTAGGACAAGGTGATTATTTACA^ 

CTAGAAAGGAGT ATTCTGGGCAGAGGGAG AAGAACG AAAG CAGATAAAT AAG AAAC CAT CTTGT TT AT ACAT CT CAGGGTCT AT AG 
AGCAGTTCAAAATGTATAAAATATAAGATATAGATTGTAAGAAATGAGGCATGGCCTCCTGGAAGCA 

6 5 TGGAAGCCATGTATTTGAACCCAAGGACGGTTTTAAATTGCAAATTTGGTGTGGCTTGATGTGTATCCCC^ 

GCAATATATTCGAATATATTTTTAAAGGGGACAGAATTAAAAAACACTTAGAACACT 
TGTCCCAATGATGATGGCAGTCAAAATAAAGAAGAGGCATAGCTGTTGGC^TAACTCAGTGGCA^ 

TATGTGCCCTAGG AT AGCTATGGGG CAACCCAACACAAAACCATGAACACTTAAAACAG CATGAGCAGATTGTTTTTGCAAGTTTT 
TTTTTTT CAAT CT ACCT ATG CTTTT CCCAAATGTGAACTTT AT AGGTGACAGTGT AT CAT AACAATG TCAAAAGACTG TACATTT C 

7 0 TCCAAGCCTTAAATTTAACCCCTAGCAACAGAAGAAGAAGAAGGATTCATCACAAT^^ 

AGGCTGAGGTGATAGGAGGGCCAGTCATGGCTCTCAGTCTGCAGAGATATCTCAGCAAGTAAAGT 

AGCAGAGTTTGGACCCTCAAGAGAAGCCAGATGCZAAGTCTGTAATTCCAGA 

CAAAACAAAGAGGCCATCTTCCATCCCACACTTACACAGCT 

GTCGGGGGTCAAATGCl* CCrrrTAC^ ACATCAGATATAv. * GTACCATGATTCATAACTAGCA 

574 



WO 03/008583 



PCT/US01/51291 



AATTACAATTATGAAGTAGTAATGAAAATAATTTTATAGTTGGGGTC 
GGAAGGTTGAGAACCACTGCTTTAATAAAGTCTTTTA^ 
CTTCCTTGGTACAGCAGAGGTATCCTGTTCTAGCACCATCTC5CACACT 
TAGTCCTGACTTTACCCTGCACGATGAATGGCCTGGGAACCCAACTGCCATGCCAGAGA 
5 GTrTGCTCCTTCTGCATTTTCAGTCTTACTCTGTAGCCCAC^ 
TAGGATTATAGCTATGAGCCAATAAGCTTGCTTCTCATTT^ 
TATATGCTGTCCIXK3TGTGCWTTAATAGTAAGGCAGACTGTGCCTC 
TGGTAAGGCAGCCTTAAAGGTGTTAGACATTTGTTATAGGCACGTGTGG^ 
TC^GAATGTACTGAGTAAAAATAAATCCACAACCTACATCAGTTTTGTTC 
1 0 ATCAGTTCTTTTAACTGTAATTATTAGCATTAATTCTCAGACCCACAGTATATA 
GGATAAATGATGTCTAAGAGAGGCTACAGAATTCTGAGGCTTTCC^ 
TCTGCATGCATTGGACTAGAAGTCITACAAACTGCAGGCATGGACCG^ 

AGGGTCAGTTTGATGTGAGCCAAGTGTCACCTGCCGGAGTCTCTTTGCTTATGCTGGAGACTG^ 

GATATAACAATT ATCCTAT ATGAGGCTTATG ATAGGT AG CGGGGACAATTGTCATAC CTGTCATGTTTTGGTCTTGGGGCACTACT 
1 5 TCTTTTCTAGTCCCAGGTCAGGGGATGGATTACATGTAACATGTCAGTTCATGTGTC 

TAGCTCTTGGTTTATGTTTCACACTGTGCTGG 

ACTCATACTCAAAACAGAGTAACTCTGGTCATGGTTGTTGCITTGn^ 

GAACACTCCCATGGGCTC^TATAGTTTAACTTGGTCTAAAGTAGGTGGAACTATTTG^ 

TAGGCTCTGATGTTTCAAAAGCCATTCCCTGTCAGCTTTCTGACTTGTG^ 
2 0 CAGTGCTCTGGCTTTCATGCTCCCCACCAAGATGGCCATGJ^ 

TACAAATTGCCTTGGTCATTCCTTTGTTTTATGACTGTAATAGAAAAGTTACTGA 

CCACATCATAGACCTGGTCAGAGCAGTCCCAGCCCATTCTCTCAGCATTAGTATG 

AGGGAGCCAGAGGTTGTTACTGATAGCCCGCGCACACTATAAGTCCACACCCTGCATCAGCTAATACT 

ACACTCTT CT AAAAGTTT AT AC CT ACCACATCCATCATT CT CTCCT CAACCTGCTT CAATTTGG CTTTT AT AATGATCCTTTT AAA 

2 5 GGCACACTTCTCAAGTCACAAATGTCCTCCATTCTTTATCTATTTCATAG^ 

CTTCAGAATGTCTGGCATGAGCCTGCCTAGTCCTGGGATTACTGGAAAGAGCTATGCC^ 
TACTC^TTTAGAGTTCATTCAGAAAAGAATCCTCAACTGTTTCACITCT 
GTCTTCTGTACAAACACTTGTCTAAATCACCTTTCTAAAATTTCCATCATGGCA^ 
TACTAACAGTGTACCATACCAGCCAGGGTAAGTCTTCAAATGTACTGCATCAAC^^ 

3 0 ACTCAGCTCAAACATGGCCTCTACAGAATGTCCTTCTAGAGAGCCCAGTTGGTTTCTACT 

GAAATGGAGTGACATCAAGACTTATGGATGCTTC 
ATAGTGGAGGATGACCTAGAACCCCTGCTGGTCCTACCTCCACCT^ 

TT ACCACG TGTT AGT CT CTG TAAAC CCCAATG ACAAGCAG CCTT ATCAGG TG T ACAG TG ACAC CTT CCC CT AG CACAAGGTCT AAA 
AAT AT C CIATGAG AATTT CAACCT CCT ATTT ACTTTTTCTTTGAAT AAAG ACAGT AAG T AG AAACTATGT CATTCAGAT ATGGTTG A 

3 5 GTTAAAAGCTGAAGTATGCCTGCATTCATGTGTGTGTGTGTATTGGTGTGTTCAGTGTTTTGTGGCT 

TGATTCTGTGGTCTTGACCTCTGGAGCGCTGAACATAGTTCACCCTGCTAATTC 
TATGATTGTGTTTAACTAAAAACACAACTGCTATTTATACTCTTTGAGAAGACAATTTGGAAC 
CACITTGGGGGAGTTGTATAACAGGATGAATT1TITATGACAGGATAGTATATATTTTAATTAT 
ACACACACACTTTGGCX5TTCTTTGAAGCGCATGTAGAGGTAAGAGGACAACrrTTT 

4 0 CTCAATTTGTCAGGCTTGGTAGCAAGCGCAATTATTCTCTAAGCrrATC^ 

OVTTACTTGTTTTAGTCAGTAAATGACTGGAGAACGTAAGAGGTGGA 
CACCTACCTCAGCCTCCCGCGTGXrrGGGAGTGCAGTGCGCTAAACGGGGCTGGATAATC^ 
TAGAACTCTAAAGTATGGACAAGCTACTCAGCTTCTCCAAACCCCAATCAACGCTAATACTCGTGTATA 
ATCATAGACTAAGAGAACAGGTGCGTATTAGGAACTGAAGGCAAACAAG 

4 5 AGG CCTCAGGCCGTGGGTATCT CGGCCAATCTCTGGACGTCGTGTACATTTATACAG AATGGGGTTAGCGG CAGTGCAG CT 

GAAAAACTTTGGTAGGTAAAAGACGTAATGTAGATATAGAGGGAAAATAAGGCTGCGG 

CTGCCGAACGCGGGCGCTC^CGCTTCCGCACTTCCCCTCCAG^ 

GCGAAATCGGGAAGAGGGGGAACGAGCGGTCGAAAACCCCGGGCGGGCCT 

CCCXSCCTCCGGCCGCTCGGCCAGCATGACGTCACGAACACGCCCCTCGGCTTTAAGCGGCGCTCTCAAGG 

5 0 ATTTCTCAGGG ACCG ACGGCTTCTC ACG CGCGGAGTCGTCGCTGCGCGCCCTCACGGAG CTCCCGGCG CGAGCGTGCAGGCTGTGG 

TGGCTCTGGAGGAAGTAAGCGCACTGGCCCXSCGCGGGGTCGCGCGCCCGCCGCGAGGCrCCGCCCCCCGCCC 
CGCGAGTGACGTAAGCAGCGGGAAATCCCCCCCCCTCCCCCCACGTTCCGCCGGGCGGTCCGCCCCGCCCCCTGCCCCTGCCGCGG 
CATCCTGGCAAGGCGACGCCAGGACACTAGTGACCGGAGCGCGAAGATTCGGGGGTGGGAA 
CCGGGAAAGAAGAGGAGGCCCGGGGGTGGTCCGAGGACAGGCGTCCCGGCTGCGG 

5 5 CTGACGGGCTAGCTGGCTGACTGCCTGCCTGACCGACTGCCTGGCTGACT 

CGGGCCGGACG CTGG AAGCTGCAGCGAGCGGCGCGCGGAGCCG AG CCATGG CCTCGAGTGAGTACCGGGGCGAGCACTCGCGGGTC 
ACGGGGAGGAGCTGTCGGGGGCTCCGTTTGGGGTTAGGACGGTCTGCCTTTCAAGTCTTCTCATCTGGCAAATT 
AAAAGTTGCCCGCTCCGGGGTTTGCGTGTGAAATGAAAGCGGCGCACTGTAATCTGATCTTCTC 
TTCCCAACGGTGTGCTTTTAGGATCACAGGGGGTAATCTCAGTAAAGGAGAGCAGAGAAGCCTCG 

6 0 GGAGACCCAGTTTTACATTTTTGAACAAAAAAGAGGAACCT 

CXjGCTGACACTGGTACAGAATGTGGCGTGGCTGTGAAAGTCCTCAGTGGCGTTTTAATTCGT^ 
CCCCACGAGCTAAAGATGTTGCTTGTCTAATTTACCGTGTAGAGACCTCGATCCCTT 
TTTTGTTAGCAGCAAGAGGACGAGATTTGAlTATACTTTAAAGTAACr 
TATCTGCCACrAGTAAGTTTTGGATCATCTGCTTTGAGGGGGAGATTTATAAGAAACAGTTT 

6 5 CAGTCGTACTATATGTAAAGCAAGCACirGAAGCAGCTGAGATGGTGGTGGG 

GCTGACCCTTATCTTCCCAGTGACCCTCAGTAGAAGGCAGCCGACGTCT 
TAAGAGTTGAGCAGAACAATATCATAACTCCTGCTGTAAAGGAAACTGGGGTGT^ 

GTTTTCATGAG AAAACAAGTGTAGTTTAAAAT ATTTCAACTTTCT CTTT ACCCATTTAAATG AAGC CAGGCTTTCAGATGCTCT AG 
TAAATTTGTAAATTTAACTCATGTTCTGATTAGAAGCACAAAAAGGC^ 
70 TGTATGACATTTGTAAAGAGTGGCAAGCTTTGrrAAAACAGTTCTGTCCTTGTCTACCT 
ATGCAATACTTGAATTTTTTTCAACAGAAAAGTCATTTATTAAAGAACT 

AATTTG CCT AAGG ACAT ATATTTTGTGCTGTT AT AG AAATG ATT C CAAAAGTT AAAGTGGGG AACTTTTT AAAAGAAATGGCAAGT 

TTTAAGGATTTAGAAGTTGGGTTGGGCTTGCCCAGCTTCTGTGAA 

AATCCCAACAAACATAGAAAACAAATCTGCAAGTTGTAGTATCATTATCTTCT^ 

7 5 ATGAGACTACTTAAAATACACCATGTCATTTAAATTGTAAriC'T\J 
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TGGGTACCTTATATCAGTTATAAAGACCAAATAGAGAAAAATTATGAAATAAATTTAAA^ 

T AAAAGCAACAAAACTGTTT AACTTATGATT ATGT AATTT AGAT CCTTTTTTTTG CAAT AT AG TACTGAAACCT AGGAAAGTATTC 
ATATTGATTACCAAGCATTGAGATCATTTGTTCATTCCTCTCCGATGTCATATGTAAATAGGTATC 
TTATAAATGTAAAGTGTCGTAGATAATAATCTCCTC^CAGAATTATATAAATGTTGTGAAAAGACAG 
5 TAATTATTGTTCAGGTAAATTAGCAAGTTAAGACTTC 

TTTGTTCT AG CTTTTTGATAAG ATT AATT AT AG CAGG CAACATGGG AAAAAAATGTTC CTG ATCAAAGATCTGCATATTT AAG CAG 
TCATTATTCAAACAATTGTATTCTATTTAAGTTAGCTGTCTTGTGTCC^ 
NNNNNNTGGAGGGAGGTGCACAGTTGCCTTGTAAGATACATACTA 
AAAATTT AG AAAGTTTGT ATTT AGT CACATTTTTTTTC 

1 0 TATTTTTGTGGACXTTACTACTCACCAACTTTTTGTATGAAT^ 

TGTGTGACAACTT AGACTTTAAT AATAGAGCTGG AAGTT AGGAGTGGCCAG CTAAAACAAATGACTT C CTTTTTT AAG AAGG AAGA 
CTGTT CACAAATACAG ATGTTTCAG AAACGCAG CACATGAAAAAGAG ATAAAGGTGTTGT AATAGTAGGTGATGTAAOUVCAAGAA 
TGAATGGAGAAAATGGTGGAT ACAGGGT ATATG TGGT CTCTTTCT CAAT'ITT TT TGTTTC AATTTTCT CC CAT ATGGG TTCT CTGC 
ATGATTG TG TTT ATGGGCACATGAGCTGAAT ATTGGAG AGG AAG AT AT ATTTT CAAAAAAAGTGTTC CTTTG AAATTTGT ACTTAA 

1 5 TTGGATCTCTTGTATTTTTAAG T CTGAAATCTTTTTTTTTTTAAAG ATTT ATTT AT^ TACACTGT AG CTGTCT 

TCAGACACTCCAGAAGAGGGAGTCAGATCTCATTACGGATGATTGT^ 
TGGAAGAGCAGTCGGGTGCTCTTACCCACTGAGCCATCTCAC 
AATTTTTTATTTGAGGAAAGA<J1'1'1 1 AAACATTACTTTTCT 

TTTGTTTTGTTTTTCCCGT ATCCACAGCAGTG AT AT CTT ACTGCAAACCGTTT ACACG G CTTTGGTTGG CCn U iGATG AAT ACTC 
2 0 AGCAGAAAGTTCTGGATGGAGATGATATTTGGAAGTGTATGATTATTAGCTGCTTGCT 
CCTAGGATTTCATGAGAAAGAAGCACACTAGTCAGCTCTTCTCAACC^ 
TAACAAGGCTCAGATATGTTTTATATAAAGGGAAATCAAAGGCCATCT^ 

GCTTGGATCCGAGCATTGGGTTACTTCCCTGAACACTTTGCATTGT^ 

2 5 GTATTCAGATTAATTTGGGATGTTTCTGCAGAAATCTGTCTGCACTT^ 

GG TT AGAT AGTGGTTTGG CTT CTG AT AT ACAGCCCCAG AACAAACT CTGTTCT CAC CTT AT ATT 

CTTAAGATAGTTGTATTTGTTTTAGGTTATAGGTTATGGCCATATCTGTTATT^ 
CATAAATTTAAAAGTTGATTTTATTTTGAAACTTGGTTGTAATTACACCT 
AAAAAATAATCTCATGGAAGAGTTTCACGCTATCATTCTTTCCCT 

3 0 TGGTAGTCAGATTTGTTAGTTCCTTGGATTATGACTGCCTCACTGCTATATTATTAAAAATTC 

TTTATGAC!AGGCATAGTGGTCCATTCCTTCAATCCCAGCACTTGGAGGCAG 

TTT AT AAAGCAAGTT ACAGGCCAT CCAGACCAT ACAATG AGG CTGT CT CAAAAAAAAAAAACAAT AAT AAT AAT C CTTTT CCT CAA 
AATTTTATATTATTTTGAAGTTTTAGCATCTCrrAAAATGAAGAGCAGTTCT 

3 5 TAGGAATAGTATTTTTGTTGATATACCTATTTTATATATAATTGTGGCAATTGATCTATTTTTAATGTTTCT 

TTCTAATTCCTTTTTAATCTTAAAATCTAATTTTTAATTTTTTTAATCTTAAAACT 
TTTCAAATAGCATTCTCTTGCACCTAAATGCAAATGAAATCATAACTATAGT^ 
AGTAGATATTAGCAATTCAAAATTTATAAAAATATATTTAAAAAATATGGAGACCAATCCAG^ 
AAAAATACCTTAATTAAATTGTTTAGAAACAACTTTGAAAACATTTCTGTGC 

4 0 GTAAGAGTATTTGTTTTCCCTAAGGTTCCTTGGTTTATAATTTTTCCT 

ATCTTCTJ^GAGGAATGGCATTGTAAAAGAGTACCAAGGAAGAAC^ 
TAGCTAGGAGGGCTACAGTTAAATCAGAATATGAACACATAT^ 
GGG T T'rn Tr fTCCAGTTATTGAGGAATATAATAAGGAGGGACTGATAATC^^ 
CAAAAGAGCTTTGGAATTTTAATTTTCACACAGATATTAATTTAGGAC^ 

4 5 AAGAAGTAAAACATGTAGTTTCATTTCAATAACAAAGCTGGGTATC 

GTTCAAGGAAAGCCTCAAGCTACACATTGAGTGAGGCCTGCGTAGATTACTTG^ 

GATAGAGT AG CTGCCCAG C CTTG AGGAGCCCTGATTGGACCGAGCATTGTACAAAGTCTGTGGT^ 

CTAGG AAAGTGG AGGCCAG AGGCTTAGAAGTTCAAGATCAG C CT AGGCT ACAAAACAAGTTAGAGGTT AG CCCTAGATAG AT AAGA 
CACTGTCTCAAAACATAAAGAACACAATAGTTAATTTTCCCTAAACT 

5 0 TGTTTTGCTTTGTTTGGAATAATTGCTCTTAAAAATTACC^ 

GTAGACTATAAATTCTGAAATTrGTTTTTGAGTGAGACCT 
TTTGTTTTTTGAGACAGGGTTTCTCTGTGTATCCCTGGCTGTCCTGG 
CCACCTGCCTCIGCCrrCCCAGTGCCAGGACTAAAGrrGTGCCCAACTATTTCCTGGCT 
ATAAATATAAATGCATGAAATTACATGACAATATTTAGGTAATTCTTATTAC^ 

5 5 (TTTTGTATTTCATAGTTTTATAAATACACrCACn'GGTTTAAAGAAATGTGTAACTTGCTT 

TTTCTCCTTTAAAAAAAATCCAGCTATCATTAATGATATCIATTAATTATCAATTTCTCACCAG 

GG ATTT ATTTG ACCATTT AGTT C AAC CAT AT AAGTT CTTT AAG ATG C CACTG AAAAAAATCATGTTG AATTT AACACATTTG AAGA 
GAAAGGTACATTCTCTTAAGTAGTCCTACAAGAATGCATAGGCATATAAGATATTTAAAATTACG^ 

TGAT AC ACACATTG T AAAAT ATT CATTTG CCTTAT ATTT C CAGT AATT ACT ATGG ACTTTGTTGGATT AAAGTAGTTTT CT CTTTT 

6 0 GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGTTGTTTATTTGTGT^ 

TTTGTACCCAGATCCTTCAGCCTGCTTTCTGTCATTTGAATAAGCAAATGAGTGATA 

TG AATAAAATT AAAGGTAG CAGCAATAATTTATTGAAAAACCAC ATGGC CACAATTGATTCTCCCCTTT^ 

GTCTCTCT CAC AGGGACCT AGT ACT CACTCATT AGGCT AGT CTGGCAAGCCAGTGACT 

CACTGGCTTACAGGGATATGCTACCATGCCTGGCTGTTTTTATGGATGCT^ 

6 5 G CATTTTTCTGACTGGGCTGTTTTCCCAGACT CTTACCACTAGTGGTTTATTTCTCTTTGATGTTTAG ATACCAAAAT ATGTCAGT 

TTTGTTTTTTG AAAT AAT CTT1T1TCTTT AT AACAGCT AT ATTT CAG ACAT AT CCACATTTTT GCT CAT ATGACAAT ACTCTCCAC 
TTACCTATAGTCTG A TITTTT I 'CAGTTCATTTATTTGTAATCAACTG 

AAT AACTTGTTTT AGT C ATTTGTTGT AATTCT ATTAT ATTGTTG TT AAC CT CTT AT ATGTGTG T ATGT ATGT ATGTGTGTG T ATGT 
ATGTATGTGTGTGTGTGTTTATAGGATAAAATGTATGTCCATGTATGTAAG 

7 0 TTTTTAGTCTAACCACTACCATCAGAATTCTTrCTCTAGAAA 

TTTCACATATACAACAAAATCC^TATTACTGGATGTGCTTGGACTT^ 
TTGGTGGTTGATGTCATGGACAATTACTTCCCTTTTTCTAAAATG 
TGTTAGATTCCTCTTTTGTTCTTATTATAACTTT CTTG CAG AATTTTGT^ 
GACrTAGTATAGTCATTGCTTGCTGTTGGGTATAGGGTCCTTCCGAGTAGCAGCT 
7 5 GCAACATCTGAAGTAATAGTAGTACCATTTACTGATTACCACT^ x 1 i ACAGA 
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GCATATTAATAATAAGATATTATTAAACACAGGGTCTTGCTAGACT^^ 

GGCAAAAAAATCACTGCAAGTTCATGGTCAATTGGGTCTACATATACATAGTAAATTCT 

GAATAGACCATGTCCCATGCTATACTAAAAAACTCATTGTTTCACC^ 

TGAAAT6GGTTTATGGTTTGGGGAATTTATACTCCTGAAGTAAAATCACA 
5 ATGGACCTTCT ATTT CCATG ACTTGGCT AAAG GG TGAAG ATG CAT AT ATTTTT AAAGTTTT ATTG ACAT AT ACTT AAAACTCATT A 

TAATATAATAATTATATATAAA C , r ' l M I"r , rr ' ri " r AG'nTrnT , ri 1 1 1 1 T TGATATGAGATCTCATTATGT AG CCCTGCTGGCCTGG A 

ACTTGCTATATAGTTAAGACTGGTCTTGAACTCACAGAGATCAGCCTACCT 

ATCCTTCAAACTTTCTCATCTACCAAGTTTAGACA 

TCATGTAAATGGATCATATAGTATGTGCTTCTTTTACTCAACACAC^ 
1 0 TACGTATGTTGAATTACATTTGTTATGGAAGGTAAATGACTTTGTTATCT 

GGGACCGATTTGAAAGTTCAGTTGAGCTTAAAGTCAGTATACTGTT^ 

CCTATGCAAGGACGTAAGTCCACTTTAATTTTTAATGAGTGT^ 

ACATTITACTCAGTTTTGGAATATAATGTACTTCTGAATACTA 

AATTGCCCTCATGTGGACAGGTAACTCTAAGAGTTGTTATCT 
1 5 AACTCACCAGTTATGTGTTAATACTTACATAGGAATTTAGTTTGTATA 

ATTTT AACTTTGTT ATTT AAT GTTGGG AAGGAAAAAG AT CTTTGTTTTTTTT AAAAAAAAAT AT AT CCATT AATGTCT ATT AT ATT 

ACTATTCTAATCTCATTGCACTGTTCTTTTTAAAATTT^ 

AGGGGAATGCGGTTTAGATACAAGTGTCAAGGGCGATCAGCAGGTAGCATC 

CCCGTCTCTCCAGGTAATGGCCTTTCTTCAGTATCTCACTTTTAAG 
2 0 GTGTGGTCCTGGCTGTCCTGGAACTCATTCTGTATAGACAAGACTGGTCAG 

TGCATTACCACCACCTGGCAGAAGACTCAGTTTAAC^ 

TGTGTAGGGTTATCTCTTAAAGGAATCTTTCCTGCnTTTAAAGT^ 

ACACTGACAGCTTTGGATCGGATCCCTCTTGTAACACAAAGCAT^ 

ATTATTGTATAAGTTAAAGATGGATTTGGAACATTTTTTGLACCTTT^ 

2 5 ATACCTTTATGCTTACCTCCCAAACTCTGAGTTATTAAC^ 

GTGTGTGTGAATGTGGACACTCGGGTACCTTAGTGTCCATGTGGGAGTCAGGACAGCATTCG^ 
TTTTGAGACAGGGTGTCTTTGCTGTTTTTCCACT 
CATCTCCTGATAGGAGTGCTGCTACTACAGGCTCAAGCT 
TATTAGGAATGATAAACAAGTGTTCTACCACTGTTATATATCCCTAGCCT 

3 0 AAATATACCATTTTTATAATGTATGTTTATATTTGATATTTATAAATTACATTTCCT^ 

ATTCATGTTTAGTGTTTTCAAACAAAATTGTTAGCCCCAAACTTGTAGTGATCTTGCCT 
TACAGACATGTGCAATCATGCCTAGCTAATTTTTATCGTTTAAGTGCCAATTAAAATCCT 
TGACAAGCATTGGGAAGCATAAACAGGTAGATATTGAGTTGTGGACTACTTACCT 
TTAAACAAGACCTTGTCTCTACAAACAAATGAAAACAAAGTTAATAGA^ 

3 5 TGGGAGGCTGTGGTGGGTAGATGCCTTGGGCCTAGAAACTTGTATGGATAAAAGGAAGACTATCT 

CTTAATTTTTCACCAATAGATCTGAAATTACTGTTTTGTAACTAGCTTATT^ 

ACAAAATTGCCAAGTTATATATAGATTCCCAAATTTCTGTTTTTATATTTTTAA 

GGGATGTCATGTAATATATAATACATGTATACATCATGTAATGCT^ 

TT C CTTGTGGAAACATTCAAAG TCCTC CT AGTTTTTTGAAAT AT AT AGTG T ATTTG TACAAT AAT ATGC CTAAAGG CTTTTGT C CT 
40 ATCTACTGTACGAACTTAATTTTAAGAAAATATTTTATATATGCACACACATACA^ 

GTGTGTACATGTGTGTGCACGCATAGGCCCATGGAGGCCAGAAGAGAGTATCAGATCC 
AACTGTGTACCATGAATTCTGCGGTCTGAACTCrrGGTCCTCTGGAAGATCAGC^ 
CCCATAACTTCATGTTTTTAAATCACACTATTTGCAAT 
CAATGAAAAGTTAATTTAAAATTCTTTTTTTCCAACATTGTGAGTCATT 

4 5 TCCAGAGCAGTGTGTCTTAACTCCTTACTCCCCTACTCCAGTTCATCCTGGTAGACCTG 

GTTTCCTGCTGTCTACAGGGTGCAGCCAGACACACAACCAGT^ 
TACTTTTGAAAGAAGGTGCCTCTGAAGAGGCAGGTCTTGCTCTCACT 
CCTTCACCTCTGGCGTGGGTACAGTCAGGGATGAATTTCAAAACCT^ 
GCTATGTGGTCTTAGAGGACTCAGGCTTT'rrrin'l'irrTTTTTTTTTGACACTTCT 
50 CTCTGTCTCTCTCTCTCTCrCTCTCTGTGTGTGTGTGTGTGTGTC 

ACTGTGAGATACAGAGATTGAAGTTGTTAGTGGCAAATGCCTTTACCTGCCX3CACCATCTTACCAGCCC 
ATCTTACTTTATTCATCAAAATTATATGTACAGTTTGTATTTTTTCTGTCCCAACAA 
GTTTCTACTCTGCCTTGGATCATTTAGTTCCCAAATTAAAAACTTAAAACC^ 
TGGGCAGATATCTACCCTCTAAGTTATTAGTATCTACTT^ 

5 5 TTGGCCAGCCCTTATGGCCATGCTTTCATGATTCACCTACCCCATGGT^ 

GTCTCCTGCCTCAGTCCCCACGCCCAGGAACCAAAACTTTACCCATCTCTCTTCT 
ACCAATCAGGGATAAGTTGGGGGTCAAGGTTATATTTGGTGTACATC 
AGCATTACAATACATAGCAACAGACCAAACCTCAACAGTTTTAAGTT 
GAACTATTTATTTTGGATTTTTGAGGCAGGGTCTCTTTGTGTTGCCC^ 

6 0 TGAACTCACAGAGATCTGTCTG CCTCTG CCTCCCAAACTCTGGGATT AAAAGTG CACCATAATG CCTGG CCTGCTTACCTGCTTTA 

AAAGGAAAATAAACAGATGATAAAGCTAGATAAAATAAGGGCAGAAAGAAACATTTTAGGAAT 
ATATATTCTCTGAAAACTTTAATCTCAAGTAAACTGCTGACTATTTGAATT^ 
TAAATTAATGGATCAATTCTTTTTATTTGTTTGCTTTTTAAA 
TGATCCTGCTCAGCCITCCAAATGCTGGGGCTATAGGCATATGCTACAATACCT 

6 5 TATATTTAATTTCCCTTTTCAGATTATGAACTACTATGGAAAAGGAAAAATAAGAATTA 

AGCCTCATCCTCATGATTTAGTTGGAAAAGATTGCAGAGATGGATACTATGAAGCAGAAT^ 

T AAGT AT ACAGTG ACAGAAACCTT C^GCAATAGGACAAAGAT AT AATTTT AGTTT AT AG CAAAAT AATTCTGT ATTT AACTT ACTG 
T AA£^TGT ACATT ACATTTT AATT ACACTTTG TT ACCTG TGGGGAGGACACACGTGCTTGGGGTGCATGTGGAGGT CAGAG TTGG C 
TCAGCTGGTGGAGTCGGTTCTCTTCTCCACCACGTGGACCCTGAGGATTAAAC^ 

7 0 TCTCACCAACCTGATATTAAAATTTTTGTTCTTACCACT 

TTCTTCCTAATGGCACCTAGACTGGGCTCAAACTTACTATGTAGAATGTCTT^ 
TGACATTAGAGGTAACACCACCCTGTCTGGCTCAGTAGGAGTATTTTTTA 

GCCTCCCC^TGOT 
7 5 GG^TTGCCTTCCAGAGGATTCAGATACrGAGAATTAGAGXjTCT 
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ACGTAAG ACTTT AT ACATGTTAGG CTAGAAATCTG CCAACAT AT CC ACCACTTCTT A 1"1 1 1 T T AACTTTTG ACTGTTTTG AG ACAG 

AATCTCACTGTGTAGCTCAGCTTAGCTTCCAACGTGTAGT 

TGCATTACCATGTCTGGATAGGTATACCTCITTATATTG 

GTTCTACAAAATAATTGTGACTATCCTGTAATACACAGACATAATCTCCAAAAAGT^ 

CAAAAAGCCCTAGAAGTACACACTTGACAAGAAATCTTAACGCCTGCATGTCTTC 

GTCATTTGlt i TCl'ltjAATACTCriTlTl^ 

CCCTG GTT ' r T^ATCTTTTGTCCCACrCTAGTGATATATATATATATATT 



TGCCTCCCAAGTACTAGGTCTAAAGGTX5TATGCCTCCACTGTCCAGCTTAGTGC 
1 0 CCTTCAGTCTCTX3TTCACAGTGTTCTTTCAGAGAGATGTGA 
TGTTTGTTTTTAAATTCrAGTAGTATCATAGGCCTTGTA 
AAAAGATATGTGAGTGAATGAACATGCATGTGTTCCAATACAAACTTTAT^ 

TTT AGAG TTTGCCAACCG TC CATTCCAG ACACT AAGTT C CATGAAGAG AGGAAAATGT AT CTC CATT C CCTTTGGGAGTTTTAGTG 
CACAATTGGAACTCCAGAACGTGTTGAATGACTATGAAAAAGTACATTG^ 
1 5 TAAGTCAGTGCITTCGCCTCTTTCCC V^ ^ 

ACTACTGAGCTAACCCTCTGTGCATACCCGATGAGGTTATTCAGATGAAAGAAAAC 

TTTCTTG CAGGATTTCAAGAAAGGATT AAGAATTG ACAATTT CAAAGTT 
ATTTTAGGTGCTAATGAAGTCACAGGTGTGAGTGCCAAGCATGAAGCATGAACA 
AGGCAGCCTCTGGCTGCAAGTGCCAAGTGAAGTTTGAAGAAACAGTGATGrr 
2 0 CACC CAT CT CAGATGATGTCATGGTTTTT CCATGTGGG AACAAG T ATGACTGG ATGGTGGT AGGGGAG TTCCCC CATCAT CA.TTTG 

TATTACAC^TATGCAAATGAGTAATATTTGTAGCTATTGAGAAG 
AGTATTAACCCTCTCTAATAAAAATACTTCATTTCCTATATTGTAATCATATAA 

AAAAATTGAGG CCAGGCT AAAGT ACACAG CAAGACCTTGTACAG CTGTG TTTT CATAT ATTTCATGGT ACATTTTTT AG TGATGAG 
CTAGGCAGGCATCATAACTCAGCAGAGGGAATCAAATATAAGAAGGCTAGCCCGGTTCCT 

2 5 TGGGGTTACAGGTGCTCAGCTCTAACAGATTGTCTCTGTG 

AGCTGATGCTCAGCTCTGATCACCTCACTCTTAAGTTTGGTAATGTATA 
TTTTTTAAAAATACATAGGTTAACACTGCTATAAGTTTAGCAAAACATACCTG 

TG AT ACAC CAATTTT AGTAAATTGATCATGAGTTTGTTTGGAAG GTGAAACTTTGCTTCTn^TGGl"rri > GTTTGl ,, rrrACTTGTA 
ATGTTTATCTACCTTGGTATATTTGACTAAGCCTATTGGCAAAATGTAATTA 

3 0 ATATACTTTTTTAGTTTGGGAGAGGTTTTCCCATAA 

TCTCCTAGTGCTAGAATTAAAGGCATGTCACCATGTGCTCTATTTTTCTATTAACAATTCT 
CTTCTATCTTCTTTATTATCACAGTCAGAAACAGGATCTGTGACCTA^ 
TGATTTTTGTCATACTTTTGTTTAACTATGATGTGATTTTGAAAGTTTC^ 
TCTGTAAAGAAAAAGGAAGTGAAAGAAGCTATTATTTTAAGGATTAGTGCAGGA^ 

3 5 AATTTCTACTATTTGTATTATCAGTGGACTGTGATTTCTATTTAT^ 

CCCCTCCCTTGTTGTTTATTTTGACGTGAGGTCTTTATATGT^ 
TGAACTCCTAGCAATCCATCCTCATTGGCCCCCAGGTTCTGGGAT^ 
CAAAACTGAAACCTTCTCCAAGTGTTTTTGATTCnTTCAAT^ 
CTTTCTGAGTGTTGGATGGACACTATGTTATAAAGTATTTTCCTC 

4 0 AGCCAGCTTGAATTGCTGCTTTGCTGAGGCCGGCCTT 

TACCTGGCCACTCAAAATATTITTTAGACTTCT 
ATAGCTGGAGGTCAGATCTCTCAGAACTTGAGTTGCAAATGTA 

CTGCAAGAACAACAAGTACrCTTAGGTACTAAGGCATCTCTTCAGCCCTTCAAAATATr'r 
ATAAATCTTACTGTTTAGTACATTTTATTGTGAATAGCACTTACTAAGTGGTTTC 

TTTATAGTTTAAGCATTTGATGTTTGTTGTTGTTTT^ 
TCAAACTCACTATGTAGCCAAGAATGACCTTGAATTTTCTGATCCCCTG 
TAACACITGGCTTATGAATCATTGGAGATTAAACTCAA^ 
GGCCTTTATGACACTCTTGGCTATGGC^TCTGGTACCTTTAGCATT 

5 0 GTTATT CTATGTACCTATTTTAGTAGTTTTGTTTG CCTAGGCATGGTAGAACACACGTGTAATTCAAGCTACTTGAGATTGCATCC 

TTAAACTTACTGTGTAGCTACGGGTAACCTTGAATTCTGTTTGAGCCTT^ 
TCCTATCTTCAGTGTTTAGTGAAAATACAAAGATGTAGCAGATACACCCATT 
GTGGCACTTGTAGGTGACAGTCATGTAAGATTAAAACCTAAGTAC^GGGGG 
GATCTTCCCGGGGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTC^ 

5 5 GTACTTACATATAATAAATAAATAAAATCTTAAAAAAAAAAAAACCCTAAGTACAGATCTGGTC 

GATATTTTTTGTATTCCAGGATAAAGTTTAGGCTTTATAG 

ATGAAGATCTGAGTTCAATCTCAGCATCCCTTTGACAAGAGCTAGGTATTCT 

GAGCTGGCAAGGCTCAGTGGGAGACCCTCTCTCAAAAATACTTAGTGCT^ 

T ATCTG T ACAT ACTG T AT AC ACACACAAAG CAG TT ATT AATTG AATTTTCAG CTTTT ACTCTG ATC CAGATATC CCTTGT G ATTCC 
60 * TGGTTGTCAGTAGTTTTTAACCATGTCGTACAGAGATGGGAATAATATTGATAAA^ 
TTCAAAGCAAAATATCCCAAATTATAATCTTTTGTGA 
ATATTTTAGTTTATTAGCCATTlTATTTTTCTrTAATAAATCA 
TACAGTCTTTCATX3TAACCTCAAGGCAAAGAAACATTTCCCCCACT 
CCAGCTCTTGCCCCAGTCAACAGCTCATCTCCTTTTTGT^ 

6 5 T^ATATTCTATATTCTTTCCrcCCTGGCTCCACTTGGTCTTACTGGTTT^ 

TTCrTTATTGCTGAGTATTTGTTCATTGTATGACTATACT 
CACCTAGCCATTCATTGTTTTTGTCTGTCTGTCTre 

ACTOAAACCTTGAGCTCTTAAATTTATATTGCTATTAAATGAGACTTTCAGTAGTCCTG 

CAACTGTGCT ACTGTG ATACAGGTG CTG C C CTCTT AG CATTTTGAACCT CAT ATTTT C CCCTTGAACATGGT ATGAGGGTGCGT AC 

7 0 CTAGGTATGGAGGCAAAAGGACTGCCTCAGATGTTTGCCr^ 

GGCTTAGATGGCGACCAGTACCCTGCTTCTGCCTTCCTGGCACTGAGAATA 

ATTGCATCGGTTGGAGGGTTGAGCTCATACTTGCAAGGAAACT 

TCTAGTTGTACAAGTATGTTTTCATAC^CTCAAATGTTAGGCTT 

CCTGAGACTATTTGACrGGCrrTTGTGTGAGAT AlXjlTr ' ^ 1 1 1 AAATAAGTTATCACCTGTAAGACT<XyyiAATTTAGCTAATTATT 
7 5 TTTCATATTTAATCTTTTAATTTTAATACTAGTG^ 
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TATGG CAT AT ATT AHTAACAATTT ATTGCT AT AGG CTT AAAATATTTC AAAACG C AG CAAT AT TTTGTTTT AG AAATGGG CTTCTT 

TTCAAATATCAATG CAAAG AGACT ATTGT ACTAT ATTCTGTGGCAAG TTTTTGT ATTTC I I I r i CAGAACTGT AAAAATG CCCTGG 

TGTGCAGTTTG TTTTT AAG CTC ACTTGACTT ACACAT AAAGTTATATCATCTGAT ATC'I 1 J i l u rCACATTG CAT CT CT AAAATG CTG 

GTGCAATATGTTAATAAGTTAATATGACCCTCTGCTTTCAAATTTC 
5 AATGT CAT CTC CTT AAAATCAATG T CTT ACCT ATAAGAACAGGGATGATGGTG A 1 1 Hi GCTAGATTTAGAACGTAATTACAGACT 

GGGCCAAGTGTTATGTTAAATGAAGTATX5CATGTTTCCCATTGTCT 

CAGTTTATGATTCTGCCCTTAAGTTCCTAGTCCCIXOTATAGTCT 

GTAACTGTACAATAATGAAAACTCTTCTTGAGATTGAACCCCATCr^ 

TGTTCTGGGCAGAGAGAAAGACAGCAGGATTTTGCT 
1 0 ACTGTTCAGTCCCTGAACAGCAGCTGCTGGACATTGAA 

GATGAACATGGTAACITCACAACTGCTCTG CCTCCCATTGTTTCTAACCCAATTTATGACAAC CGTAAGTG^ITTAA i-rrrCTT AT 

GTCTGT ACTTTT AGCTTGTTTCATTT CCTT AACTGCTG A^ CAATTTGAAAAGG AGT ATGTTAGGTGGG 

ATTGGTGGTTTGTGCATATAATTCTAGCAACCAGAAGGC^ 

ACCTTGTCTCAAGAAACAAGAAGAAGCCAGGGGTGGCGGTGCATGTCTTTAA 
1 5 TGTGGGTTCAAGGCCATCCTTXSTCTACCCTGTGATTTCCAGGATTAC^ 

ACAAAGCACTGTTCATTATAGAAAAAAAAACCCTCAAGCATATATAC^ 

CT AAATGG CCACTGTCCTCAAG CAT ACACTGT AGTC C CTTTG AAT AAG CACAGACACCACCATCTTCCTCAT CAACCCCATTCACC 
CACrrAAGTTTGGCAACTTAAAAAAAAAGATCAATC^TATTTTAAAAAAAAACA^ 
CTATAATAATAGCACTTGGGi^GTCGAAGCAGGCAGATCAGGAGTTAAG 
2 0 CCTACTGTGTCACATGGGATCCTTCCTCAATAGATAGAGACAGAAAA 

TT CAATGACTTTTTCTTTT CITTTT^ CAGGACATTGT ACAGAG CT AT ACC CCAAACC CCA 

ClTCCTrCTTCTTCCTTTCTTTGTGTTGCTT^ 
TGGTTGTCCTAAACTCACTGTAGATC!AGGCTGGCCTTGAACCACAGAGATCCACCT 

2 5 CTGGTATTAAAGGCCAGTGTCACCTTGTTCAGAAGCATATTT^ 

GTATAAAAAATAAAAAAGATGACITTTAAGAATCATATTTAAGAATATATT^ 
GTAGATATATTTATGTGTGCATTTAAAGACAACTCTCTATGGATCATGTATCT 
TACAGGACTTGGCTGTGCTCTTTCATCATGTGTTTCCCTGAGATCAAACrCAAGTTGTCAG 
GGCTCCTGAATGTTTATCTCTTATGCAAATATTTCTGAC 

3 0 TATAGAAGTTTATTTGAAAAGATTTTAAAACTTTCCTCCAAAATATTTTX3TGTA 

CACCTGATTTGCIATGTAGAATACTGGACCATAGGAGTATAGAGGTAGATTATTCTTAACAGAGAGCCTTG 

ATCATAATAATATTCGGAAGATAAATGAGAACTGCATTGCCCTAGGGATATAGCCAGTGATATAAAATGACTCATTTATTCACAGT 
GTGCCTTTACCAAGAAACAGGAGGAAGAGGCAAGAACTGCTCAAGTTAAAAGGCC^^ 

AAAGAGACTGTCT CAAAAGAGGT AGGAGTTGAGGG CT AGCAAGCACCTG AAGTTGTCCTCTGACCTCTTCCACACAT ATGCTC 
35 CATATGCTCACCTGGGCCTACATATAACCACACTCAGACATACTATAGTTATAATACATA 
CATAGCTATAATACACAC^TAACTACACTCIAGACATACCTCAGCATAGCT 
CATAGCTATAATACACACAAAAAACATTTAGTTTCCTTTATCTATACATTAAAATTTTATCTA 
TTCTTCCCCTTGTACITGTACTGGCTAGTTTTATGTCAACTGACAC^ 
CTCCTTGGAAGATCTGGATATAGGAAAACCrrGTGGGGCATGTT 

4 0 AGGGCCGCCCCTGGGCTGGTAGTTCTGGGTTCTGTAAGA 

TCCAGGGCCTCTGGATCGGTTCCGGCCTCC^GATTCCTGCCCTCACTGCTCtf 
AGTACACAC-TTGCTCTTTTCAGACATGCCAAAAAAAAAAAAAAATCATCCC 

GG AATTGAACT CAGG AC CT ATGG AAGAG CATCTG AAC C CTCTCT C C CAT C CACC CTCACTG CTTCTGATG ATG CACTGTT ACATGG 
AACTATGAGGGAAATAACCCTTTTCTCCTCAGGCGACTTTTGGTCA 
4 5 CCGAGACCCAATGTTTTTTTTTTTAATCCTTTGCATTTAGGTCCCCCA 
CTGTGGAAGTGTCAGGGGAGGAGATGAAATATTTCTGCTTTGTGAC^ 
TAGTAGTACATATTTGATTAATATATGTATTGTACCGTTTCCCCATTGTTAT^ 
AAGTTCCTTTTGTGTTGAATGACTGGGAAGCCAGAGGTGTTTTT^ 



5 0 TATGGATTTCAGATACCTGCCAGATGAAAAAGGTATGGCAAGTTGGGAGTAATG 

TGTGTGTG CTCACTAGAAACAG CTATGTAGCTACAGATGGATCTT AG CTGAGTGTGGGGATTACAGCJ^TGTGCCTTGTGCTGTCC 
CTGCTGCClTrTTAAATCTGCTGCTAGGAATTGGACTTAGACTTTTA 

CTGCTGAAGGACAGATCGTCTAAAGCTGATCATGCTTGTAATATTGTAAAATAAACATTCGAGTGCAGTTTT^ 
TGTA TTTTTTTT CCCCTAtSAAGTTTrCACT 

5 5 AAGAGTTTGAATAATTTTAGAGT^^^ 

CTCTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCACCTGCCT 

TTATATTTTCCATCTATATATGTATTTTTGAATTTA 

AAGTTGTATCTATTTTGGTCCAATGTGTTAGTTCATGTTGACTTACAGT^ 

AAAATTAAGAGTAAAGTGTCCTTTATAACTGGTACTGGATGTATTGTACAGTAGC^TACTTCCAG 

6 0 TTAAGC^TTGCCACACTCTGATAAGGCGCTTGTTCCTGGTGCAGGAAGGAAGAATC^ 

CTTAATCACATGCATATGGCAATAAATCAAAGAAACAAAAA 
ATTTTGGATTGAGTTATAGTTAAGTAAATGTGTCTTATTTGCTTTAT^ 
ATTACACTTAATAATAGAAAAACTTATCATACCAGGTCACTTTA 
GAAGATTCATCAAAAAAGGTATTTTATCCACTTTTTAGGAATAACT 

6 5 TCACTATTGAGTGCAGCTAGGCTGTACCTCTGCAGGTTGTGTTGGAACTAATTATATTTGAGGGATTT^ 

ATTTTCTCCACACACTGGTGCITTTGCTTTT^ 

TAGCCCTGGCTGTCCTAGAACTCACTCTGTAGACCAGGCTGGCCTCAAACTC^ 

GATT AAAGG CG TGTG C AAC CACTG CCTGGCTCATTGGTGCTTTT ACT AATTTG ACTGTT AC CATGT AAG AAATGG CATGGATG TGG 
TAGGCACCTGTTTATAGCTGCTTACACTTTGAACAAGGCTGTGGCCCTTGGGGTCATGTACTG 

7 0 GGTAGGACCACAGGCATGCTGACCAC CTGCTGTCATGCCCAGACCTAGAGTCCCCTCT CTCTTCTGCTACAGCAACAG AGCTGGAC 

AGTTCCCACAGAGATTGCTTCCCACATAAACCTTAAAACrc 
CTCTCTTGTACCCTTACATTGGTGCTTTATCCTGGTTTCTGTAGGGCT 



CCATTCTCCCTAAAAGCAGAGACTAGAAGATCATTTTGGCCACTAGGTACCTG 
7 5 ATATGGCAa^AGTUWSCTGGATTTTCATATTTAACTUGAA^ 

579 



WO 03/008583 



PCT/US01/51291 



TATATGGAAATTTGCATGTTACAGACTTTAGTTTTGAATTCAAGTAATAATTTTA 

CT AGAAAGTTTTT AGTATGTTT1T AT AATTTCAATT CAG AAAT AATGC1TT AT ATCAATTTATTCAT CTT ACTCT AT AAAAT AT AT 
TTTTTCCTCTACAGAATCAAACITATTTTCTCATGGTACAGtT^ 
AACCTTACTATTCTTCATGCGGGTCCATCTCGAGTGGATTGCCACATCAC 
5 AGCTGGCCACCAGTGACTCACCCCACCTCACACCCAGTCAGTA 

TT CACAAGGT ATCCTT C CATTC CT AGAAGGGC CTGGTGT CAG CGACTTG AGTG CATCT AATT CGTGC CTTTAT AATCCTGATGACC 
TAGCTAGAATGGAAACGCCATCCATGTCACCAACTGACT^ 
GTGATGGCACCCAGCACTGATGGCATGGGGGACACTGACAACCCAAGACTTGT^ 
CAGGCTAGGCCOVAGAGACCTGAGACAACTACATCAGATGTCTCCTGCCAGTCT 
1 0 TTTCACAGTCAGATGCATTTGATAGATCAAACT^ 

AATAATCCTACCTITGTrCAGAGCAGTCACTATTCAAGTGTTAACACT 

TTTTT r I AAAATATAAGTTGT AGGACTT AAAT AAGTAT AT AGAAT ATT AAG AG AAAGTTCT^ CCGCT A 

CTGTTTCTTACTTTGAAAGATATGCCTAAAGTCCACGTGCTTGCTC 

TTTGCTAATGTATTAGTTAAATGAGATGATATTTGCTTTCAAGCAGATTTAAGGTG 
1 5 ATAAACTCATTATTATTGTTGAGGCCCATAGGCCAAAGTGACCCCTAA^ 

GTGTGTTAAAGTAAGCTTTCAAGGCATGGAAG'rmTT'l 1 Til 1 1 CTATTTTCTATT ATTTAGGCTTTT ATCTTTAAAGTTACATG 

TGAAATT AG CTTGAT ATCTGT ATTTTCAAG AAAAG T1TT CTT GAGCCTITT CTTT1T AG ATGTTCTCATTCCGTTTGATCTC CT AA 

AGCCTGTCTGCTTCACTATAAAGAATCTAACCAGGTATTTTGGGCTC^ 

CCTGCCTGAACTACATAGTAAGTTAAATACCAGCTTGAACAGCTTATGGA 
2 0 ACTCAGTGGAGGAGCTTGCGTAACATTTtSCTAGGCCCTGTGTTCAGC 

TCAGTATATCCATACAGAGTTTTGAACTTCTCTCCCAGCCTTGTGTCTCC^ 

CTTCCXXSTTGCTAAGCACCTTGGTGCGCCTTTATTCT^ 

TTTC CTCAGT ATGGAT CCGAAG CC CAC CGTCrCCTTTGAACTTTTGTGTG AG C CAACGA 

TGAACTCAGCCTTCAGTTCTCCTGOTCTCCCTCGGGAGTGCTGGGATTACT 

2 5 TTGTGTGGTGCTAGAGACAAAAACAGGTCTTTCCTGGGAGAAGTA 

AGCATTTTGTATTAGTAAAACTGATAAACCCAGACGCAGCTAT^ 

AGTTTTTCCT CTAAGTTGAACT AAGGTCTGTCT ACATCTG CATTT AG AGTATTTTCAATTG ACGTTTTATTTTCCA 

CTG ATC ACAT CATTCAGTTT AT AC CTT ACAATG CATTT AG CAACATGTTTTT CCTTTCTCT ATGG CTG CGTTTGT ATTTCACATGG 

TTTTGAAAGAATTGGCCirCTTTTCCAGATTCTGCTTGTGCACAGCG 

3 0 TCCTTTGCATAAACAGAGTAAGTATCTTCTTGAGCTTAAAAGG 

CCACAGTGCCCTGTCCTCCACATTITGCCTCrTCCAT 

GGGACGGGGCTGTGTTGTCTGCACTCTCCTCAGCACCGCAGC^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 

3 5 TTAAGG T ' l ' T T T T 1 1 "! 1 1 1 IGATACCTTATTCCTCTTAGACGTTGTAGTTC^AACATGTTGAATTTATTCCCAC^GGTGTCTATAGA 

TGTAACCTATTCTGCCTGTGCTCCCAACACITAGCTTTT^ 
GGTGCAGCTCTGCCTCTCTCTGCTCCTATATTACAGACCrrC^ 

TCTAGCATTTGACGTACACTCTGGCCrAACTGGTGCTGTATAAATAGTAGCTATTTATT 
ATTTGAGGATATAGTTITTAATGGGAAAGGTGCTTCIACCCTGCAGGAAAAGTAATGA 

4 0 CTTCTCTTCTGGTCGACTCCTGCTCTTCCTCCTCCTCT^ 

TAT ACAG ACACTGG CACATG CTT CATGGT AGAGCATGG C CCTGTG TTT GAT ACCC^G AATTGCAAC AACAACAACAAAATT AT AAA 
ACAAAAGTATAAAAACTACCCTGAGGCTCTTTGATACCCAAAGCTGTTCTCTTGGAAA 

AGAAATCCTG CCTCTGT ATGG AAGGTTAT ATCCTACAT ACCCCAGCGGT ACCCCTTG AC^ CATAGACATGTAAGTA 
GCCTGTATTCATAACATTTTTGTAACTGTATATTGTACAATGTACCCTGTTCCATC 

4 5 TTTGTTCTCAAAGATATGTTTCTTTTCTTATGAAGAATCACATATATAGTCAA 

TTGAAGAATTTTTGATTATATTAAAACAATGCAACCACTATG 
ATTTTTGAGAGCCTGCTGACTTCCTATCCITTTCTTTTGC^ 
TGTTTTACTATGCTGTGTCGTGTTTCCACAGTCTCAGAAAAACTATATTTTTG 
TAATTGTACCATATCAAGTTTTAATCATAAGTTGGAATATTGAAAGTTTACCATAAG 

5 0 AGAGAGACTGATTTTAGTCAGTTAGCCTAAATGGACCTGCTGGTCTTCAGA 

C^GTCTGCAGAGTGACTTCAAGGGAACAACTTTGTGAGCTCCTATCrCAAAATATAAAG 
TTCAGTGATAGAGCACCTAGC!ATGTACAGCTAAAGACTCGGTGCCTAACTCCAAAGAACTAA 
GGTGTGGTTTGTGCTAAGAAATGTCTGAATAGAAACACCGAGTTAAAGATTGTC 
AACATTCTTAACCACTTGATACCCAGAGAGAAAAATCACAGGGTGGATAAACACTTT^ 

5 5 TTTAGACAAACX3TCTTTAAGTTGCCCCATATGGACCTGGCTCTCT 

TACTTTTGGTATTTTGTGACITGATTTCTTTGTG 

GGTTTCAGGTCTGTTCAGAGGTTGGATGTGCACGGAAGAGGATAGAATGCTTTTGTGTCTAGGAATG 

GGAAGCCCAGCAGTGGGCCATAGGCATGAAGGTGCGACCCAAGTCTATCCAGTT^ 

GCAGAATTTAAACCTCAAGGTCAACAGTGGATGAGATCAAATCTGAA 

6 0 CCGAAACCCACGCATGTCCTTCATC1ATCTTTTGTCTTTTTGGTTGG 

TTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTT^ 
ATGGAAGTCTGCTTCTGCCTCCCAGACTTTATTTATTTTTTT^ 
TAACACACTCCTTTAAAACCAGCACCCAGGGGGCAGAGGCAGGCAG^ 
TGTCAGGCCAGATGGGGATACACAGAGAAAACTTGTCTGAA 

6 5 ACAAACAAAATAAAAAGAAATGTTGACTCCTCAACAAATTGAGTTTTTAGCAGCT^ 

TATGGTAATTTTTTTCTAAGCTTCTTTAGCCAAAATGTAACrTCT 

CT ATG ACTT CACTG CTCTT AG TGAAG CAGCCAT AGTG AT AT AT CCTG T AG CTC CAAT ACTTGG AAGG ATGAGG CAGGAGGGATG CT 

TGAGGGAGACCTCAGCTAGTCAGTC!ACCAGAGAACTTTAAAACACrA 

TTTGCTGTTGTTTTGACCTTAAATTGAATATTGACCTGCCTCTCCCT 

7 0 ACCTAATGT AAAAT ATTTAGATTTCAG AGAGGAGGG GAGCAT CATG CTGGGATTG AACC CAGGGT CT CATACAAGTT ATGG TTGGT 

GTTAG CTCTCT ACCACT AAGTTGTCTG CATTTCAAACTGJVTTTrATATGCTTG ATGATGAATG AATT 
ATTTTTTTTTAAGTTCTCAAGATACTGGTACTTAAGAAGTTA^ 
TTTATTATATAGCATATTGTGTTACTTTTGTAATATCTT^ 
AATGCTTGTTACAGTTTTGCTGTTTGCTTACGAAATATCAGATTTCTTTTTT^ 
7 5 GGGGTAGCTATTTGGGATTGTTTAGGAATAGTAGGCATTTTTATAAAATGTTC^ 
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AATAGTAGTCTGAAGAAGTACTTGAAATGCCCTTTCAAGGAGTCT^ 

ATTTCACTGGGGTTTTGGG CATGAAC CACCATGCTGTTT^ CACTG AGCATGGAAT CT AGGGACTCATGC ATG CTAAG CA 

AGCATTCTGCTGCCTGAGCTACATCTTCAGTCCTTGCAGGCTTTGGTCT^ 

TGCTGCATACTTCAAATTCATGGTCTTCCC^CCTCTGCCATCTGAGTCCTAGGATTTC^ 
5 ACTTCATTCTGGTTTAAGCTCACTTCAATTGCTTGGTTTTTCTTCT 

TTGGAATTATGATTGCTTTTGTCATATTAACCCATCCTT^ 

TCAGTTTGCCTTGATAGCATCTTAAATTTTTTCATGT^ 

TGGATTCTGCCATCAGAATTTGGCAGCAATTCCTCTCTGCT 

CCCAGAAACCACATGGTATTCAAGAACTAAACITACT^ 
1 0 TGCACATGATAAAT AT AAG AAATTT ATT CTAGAAAAAAT AC CATTTT C CACAAATGGAACCT G ATCTTTTT AG AGAAACTG C CT AT 

TTTGTGGCTAGTGTAGAGGAAATACAAAGCCTGGGATATTTTTATT 

AAGGGCATAGGAATGAGCTTGAGAATTCTCACTCCTCAG^ 

TAAAATGAGAATCTATTGGTGACAGTGCTGCTTTGTCTTGGTTCAGTCCCT 

ACACAACCCAGGCTTCAGTCCACATCCRTAAAGTAAGGAAATGAGAGAAGGCCCT 
1 5 GAAAAGAATGACACACATTAGCAXJTTATACITA^ 

AGTGAAAATCTGGTCTATCTGCAAGGCA.CTGAGTAGAGAGGGAAAG 

CTCAGGGACTAGAAAGTTTGGCAAGATGCAAGGAGGAAGCAT^ 

CTGG AG CTGAGCATAGC CCACT AAAT AACTGGGCTG AAG CTGTGCAACCGCTTCATG AAG CTT AG CATACACCGACT ACAAAGCAG 
GAGGGCTCGGTCATGAAGCCAGTGACAGACAAACTGATGGACAACCT 
2 0 GCTCACTTCCTGGTATGGTGGGTGGCCTGTGATTTACAAGATGTCrTCT 
GTGGTGCTGACTCAGCACCTCTGTGGGAAAGTTCCTTT^ 
AGATATGAAAGTAATTCTTITTTCATAATAATAAAACACAGAT^ 
TCTGGGAAGAGGGAATCTTGCITACTGATTGATGTGGGAGGGCCCGGCCC^ 
CTGGATGCTATGCAAGAGCAGACTGAGCAAGCCATATAGAGCAAGCC^ 

2 5 G CCCTGACTT CTGT CAGTG ATGG AAGTGT AAGAG AAAT AAACCCT'IT C1TC CTCATGT CCATTTT AAAAC CAG TCAGAAAAG CTT A 

CATCTGGACTGGGGGTTGTGGCTCCTGAGTATAATCCAGACACTTTGAGAG^ 
CAAGCTACAAAGTGAAACCCTCTCTCriTGTGGCCTCCCr 
AAGACTCAGGAGTAGAAATAGTAGATGAGCTGGCrGATGAG 
AATCATGACCTCATGGTAACTGAAGATGCTTGAGTGGGTAGGGAGAGA 

3 0 TGATAGATGCTTGAAGAGGCATCATCTTAAATGTTGCTTCTGTTTGCGAGTTCA.CT 

TTAATACTGTAAGACCCCCGAAGCTGGGCCTCTGCTCAAGTCCCGGGATGAGCTGGCCAGCACC 

CACCTGATGCAAACTGCAAGAGCCATG CAGGGCAGGGG AGTTCAACCCAGAGCXK3TGACAGTAGGGGGCTTTT AACAGCT CTGA 
GGAATTGGGAGGAGTTGGGGAGAGAGTTCTCTTCAGCATGTCC 

GAATTTTTATCTCCGTGTCCTCGGCACTGG AACGCAGGCATCTCTGG TTGT AAAGTATAGAAACAAAAAGGCTTTTTGCTGG CTAT 

3 5 AGAGAACAACGAGCTTCTTTCTGGCTGTATTTTTAGAGATCAG 

GAGAAACGCTCTAAGTTGGGGGTCTCACAATACCAGTGGGTC^CAAAACA 
CTTTTGTTGGGGGTGTTGGGGGATAAAGTAGGGGTCAATGTGCTAGTACAGGCCATGAAT 
ACTTGTGAGCTCTAGGACAACOWX3ACTA<^GAGAGAAACC<^ 
CTGTGTTATATTCATCTATGAGACTGGCAAAAATAAATTTAAAAAC 

4 0 TTTAGGAAATAG AAATTT AACTTGTTAAGTTACTrCATCTAACrn'G 

TCAAGTClTTrTTTCCTTGCTTTCTCTCGCTCTCCAGAGTTC 

AACACAGATAAGAAGAGAG AGCTCATTCTG CAGATGACATAAACATTG CTT CTAAG TTAAACAAAG CATAGAGAAGAGGAT ACACT 

GTAAGCCAAGTGACACAAACCCACAAGACTTCCTAG^CCTGTGTCCT 

CAACAC^AAATAATTCAAGGAATGCTTTGTTGGAAGTTCCTG 

4 5 AAAAACAATGCTCAAAGACTTTTCCCAAATTATTTAATCTTTTAG 

TT AACTTTT CACTTTT CATCAG TT AGTTTCTGTTATCT C CTTTTCAATTTT CTGACAATG AAAAACCCTTTG CT ATT ATTCACAC A 
TGATGAAACAAGGTTTCAAAGGAGCAGGCTGCCCTTTCATCGCAATCAAAA^ 

T ATGAATTX3TGTGCAAAG ATGAG AAACGTGAATAATCAAATG CT ATTCAAATCAATG CATCAAAAAAAGTCAAGAAGTTGCACTAG 
CTGATGAAAATTTTACTTAAGTCAAAATTTTATTGAGCTTCCCTTCT 

5 0 AAGATTTCTCTTCACCATGCCATTACAATTGACAATTATGAGATCATATA^ 

GATATCCAAGTTCCAGAAGCCAACCTCCTC^CTTTGTAAGCCTAACTG 
GCTCCTGACAAGGGTTITTAGAAAGGCTCCTGGAGACACAACATCAGCTCC^ 
ACTTTTGTTTCCTTGTTTATGTAATCAGATGTTAGATAGACCACCT 
CCAAAATATGTTTGTATTTGTTTAGCATTATCGAATTTTCAAATTTGTA^ 
5 5 TTTACAT AAAAGACAAAAG AGG TAGG CAG TGTCT AG AGT ACG CCTCCAGGGGT AAGGTGT CACTTG CTTTGCAGGAG ACACAT ACT 
G ACAATTTT CGATTGT CAAAAT AG TCACTTG AGGTCAT AAGG ACT ATGACAAATTGTTC CT 



MOUSE SEQUENCE - mRNA 

CACTAGTGACCGGAGCGCGAAGATTCGGGGGTGGGAAGGTGTGAGCCGCAAACCCAGAGGAGCC 
6 0 GGTGGTCCGAGGACAGGCGTCCCGGCTGCGGAGGCTGCTCIAGCCrCCrCACTGGTT 
CTGCCTGACCGACTGCCTG<3CTGACTGACTCACTGACTGACTGACT 

GAGCGGCGCGCGGAGCCGAGCCATGGCCTCGAGTGGATATAATCCATACGTAGAAATAATTGAACAGCCAAGGCAGAGGGGAATGC 

GGTTTAGATACAAGTGTGAAGGGCGATCAGCAGGTAGCATCCCAGGGGAGCGCAGCACAGACA^ 

CAGATTATGAACTACTATGGAAAAGGAAAAATAAGAATTACATTAG 

6 5 AGTTGGAAAAGATTGCAGAGATGGATACTATC^GCAGAATTTGGACCAGAACGCAGA 

GGTGTGTAAAGAAAAAGGAAGTGAAAGGAGCTATTATTTTAAGGATTAGTGCAG 
CTGCTGGACATTGAAGACTGCGACCTCAATGTGGTGAGGTGTGTTTTCATGTTT^ 
TG CTCTGCCTCCCATTGTTTCT AACCCAATTTATG ATAACCGTG CCCCAA^ 
ACTGTGGAAGTGTCAGGGGAGGAGATGAAATATTTCTGCTTTGTGACAAAGTTC^ 

7 0 AATGACTGGGAAGCCAGAGGTGTTTTTTCACAAGCTGATGTACA 

AGCTATACTGGAGCCTGTGACAGTGAAAATGCAGCTGCGGAGGCCTTCT 
TGCCAGATGAAAAAGATGGATATGGCAATAAATCAAAGAAACAAA^ 
CACTTTACTGAGAAACCAAGAACTGCCCCTCn'GGGATCAACT 
TGGTACAGTTTTACCAGAAATGCCCAGGTCTTCAG^ 
7 5 GTGGATTGCCACATCACCXACCAGCCATrCCCTCAGTGGCCCATCAGCCT 
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CCAGTCAGTACAAATACC CTGAGT ACTTTCTCAGCGGGAACACTCTCCTCTAATTCACAAGGTAT CCTTCCATTCCTAGAAGGGCC 
TGGTGTCACCGACTTGAGTGCATCTAATTCGTGCCTTTATAATCCTC 
CTGACTTATACAGTATCTCTGATGTCAACATGCTATCT^ 
ACTGACAACCCAAGACTTGTGAGCATCAATCrTGAAAACCCCTCATGTAATGC 
5 TCAGATGTCTCCTGCCAGTCTGTCCGCGGGCACCAGCTCCAGTTCrGl 1 1 1 TGTTTCACAGTCAGATGCATTTGATAGATCAAACT 

TCAGTTGTGTAGATAATGGCCTGATGAATGAGCCTGGACTATCAGATGACGCAAATAAT 
TCAGTTAACACTCTGCAAAGTGAGCAACTGAGTGATCCCTTCACATATGG 
AT AT AG AATATT AAGAGAAAGTTCAGAAG TCAAAACTTT AAAAG CCGCT ACTGT^ 
GTGCTTGCTGTGAGTTGTTTTCTGTATTTCCCCAA^ 
1 0 TTTCAAGCAGATTTAAGGTGAAAAACGTGTAATGGCTATGCATTGGGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

ATGGCCTCGAGTGGATATAATCCATACGTAGAAATAATTGAACAGCCAAGGCAGAGGG 
1 5 GCGATCAGCAGGTAGGATCCCAGGGGAGCXCAGCACAGACAACA^ 

AAGG AAAAAT AAG AATTACATT AGT AACAAAGAATGATC C!AT AT AAGCCT CAT C CTCATGATTT AG TTGGAAAAGATTG CAGAGAT 

GGATACTATGAAGCAGAATTTGGACCAGAACGCAGACCT^ 

GAAAGGAGCTATTATTTTAAGGATTAGTGCAGGAATCAATCCTTTCA^ 

ACCTCAATGTGGTGAGGTGTGTTTTCATGTITTTCCTTCCTGATGAAGAT^ 
2 0 AACC CAATTT ATGAT AACCGTG CCC CAAATACTGCAG AATT AAGGAT ATGCCG TG TG AACAAGAACTTG TGGAAGTGTCAGGGGAGG 

AGATGAAATATTTCTGCTTTGTGACAAAGTTCAGAAAGATGATATAGAAGTTCGTT^ 

TTTTTTCACAAGCTGATGTACACCXjCCAAGTAGCCATTGT^ 

GTGAAAATG CAGCTG CGG AGGC CTT CTG ACCAGGAAGTT AG TG AGT CT ATGGATT TCAGATACCTG CCAGATGAAAAAGATGCAT A 
TGGCAATAAATCAAAGAAACAAAAAACAACTCTGATTTTC^ 

2 5 CTG CCCCT CTGGGAT CAACTGGAGAAGG AAGATTC^T CAAAAAAG AAT CAAACTTATTTT CT CATGGT ACAGTTTT ACCAGAAATG 

CCCAGGTCTTGAGGAGTTCOVGGGCAAGCTGAACCTTACTAT^ 
AGCCATTCCCTCAGTGGCCCATCAGCCTACAAGCTGGCCACCAGTGACTCACCCC^ 
GTACTTTCTCAGCGGGAACACTCTCCTCTAATTCACAAGGTATCCT 
TCTAATTCX3TGCCTTTATAATCCTGATGACCTAGCTAGAATGGAAACG 

3 0 TGTCAACATGCTATCTACTCGGCCTCTGAGTGTGATGGCACCCAGGACT 

GCATCAATCTTGAAAACCCCTCATGTAATGCCAGGCTAGGCCCAAGAGACCTGAGACAACTA 
TCCGCGGGCACCAGCTCCAGTTCTGTTTTTGTTTCACAGTCAGATC 
GATGAATGAGCCTGGACTATCAGATGACGCAAATAATCCTACCrrTGTTCAG^ 
AG CAACTGAGTGATC C CTTCAGAT ATGG TTTTTTT AAAAT AT AA 

35 

HUMAN SEQUENCE - GENOMIC 

AAACCAAAGCCTGCAT ATGAAAAAG CTCTAGAAG AC AACACATACAC ACAT ACT CACACATGCACGT AGGTGATGTCTGTATTAGT 

GAGGCAGAATTATAAGTGAGTTrTCTTCTTTTCTCCAATCTTTATT 

CACCATGTTGCCCAGGCTGGTCTTGAACTCCTAGCCTCAACTGATCCTG 

4 0 G AGCCACCATGCCCAGCCTTTCTTTAATGTTTTAT ATGTGT CTTTTTTTAGGTTG TAATTTAAAAATAAACATATTTTAGCAAAAA 

ATTAAAGACTTTTGAATGTTATTAAAGAATTTAGGTCCGGGAGTAGTGGTTCACGC 

GGG AGGGCTGCTTG AGC C CAGGAGTTTG AGAC C AG CT CTGGG CAAC AAT AG TG AG ATT CCATCTCAAT CT AAAAAT AAATAAGTAA 

AATCACTATCATACCTCTCTGCAGGCATAGTTATCACTTTGGGTA^ 

4 5 TCTAACGCCOVGTGGTGATACATTGTCACTCATATCAATACTGGTC 

ATGAGCATCTACAATGCH'AAAGGCACTGTACTAGCCACTCTAGGGAATAAAAGATGAACAAGAC 

GAAGTAAATGCACTGATTTTTACTTTTCCTACACAGGTATTG^ 

ATGGAACCTGTGAACGCTAAACCTTCATATATTTGCATTTTTATTAGTAAG 

GTCATATTTTGAAATGTGATGTGACATAGTGAAAAAGAAACAAAGAGTCTGGAGGTCTGGTTTCAAGTT 

5 0 T ACCT AATG AACACGGGCAAGG T ACITAAACTTCCTGTC C CT CATGTGT AAAACGAGGGAGATGAACT AAAT GATCT 

AC AGGGTTCCTTC CAACCTTAAAATTG AACT AACTCT CT CCTGTTG AGACATAAAAGAGG CACCAT ATG ACTTGAAGG TG ATG ATG 
ACrrCATTACTTGTTCTGAAAAATGTTAATTCTAAATTGGAACAAGTAATCACT 

T ACAGTTAGAGAAAC CCAAGTT AGCACR.CTGTGT CAT ACTTCT AGG C TG CT AT AT ACG C AAAACT AGCAT ACCACAGT AT AGGG 
CCTACACAATATAAACAAAAAGATTAAAAAGAGATTTTTTTT^ 

5 5 GTGGCACGAGATCGGCTCACTGCAACCrCTACCTCCCAGGTTCAAGCGATTCTCCTGCCT 

GGCGTGCACCACCATGCCCAGCTAATTTTITGTATTTTTAGTAGAGATG 

TGACCTCAAGTGATATGCCCGCCTCGGCCTCCCAAAGTGCTAGGATTACAGGCGTGAGCCACTGCGCCCTGCC^ 
TTTTTTAAATAG AGACAGGGTCTCGCTATCTTGCCCAGGCTGGTTITG A CTGGG CT CAAGCAATCCTCCTG CCTCA GCCTCC 
CAAAATGTTGAGATTACAGGTGTGAGCCGCTGTGCCCAGCCAAGAAAATre 

6 0 TTTTATGTACTAAGATTTCATTTCTCTGGGTGTTCATTAATCTTAT^ 

TTCCTCAGAGGTATAGTTTTTACrrTAGGGACTATCCCTTTCTCCTAAAGTGTGCT 
GATAGGAGGGACAGCAGGCAAAAGTGATGACAGGATCTGGGTCnGCTGCCACACA 
AGGGTCTTGCTCTGTCACCCAGACTGGAGTGCAGTGGTGTGTTCATAGCTC 
CCCACCTCAGTCTCTCAAGTAGCTAATATTACAGGTGCATGCCCCCAAACCTGGCTAA 

6 5 GGGGTCTCACTATGCTGCCCAGGTTGGTCTCCAACTCCTGGTCTCCA^ 

CTCATTCTTTATGG GTITr T r G' l I ' l ' 11 TTTTTTI 1 IGCCCTTAAGTTTTGTTrTTTACAACCrTATAAATCACATAATTGA^ 
AGAGATCTCAAACAATACTAATGCAGTAAGTAAACAGGAGAGCAG^ 
GTGGTTGAACCCAGCTGTTGCAGCTATTTGTCCTrAATAGGAAGGACACCT 
TGCTAACTGGGACTAGGGCATACTCTCGATAAGGTTTAATTTTTTGTAAGAGA^ 

7 0 ATAAACTACATAACGGAAGACAGGTAACTCAGATAATGGGGGTATCAAAGACGAAATTT^ 

AGAACCCCAATGAATCCAACAGATACAATCACTCTGGTTATTAG^ 
GGG CAAAG TACCAGTGT CTG CAACTTATTTTC AAAT ACATTT^ 
TAAACAGACAGGTGATAAGGCAAGTGCAGCAAAATGTTAATGGTGGA^ 

TCTTTCAGTTTGG(nXSTATATTTGAATT Tr iTlTia Tl 1 1 ' 1 ' 1 ' lT ' l ' l GAGACGGGTCTITCTCTGTTtSCCCAGGCTGGAATG 
7 5 CCAGGATCATGG CTCACT ATAG CCTTGACCTT CTGGG CTT AGCTGGAACCAGAGGTATGTGtCAC C ACAC CTGGCT AATT AAAAAA 
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AAAAAAATCTTTTGTAGAGATGGTGTCTCATT^ 

ATCCCAAAGTGCTAGGATTACAGGCATGAGCC^CCATGCCIW 
ATTTCTGAGGGCTTTCCCTGTGTCAGGCACTTTGTATTAGT 
TTACTACCCCTATTTTACAGATAAGCAAACAGGCACAGAGAGGCTAAGGGAC^ 
5 AGCTAAGATTCAAATCCAGGCCTGTGTGTTTGCAAAGCCAGGGCCTTTAGCTGTG^ 
AGCCTCCATTAAATATTACTGAGTATTATAAAACTTCTAGA^ 
AACATTCACAAAAAACACTATCAAA.GAAAATAAATCAGACTTAA.TTAA 
AAGCCACAGACTAGGAGAGGATACTTGCAATGCATATA^ 
CATTCTAGAGAGGATCTGCCTCAGCTTTCAAGCCAAAGCCA 

ATCTCA TT T TCTTT C TA CCAAATTTACCTrCCTC 

ATAACTTTTTTCI X I CCCCTTTATCTTTCATAGATGTTACCCCCAATAAATCTCTTGGACTGTC 
CCCAAAGGACCCAACTGGTATACTCATATCCAGAATACATAAAGAATGTCTACAAATCAAT 
TTTTTTTTTTTTGAGATGTTTTCT 
1 5 CAGGTTCAAGCAATTCTCCTGGCrcAGCCTCCCCCGTAGCTG^ 
TTTTAATAGAGACGGGGTTTCACCATCTTGGCCAGGCTGGTC^ 

GCCCTATCATTATTATTATTATTATCATTATTATTATTATTATTAAGACAGAGTCITGCT 

TACAATCTCGGCTCACTACAACCCCTGTCACCCAGGTTCGAGTGAT^ 

TTCGCCACCACACCTGGCTAATTTTTGTATTTTTA^ 
2 0 TCAAGTGACCCACCTCCCTCAGCCTCCCAAAGTGCTGGGATTATAGGAGTGAGCC^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNKNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNIWNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNIWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN>^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNIWJNNNNNNNNNNNNNNI^^ 
30 NIWNNNNNNNNNNNNNNNNNNNNNNNN^ 

• NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNl^ 

35 NNNNNNNNNNNNIWNNNNNN^ 

NNNNNNNMWNNNNNNNNNNNNTOn^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

>^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNTONNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 
45 NNNNNNNNNNNIWNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNMNNNNNNNNl^^ 
50 NNNNNNNNNNNKNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNJWNNNNNNm 

5 5 CAACCTCCGCCTCCCAGGTTCAAGCAATTCTCTACCTCAGCCTCCTGAGT 

AATTTTTGTATTTTTAGTAGAGACC^GGTTTCACCATCITGGCCAGACTGG^ 

GCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCGCCT 

TCCT CAG AACTTTCCTCTCCCCCAGTCACTTTTCATTTTG AAATTCCTT 

ATGTACATATTTTAAAAAGCTAAATAAGCCTCITGCTAGTTTTGCGTCCTCAGAA 

6 0 TGGAAATGTGAACATCAAGGATGGTCCTCOVTGTTTCTGTCT^ 

CATTATTTTCCTTTGGATAAAGACAATTAATAGGTATGTAATG^ 
TTTTGGTCTTTGCACTCTCITCAGTGGATTGCCT 
ATTCTTTTCTTGTACATTGCGCATTTCGTGGAAAGAACAGTTTGGGG 
CCAGTTTTGGGCACTTGTATGGCAGGCTGATATTCCCCATTACCTCTCCG 

6 5 ATGGAGCTTTGGAGTCAGACAACCCTGGCTTTAAACTGTGCCTTTTATCAGCATTCTG 

GAACCCCGAATACTTCATCTATAAAGTTAGGAATAATAAATACGTCATA 
TATTCAAATGAAGGAAACGAAGTTATCCTAGGTCCTTTAATGCACT 
AAATCTCAGGATATAGTGTCC^CTAAACTTCAGTGCACTGCCTAATGAGACAACXS^ 
GGGTTTCAGTAGGTAAAATACGTAAAGTTCACAGTACCCTCGGAATTTAGATGGAG 

7 0 GT ATG CAAAT ACAGGAATTACAGAAG AAG ACAGGAC CCC CG CCG CAG CAC CACTT C CCC CT ACAGGG CGCAACGTCCCG C CG CTGG 

CGCGGGGGTGGAGATTTGCATGTGCGAAGTCGGGAAGAGGGGGAACCACCT 
AGAGTCGGCCCOSCGCGCCCCCAAAAGCTCCCCAGCGGCCGGCCCTCTCT^ 
AAACCACACTCGGAAGAACAACCTGGAGGAGGGGGATTTCTCAGGAGCCAGCGGCC^ 
TCCCGCCGTTCCCTGCGCTCCOSAAGCCCACCCCAAGCrrCCCGCTCCCTCGACCCCCT^ 
7 5 GGGCGTATGCGTGGGGGCCGGCGGGGAGCGTGOTCGGGGAC 
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GAGGTCCXrCCCGCCCACGCCTCOXKSTGAGGGTTG 

CGCCTCTCGGCTGGGCCAGCACrC^GCTCTCCCCGCTCCGCCCCCTGCCCCTG^ 
CCGGGGTGCAAGAATTCAGGGGTTGGGAAGGTGTGAGCCGCA^ 
CGGGGGACTGGGGGCCCCGCCGGCAGAGGTCCCTCGGCCTCCTGACTGACTGACTC 
5 GCCTGCGGGAAGGTGCGGGGAGCGGAGCCATGGCCTCCGGTGAGTGTTCATGGGGCGCGG^ 

CTGAAGGGGGTCCTGCCGCAGTGAGTTTAGGGAG CTCAGTTTTTG CTCATATGGTAAAATT 

GAACCTTAGAAATTCCCAACGACTTO 

TGTATATGTATTGAAGATGGTGGTTTTATGTTTTAGA^ 
1 0 AGTTTAATTTCATCAAATGCAAAGAGGTGATCCCCCTTGTACTGACAGTTGTA 

GTATTTTAACACGAATTAGAAAATCCXITGGTATTTTAAGTGTTAATGTTG 

CCTGTATTCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATAACCAGAGGTCAGGA^ 

CCCCGTCTCTACTAAAAATACAAAAATCAGCGGGGCGTGGTGGCGCGCGCCTGTAA 

AATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCCCTGCACT 
1 5 CTCTATTTAAAAAAAAAAAAAATGTAGTTTTTACTGTGA 

CCTTCCTCCTCGCTGGAAATAAGGGGAAAGAGGAGTTTTAGAATATCCGAGCTTTAAA^ 

TCACTTATGTGACTATATATTTTCCCTTAATAAATGCTCGATCT 

CAAGAAAGTCTCCTATATACACAGATGTGGTTGGGGCTGCCTGGAGCTTAAGG 

AAATCAAGTAGCCTGGGGTCGAGTTGGTCTTTTACCAGGATTTTGGCAAGGTGA 
2 0 GTGAG CAAAGGCAGGGAGGGCT AAATGCATGTT AAATG CTTG T ACT AAGAG AATGAAG AGGGGG C CC CGTTT ATGAACTTGACT CTT 

ATCCCCAAACTGCCTGGTCCAGTGAC C CT CAGAAGT AGG CTGATGT CT AGT AGAGATGAGGAAG TAGGAAG CTT AAAGAAC CTTTT 

TAAAAGGTTGAGCAG AACAATG C CAT AGCTATTG CTGT AGGCAG AAAAGG AAATTGGGGTGTTT CACATGACATTTGATTT CAGTT 

TGTTTGTGGTTGTGAGAATACTAAGTGTAGTTTAAAATATTTCAA 

ATCGTGTAAATTTATGAATTCAAAGCATGTGTAGATTAGAACTGCT 

2 5 AAAGATTTTGTTTGGAAGTAATGGAGAATGCCTAGCATTATAAGTAAGTTCA 

ATTGCATGAATTTAATACTTCTAATTTTTT^ 
TTAAAATTTGATACAATTTAAATTTGTTCATTGAGA 
AAATGTTTAAAGAAATGGAAAGTTTTAAAGGTTTAGAAGTTGAAGG 
AGTTGCATTAAAATGGGACTATTTAAAATGTAACAAGTTATTTAAATTGT^ 
30 AAAAAAAAAAATGTTTACAGGAATCTCTCTACTTTTCCTTAA 

ATTCTAAATTCCTATTCTGTTTTGAAACA.GTAATTTAACCGGAA 
CATTGTCTAGTTTAGATTCTGTTTGTAGTAAGGTAGTATTAAAACCTCAGAAAGTA^ 

ATTTCTTCATTTCTTACCAGATGTCATATGTAAATAGGTATATGTAGTTGAATATGTGATAGCTATAATGTAAA^ 
CATAATCTTATAAGCAAAATTATGTAAGTTAGAATGTAGTTTAAAATCATGTATATAATATGATTCT 

3 5 TAG CAGATT AATT AGGATTT CACAG CTTGT ATT AAG CTG C CTTT CGTTG CTTTTT AAAAAAAT T ACAC CTTG ATCACAGT CTTTTG 

ACAT AATT AAT AAT AGATAT AGTGAAG AAAAATG TTC C C AAACC CATCT AAGGT ATGCAT AG TAGG CAGTTGTTG CTTTTGTAT AT 
TT ATTCT ATTT AAT AT AAGT AAAATTAG AT ATTTT ATGTCT GAATTT AAACC^^ CAAAATGT CT 

TTGTGTGGCATATAGCTCTGTTAACT AAT AATT AGTGCTGTCTTTGGT^ 
ATTTTTCTTAGTATGCTTCTTCTGTGTCACACAATAGGAAGGAAGTACAGAAATG 

4 0 CTATTTGTAGTGTGAATTTCAAGAGAATTGTTGTTTTTCTCGATTAAAGGGG 

TAATAACAGAGCTGGAAGTTAGGAATGGATAGCTACAGTAAATGACTTCTTTTTTTAAGTAA 
TTCAGAAATGCAACACATGAAAAAATAGAAATAACAATGGAG 
GAGTTAATGTAATXaACTAGGGAGGAGCCAGACAATGAATCATAATTAGGA^ 
ATTTAAGGGACTCAGAATTTCATTTTGTTCCAACTTTCTCT 

4 5 AAATATTGCATGGGATAAAATTGTTTTTCTGAAATTCATATTTATCTGGGCCTCCT 

GGTCTTG C CATTTG TTTTT AACTTT AAT AC CTGT AT AATCT AGG ACAG CTTTT AAAATTTATTT ATTTTACC CTGGCACGTGCAAG 
AGTTTTTTCAAGACTTACCTTTTTTCCCCCTTGGTAAC 

AGTCAGCCTGATATATTGGCATAGCCTTTGGCTGATGAGAGA 

5 0 CAAATTGAGTAGAGGTGATATTTGGAAAAATATATATTGTTAGCTCCZATTGGCT 

GACCTTTATGAGAGACATGCAGTCTAGTCAGCTATTCTCAACT 
TGCAAGAGTTAAAATATTTGTGTTGTAAATTATTTACTTTTTAA^ 
TTCTTGCTTGCATTTGAATAAGGTTCTTTTTTAAGTGGGAGATAAACACAA 
ATTTACTCACGTCCCAGTCCATATTTGTACCTGAATTTTTAGATTACTTCT^ 

5 5 TAAGACAATTATTCTTAATCAGGAGTGTGTTAGAATTTCTTGGA^ 

AGATTTCTTTATCTTCTAAACATCTCAAGTGG^GTAAGGACATGTATGTTTTG 
CCAGTATGAACTCTGTTCTAGGGATTTCTTTGGATCTAAATACTCATTTGGCT 

GCATTTATTTTAGATTGCAGATTATGTG CTGTATCAGTTATTACCT CTTGTTTTTTAAATTTGGCACTGCC 
ATATTAGCATACCATTCATAATTCATTTTTATTATAACTTCATATAATTT^ 

6 0 AATTTCTATTATTTTTCAAATGGAGATTAAAGTTAAGAGGATCAAATGAGAAATGTATATAAAGGG^ 

TTACAATAATTAGCTTTTTTTTCTGTATACTTCATGCTGGTTGCTAATTCT 
CGATAATTTATCATCCATTTAAACATTTCAATGTAATTTTT^ 
AATAGCACTCTTTCTTGCCTAAGCGCAAATAAAAACATAGATATATCTTACAA^ 
GTATATTAGCCTTTTAAAATACTATTCAGAATATAGGGGTCAGTCCCAGCTTTGCATAGT 

6 5 AATTTGTTTAGAGTCAATTTTGGAGACGTTTTGGTG CTTTT C^ 

TTCCCTACAATTTCCTG ATTTATG TTTT AC C CT CTTG CACT C CAT AGATG ACT ACATTTCAACAC ATTTTAT ATTCAGG AAGAATG 
GCATTGTG CAAAG CACAGT ACTGAGG AAGG AG AT AACTGGTG CATTGTACTG ATGCATT AT ATAAATC^ CTG GGAAGTT 

TTTTAAAAATCATAAACACAGATAAATCAAAATAG^AATGAGTATGAG 
ACT ATTTTTTTTTCCTTAGTAGTGGAATCT AATT AAAAAATCTGACT 

7 0 T AAACAAAAG AC CTT AAGATTTTTT AGT CATTC^TTCAAT AGATGTTT AATGAATAT CTCTGT ACAAGG C^ 

CAATTTCTGCCTTCAAGTAGCTTGAAGAAATAAAACAAGTAGCTTT 
CAGTTATTTGGAATAATTGCTATACACAGAATACTATGGTAGTTTAAAGGAGGGG 
TTACAGTATACTGTGTTACTTGGTTGTAATACTCTACATTCCCCC^ 
AAAATAAATTCCAAGGACTTTAAATACTCAATATTATATGATAACTG^TATTTA 
7 5 TTTCAACAACTACTTTrCTCTTCT^TTTC 
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TTTTTGTGTTTTGTTTGTTTTTTTTTIT^ 

TGCAGCCTCCACCTCCTGGGTTCAAGCAATTCTCTTGCCTCAGTCT 

GCTAATTTTTTGTATT 1 1 1 AGTAGAGATGGG U ' i ' TT CACCATG' ri ' Tl CCAGGCTGGTCTGGAACTCCTGACCTCAAGTGATCCACCC 
ACTTTGG CCTC CCAAAGTGCTGGjGATT ACAGGTATGAG CCAC CATG C C CAG CCTCAGTGAAATATTTGAATG CT CACT ATGAACCT 
5 AGCAGTTTTATTCACACTCTCTCTTCCCAAAGTTAAAC^ 

TTTAAATAAGTAGTGGGCAGGGATTTATTTGGCCTTTACTTCAGCCAC^ 

TTTT AACAGTTTTT AAGAGGAAAATAT AC 1"IT CT T AAAT ACTCCT ACAAGG AT AT AT AAGGGTATGAAATATTTG AAATTG CAATG 
TGAATTTGAAAT CTTT AAAAT ATG CAAGGAT AAAATCGTGAAATACT CAT CTG CAT ATCTT ATGTTTTTGAGT AAAACT^IAAAT AA 
GGATT ATGAACTTGTTTAGGTT AAAGT CAGTTTTCTCTGG CTCTT CAT CTTCTGAATAAACAAAT AGCT TAG ATT CAAACACAG 
1 0 GGAGTTTACACCATACAGAATAAAATTGAAAGTTACAGTAATACT^ 
ATTCTTTAATTTCTGAGCTATCAGTGATTCTT^^ 

AAATAATCCTT CATT CAT CTT ATCACTGCCAT AT ATCAGAGATAC CCT CAT CATTG CC CACATT ATT AT AGTGG TCTCT AAACTGA 
TGACATTGCTTTCITTTTTCTATCTGTTCACTGTGA 
CAAAATATTCATTGATTTCATCACACATAAAATGAAATCCAC^ 
1 5 AAGTGTGTTCCATTTACATTGGTGATTCTTAAATAAACCCTCCTCTGTGG 1 CCCTAGAATACCTACCTTT 

TTCCTCTTCATTTCCTCCXSCTCTTTTGCTTCATTAACC 
ATTTCCGCAGAATTTTGTTAGTGCTTrCCTTCCTTCTCT 

AGAGTGACAGTTGGGATACCACCATAGTATCACTAAATCATTGTTTTCAAGAAAGCATCTG 
TTTTTTATTTTTTTACACTTTAAGTT^ 
2 0 GTGCTGCACCCATTAACTTGTCATTTAACATTAC^ATATCT 

TGGTGTGTGATGTTCCCCTTCCIGTGTCCATGTGTTCTCATTGATCAATTCCCAC 
TTTGTCCTCACGATAGTTTCCAGCTTCATCTATGTCCCTACAAAG 
GGTGTATATGTGCC^CATTTTCTTAATCCAGTCTATCATTCTTGGACATTTG 
CCGCAATAAACATACTTGTGCATGTGTCTTTATAGC^ 

2 5 TAAAATGGTATTTCTAGTTCTAGATCCCTGAGGAATCGCCAGACT 

GTGTAAAAGTGTTCCTATTTCTTCACATCCTCTCCAGCACCrGTTC 

GATGGTATCTCATTGTGGTTTTGATTTGCATTTCTCTGATGG CCAGTGATG ATGAG CATTTTTTCACGTAGACAACATTTATTGAA 

TGTGTATGTATATATGACACTT05TAAGAATTTAGCAC 

GATTATTCCTATTTTACAGAGGAGGAAACTAAGAGATTTTTA^ 

3 0 GGTAGGATTCrAAAGGCCrCTTCTCCATACTGTTCTGATAAGGCCCACT 

G AT AAG C CTT AGT CAGCACT AAAGTGATT AAGGG ACACATT ACT ATT ATTTTT^ CTT CACAC CCCT ATT AACT C 

CCCTACTTGCAACTACAGTTGAGTGTCTATTTACCTTTTCCCCCACA^ 

GCTTtATTGACATATAATTTATGTACAACAAACTGTACnTTTAAAGTATGC^ 

3 5 TCATTGTATATTACTTTACATATCCTAC^TTC 

AGCATAATCATTTTGAGAGTCGTCCATGTTATTATATGTAGCAGCAGTTCATTCTGCAGATG 
AGAAAGTGAATACTTTTTTTTCTCTATTATGTCTTTAGAAAAGGTTTGCATC^ 
TCCTGGGGCTTGTTTTGATATACTGTGTGTAAGTCCTTTAATATTTC 
TTTAAATGCATTTTCATTTTCAGTGAATGGTCTTTTCATTCAGTAT^ 

4 0 CTC!ACTTTAGGAGTGTGATATTTTACTTCrGAATACrc 

TCTCACGTGGGAGGCAGATATCTCTGAAGTGGTTATCTCAGAACTTCATATTGACGT^ 
AACCAGTCAATATGGTCTGAATGACTTGTTCTGACACTACATAGGGACTC^ 
ATCTCCAAGATAACATTTCGGTCTTGTTATTTAATGTTAGGAGGAAAAAA^ 
ATG TT AAAAT C CACT ATT AACTGCT AT ACT AATCTCACTG ACCT C CTCCTTTCT 

4 5 AGAGATAATTGAACAACCCAGGCAGAGGGGAATGCGTTTT 

ACTAGCTATAGC!AATTATTTTAAAGGGCCAGTTTC!TTC^ 

AAACGGATCTGTCAAAAAGACATCTATTAGATAGATGTCTAAGGACATTTCT 

GCTCTATGGTGCTTGTGTACTAACCTAAGTAGAGCGACTCAATGCT 

5 0 GGAAC^CAGAGATAATAAGCAAAACCCATTCTCTGCAGAGGTGAGTGATATTATTCTGTAG^ 

TCATAATTCTTCTTGGCTAGTCTTAGGGCTAGAACIAlTT'l'TTTC'I'CC ITT T i C ITAACTTTATTGCTTGTCTCTGATTCTCTGGGT 

GAXTGACTGATCTGAAATACCACTCTATAAATAGAGCTCTTTCAC^ 

TGGTAAACATACTAATAGCTATAAGATGTACTAAATCTATCTATCTATCT 

TGGAGTGCAATGTCGTGATCTCAGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCT 

5 5 GGG ATT ACAGACG CCCACCAC CACGCCCAGCTGATTTTTG T ATTTTTT AGT AG AGACGGGGTTTCACCATG TTGGCCAGGCTGGTC 

TCCAACTCTTGACATCAGGTGATCCACCCATCTCGGCCTCCCAAAGCGCTGGGATTATAGG 

TGTACT AAATCTATTTATTTTTCTATTTTCTA CCCAGG CTGG AGGGCAGTGGCGTG A 

TCTC^GCTC^CTGCAAGCTCCACCTCCOVGGCT 

TACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATG 

6 0 ATCCGCCTGCCTTGGTCTC CCAAAGTGKTTGAGATTATAGGTGTG AGCCAACATGCC CAG CCCAATGT ACTAAATTTCTAT^ 

TTTG T AAACT ATTTTT CTTTGACTTT CAATG AAGGG TTT ATC CTTT AAATG C CAATT AAAAACTTT AATGG CAAAG T AGATTTTG A 

AAAGTGTTGCTTTTTGTTTTTGTTTTTGAAAAGTGTTA 

AAAATCTATTCTGTTGATGAACTTAAAGGGAATTGCTtlA^ 

TTCTATTTTTATCACATTTTTGGCAGTAATATGGATTTGGG 

6 5 AAGGAAATAAGGAAATACTATTTGTCCAGTCTTCTGAGTCATCTCT 

TTTCT AAG AT ATG T ATTTTT AC CATTCCAGTAGGTCCTAAACACITTAC^ 
CAACTACCCTCTCTCTCATACAGAAAGAAATGTAATACCTCCTC^ 

GTG C CCTCT ACCTTGT AAG CT ACC CATT CTT ACTTT AC CTTG CT CCTTTGTGGT ATG CTGG TTTTT AT AGTCC CTGCTTTGTTCTT 

7 0 G CT AT ACTCTCT ATGTGAAATCTG ATG TTGCT AAACTCAATCTGATT^ T AAACACTT GGGG T ATGCTT AC 

TG AACAAAT ATTTTTGTGAAGTGAAATGTCATATATTAAGTAAAATGATCATATTT ATGTG CT ATC CTT AAAT 

ATATTTTCAAGGAATGTTTATGGTATGACAGTACGTTAAAGAGCTGGGCATTTAGTGA 
GATAAAATTTGGGTAGAGTTTTACTTGTTTTGAAAGGAAAATAAGCCCAGTCAAAT 
TATGTCAAGAAACTGAGAGGTAAGTTTTTCT 
7 5 TTTATCTATCTAGTGCCAACTACACTATAACTAATGTAAACTGAACTCTCATCT 
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TCATGTATTTAATTTCCCCCTTTTTTCAGATTATGAACTATT^^ 

CCATATAAACCTCATCCTCATGATTTAGTTGGAAAAGACTGCAGAGACGGCTACTATGAAGCA 
TTTGTTGTAAGTACACAGTTACAGACATCTTCAGAAATAAGA 
TTTTTAGCAGAATAATTTTCTCATTCTACTTCTATTTACTGTATTT^ 
5 CATTCTTT ATTAGTAGACACATCTTATGATAGTTAG CAGGAAG ACATTTTT ATTGTTATTAAAAGTAAACGTAAAC TTCTT GTATA 

CTGTGATTTC CriTCriXSGGGTGGACAGTTTTTG AAAT AATT ATT CACT CTGT CTTTTTT TT TT l"i 1 1 ITT'nTTTGAGATGGAGT 
TTTACCCTTGTTGCCTAGGCTAAAGTGCAATGGCGCAATCTCGGCTCACTC 
CCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGCACCACCACGCCTGGCT 
TCCATGrTGGTCAGGCTGGTCTGGAACTCCCGACCTCAGGTGATCTGCCTGCCTCG^ 
1 0 AGACACCGTGCCTAGCCTCTGTCTTTTCATTTATAGTTAACTTATTATTATTA 
GGCTGGAGTGCAGTGGCATGATCTCGGCCCATTGGAACCTTC^ 

TGGGACTG CAGG CATGT ACCAC CATG CCCGG CT AATTTrrGT Al M rri'l"l'GT AG AGATGGGGTI" CTTGTTG C CGAGGGTGGTCT 
TGAACTCCTGGACTCAGGCAGTCTGTCCACCrTGGCCT 

TACAGTTAACATAAAAATGCCTGGATGTGAGAAAAATCAACTGGAAAACTCTAGAGTTATT 
1 5 TTAACATACAACAATAGTTCTAAGATAAAAATAGTCATGTAATACTT"CATGATAGTGAGGCTACCAGAAG^ 

TGTT AGAATGT AAATT AAT AAAG ACG T CTGGTAGGCAATTT AACT ACATGG CAAAAG CCTTG AAAAATATT AATG CCT CTTG AG CCA 
GCTGTTCCACTTCTAGGAGTTTAGCCTCAGAAAATAATTAAGTGTGT^ 

TTGT AATGGCAAAAAG CT AG AAG CAAT CCAAATAGT AAGCAAAGGGGATT AATT AAAT AAACTG AT ACATTC^ CAATGT ACTC C 
TGTACAGCCTTAAAAATCATAT^GCCAATATGTCTATTGGTATGTAAATATAGTCACAATATAT^ 
2 0 TTTGTTTTrrTTTTTCTTTTTGAGACAGTCTTGCTTTC 

CACCrCCCTGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCATGTTAGCT 
TTTGGTATTTAAACCATTAGAGA^GGGTTTCACCATC 
CAGCCTCCCAAAGTGCTGGGATTACAAGCGTGAGCCACGGCACCTGGCCAGA 
TTTATTTTTGTTT AAGAT AAATTTAT ACAT AT ACIATGATGTGT 

2 5 AAGATTGTAGGGTTTTATG1 M 1T1'TC'I'1TTAACTTATA1TTTATAATTCATTTT^ 

TGAACTAATGGCAACAGAAAAGAGATTAATAGTCTCCACATATATTAGTAATAACCATTTA 
TTACAATAATAACACAAAAGAAAATATCTAGTATTATACACACTTG^ 
TTACACTGTGATCZACTCCTTACTTGAAACATTTTTAT^ 
GCrGTTCTTTCTTAGTTACAGGCAGTTAAATTTTTGG^ 

3 0 TCTCCCCAATTCAGGATCCCATTAAGGATTACACACTATAGGTAAC^ 

TAACATGAAACATTTTGGGCAAGAAATCTATAGAAGTGATGCAGTATAA 
GATATTATCTTCGCATTCTATTATAAAGAAGAACTTTCCCT^ 
TCrrGTATTTAATATATTATAGTTATTCTGTTATTCrTTTGATTT^ 
TGTCTCCrGTGTU Tin ' l ' l XSATATGCCGTCATCTTTCTTTGAGCA Crrrm 

3 5 TTITCTTTTTTCT TT TTTTTTTTC 

CCTCACTGCAACCTTCCCCTCCCAAGTTC!ATGCAATTTT^ 
CACCCAGGTAATTTTTGTATTTTTAGTAGATAC^ 

CTCCTGCCTCAG CCTCCCG AAATGCTGAGATTACAGGCATGAGC C ACCGTGCCTGGCCAACTTCTGCTTTGCTTGCCCCAG CCCTG 
AAGTCATCAGCTTTTTCCAGAGATTCTTGGTTCCTTCTGGTGGGGAA 

4 0 AATGTCTCAAAAGTACCTTAAGTAGCGGGGCACAGTGGCCCATGCCTGTA^ 

CTTGAGCCCAGGAGTTTGAGTCOIGCCTGGGCAACAGAGCGAGATTG 
AATATTCCCCAAACGTGATTTTACTGTGATCATATC^ 

GCX5ATAAAAGACCTATAGGTGTCATCTCTGACATTCCTCATCCAGGTGTACJ\CATTGAR 
TTTTGAATCTATCCACTTCACrCAGTGAAATTCATCACCGTCIAT 

4 5 TGACTGGTCITGCATTAACCATTTTGTACCTTTrCCAAA 

ATGAAATTGTCrTTCCCTTGCTTAAAACATTTTAATGGGTTTATTTGGCTCT^ 

TAAGGCCCIACATGACCCACCTTTATTTCTAGCCrTCTATTAAACTCT 

TTTAGTTTCCTCTGTTTCCACTGCTCTTTTCTAGGGGT 

AGCATAGGCTCCCATAACATCATGTATACCTCCTTCATAAAACTTGTTATGATTGC^ 

5 0 AATGTATATCCACTTGACCAGAjGACTACCAGACTTTTTCTGTAAAGTACTAGATAGT 

TCT CTGTTGCTCAG CT CTG CTGTTGT AGCATG AAAG CAG CCAT AG AT AATATGT AAAC ATGAGTTTGGCTGTGTTC CAAT AAAACT 
TTATTTGCAAAGACIAGACAAGATGTTGGATTTGGTCCATGAGTT 

AAAGGGATTGTATATCCCCATTACTTTGCAGAGTTTTTGGTATATAGTACGAACCCCAA^ 
AATGAATAATTGATTCAAACAATCTCTTGATCCTTCATTTCAGAATTTCAAAAATTAC 

5 5 GTCAAATATGAGATTATTCAGATGAAACAAAATGAAGTAATATAAATGTATTAATATGAAGAATTCTATA 

AAAAA^AATCAAACCAAGAATCTTTCAAAGAAGTGGAAATACTTTT 

AAGAATTCAAGATTTAGCTGACCAAGCATAGTGGCTCATG CCT ATAAT CCGAG C ACITTCGG AGGCTG AGGCAGGTGGACTGCTTG 

AGTCCIAGGAGCTTAAGACCAGCCTG^GCAACCTGGTGAAACCCTGGCTCTACAAAAATACAA 

TACCTGTAGTCCZAGCrACrCAGGAGGCTGAGGTGGGAGGATCACCCGAGCCTGGGAG^ 

6 0 CACTGCATTCCAGCCTGGGTGGC^GAGTGAGATCATCrCAAAA^ 

AGGAAGGAAATGTGAAAGATATTTTAGATGCTAGCAAAATTGTAGATGTGAAGAACATGAGCATAAGAC^ 
AGGCAGACAAAGTAGCCTCCGGTTGCAAGTGCCATGTGAAGTTTCAACAATAG 
TC^CGCCCACCCAATTTTACTGATGTCATGGTTTTTCCATGTGGGAACAAGTATC^ 
TCGTCTATGCTACACATATGCAAATTAGTAGTATTTGTAGCTATTGGGAAGTTTTTC^ 

6 5 TTGTTAGCATTAAACTTCTTTCTAATACAAAAACATTTTAGTAATTTTTTATGATGAATGGCTATAA 

AAACATATTCTACTGCTAATAACAGTGAATTCAGGGCT 

GTGGATCAAGTGT AGG AGG ACT AACAGG CACTGT AG CAAACTAAAG AGC CCTGAAT AAATGGGG CTTAGTCCCT AGCCTGTTGTTT 
CTGTGCCAACCTAGTATTGCCAGGTCTTTAAAATGTACA 

AATGT AGAT AACCTGTGCAGATTTTT AAAAAT AT AGG C AACACTTTTG TGAGC CT AGCAAAACATACCntSTC^^ CTTCAAC CTGTG 

7 0 AACATCTCTTTACAGCTTTTGCTTTACATTATCAAACAG^ 

TTTAATATTGCTTTACTGTTAATCTATATTTTGGTATGTTTGACATAGGCATATTA^ 
TATAGTATATATTATTTITCTTTTAACAGTTCTTAGAAGTACAACCCCACTT 
TTGCCCAGGCTGGAGTGCAATGGCGTGATCTCGGCTTACCGCAACCTCTGCCT 
CCGAGTAGCTGAGATTACAAGCATCTGCCACTATCCCTGGCT 
7 5 GGCTGGTCTGGAACTCCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGG^ 
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GCCATATAATCCCACTTCTTACTCTCrCATCTT^ 
GATAACTTCAGTATTGCTATATGTTTGATTTTTGCAATATTCCTTGTG 
GTTTCCAAAATTTGGGTATTCG ATGTG TGAAG AAAAAAGAAGTAAAAGAAG 
TTOVATGGTAAGTATGTTTGATAAAATCATTrCTATTTATTTGGGTC 
5 TTrTTTTTTTGAGACGGAGTTTTGCTCTTGTCGCCAG^ 

GGTTCAAGCAATTCTCCTGCCTCAGCCTCAGCCTCCGGAGTAGCTG 

ATTTTAAGTAGAGACGGGGTTTCACCATGTT AGCCAGG CTGGTCTCGAACT CCTG ACATC AGGTG ATCTACCCG CTTTGGCATCCC 

AAAGTGCTGGGATTACAGGCGTGAGCCATTGTGCCCAGCTATTTATATCrrCA^ 

CTTGTTGACATGTACCTGTATCTGAGGAATGACGTATACTATTTAAAATAGTT^ 
1 0 GAGAGGTrTrrGGTATTGTTGGAGAGTGTCACTGTTTTATATAATAAAACTTCAA 

TTATGCCACTTCCAGCAGACAAC^TACACTCCCCCCTTTTGAGTGTTAAA 

GCTCAAGGGTAACACTTAATCCCAATGTGGAAGAAGAAGCATAGAAATAATAGAAATCA 

GTATATGACTAGATATTTACrcTTCAGTGAATTTTATTGGGGATAGCATT^ 

TGCTGTAAGTATAAATGCTTAATACTCCCTGGCTAACTGTTACTTGTAAGCACT 
1 5 CTAGATGCTGGCATCCTATACTTAGAGTAGCCTCCATTCTGAAGTCTTT 

ACTGT AAT ATTTT CAAAGAAT AT ATT TCTTGGT CAT ATTTGAGACATC CTT AT ACTGTTGGTTTGT ATTITTCCTGG ATGT ATAAA 

AAT AATTT AAG AGAT ATGTTGAT AATTT AAGGAAT AT ATTCACAGG TAATT ACTGTTTT ATGAACCATAAGAT CTG CAT CTCTTT A 

GK3TTTTATTTTTCACACTATTTAGTATAAAGCGGAAAGTAA 

AAGTTTGATGTATATTTTTCTAGAATTTTCrTTATAC^ 
2 0 TATATATCCTATTCTACTCCTTCATTTTTTTCCACTCAGG 

TTTTTTAACAGAATATTITATTGTTTGGGTAGAA 

GCTCTGTCGCCCAC<3ATGGAGTGCAGTGATGTGATCACAGATCACT 

AGCCTCCCAAGTTGTTGGTGCACCACCATGCCCATGCCCAGCT 

CAGGCTGGTCTCAAAACTCCTGAGCTCAGCMTCCAOT 

2 5 CCGGTCTATTTAGTCTTCTATTAATGAACATCTATC 

TGTGTGATTATTrCCCTAGGATAAATTTTTGGAAGTGGAAAATTTC 

CAAATT AATAGGC CAGACTTTT AAAG AGG CAGGT AAGT CT AG ATGCAAT AT AAAACAAGGAAATT AT AAGTGAT AG CT AT CATTT A 

TTAAGCACTTCTTGTATGTCAGGGAGTAAACGTTACTGCTTTATGCACA 

GAAGAGAATGCCAAGACTTTTGTTATTTAACTCTCCCAAATCATGCTG 

3 0 CCTTGACTCCAGTGGTAAAGGCATTAAAAAGGAAGATGCTGTCTGTCGAATTT 

TTGTTTTTTGAGATGCCCGGGCTGGAGTGCAGTGATGTGATC^ 
CACCTCAGCCTCCTAAGTAGCTGGGATTACAGGTGTGCATCACC^ 

TCAC CCAGGCTGG AGTGCAGTGTCACJ^TCTCCACTTG CTGCAACCCTCACCTCCTGGGTTCAAGC G ATT CTCGTGCCTCAGCCTC 
TCGAGTAGTTGGAATTGTAGATGTATACTACGCCTGGCTAATTTTTGTArTTTTAT^ 

3 5 CTGGTGTTGAACTCCCAACCTCAAGTGATCCTCCTGCCT 

CGAGAGTTTCTGTATTTTTTGTAGAGATGGGTTTTGCCATATTCTC^ 
CCTTGGCCTCCCAAAGTGCTGGGATTACAC^TGTGAGCCACTGGGCCCGA 
CAATTAAAATAGTTGTTGGTAGATAGTAAGTGCTTTATATATTTTATTT CATTT A^ 
TTAATAACTCAATTTTAAAAATAAGAAAATGGAAGCATAGAAAAGTGAACTCAGGCT 

4 0 TGGGGGGAT^CGAGATCAGGAGTTCGAGACCAGCCTGACCAACATGGTGAAACCTTGTCTCTACTA 

GGACGTGGTGGCACGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAAGCAGGA^ 
TAAGCCAAGATTGCTCCACTGCACTCCAGCCTGGGCGACAGAACGAG^ 
CGTCGTGGCGCACACCTGTCATCCCAGCTACTCAGGAGGCT 
GCOSAGATCACGCCATTGGACTCCAACCTGGGCAACAA^ 

4 5 GTAGGCCAGGCACGGTGTCITATGCCTGTAATCCCAGCACTTT 

GATCAGCCTGGGCAACIATGGTAAAACCCGATCTCTACAAAAAT^ 

AGCTACTCAGGAGGCCGAGGTAGGAAAATTGCTTCAGCCTGGGAGGTCAAGGCTGCAGTGAACG^ 

GCCTGAGCGACGGAACAAGACCCCGTCTCTAAGTAAAGAAAAGGAAAG 

CTAGGATTTGAACATAGGGAGCCCCTGACTCTAAGGTCATCGTTTTAACAACT 

5 0 GAAGGAACAATCCTGCCAGTTAATCCTTCCCACTATTTATGTTTCCTGTTGTTGTTTA 

GAGTCTCCCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCACTGCAAGCTCCGCCT 
CTGCCTCAGCCTCCCGAGTAGCTGGGACTATAGGCACCCGCC^ 
TCATCGTCTTAGCCAGGATGGTCTCAATCrCCTGACCrrGTGATCCGCCC^ 
AGCCACCACACCCAGCCTGTTTGCAAATTCIATTCCTTCCCGAGCCTTTCAGCA^ 
55 TTAGTACTGTTAAAAATGTAGATTTCAGGCCACACTTCCACTAATTTGATAGGGCTGGTATGGGTC 
AAATATTTTTAAAATAATAACTTATCCGAGTCA 

CCCCAATTCTTGCCTCCTCAGGAGACAGAAGGAGAGACCGAGACAAGTTTTAG 

G CAGGAAGG AAAGG AAG AAAAATACACTTGGAAG AGGGC CAAG CAGGTG ACTTG AAAG ACAGGTGTG CTGGG AAT CTG CATTTT AA 
CAAACATCTAC^TCAGTATTTCTTAACTCTTTCACCATGACCCACACT 

6 0 TACACCACACCCTTAGGACAGCGATACCCTATGTATTAGTCCGTTTTCACTCTGCTG^ 

AAGAAGTTTAATTGGACTTACAGTTCCACATGGCTGGGGAGGCCTCAGAATC^ 
CAGGAAGAGAAAATGAGGAAAATGCAAAAGCGGAAATCCCTGATAAAACCATCATATCT^ 

CAGT ATGGGGG AAACC AC CCCCATG ATTCAAATT AT CTCCCACTGGG TC CCTCCCACAACACATG AGAATT AT GGGAG TACAATT C 
AAGATGAGATTTGGGTGGAGACACAGAGCCAAACC^TATCGCCCCACTACATGCAGTGTACT 

6 5 TCCCTTCTTTTTCITTCTCTCTTGCATTGTTAGT 

TTTGAAAATCTCTATCTGAGATGATTCTGATGCAAGTGATCCACAGACT 
TTCAGCTTCTCAAATATGGCCTTrAAAGCCCTTCATAATCTG 
COTACCCCTCCTTCCTCTACCACTGAGTGCACACATGCACTTACTrGTT^ 
ATGTTCTTTCTTACCTCTGTGCTTTTATACATGCAGTTCCTTTGCATGA^ 

7 0 CCTCCAAGTTTAAGGATCACATCGTCTTTAAAGCCTTCCCTGACCTTCTCTCT 

ATATATGTGTGTGTATGTATGTGTGTGTGTCTGCGTGTGTGTATGTGTGTGTGTGTATATATATATATATAT 
1 Tr rm "n 1 1 1 CTTTTCCGAGACAGAGTTTCGCrCTTGTTGCCCAGGCTCGAGTGCAATTO 
CCACCTCGCAGGTTCAAGCAATTCTCCTGCCT 
NNNNNNNNNNNNNNNKNNNNHN^ 
75 NNNNNNNKNN^nWNNNNNN^^ 



587 



WO 03/008583 



PCT7US01/51291 



NNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
TCTCGGCTCACTGCAAGCTCTGTCTCCCAGGTTCACACCATrCTCCTC 
5 CACCATGCC CAGCT AA1T1TTTTGTATTT1TT AGTAG AG ACAGGG TTC CAC CGTGTT AGC CAGG ATGG CCT CXXJT CTC CTCAT CTC 

GTGTTCCGTCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCTTG^ 

GGTATACTCAGCAATTAATCATTGAAAGTTTTGTGTTA^ 
CATGCTACTTGTGGACAGGGTCTGTGATGGAATCTTGTTTCCATTTC^ 
1 0 TCTATCTTCTCAGCTAAGTGCCTTGAGCTATTGAGTTTCCT^ 

CTTT AT C ' rrrC'ri 'GCCATTTG TT CTTTAAAACT AAGAAACAGT CTCT AG CAAACGGGG CAT AAAGCAGTCT AATACTGTCAAATTT 
GAGGCTTCAAGTAAAGGATTATTCAAATTGTCTAATTGCCAAAGTCTAAAT^ 

ATTTTAACAGTGTTTACTT AGCT AGAACAAGAGTGG CAGTT AGACTT^ CTTAACTGCAAAAGCTGAAAAAAATTTG 
TAATTATATATCATTTTCTAGTTATGTACTTTGGTAATGAAATGTTAGGATT^ 
1 5 TAAACTCTATATTAAATTTTGTGCAATAAAATTGATTTTGACAATA 

TCCCATGT AGAGAATCTTG AGTTGTT ATGAGAGT AAAAGGCTG CT ATACAGTTG AGTTTGGGAAAACT ACATT ACTTAGT AG T CTG 

AAAGGGAATACAGTTTAATAAGTAGTCTAACATAGATTCTTATCTCTAAA 

CTGAAAACTTAAAGTTTGAAAAAATGGCTCATGTGTACTTCATTGTCC^ 

CATT ACC CT AGGCAGAACTTGTGT ACTTT CTTGCTTT CAGT ATT CCTTG AG CAT ATTTCCATT AT AAAGTTTTTC ATG TTGT ATT A 
20 TATAATTTTAGTTTATGTATCTATTTCTCATCCAGATACTCATCTCACT 

CTTT ACTATGGT AG CACAT AAT AG ATACTCAAT AAGTATGTTGAAAGGTTGAG T AAATT AATG AT AT ACTT CT AGTAT ACTTCT AG 
CACCTTTACTTGTTTGCCrGGCACATGAACIACTTAAATGTCAA 
AACAAAAATGTAAATTAATTATAACTAAATTTTATATTTTTCCT^ 
GAGACAGGGTCTCCGTATGTTGCCCAGGCTTGGTCTTGAACTCCTGGGC^ 

2 5 GATTACAGGCATGAGCCACCATGCCCGACTGAGAC^ 

ATCCCAGCACTTO5GGAGGCCGAGATXKK3CGGATCACGAGCT 
CTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCGTGCACCrCTAGTCCT 
GTTGAACCAGGAGGCGGGGGTTGCAGTCAGCTGAGATGGCGCCACTGCACTCCAGCCTGGCG^ 
AAAATAAAATTATTATTCCATATGCTGTGTATTTTTGCCTTGTTATTTATTTTT 

3 0 AGAAATCTTACACTTCTAGAGCTAGTAGACATAATTGATTTTTAAAATTTAAT^ 

TTCTTGGTTAAGAATGAAATTCTCATTACACACCTACTGATTTTAGGGTTAAAGGAGTGAT 
ATTTAACATAAAAAC^AAGGGGCTGGGCACGGTGGCrCACACCTGTAAT 

GTTCAGG AG AT C AAG AC CAT CCTGG CT AACAGGG TG AAACT C CG TCT CT ACT AAAAAT ACAAAAAATT AGC CGGG CATGG TGG CGG 
GCGCCTGTAGTCCCAGCTACTC«3GAGGCTGAGACGGGAGAATG^^ 

3 5 ATCACTG CACTCCAGC CTGGGTGACAGAGTGAGACTCCATCT CAAAAAAAAAAC AAAAAACAAAAAAAAAAAAACATAAAAAGGAA 

GGAGAAC^GGAGAACAGTrATTACCTTATATTTTGACTAACTACAAAGTG 

TCG ATTGGT CT CATG CTTC CTTCCGTT AG TTTCTG CAAATGTT AAAACTGAGT ATCAGGGCAGGCAC C CACTCACACCTG T AAT 
CCCAACACTGGGGGCCCAAGGTGGGTGGATCACTTGAGGTCAGGA 

ACT AAAAAT ACAAAACTTAGCCAGG CATGG TGG TG CATG CCTGT AAT C CT AG CTGCTTGGG AGGCTC^GG CG TT ACLAATCTCTTGA 

4 0 ACCCTGGAGGCAGAGGCTGCAGTGAGCCAAGATCACAACACTGCACT 

ACAAACAAAACTGAGTATCAATTTTGACATTACA 

CTACAGCAAGCATTATCCTAAATAAGAGAAATATCAGAAACATTCGCTTTC 
ACTTGTATTCAACATCGTAGTtlAAGGTCATAGCCAGAGCA 

GCATAC^ATTGAAAAATGAGGAAATAAAACTATAATTATTTGAAATTATTATAGTGATA^ 

4 5 TAGATAAGTTTAAAAGAGTGAGCTfTGTGC^TTGGCAGTATAGCC 

CTTTTTCCTAATATCCTGCTGTCATGACTTGCAGTATAGTCAGCATTGGCA^ 

TAAAGTTTAAAAGGGTTAGTGGATGTAAAATTAATATATAAAGGTCAGTCATATTTCCATATCTCAGC^ 

ATCATTAAAACAAATATTGTCAGGAGATATGAAGTATGTAGAAACAAATCAAAGAAATGATAAGATTTCTT^ 

AATTTTATTATGAGATCTTAGAGAATACTTAAATAAATGGAGAGGTATTCCGTGTTCTCT 

5 0 TTCCCAGGCCGGGCGTGGTAGCrC!ACACCTGTAATCCCAGCACT 

GACCATCCTGGCCAAGATGATGAAACCCCATCTCTACTTAAAATACAAAATTA^ 
CTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGAGGCGGAGGTTGCAGT 
CTGGGCAAAGAGCAAGACTGTCTC^AAAAAAAAAAAAAAAAAAAA 
CTCCATATTAAATGTGATAATTCTAAACAGGAACAGAATATAGAAAAAGAACA^ 

5 5 TTAATACT AATTAAGG CAGTGT(ZATATTGGG ATAAGAATAG AT AGATTG ACTCATTGG AGTAGAATTG AGAGC CCAGAAG AGACTC 

CAAT CAC CTGTC^C CCTG AACTGGAAT AATTGGGT AAATCATT ATCTT AACTTGTT^ CTTTCTTAAATGT ATGT ATAG 

CTCAGATTTATTTCAATATTTAATATTGCAAGCGTTTTGGTCTTATAAATTTAG 

AACTTAACTCTTATTAATATTGATTAGCCTATGGTAAAATTGGTTT^ 

ACCTATAGACATCATCCTAGAAGACATTAAGTGAAGACTTACTGTGTATATAAGCAC^TTTGAAATT AAAAAT^ 

6 0 T G AAGTT G CAACAGTG CT CAG AAGT ATT AT AAAATAG AAAT AAATC ATTTC C AAAAGAG AGATTTCATTTTTG CC CAAAAT CTTT C 

TTCTAAGCTATAAAAGACAAAACTGTTCAATACACAGTTTTGTCAATTTAAATCT 

AAAAATGTATAATTATTGAAAAAAGAATTGAGAGCCCAGAAACAGAGCTAAGGAAATATGCGACTTTAGTG 

ATGTCAT AT CAAGAGG AAAGGAGG AAATGTTTGAGAAATG AAG CTTGAAAAAAAAAT AG TT ATTT AATT AAGT CGG ATTCCCTACC 
TCACATCATATACAAATATCAGCTGAAGATGGATTAAAGGCTTTAAAAGTGAATCA^ 

6 5 GGTAAACATTTTAATCCTTGAGGTATAAAAAGACAAGA 

ATTTCAATTTATTATATTTTrCAGTTCTAGAATTTGATTCCTTTiTrrCA 
CTTGAAT AT ATT AAGT AT AGTT ACTTT AAATT ATATGT CTG AAAACTTC ATT ATTTG^ 
TTGCTTTCTTTTAAAAAAAAATTTTTTATTTTG^ 

AAGAGTAAAGATTTT AGG CCTACAATATTATCTCCCTG CAGAAAATTTTCTTCTGGCTAGTGGCCXKK3TACACTAGCAATCCCAGG 

7 0 TCACCTAAAATCAGTGAATGATTTAGGTTATTTGAAGTT^ 

CTAGGGTATAGCrCTTrCAGACCTCAAATCCCCCAAAGCCrTGGGTATTTAT 
TTATTTTTTCCCTATAGTTTATTGAGAGTTCATCTCAATCTTCTTAACCATCTCT 
GTGGCCTCTCATTCTGAACTGTtXSGTTTTCTTCTCTCATA 
TTAGAGAAATTTTGTTTCATATTTTGTCCACTTTATTCTCATGGTTGAA 
7 5 AGTAGAALTTCTTT 1' 1' ATT AACGTTT ACTTT ATG CTTT C T T CAGT AACTTGTAAAAGTATC~lAAftiGT"T TGG i t CAAT AGGAC 
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tttaagtaattcttcagtcatttatgtta^ 
cagacatcccttctgacttcatttgaaactttgtct 
cxtttaagggcataagaaaggtataggtoagtcttgca^ 
aggctttatagaaaagaagctttgagctggggtttcagccttacactgaatct^ 

5 CTTGCCTCTCAGTAGTITTTAACCATGTTGTGCAGAGATGA 
GATTCAAAGCAAAATCTCATAAACAGTCATTTTTTTTGAGA 
ATGTTTTTGAACTTTGGCTTATTGACCAGTTTG 

GATACAATAAAATCTACCCGAATATGTCCTGCTACATAACCACAATCAAGCTATGGAACATTTC 

1 0 AAAACTGCATGTAAATGGAAGCATACAGTATACrTTTGTATCTGGTTTCT 

AGTGTATACCCATAGCTTGTTrTTTTG^ 

CACAATTTCGGCTCACTGCAAGCTCTGCCTCTCGGGTTCACCA 

CGCCACCACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGGCGGGGTT^ 

CATGATCCACCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACGC 
1 5 GAGTTTATTCCATTGTAGGTCTATACCACACTTTGTTTATCCATTCACAGA^ 

AT CT AG CT ACAAATTT ATCTTTTTTGCAT ATTGATT ACT C^TCTTTG CAATTG AG AG ACACT AATATCITG AATG CT ACATTT AT A 

ATGCTGTAAAAATGGATTTTGAAGAGGTAGCTCTCATATTGCCTACACAATAGTC 

TGGTATCATAACATTTTGACCTTrCAGTTGTACAGAGTACAGATG 

AATACTGGGATACTTATAAATGTTGTGATAGCTAAGTAGACTCTCAATGTCATTA 
2 0 TTATTAGTTATTGCCTGTAAAATAAGGAATTTGGGCTTAACTATGTTT 

AAAATTG TT ATGT AAAT AATGTG C ATG TCTTGCT TT TTAAAAAT ATGACTT ATTGCT AT AG CTT ATGGCATTTG AAAAGG CAAT AT 

TTTCITTTGGAACTGGATTTCTGGTTCAAATACTAA 

GAACTGTAAAATTTTTAATTAAAACATTTTTTAAGTTTGCA 

TCTTGGTATTTCATCCTTAAAATGTTGATCCACATTTAGTAGATACTCITGTGTA^ 

2 5 TTCTAATAGATTTCAATTTTTAAAAGCATAGGCT 

GGCAAGTAATTCATCTCTITACAGTCAGTTCCAGGCCAGGCTCAGTGGCTCACGCCCA^ 
GGGTGGATCACTTGAGCCTGGGAGTTTGAGACCACCCTGGGCAACATAGTGAAACCCCA 
TGGGCGTGGTGGCATGTGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGATGGGAAGATCACT^ 
GTGAACCAAAATCGCATGACTGCACTCCAGCCTGGGCAACAGAGTG 

3 0 GTTTGTAAAATGGGGATGAAAGTAATTCTTGTACAGATTTAGGAGATAATTTACAGAGTCC^ 

ATACTTGTTGAATGAATGGGTAAATGCCATTCTTTGTTC 
TCAACTTGGGATTTTGCTTTAAATCCCAGGTGAGGGGATAGTTACTTCCT 
TCATTACTTGATAAGAATGATAAAAGGCATTCTTGAGATTGTATACTGTTCTATTT^ 
GTTATTGGCTATAATGGGCAGGGAGGAGGACCTAGCAAGTCTTTAGGTCTATGTGACrATT 

3 5 TTCAGTCCCTGAAAAACAGCTGAATGATATTGAAGATTGTGACCTGAATGTGGTGAGACT 

AACATGGTAATTTGACGACTGCTCTTCCTCCTGTTGTCTCGAACCCAATTTATGACAACCGT 
TAGTC1TAACTTGTTTTTTGTAATAGTTTAATTCTTAAAATTATTTTG 

CCAGACATTAGAGAAATATATAACAGAAAGGAAATCTTCTCTAATCTCATCCTCCAGAGATAATCACT 
TACCAGTTTCTTCTTCCTGTATATACACAGATACATTGTCATCATCATC^ 

4 0 TAAGTGATGATTGTTTTAAAAACTAACTTTCTTCTTACCATATATCTTGGACAT^ 

CAG CAT ATTTT ATT ATGT AG TTAT AT CAT AATTTT ATT AATT ACCCT CTT ACAAGGCATTTGG TTTGTTTCTTT^ 

TTTTTGGCCTAGCAAACTGTACTGCAATACATATTCTTGTATATTTTTTTTT 

CACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAGTCATGGCTCACTGC^^ 

45 ATAGACTTAGATATACGTCC^CCAATGGTGTATGATATACCT 

G CATGAAAGTTGT ATTG C CAT AGGGAG ATGT ATTGC CAT AGGG AAAATTG T ATTG CCATGGGGATG TGGT CATG ATTT ATGAAG AT 
TAATCTGGTGACAGTGGGTAGGATTGATTGGCGGAATCAGAATTTAGAAGC^ 
AAATGAAGCCACAAAGTCTAAAGTAAAGCAGTTAGAAAGGAAGTGGACAG 
TATACACTAAAACTTTTATTCTGTGAATGCTTT^ 

5 0 ATTTTTTTGAAAATCCTTTATATTTAGGTGCTCCAAATACTGCAGAATTAAGGATTTGTCCT 

AGAGGAGGAGATGAAATATTTCTACTTTGTGACAAAGTTCAG 

TAATAGATGCAGTT ACTTTGTTTT C CCATTTTTT T T T T i I TGGTTTCTTATTGACTAGATGACATAGAAGTTCGTTTTGTGTTGAA 

CGATTGGGAAGCAAAAGGCATCTTTTCACAAGCTGATGTACACCGTCAAGTAGCCATTG 

CTATCACAGAACCCGTAACAGTAAAAATGCAGTTGCGGAGACCITCTGACCAGGAAG 

5 5 CCAGATGAAAAAGGTATGACATTTTGCTGGTAATAATTTATATATTTCTTGAAG 

CATTTGAGTACAGTTATGTATATTCATAATTTATGTTTCTTTTCCT 
TCTTTATTTGTTGGGTTGTTTTTTGTTTTGTTTTGTTTTC 

GCAATCTTAGCTCACTGTAACCTCCTCCrrCCCGGGTTCAAGCGATTCTTCTGCCT 
GTGCCACCACGCTGGGCTACTACTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGC 

6 0 CTTCAAGAAGAGGTTTGATTAGTATGGAATAATTTCAAAGTATCTAAACTGCTTA 

TTTT ACT CTTTG TG CACTTACCACTCAG AG AG T AGTCTCT AT ATT GGTTT CTT AT CAAACCTT ATTG CATT AGTTG ACATT ATTGA 
CTTGTGTGATAAGAAAACAAGATGAATATTTTAAACAGCTTGTTT^ 
TGGAGTACAGTGGCATGATCGTGGCTCACCAGAGCCTCAGCCTCCTGGGCTC^ 
GGGACrATAGGTGCGCACCACCACACTTGGCTAATTTTrGTGGGGT^ 

6 5 TGCTCTGTCTTGCCCAGGCTGGAGTACAGTGGCGCAATCCTGGCT 

CTCAGCCTCCCAGGTAGCTGGGACTACAGGCATGCGCCA 
TGTTGGCCAGGCTGGTCTCAAACTCCTGACCrrCAGGTGATTCACCCAC 

ACTGCGCCTGGC CAATTTTTCT tiT Tl T I TT ATGG AGACAGGGTTTTCTCATGTTG CTT AGGTTGGCCTCAAACTCCTG CGCTCAAG 
TG ATC CACCTGC CTCAG CCT C CCAAAGTG CTG AG ATT ACTTT AAAC AATTTTTT ACT AAG AG AAAAAT ATCCT CT ACTG AT AC 

7 0 TACTGATTTGGGACATATTCTATAATACTTATTAAATCCAACTTACATA 

TATTTGTTTATTAAAGGAGAGGATACAATCCrATAATTTTTTA 
AAGCAAAGAAACAAAAGACAACTCTGCTTTTCCAGAAACT^ 
TAGGTTTTGTTTT ATTT ACTTT ATTCAGTTTTTCAAATT^ 

ATCTCTCTCTGTCACCCAGGCTGGAGTGCAGTAGGGCAATCTCAACTCACTGCAACCT 
7 5 CCTCAGCCT C CCGGGTAGCTGAG ATT ACAGG CATG TG C CACCACACCCGG CT AATTTTl'GT A'lTTTT AGT AGAAACAG wTTT CGC 
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CATGTTGACCAGGATGGTCTTGAACTCCTGACATCAGGTGATCCACCCACCTTGGCCTC 

CCACCACGCCCGGCCMTTTTTCAGATTTTAACTGATAATGTTA 

AGTTAATTTTCCTGAGAGACCAAGACCTGGTCTCCTCGGTTCAATTGGAGA^ 

TAGCATATTTCTTGTGATCAGAAAGACCAGTACTTTGCACAATATATTGGAA 
5 AAATTTTCTCTCCAGATTTTTC^TGCTTTCAGTA^ 

TTTGGGATTT ATTG TTCAAGAAATTACATG TTGTTCAT ATGGT AG CAATTTT ATTAATT ACATTG CX3TTATG AAAAAATGTTACTG 

ATTTGGTTAGTTACCTGTTTTTATAAATAACATTTTACTGGAACACGGC 

TTTTTTTTTAACACTATGGCAACAGGGTTGAATAGTTCCA^ 

ACTTITTACAGAAAAAAAAATTGCCAATCCCTA 
1 0 ACATTCTTAAATCATTGCTTTTTCCTGGTTTAGTTTGTAAAGCA 

AA l " riT , rT rCTCACrGTGAATTTTATAGTCTCTTAGGGTTCTGTTGTAATATAT 

TGAGTGTTGTCTO^TCCTTCCATTTTCTCCTAAAAGTGAAGAGTAAGAGATGA 

TCTAATATTTATCTTTTAGTTTTCAAAAATACCTTTATAGAGCCTCAGAAACT 

ATGCCAAAGTAACTTGCCAAAATCATTTTACCTGTATTTGTAAAATGGTATATAC^ 
1 5 AATTTTAGTGCrTTTTAC^TTTTTTTATTTGAAATG 

CAATTTT AATTTAGAAATG Cl^ri'I'l ATAATTTTG TTCATTTT AATTTC^T AAAAT AAATTTTT C CTCC CACAGAAC CAAACTTGTT 

TTCTCATGATGCAGTTGTGAGAGAAATGCCTACAGGGGTTTCAAGTCAAGCAGAATC 

GTGGATTGTCACATCATCCCTCAATGGCACCTCTGC L '' l 'T L '' l ' A 'CAAGCTGGTCATCAGTX^CCCACCCCACCCCACGCTCAGGCAAT 
ACAAACCCACTG AGT AGlTl"fT CAACAAGGAC^CTTC CTT CT AATTCGCAAGGT ATC CCACCATTCCTG AGAAT ACCTGTTGG GAA 
2 0 TGATTT AAATG CTT CT AATG CTTG CATTT ACAACAATGC CGATGACAT AGT CGG AATGGAAG CGTCATC CATGCCATCAGCAG ATT 
TATATGGTATTTCTGATCCCAACATGCTCTCTAATTGTTCTGTGAATAT^^ 
AATCCAAGACTTCTGAGCATGAATCTTGAAAACCCCTCATGTAATTCAGTGT^ 
GTCCTCTTCCAGTATGTCAGCAGGCGCCAATTCCAATACTACTGTTTT^ 
GTTGTG(2AGATAACAGCATGATAAATGAGTCGGGACCATCAAACAGTACTAAT^ 

2 5 TATTCAGGTATTGGCAGTATGCAAAATGAGCAATTGAGTGACTCCITTCCATATGA 

ATCCTTTTAAATCITGATACCACCTATATAGATGCAGCATTTTGTATT^ 
ATATAATACTGACTGAGAATATAATACTGTATTTGAGAATATAAA 
GAATACAAAATTGGAAGCTGTCATAAAAAGACAACTC^ 
CCAAGGCGGCTGGATCACTTGAGACCAGGAATTCGAGACCAGCCTGG^ 

3 0 ATTAGCTGAGCATG<3TGGTACGTGCCTGTACTGTCAGCrAC 

GTTGCAGTGAGCTGAGATCACACCACCX3CACTCCIAGCCTGGGTGACAGAGTG 
CACACTl'rrrTATArrrCl w lTTTATAATGTTTTAATGTATTCTTAAATTT^ 
CATTGAAAAACTTAATTTTTTATTTTTGAGGCCCATGGGCCAAGGTAACC 
TTGTATGTATATCAAAATGTCTTTAAAATGTTAAGTTGGGCAGAAGGCAGTTG^ 

3 5 ATTTTATACTATTTCAATCTATGCCTTTAAAGTTGCTTATGATlTTAGCre 

ACCATTCGCCTTTCTTAGATGTCCTCACTCCCTGTGATCTCATAAAACrG 

CTCACACTCAGCATATCCAAAACTGAATTCl'lGGTCTTCCCTCCCAAACTT 

CAATGCCATACTTCTGGTTGCTCAGGCCAAAAACCCTCAAGTC^ 

AATCTGGT AGACC CTACCTTCACAAT AT AT CT AAAGCTGAC C AC CT CTTTTG ACCTCT ACT ATT AACG C C CT ATTCCAAGC CAC CA 

4 0 TC^ATCTCTTCCCTGGATTGAAGCTGTCATCTACAAAAATTCT^ 

CAGAATGATTATTTTTAAAACAAGTCAACTCATACCATTCATCTGCTCAAAACCAT^ 

CAAAGTCCTTAAAAGTTGCAGGCCTAGACTCCCTGTCCTACCTCAGGTACCACCATATCCCACCTC 

TTGGCCTCAGTGCTCCTCAAAGCATCAAGTATGCACTTC^^ 

TCCCAG AAAAATG CATAGTTCACTCTTACATCCTTCAGGTGT CACTCCACTGTTACCTGAGCAGGTCGTCCTTGAATATAT ACATC 

4 5 AGCATTCCCTTTCCCCCCTGCTTTATGTATGTCCATAGCACTCACCACGATCTGACTTTACT 

TTTTCC CAT ACCX1AAAT AT AAACTTTCT AAGGACAG AAATTTTTGT GCTT^ C CT AC 

CTT AT ATTAAACACTCAGT AAATG TTT ATTG AACATTAAAAGT ATTACH 
ATTATAAGACGTATATGATTTTTGCAGTTTTACTTACTGTGACATTG 
AATGGTCATCCAAATTGTTTGAAAGGAAAATAATCCCAGTGGCAAAATGATG^ 

5 0 AAAAGAGATTTTATTGAAGGTAAAACATTAGAGGTTCATTGAGAATCT 

TTCCAGATTGAGTAACACTTATAACACATTTCCTTTTCAAAGTC 

TTTCAAATGGTGAGCTGAATGCTGGGTAATCTCTACTAGCTCCTTAATCAGATTTAAAATTCTC^ 

AT ACTTT ATG TGAGTTGTT AT AG CTG TAACATTACACTTT ATTT GCT GTTTG TGTTT C GT G ACTTTTGGTAATT CTGGCATTT AGA 
AACCTTTCACTTTGCTTCAAAACGTAGTTATATTTTGGAGTTT^ 

5 5 AGACATTGTAAGGGTTAATTAGATCATTATATTTTATTATTACA 

TTATGCTAAGTAATTAACTTTACCTAGTTTGTTTTACAACTAGAACCTGC^ 

TGTCATTTTACTTACACGATGTCTAACCAAACCATAACnTTACATA 

ATAAAAACAAAAATTCTCAGTTAAATGATACTGGGAAATAGGGA^ 

6 0 ATGAG CAGC CATT ATGAAATCTCACAAGAATCAT AGAATT CAAT AAAAAAGG T AGAAAGT AATTTTTTTACTT AAAAATAT AAATT 

AAAATAAATTTTTAAAATCATAAGCACATAAATAGAACTTACCAGGGAGAAAGAAA 
AAATGTGAACCCAGGATGTCTCTAGATGATGATGGATGATAGGTGGGGAGATTTTTTTTTT^ 
GGATTAATTAGCACCAATCAGTTTAAACACTGACTGTTAGAATAGCTGCATGGGTTTTTTT 
CTTAGTATAAGTAAGTATAAGCCGAATTAAGGTTCTGCTACATCTGTGTTT^ 

6 5 GTTTTGCGGTGTAAGGAGAATTAAGGTTCTGCT ACCTCT 

TTTGACCCTGTATAGACrTTAATTTAAAATGAATTTGGTAACGTTTCTCCT 

TTTAACACCTGCTTTTAGTATCTGAGGCACTTTTrCTGAACT CCAGGCTGTG 

TTTACTTTATCCTTACATCACCACTTAGTGATTCCTTTCTT^ 

AAATAAAACCITrCCTACCACTTGGATGCATTTGCATCCTGACTTCTGA 

7 0 TCCTTAGCCCAGCCATCTCTGTCTTTAGCTCCTACAATTTTCTT 

AAATCTAGTGTCTTTTTTTCAGTCCTCACACTGCTTGAC 

TTCTGTGGTAGTCAAGATTCCTCCCTGAGATTTATTTCCCATGAGTCTTGACCCCTCCCCT 

GCCCTCCGATG ATCTTATCAG AGCCCACAGGTTCAG'rrrTCT TT CATGCT ACCTGAATCT CCTGAT AAACTG GCTCG CTCTCTTCT 
TTACCTTCGATAATGGCATTACCATTTACCACGCCACC 
7 5 ACACTCATACTTAATCATCAAGTCCITrTGAGCTTGfrCTCCT 
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(XICTTCATTCrTTTTTCCCTAGTAATAAT 

AGCATTCCTTGAAAAAAAACAATTCTCTCAGGCCTCCATACCTTTAGCATGT^ 
CACTCJVTCCTTGAAGGCTGGGCTTCTGTATGAAGGTTGGTCCTGCCT 
GACATCCTCTTATTTAAGTGTTTGTCTCTTTCGTCATTGGG^ 
5 TAAATAGTAGCTATTATTAGAAAAGGAAGGGTGAAATTGACATGGGAGTTAGTAAAATGTATATGGAAA 
GGTAATGATTTTCTGGCAGGAAAAGCAGCAATGACAAGATTACI^ 
TCCCTTTTTTACCACACACACACGCACACATACCACAGCCCTTTGA 
ATTATGTAGAAATTCTAAAGTTTTTGGGATTATTTCATAGCCCrGACCTTC 
CAGGTCTCCCTCATACACTTCTCAGCCTCAGCACCrAACCCTCACACA^ 
1 0 TTTTGAATGTCCAATGTGCAAAGCACGATGTTGGAAATTATACAGAGGTG^ 
CITTTTCTTTGCAAGGATAACACATGTAGAGTAAAATGCATAAAGGGGACT 
TGTTAACACCCATGTCACCACCATGTTTAGGACATTTCCAGCACCCCTGAAATT^ 
CCTCTAAATCTATTTTCAATCTTAATGGCCTTTTAAATAACT 

CGTCTAACCTAATACTTCAGGAAAACTCATGATGGTTTCCATGTTAAGAGAGACATC 
1 5 ACGC CTGT AAT CC CAGCACTTTGGGAGG CCX3AGG T AGGGGGATTG CTTG AG C CCAGG AGTTC AAG ACT AG CCTGGGT AAT AT AAGG 

AAAACCTGTCTCTGCAAAAAAAAAAAAAAAAAGAGGATACAACCAAATGGAAGAACATTCCATGCT 

ATCGTGAAAATGGCCATACTGGCCAAGGTAATTTACAGATTCAGTCCCATCCC 

GGAAAAAACTACTTTAAAGTTCATATGGAACCAAAAAAGAGCCC^TATCGCCAAGTC^ 

GCATCACACTACCTGACTTCAAACTATACTACAAGGCTACAGTAACCA^ 
2 0 CACTGGAACTVGAACAGAGCCCTCAGAAATAACGCCGCATATCTACAACTATCTG^ 

GGGAAAGGATTCC CT ATTT AAT AAATGGTG CTGGGAAAACTGG CT AGCCAT ATGT AGAAAG CTG AAACTGG ATCCCTTCCTT ACAC 

CTTATACAAAAATCAATTCAAGATGGATTAAAGACTTAAACGTTAGACCT 

ACGATTCAGGACATAGGCATGGGCAAGGACTTCA.TGTCT 

T CTAATT AAACTAAAG AGCTGCTGCACAG CAAAAGAAACTAC GATCAC^GTGAACAGGCAACCT ACAA 

2 5 CAACCTACTTATCTGACAAAGGGCTAATATCCAGAATCTACAATGAACTC^ 

AGTGGGCGAAGGACATGAACAGACACTTCTCAAAAGAAGACATTTATG 

GCCATCAGAGAAATGCAAATCAAAACCACAATGAGATACCATCTCACACC^ 

CAGGTGCTGGAGAGGATGTGGAGAAATAGGAACACTTTTACACTG^ 

TG TGGOIATT CCT CAGGG ATGT AG AACTGG AAATAC CGTTTG AC C CAGCCAT CC CATT ACTGGG T AT AT ACC CAAAGG ACT AT AAA 
30 TCATGCTGCTATAAAGACACATGCACACGTATGTTTATTGCGGCATTATTCACA 

AACAATGATAGACTGGATTAAGAAAATGTGGCACATATACACCATGGAATACTATGCAGCCATAA 

TTGTAGGGACATGGATGAAATTGGAAAACATCATTCTCAGTAAACTATCGCA 

ATAGGTGGGAATTGAACAATGAGL^CACATGGACACAGGAA 

AGGGATAGCATTGGGAGATATACCTAATGCTAGATGACGAGTTAGTGC^TGCAGT^ 

3 5 GTAACCTG CACAATGTG CACATGT AC C CTAAAGCTT AAAGT AT AAT AAAAAAT AAATAAAT AAAT AAAT AAGAAAAAG AAAGC CAG 




TAATG^ CATC AAATAATTTGTT AT ATCTTTAATTTATTG AAGG ATCACCACATG 
GGGACTTTTTACTCAGGTAATACCCAGCCTGCTACCTAACAGGTTGTGTTGTACTAAA 
4 0 AAATTATATAGTGCGAATCAAATATTGTTAATAAACCAATATCTGTATATCCTATGTCCCGGATTAA 
TTCTAGTTATCTAATACACAGCAGAGTGAGAAAAATCATTATC 

G CIATTAACTAGTT'ATTXjCCCAGCnTTGGAG AGC Cri ' ClTrr GGCTTATCATTTATTATAJ^CCCAGAAATAGGTGACTAATCAG AG 

ATAAATGTATGGGTTGTCTGTATCTAGTTTTATGCCTTTTTTTTGCCTAAGACGTAGTCAA^ 

AGCAGAGTTTAGTTAAGCACAAAGTTAACAGTGGGTAGG^ 

4 5 AAAAGCACAGACAAAACCACTCATGCCCTATTAATAAAC^ 

ATGTCTGTGGCTTTAGGAAAATACCACCAGCTAGTACTTACCTATTTAAAGATGTAGAATTTA 
TTGTTTCCACAACTTTAGTTTACTATTGGACTTTCAAA 

G T AAT ACACAG TAATT AGT ACAGC ATGTTG CTTCTT CAACAAATTGAGTT^ CAGGG AAATCAG CAAGT AAAT GAAAT AT AAATTT 
TTGGTAAAAGTATCAAACATTCATCTTGCCCATTTTTCCTCTTAAACTT^ 

5 0 TAAAATATAGATAAGGTTGGAATATTTGAGGATCCATTTGT'GGAACTGAATTTAATGAGACT^ 

ACTGGGTAATTAGCTAATAATGTTGGTCACTGTCTCACAGTTCAAGTAGCTTTAAGATGA 
GGGTAACCAGCGTTCTTAAATGTATGGTTTTTGACCA 

ACCTTTGGCTGTGATGAATGTAGATCCCAGCAGAATACCAAAATCCTATTTTTTTTGA 
AAATGAATTTGGAGGCACTGATGAAAGTGATTTTTTTA^ 

5 5 ATACAGTTGTTTTTAAAATGCCAAAATGCTCTTTATTATACAGAATATTTTATT 

TTTCTTGATAATAAATGGAAAAATTCTAAAACACTTGCTGAAA 
TTTATAGCATCTTCCTAACAGGAAAAGGTAGTTTTAGTTGCTTAGAAACATTAGACA 
TTGCATTTGTAACTGTCCTTTGAAATAATTTTGTAGAGTACTGAAGTGCCTTTTTA 
TTAACCAACAAAAATCTGTTTTCAGAGTCCTGGAGGCTGAAAGTCCrGA 

6 0 TCCTGGCTTGCAGAGGGCTGCCTTCCCATCATCCTCACATGGTAGAG 

TTGAGACAGAGTCTCACTCTGTCATCAAGGCTGGAGTGCAGTGGCACAATC^^ 

AG ATC CT CCCG CCTCAG CCTC C CAAG TAG CTGGGACT AC AG G CG CATG CCACCATG C CTGGCCAATTTTTG T ATTTTTTGT CAG AG 

TTTCATCATGTTACCCAGGCTGGTCTCAAACTCTTC^CTCAAGTGATTCACCCGCCTT 

CGTGAGTCACCATAGCTGGCCTGGCCTCCTCTTIT ^ 

6 5 TTGTTCAACATGATGTTTTGAAATATGCATACGTTGTGGAATGGCTAAACT 

TTGT AAG AACACTT AAGAT CT ACT CTT AG CIAACTTTG AAG AAT AT ATTTG TT ATT AACT ATAG T CACCATGTTGT ACAGT AGATCT 
CTTAAACTT ATT CCTAT CAAG CTGAAATTTTAAAT CCTTTG ACCAACATCTC CC CAACT CCGCAACCC CT AT CCCACT CTCACCT C 
CGCAGTTCTAGTAACAACCATTCTACCCTACIT'CTATGAGTTCAACT 
GTCTTTCTGTGCCTGACTTATTTCACTTAACATAATGTTC^ 

7 0 TGGGTG AATAAT ATTT CATTGT ACATAT ACAC CACA1TTTCTT CATTT AT CT 

HUMAN SEQUENCE - mRNA 

CGGAAGGTGTGAGCCGCAAACCCAGCGGAGGGCGGGAAGAAGGAGGAGGCCTCT^ 
AGAGGTCCCTCGGCCTCCTGACTGACTGaVCTGCGCCCGCCrCCGGCC^^ 
75 GAGCCATGGCCTCCC^TGCGTATAACCCGTATATAGAGATAATTGAACAACCCAGGCAG 
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